Submitted by nirnamous t3_11pq968 in deeplearning

Hey everyone,

I'm struggling with understanding mathematical proofs in research papers. I have a good grasp of basic concepts such as calculus (single variable calculus and basic knowledge of multi-variable calculus), linear algebra, and basic probability.

I was wondering if any of you could recommend some sources (preferably videos or lecture series) to help me become more familiar with advanced mathematical concepts found in research papers.

For example:(source)

https://preview.redd.it/m19pwqkwkdna1.png?width=1104&format=png&auto=webp&v=enabled&s=5cb83feec7e92d4e7f991f7c22cda8483c39c377

In papers, I have frequently encountered concepts like, KL divergence, mathematics in higher-dimensional space, hessian, topology, Random projections and many more;What are the subject/module names I need to study to confidently read and understand proofs in papers?

​

Thanks in advance!

16

Comments

You must log in or register to comment.

Nerveregenerator t1_jbzo0x3 wrote

Do problems involving the equations on paper and also read and copy down articles that are written on them.

0

RoboiosMut t1_jc00wmj wrote

I have an idea, maybe ask Chatgpt to make some stories to explain those abstract concepts

−3

deephugs t1_jc0lhog wrote

First try and understand every symbol in the equation, there are cheat sheets online. Second, most math concepts have a wikipedia page you can read, go down those rabbit holes and sooner or later you will find common threads and start to build an understanding. Finally, just put the time in, math is like everything else and just takes lots and lots of practice.

3

amhotw t1_jc0mf55 wrote

If you are serious, I would recommend working on Rudin's Principles of Math Analysis. It might take a day (or more...) to wrap your head around a single proof but at the end you'll be ready to read anything (of course you might need to check some definitions.)

For KL divergence, entropy etc., Info Theory book by Mackay is great.

For hessian, well it is just calculus; the second derivative of a multivariate function. To understand its uses, you would need some understanding of numerical analysis and concave programming. For the latter, Boyd's optimization book is a classic. I don't remember a good book on numerical analysis but some diff. eqn.s books have nice chapters on it.

8

amhotw t1_jc0r0nn wrote

I just meant this would take significant amount of time. I think it is impossible to do research in a quantitative field without understanding these so I would say it is well worth the investment. But most people are not concerned with research or even understanding the methods.

5

GufyTheLire t1_jc3evhf wrote

I've tried that once. Asked ChatGPT why L0, L1.. Ln norms, so seemingly different, were all named in a similar way. It correctly listed the norms' definitions and use cases, but failed to generalize the concept and made up some bullshit reason why they are named like that. Took me some time down the Wikipedia and Google rabbit hole to find out about Lp spaces and substitute different p values in the definition of p-norm to get the real reason

1