-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
2025/01/15 - Optimal transport intro #29
Comments
Here is a poor-quality version of the book: https://www.math.ucla.edu/~wgangbo/Cedric-Villani.pdf (The ebook is 70Euros, if someone got access to it please feel invited to share here) @agosztolai mentioned that if we want to prepare, we can read the introduction (correct @agosztolai?) |
It is in djvu format |
I built a PDF which allows copy-pasting (although mathematical symbols are not always correctly identified): cedric-villani-topics-in-optimal-transportation-2003_ocr.pdf |
This is a very nice article linking Flow Matching to Optimal Transport. I don't think I can cover this on Wednesday, but it suggests a few nice follow-up articles to round off our venture into probabilistic generative modelling. |
This website is rich in resources on OT. |
Hiya, this was a toughie, but I managed to compile a crash course on OT. I have uploaded my presentation and I highly recommend you take a quick look so that you can follow the discussion tomorrow. |
Found more slides on OT: |
One book I liked is "Optimal Transport for Applied Mathematicians" (https://math.univ-lyon1.fr/~santambrogio/OTAM-cvgmt.pdf). It is easier to understand than Villani, but not easy. |
Hi, sorry I just noticed that next week I need to be on a panel during the reading group. I can continue in two week’s time. Could someone present something instead of me? WGANs could be one paper, which uses the duality of Kantorovich and Rubinstein. I have not covered that, but that’s the next thing I wanted to talk about, so it’s a natural continuation. |
I can do it, I have slides on that topic from my time in Bonn anyways. I would add discrete optimal transport and solvers for that. |
Great, go ahead, please! You can also use my notes (if you want) to explain Kantorovich-Rubinstein duality. |
Hello, I wanted to remind you that when you learn new maths, the way you should do it is go through the main results and place them in your understanding. Knowing the proofs (details) comes only after. Please remember. Unless you want to do mathematical research in OT, I highly recommend you do not get into the technical details because you will make no progress (I guarantee) and will distract you from the big picture. Here's the big picture of Brenier's theorem: If you have a quadratic cost function c(x,y) = |x-y|^2, then the Monge and Kantorovich problems coincide. Additionally, you do not need to solve the OT problem - instead, you can just look for a convex potential that transports your density between the endpoints. This path will automatically be optimal. But because you were so curious, and because I wanted to show you how technical things get, I briefly looked into the question of the convexity assumption underlying Brenier's theorem. I refer you to Villani (AMS) to check the proofs if you want to understand this. The most standard setting requires the cost c(x,y) to be quadratic (Thm 2.32, so strictly convex and C_2(R)). However, there are various relaxations.
Of course, all of these convexity assumptions are taken in the c-sense, i.e., with respect to the cost function c in the sense of the Legendre-Fenchel transform. Therefore, the uniqueness of derivatives needs to be interpreted in terms of conjugacy with respect to the LF transform, which is called subdifferentiability. Brenier theorem states that under this condition there is a convex (not strictly) potential, whose gradient is the transport map. I hope this closes the discussion. I hope you appreciate that answering these questions is highly non-trivial (cannot be explained based on elementary calculus), so you just need to leave some details to pass to give yourself a chance to understand. |
Thank you again for the nice explanation. I agree that it is unnecessary to go into the proofs, however the conditions are nice to know, e.g. strict convexity in the cost function leads to an unique solution. For next week I will go over your slides concerning the Kantorovich-Rubinstein duality and then introduce the original Wasserstein GAN paper (https://proceedings.mlr.press/v70/arjovsky17a/arjovsky17a.pdf), Wasserstein GAN with Gradient Penalty (https://proceedings.neurips.cc/paper_files/paper/2017/file/892c3b1c6dccd52936e27cbd0ff683d6-Paper.pdf) and the discrete Wasserstein distance and solvers such as Sinkhorn (https://proceedings.neurips.cc/paper/2013/file/af21d0c97db2e27e13572cbf59eb343d-Paper.pdf). |
That's great @tpinetz, thanks a lot! I created a new issue for next week, based on your content. |
Thanks Adam for providing context. In general, I (and I think many others) am very happy to leave many details to pass. What's hard is to select them: When learning new concepts (which are all struggling to comprehend) there is no way to know, which of them will be important to understand underlying concepts vs. the ones that are just 'technical details'. Thereby, I do appreciate a lot if you (@agosztolai and other mathematicians) cut my (and other's) questions short as "not relevant". |
Video on Lagr. vs Eulerian flows (can't guarantee it's good): https://www.youtube.com/watch?v=DVB3hwfUr0w |
Hi GenAIers,
I will attempt to put together a crash course in OT theory for our January 15 session.
Foreseeably, the topics I will cover will be
Please note that this is only preliminary and I will likely reduce this list.
I will base most of the session on The Book, Topics in Optimal Transport, Villani AMS, but I will also consider various other sources.
The text was updated successfully, but these errors were encountered: