2025/01/15 - Optimal transport intro #29

agosztolai · 2024-12-29T10:13:00Z

Hi GenAIers,

I will attempt to put together a crash course in OT theory for our January 15 session.

Foreseeably, the topics I will cover will be

Monge and Kantorovich formulation of OT
Dual formulation
Dynamic formulation (Benamou-Brenier)
Variational formulation (Jordan-Kinderlehrer-Otto)
Entropic regularisation (Sinkhorn)
Displacement interpolation
Linking flow/score matching to OT

Please note that this is only preliminary and I will likely reduce this list.

I will base most of the session on The Book, Topics in Optimal Transport, Villani AMS, but I will also consider various other sources.

moritzschaefer · 2025-01-09T07:30:11Z

Here is a poor-quality version of the book: https://www.math.ucla.edu/~wgangbo/Cedric-Villani.pdf

(The ebook is 70Euros, if someone got access to it please feel invited to share here)

@agosztolai mentioned that if we want to prepare, we can read the introduction (correct @agosztolai?)

EmreTaha · 2025-01-09T17:01:00Z

It is in djvu format
cédric-villani-topics-in-optimal-transportation-2003.zip

moritzschaefer · 2025-01-10T08:01:48Z

I built a PDF which allows copy-pasting (although mathematical symbols are not always correctly identified):

cedric-villani-topics-in-optimal-transportation-2003_ocr.pdf

moritzschaefer · 2025-01-11T07:07:13Z

I particularly appreciated this statement in the book's overview:

If the reader is not familiar with the notion of Hausdorff dimension, we recommend that he/she just forget about it and skip those statements which involve this concept, since they are not of primary importance here.

(more of that please ^^)

agosztolai · 2025-01-12T14:45:03Z

This is a very nice article linking Flow Matching to Optimal Transport. I don't think I can cover this on Wednesday, but it suggests a few nice follow-up articles to round off our venture into probabilistic generative modelling.

agosztolai · 2025-01-14T08:16:59Z

This website is rich in resources on OT.

agosztolai · 2025-01-14T15:51:46Z

Hiya, this was a toughie, but I managed to compile a crash course on OT. I have uploaded my presentation and I highly recommend you take a quick look so that you can follow the discussion tomorrow.

animesh3008 · 2025-01-15T10:22:35Z

Found more slides on OT:
https://web.stanford.edu/~jblanche/presentations_slides/Tutorial_APS_2017.pdf

tpinetz · 2025-01-15T13:39:21Z

One book I liked is "Optimal Transport for Applied Mathematicians" (https://math.univ-lyon1.fr/~santambrogio/OTAM-cvgmt.pdf). It is easier to understand than Villani, but not easy.

agosztolai · 2025-01-15T14:24:51Z

Hi, sorry I just noticed that next week I need to be on a panel during the reading group. I can continue in two week’s time. Could someone present something instead of me?

WGANs could be one paper, which uses the duality of Kantorovich and Rubinstein. I have not covered that, but that’s the next thing I wanted to talk about, so it’s a natural continuation.

tpinetz · 2025-01-15T15:00:12Z

I can do it, I have slides on that topic from my time in Bonn anyways. I would add discrete optimal transport and solvers for that.

agosztolai · 2025-01-15T15:31:08Z

Great, go ahead, please! You can also use my notes (if you want) to explain Kantorovich-Rubinstein duality.

agosztolai · 2025-01-16T09:53:04Z

Hello,

I wanted to remind you that when you learn new maths, the way you should do it is go through the main results and place them in your understanding. Knowing the proofs (details) comes only after. Please remember. Unless you want to do mathematical research in OT, I highly recommend you do not get into the technical details because you will make no progress (I guarantee) and will distract you from the big picture.

Here's the big picture of Brenier's theorem: If you have a quadratic cost function c(x,y) = |x-y|^2, then the Monge and Kantorovich problems coincide. Additionally, you do not need to solve the OT problem - instead, you can just look for a convex potential that transports your density between the endpoints. This path will automatically be optimal.

But because you were so curious, and because I wanted to show you how technical things get, I briefly looked into the question of the convexity assumption underlying Brenier's theorem. I refer you to Villani (AMS) to check the proofs if you want to understand this.

The most standard setting requires the cost c(x,y) to be quadratic (Thm 2.32, so strictly convex and C_2(R)). However, there are various relaxations.

You can have strict convexity (Thm 2.44) and strict concavity (Thm 2.45).
You can have normal (non-strict) convexity as long as another condition is satisfied, as I mentioned. This is the twist condition, which is that the first derivates are injective. Thanks @shemlem for the input.

Of course, all of these convexity assumptions are taken in the c-sense, i.e., with respect to the cost function c in the sense of the Legendre-Fenchel transform. Therefore, the uniqueness of derivatives needs to be interpreted in terms of conjugacy with respect to the LF transform, which is called subdifferentiability.

Brenier theorem states that under this condition there is a convex (not strictly) potential, whose gradient is the transport map.

I hope this closes the discussion. I hope you appreciate that answering these questions is highly non-trivial (cannot be explained based on elementary calculus), so you just need to leave some details to pass to give yourself a chance to understand.

tpinetz · 2025-01-16T11:16:24Z

Thank you again for the nice explanation. I agree that it is unnecessary to go into the proofs, however the conditions are nice to know, e.g. strict convexity in the cost function leads to an unique solution.

For next week I will go over your slides concerning the Kantorovich-Rubinstein duality and then introduce the original Wasserstein GAN paper (https://proceedings.mlr.press/v70/arjovsky17a/arjovsky17a.pdf), Wasserstein GAN with Gradient Penalty (https://proceedings.neurips.cc/paper_files/paper/2017/file/892c3b1c6dccd52936e27cbd0ff683d6-Paper.pdf) and the discrete Wasserstein distance and solvers such as Sinkhorn (https://proceedings.neurips.cc/paper/2013/file/af21d0c97db2e27e13572cbf59eb343d-Paper.pdf).

moritzschaefer · 2025-01-17T15:41:31Z

That's great @tpinetz, thanks a lot! I created a new issue for next week, based on your content.

moritzschaefer · 2025-01-18T07:21:00Z

Thanks Adam for providing context. In general, I (and I think many others) am very happy to leave many details to pass.

What's hard is to select them: When learning new concepts (which are all struggling to comprehend) there is no way to know, which of them will be important to understand underlying concepts vs. the ones that are just 'technical details'.

Thereby, I do appreciate a lot if you (@agosztolai and other mathematicians) cut my (and other's) questions short as "not relevant".

dariarom94 · 2025-02-05T11:25:13Z

Video on Lagr. vs Eulerian flows (can't guarantee it's good): https://www.youtube.com/watch?v=DVB3hwfUr0w

moritzschaefer mentioned this issue Jan 17, 2025

2025/01/22 - OT Duality and Wasserstein GAN #30

Open

seasonsOfTheSun mentioned this issue Feb 12, 2025

Planning Future OT Sessions #38

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2025/01/15 - Optimal transport intro #29

2025/01/15 - Optimal transport intro #29

agosztolai commented Dec 29, 2024

moritzschaefer commented Jan 9, 2025 •

edited

Loading

EmreTaha commented Jan 9, 2025

moritzschaefer commented Jan 10, 2025

moritzschaefer commented Jan 11, 2025 •

edited

Loading

agosztolai commented Jan 12, 2025

agosztolai commented Jan 14, 2025

agosztolai commented Jan 14, 2025 •

edited by moritzschaefer

Loading

animesh3008 commented Jan 15, 2025

tpinetz commented Jan 15, 2025

agosztolai commented Jan 15, 2025

tpinetz commented Jan 15, 2025

agosztolai commented Jan 15, 2025

agosztolai commented Jan 16, 2025

tpinetz commented Jan 16, 2025

moritzschaefer commented Jan 17, 2025

moritzschaefer commented Jan 18, 2025

dariarom94 commented Feb 5, 2025

2025/01/15 - Optimal transport intro #29

2025/01/15 - Optimal transport intro #29

Comments

agosztolai commented Dec 29, 2024

moritzschaefer commented Jan 9, 2025 • edited Loading

EmreTaha commented Jan 9, 2025

moritzschaefer commented Jan 10, 2025

moritzschaefer commented Jan 11, 2025 • edited Loading

agosztolai commented Jan 12, 2025

agosztolai commented Jan 14, 2025

agosztolai commented Jan 14, 2025 • edited by moritzschaefer Loading

animesh3008 commented Jan 15, 2025

tpinetz commented Jan 15, 2025

agosztolai commented Jan 15, 2025

tpinetz commented Jan 15, 2025

agosztolai commented Jan 15, 2025

agosztolai commented Jan 16, 2025

tpinetz commented Jan 16, 2025

moritzschaefer commented Jan 17, 2025

moritzschaefer commented Jan 18, 2025

dariarom94 commented Feb 5, 2025

moritzschaefer commented Jan 9, 2025 •

edited

Loading

moritzschaefer commented Jan 11, 2025 •

edited

Loading

agosztolai commented Jan 14, 2025 •

edited by moritzschaefer

Loading