Can Transformers Do Enumerative Geometry?

We introduce a Transformer-based approach to computational enumerative geometry, specifically targeting the computation of $\psi$-class intersection numbers on the moduli space of curves. Traditional methods for calculating these numbers suffer from factorial computational complexity, making them impractical to use. By reformulating the problem as a continuous optimization task, we compute intersection numbers across a wide value range from $10^{-45}$ to $10^{45}$.

To capture the recursive and hierarchical nature inherent in these intersection numbers, we propose the Dynamic Range Activator (DRA), a new activation function that enhances the Transformer's ability to model recursive patterns and handle severe heteroscedasticity. Given precision requirements for computing $\psi$-class intersections, we quantify the uncertainty of the predictions using Conformal Prediction with a dynamic sliding window adaptive to the partitions of an equivalent number of marked points.

Beyond simply computing intersection numbers, we explore the enumerative "world-model" of Transformers. Our interpretability analysis reveals that the network is implicitly modeling the Virasoro constraints in a purely data-driven manner. Moreover, through abductive hypothesis testing, probing, and causal inference, we uncover evidence of an emergent internal representation of the large-genus asymptotic of $\psi$-class intersection numbers. These findings suggest that the network internalizes the parameters of the asymptotic closed-form formula linearly while capturing the polynomiality phenomenon of $\psi$-class intersection numbers in a nonlinear manner.

This paper has been published at ICLR 2025. You can read it at:
link

torch-dra

A learnable activation function, Dynamic Range Activator (DRA), designed for recursive and periodic data modalities.

Installation

pip install torch-dra

Citation

If you use this work, please cite it as:

@inproceedings{
hashemi2025can,
title={Can Transformers Do Enumerative Geometry?},
author={Baran Hashemi and Roderic Guigo Corominas and Alessandro Giacchetto},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=4X9RpKH4Ls}
}

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.github/workflows		.github/workflows
data		data
dra		dra
model		model
.flake8		.flake8
.pre-commit-config.yaml		.pre-commit-config.yaml
ABCD.py		ABCD.py
LICENSE		LICENSE
README.md		README.md
codespell.txt		codespell.txt
comb_recursions.nb		comb_recursions.nb
data_gen.py		data_gen.py
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Can Transformers Do Enumerative Geometry?

torch-dra

Installation

Citation

About

Packages

Contributors 3

Languages

License

Baran-phys/DynamicFormer

Folders and files

Latest commit

History

Repository files navigation

Can Transformers Do Enumerative Geometry?

torch-dra

Installation

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Packages 0

Contributors 3

Languages

Packages