OK Transformer

A collection of self-contained notebooks on machine learning theory, engineering, and operations. I try to cover topics that frequently come up as building blocks for applications or further theory. I also explore areas where I want to clarify my understanding or delve into details that I personally find interesting or intriguing.

The notebooks should ideally run end-to-end with reproducible results between runs. Exact output values may change due to external dependencies such as differences with hardware and dataset versions, or implementation quirks like nondeterminism, but the conclusions should still generally hold. Please open an issue if you find otherwise 👀 (as I often do)!

Making a local build

git clone git@github.com:particle1331/ok-transformer.git && cd ok-transformer
pip install -U tox
tox -e build

Running the notebooks

The notebooks can be found in docs/nb. To run them, create a venv using pdm:

pip install -U pdm
pdm install

Use the resulting .venv as Jupyter kernel. The following libraries will be installed (see pdm.lock):

╭────────────────────────────────────┬────────────────╮
│ fastapi                            │ 0.111.0        │
│ Flask                              │ 3.0.3          │
│ keras                              │ 2.15.0         │
│ lightning                          │ 2.3.0          │
│ matplotlib                         │ 3.9.0          │
│ mlflow                             │ 2.13.2         │
│ numpy                              │ 1.26.4         │
│ optuna                             │ 3.6.1          │
│ pandas                             │ 2.2.2          │
│ scikit-learn                       │ 1.5.0          │
│ scipy                              │ 1.13.1         │
│ seaborn                            │ 0.13.2         │
│ tensorflow                         │ 2.15.1         │
│ tensorflow-datasets                │ 4.9.6          │
│ tensorflow-estimator               │ 2.15.0         │
│ torch                              │ 2.2.2          │
│ torchaudio                         │ 2.2.2          │
│ torchinfo                          │ 1.8.0          │
│ torchmetrics                       │ 1.4.0.post0    │
│ torchvision                        │ 0.17.2         │
│ uvicorn                            │ 0.30.1         │
│ xgboost                            │ 2.0.3          │
╰────────────────────────────────────┴────────────────╯

Hardware

GPU 0:                           Tesla P100-PCIE-16GB
CPU:                             Intel(R) Xeon(R) CPU @ 2.00GHz
Core:                            1
Threads per core:                2
L3 cache:                        38.5 MiB
Memory:                          15 Gb

Name		Name	Last commit message	Last commit date
Latest commit History 2,558 Commits
.github/workflows		.github/workflows
docs		docs
extras		extras
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
tox.ini		tox.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OK Transformer

Making a local build

Running the notebooks

Hardware

About

Languages

License

particle1331/ok-transformer

Folders and files

Latest commit

History

Repository files navigation

OK Transformer

Making a local build

Running the notebooks

Hardware

About

Topics

Resources

License

Stars

Watchers

Forks

Languages