Transformers Meet Relational Databases

The repository with the framework and experiments discussed in the article Transformers Meet Relational Databases

A study on integrating transformer architectures with relational databases via a modular message-passing framework, demonstrating enhanced performance.

About

The end-to-end nature of the system allows for streamlined integration of deep learning methods in the relational database settings. The pipeline allows for attaching any relational database easily through a simple connection string (with SQL Alchemy). Special care is given to databases of the CTU Relational repository, which are currently being further integrated with RelBench into a new dataset library. Furthermore the system loads data from the DB (with Pandas), automatically analyzes its schema structure and column semantics, and efficiently loads and embeds the data into learnable (PyTorch Frame) tensor representations.

The subsequent modular neural message-passing scheme operates on top of the (two-level) multi-relational hypergraph representation. Utilizing Pytorch Geometric to build such representation allows to utilize any of its modules readily, and together with the tabular transformers of PyTorch Frame creates a vast series of combinations available for instantiating the presented deep learning blueprint. One such instantiation is the proposed model DBFormer, illustrated below:

For more information, please read the paper and/or feel free to reach out directly to us!

Citation:

@misc{peleška2024transformersmeetrelationaldatabases,
      title={Transformers Meet Relational Databases}, 
      author={Jakub Peleška and Gustav Šír},
      year={2024},
      eprint={2412.05218},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2412.05218}, 
}

Project Structure

db_transformer - the main module containing the:
- data - loading, analysis, conversion, and embedding
- db - connection, inspection, and schema detection
- nn - deep learnign models, layers, training methods
experiments - presented in the paper, including baselines from:
- Tabular models
- Propositionalization
- Statistical Relational Learning
- Neural-symbolic integration

and additionally some:

scripts - some additional helper scripts

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.github/workflows		.github/workflows
.vscode		.vscode
db_transformer		db_transformer
experiments		experiments
scripts		scripts
.gitignore		.gitignore
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
schema.pdf		schema.pdf
schema.png		schema.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers Meet Relational Databases

A study on integrating transformer architectures with relational databases via a modular message-passing framework, demonstrating enhanced performance.

About

Project Structure

Related

About

Releases

Packages

Languages

jakubpeleska/deep-db-learning

Folders and files

Latest commit

History

Repository files navigation

Transformers Meet Relational Databases

A study on integrating transformer architectures with relational databases via a modular message-passing framework, demonstrating enhanced performance.

About

Project Structure

Related

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages