Skip to content

Latest commit

 

History

History
46 lines (29 loc) · 1.43 KB

CREDITS.md

File metadata and controls

46 lines (29 loc) · 1.43 KB

Reteti - Thanks and Credits

Amata mea Argeia - gratiam magnam tibi ago!
Patientia tua in studiis meis computatoriis auxilium meum maximum!

Many thanks to Svilen Stanoev who introduced me to the concept of partitioned datasets in object storage some years ago!
Thank you, Svilen!

Many thanks to the contributors of Apache Arrow, DuckDB and Hugging Face Tokenizers!

Many thanks to the teams of Fly.io, Tigris Data, MinIO, Cloudflare R2 and Hugging Face Datasets!

Many thanks to the creators of HarperDB, Inc.! Their system introduced me to the "exploded data model".
This paradigm influenced heavily the partitioned index of Reteti which is never read in its entirety during search.

Python Modules

https://huggingface.co/docs/tokenizers/index
https://arrow.apache.org/docs/python/api.html
https://duckdb.org/docs/
https://min.io/docs/minio/linux/developers/python/API.html
https://huggingface.co/docs/huggingface_hub/index
https://www.gradio.app/docs

Services

https://fly.io/docs/
https://www.tigrisdata.com/docs/overview/

Dataset

https://commoncrawl.org/blog/news-dataset-available
https://huggingface.co/datasets/CloverSearch/cc-news-mutlilingual
https://huggingface.co/datasets/CloverSearch/data_article_count

Stack Overflow

https://stackoverflow.com/questions/2564137/how-to-terminate-a-thread-when-main-program-ends

AI

https://claude.ai/

Icon

https://www.svgrepo.com/svg/405198/giraffe