GitHub - databricks-industry-solutions/semantic-caching: This project implements a caching system for Databricks, designed to improve response times and reduce cost for frequently asked questions or similar queries.

Business Problem

Generative AI systems are transforming industries, with methods like Retrieval Augmented Generation (RAG) and Compound AI systems at the forefront. These systems enhance tasks like information retrieval, decision-making, and content generation but can come with high computational costs.

Semantic caching is a technique widely used to reduce the computational demands of AI systems by storing previously processed queries and responses. This prevents redundant computations for similar queries, improving efficiency by reducing latency and server load. It is especially valuable for scaling agentic applications, where query variations are less common.

Databricks offers an ideal platform for building AI agents with semantic caching through its Mosaic AI solution. It provides integrated components like a vector database, agent framework / evaluation, all governed centrally. This solution accelerator implements semantic caching, optimizing response times and reducing computational overhead for similar queries.

Reference Architecture

Vector Lab - Optimise RAG applications with semantic caching on Databricks

Authors

ryuta.yoshimatsu@databricks.com, nehme.tohme@databricks.com, ellen.hirt@databricks.com

Project support

Please note the code in this project is provided for your exploration only, and are not formally supported by Databricks with Service Level Agreements (SLAs). They are provided AS-IS and we do not make any guarantees of any kind. Please do not submit a support ticket relating to any issues arising from the use of these projects. The source in this project is provided subject to the Databricks License. All included or referenced third party libraries are subject to the licenses set forth below.

Any issues discovered through the use of this project should be filed as GitHub Issues on the Repo. They will be reviewed as time permits, but there are no formal SLAs for support.

License

© 2024 Databricks, Inc. All rights reserved. The source in this notebook is provided subject to the Databricks License [https://databricks.com/db-license-source]. All included or referenced third party libraries are subject to the licenses set forth below.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
chain		chain
data		data
image		image
00_introduction.py		00_introduction.py
01_data_preparation.py		01_data_preparation.py
02_rag_chatbot.py		02_rag_chatbot.py
03_rag_chatbot_with_cache.py		03_rag_chatbot_with_cache.py
04_evaluate.py		04_evaluate.py
05_cache_eviction.py		05_cache_eviction.py
99_init.py		99_init.py
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
NOTICE.md		NOTICE.md
README.md		README.md
RUNME.md		RUNME.md
SECURITY.md		SECURITY.md
cache.py		cache.py
config.py		config.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Business Problem

Reference Architecture

Vector Lab - Optimise RAG applications with semantic caching on Databricks

Authors

Project support

License

About

Releases

Packages

Contributors 3

Languages

License

databricks-industry-solutions/semantic-caching

Folders and files

Latest commit

History

Repository files navigation

Business Problem

Reference Architecture

Vector Lab - Optimise RAG applications with semantic caching on Databricks

Authors

Project support

License

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages