initial commit

aws-samples · Mar 27, 2024 · da35c2e · da35c2e
1 parent 4c61d4c
commit da35c2e
Show file tree

Hide file tree

Showing 111 changed files with 24,375 additions and 18 deletions.
diff --git a/.DS_Store b/.DS_Store
diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md
@@ -1,4 +1,5 @@
 ## Code of Conduct
+
 This project has adopted the [Amazon Open Source Code of Conduct](https://aws.github.io/code-of-conduct).
 For more information see the [Code of Conduct FAQ](https://aws.github.io/code-of-conduct-faq) or contact
 opensource-codeofconduct@amazon.com with any additional questions or comments.
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -6,24 +6,23 @@ documentation, we greatly value feedback and contributions from our community.
 Please read through this document before submitting any issues or pull requests to ensure we have all the necessary
 information to effectively respond to your bug report or contribution.
 
-
 ## Reporting Bugs/Feature Requests
 
 We welcome you to use the GitHub issue tracker to report bugs or suggest features.
 
 When filing an issue, please check existing open, or recently closed, issues to make sure somebody else hasn't already
 reported the issue. Please try to include as much information as you can. Details like these are incredibly useful:
 
-* A reproducible test case or series of steps
-* The version of our code being used
-* Any modifications you've made relevant to the bug
-* Anything unusual about your environment or deployment
-
+- A reproducible test case or series of steps
+- The version of our code being used
+- Any modifications you've made relevant to the bug
+- Anything unusual about your environment or deployment
 
 ## Contributing via Pull Requests
+
 Contributions via pull requests are much appreciated. Before sending us a pull request, please ensure that:
 
-1. You are working against the latest source on the *main* branch.
+1. You are working against the latest source on the _main_ branch.
 2. You check existing open, and recently merged, pull requests to make sure someone else hasn't addressed the problem already.
 3. You open an issue to discuss any significant work - we would hate for your time to be wasted.
 
@@ -39,20 +38,19 @@ To send us a pull request, please:
 GitHub provides additional document on [forking a repository](https://help.github.com/articles/fork-a-repo/) and
 [creating a pull request](https://help.github.com/articles/creating-a-pull-request/).
 
-
 ## Finding contributions to work on
-Looking at the existing issues is a great way to find something to contribute on. As our projects, by default, use the default GitHub issue labels (enhancement/bug/duplicate/help wanted/invalid/question/wontfix), looking at any 'help wanted' issues is a great place to start.
 
+Looking at the existing issues is a great way to find something to contribute on. As our projects, by default, use the default GitHub issue labels (enhancement/bug/duplicate/help wanted/invalid/question/wontfix), looking at any 'help wanted' issues is a great place to start.
 
 ## Code of Conduct
+
 This project has adopted the [Amazon Open Source Code of Conduct](https://aws.github.io/code-of-conduct).
 For more information see the [Code of Conduct FAQ](https://aws.github.io/code-of-conduct-faq) or contact
 opensource-codeofconduct@amazon.com with any additional questions or comments.
 
-
 ## Security issue notifications
-If you discover a potential security issue in this project we ask that you notify AWS/Amazon Security via our [vulnerability reporting page](http://aws.amazon.com/security/vulnerability-reporting/). Please do **not** create a public github issue.
 
+If you discover a potential security issue in this project we ask that you notify AWS/Amazon Security via our [vulnerability reporting page](http://aws.amazon.com/security/vulnerability-reporting/). Please do **not** create a public github issue.
 
 ## Licensing
 

diff --git a/LICENSE b/LICENSE
@@ -14,4 +14,3 @@ FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
 COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
 IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
 CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
-
diff --git a/README.md b/README.md
@@ -1,11 +1,60 @@
-## My Project
+# Amazon Bedrock Samples
 
-TODO: Fill this README out!
+This repository contains sample code demonstrating various use cases leveraging Amazon Bedrock and Generative AI. Each sample is a separate project with its own directory, and includes a basic Streamlit frontend to help users quickly set up a proof of concept.
 
-Be sure to:
+## Samples
 
-* Change the title in this README
-* Edit your repository description on GitHub
+1. **Amazon-Bedrock-Summarization-Long-Document-POC**
+   This sample demonstrates using Amazon Bedrock and Generative AI to implement a long document summarization use case. Users can upload large PDF documents, which are chunked and summarized using Amazon Bedrock.
+
+2. **Amazon-Bedrock-RAG-OpenSearchServerless-POC**
+   This sample demonstrates creating custom embeddings stored in Amazon OpenSearch Serverless, and answering questions against the indexed embeddings using a Retrieval-Augmented Generation (RAG) architecture with Amazon Bedrock.
+
+3. **Amazon-Bedrock-RAG-Kendra-POC**
+   This sample implements a RAG-based architecture with Amazon Kendra, allowing users to ask questions against documents stored in an Amazon Kendra index using Amazon Bedrock.
+
+4. **Amazon-Bedrock-Image-Generation-POC**
+   This sample demonstrates using Amazon Bedrock and Generative AI to generate images based on text input requests.
+
+5. **Amazon-Bedrock-GenAI-Dynamic-Prompting-Explained-POC**
+   This sample provides a hands-on explanation of how dynamic prompting works in relation to Generative AI, using Amazon Bedrock.
+
+6. **Amazon-Bedrock-Document-Generator**
+   This sample demonstrates using Amazon Bedrock and Generative AI to perform document generation based on a document template and user-provided details.
+
+7. **Amazon-Bedrock-Document-Comparison-POC**
+   This sample allows users to upload two PDF documents and get a list of all changes between them using Amazon Bedrock and Generative AI.
+
+8. **Amazon-Bedrock-Claude3-Multi-Modal-Sample**
+   This sample showcases the multi-modal capabilities of Amazon Bedrock (specifically Anthropic Claude 3), allowing users to input text questions, images, or both to get comprehensive descriptions or answers.
+
+9. **Amazon-Bedrock-Chat-POC**
+   This sample provides a ChatGPT alternative using Amazon Bedrock and Generative AI, allowing users to ask zero-shot questions and receive responses.
+
+10. **Amazon-Bedrock-Amazon-Redshift-POC**
+    This sample demonstrates using Amazon Bedrock and Generative AI to ask natural language questions and transform them into SQL queries against Amazon Redshift databases.
+
+11. **Amazon-Bedrock-Amazon-RDS-POC**
+    This sample allows users to ask natural language questions and transform them into SQL queries against Amazon RDS databases using Amazon Bedrock and Generative AI.
+
+12. **Amazon-Bedrock-Amazon-Athena-POC**
+    This sample demonstrates using Amazon Bedrock and Generative AI to ask natural language questions and transform them into SQL queries against Amazon Athena databases.
+
+## Prerequisites
+
+- Amazon Bedrock Access and CLI Credentials
+- Python 3.9 or 3.10 installed on your machine
+- Additional prerequisites specific to each sample (e.g., Snowflake account, Amazon Kendra index, etc.)
+
+## Getting Started
+
+1. Clone the repository.
+2. Navigate to the desired sample directory.
+3. Set up a Python virtual environment and install the required dependencies.
+4. Configure the necessary environment variables (e.g., AWS credentials, database connections, etc.).
+5. Run the Streamlit application using the provided command.
+
+Detailed instructions for each sample are provided in their respective directories.
 
 ## Security
 
@@ -14,4 +63,3 @@ See [CONTRIBUTING](CONTRIBUTING.md#security-issue-notifications) for more inform
 ## License
 
 This library is licensed under the MIT-0 License. See the LICENSE file.
-
diff --git a/amazon-bedrock-amazon-athena-poc/LICENSE b/amazon-bedrock-amazon-athena-poc/LICENSE
@@ -0,0 +1,16 @@
+MIT No Attribution
+
+Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
+
+Permission is hereby granted, free of charge, to any person obtaining a copy of
+this software and associated documentation files (the "Software"), to deal in
+the Software without restriction, including without limitation the rights to
+use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of
+the Software, and to permit persons to whom the Software is furnished to do so.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS
+FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
+COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
+IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
+CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
diff --git a/amazon-bedrock-amazon-athena-poc/README.md b/amazon-bedrock-amazon-athena-poc/README.md
@@ -0,0 +1,111 @@
+# Amazon-Bedrock-Amazon-Athena-POC
+
+This is sample code demonstrating the use of Amazon Bedrock and Generative AI to use natural language questions to query relational data stores, specifically Amazon Athena. This example leverages the MOMA Open Source Database: https://github.com/MuseumofModernArt/collection.
+
+# **Goal of this Repo:**
+
+The goal of this repo is to provide users the ability to use Amazon Bedrock and generative AI to take natural language questions, and transform them into relational database queries against Amazon Athena.
+This repo comes with a basic frontend to help users stand up a proof of concept in just a few minutes.
+
+The architecture and flow of the sample application will be:
+
+![Alt text](images/architecture.png "POC Architecture")
+
+When a user interacts with the GenAI app, the flow is as follows:
+
+1. The user makes a request, asking a natural language question based on the database available in Amazon Athena to the GenAI app (app.py).
+2. This natural language question is passed into Amazon Bedrock, which takes the natural language question and creates a SQL query (amazon_athena_bedrock_query.py).
+3. The created SQL query is then executed against your Amazon Athena database to begin retrieving the data (amazon_athena_bedrock_query.py).
+4. The data is retrieved from your Amazon Athena Database and is passed back into Amazon Bedrock, to generate a natural language answer based on the retrieved data (amazon_athena_bedrock_query.py).
+5. The LLM returns a natural language response to the user through the streamlit frontend based on the retrieved data (app.py).
+
+# How to use this Repo:
+
+## Prerequisites:
+
+1. Amazon Bedrock Access and CLI Credentials.
+2. Amazon Athena Access and the ability to create a Amazon Athena database and tables.
+3. Ensure Python 3.10 installed on your machine, it is the most stable version of Python for the packages we will be using, it can be downloaded [here](https://www.python.org/downloads/release/python-3100/).
+
+## Step 1:
+
+The first step of utilizing this repo is performing a git clone of the repository.
+
+```
+git clone git@ssh.gitlab.aws.dev:gen-ai-field-playbook-pocs/amazon-bedrock-amazon-athena-poc.git
+```
+
+After cloning the repo onto your local machine, open it up in your favorite code editor.The file structure of this repo is broken into 4 key files,
+the app.py file, the amazon_athena_bedrock_query.py file, the moma_examples.yaml file, and the requirements.txt. The app.py file houses the frontend application (streamlit app).
+The amazon_athena_bedrock_query.py file contains connectors into your Amazon Athena database and the interaction with Amazon Bedrock through LangChains SQLDatabaseChain.
+The moma_examples.yaml file contains several samples prompts that will be used to implement a few-shot prompting technique. Last, the requirements.txt
+file has all the requirements needed to get the sample application up and running.
+
+## Step 2:
+
+Set up a python virtual environment in the root directory of the repository and ensure that you are using Python 3.9. This can be done by running the following commands:
+
+```
+pip install virtualenv
+python3.9 -m venv venv
+```
+
+The virtual environment will be extremely useful when you begin installing the requirements. If you need more clarification on the creation of the virtual environment please refer to this [blog](https://www.freecodecamp.org/news/how-to-setup-virtual-environments-in-python/).
+After the virtual environment is created, ensure that it is activated, following the activation steps of the virtual environment tool you are using. Likely:
+
+```
+cd venv
+cd bin
+source activate
+cd ../../
+```
+
+After your virtual environment has been created and activated, you can install all the requirements found in the requirements.txt file by running this command in the root of this repos directory in your terminal:
+
+```
+pip install -r requirements.txt
+```
+
+## Step 3:
+
+Now that all the requirements have been successfully installed in your virtual environment we can begin configuring environment variables.
+You will first need to create a .env file in the root of this repo. Within the .env file you just created you will need to configure the .env to contain:
+
+```
+profile_name=<AWS_CLI_PROFILE_NAME>
+aws_access_key_id=<AWS_ACCESS_KEY_ID>
+aws_secret_access_key=<AWS_SECRET_ACCESS_KEY>
+region_name=<AWS_REGION>
+database_name=<ATHENA_DATABASE_NAME>
+s3_staging_dir=<S3_STAGING_DIRECTORY_BUCKET_PATH>  example -> s3://sample-bucket/
+```
+
+Please ensure that your AWS CLI Profile has access to Amazon Bedrock!
+
+Depending on the region and model that you are planning to use Amazon Bedrock in, you may need to reconfigure lines 19-25 in the amazon_athena_bedrock_query.py file:
+
+```
+llm = Bedrock(
+    credentials_profile_name=os.getenv("profile_name"),
+    model_id="amazon.titan-text-express-v1",
+    endpoint_url="https://bedrock-runtime.us-east-1.amazonaws.com",
+    region_name="us-east-1",
+    verbose=True
+)
+```
+
+# Step 4
+
+If you would like to use this repo with the sample data, you will need to upload the two sample data files found in the sample data directory as two individual tables to your Amazon Athena Database.
+
+If you preferred to use your own database/tables in your Amazon Athena database, I would highly recommend reviewing the moma_examples.yaml file in the SampleData directory to see how prompts are constructed for this sample application and spend the time creating 5 - 10 prompts that resemble your dataset more closely.
+
+# Step 5
+
+At this point the application should be ready to go. To start up the application with its basic frontend you simply need to run the following command in your terminal while in the root of the repositories' directory:
+
+```
+streamlit run app.py
+```
+
+As soon as the application is up and running in your browser of choice you can begin asking natural language questions against your Amazon Athena Database.
Original file line number	Diff line number	Diff line change
Expand Up		@@ -14,4 +14,3 @@ FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
		COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
		IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
		CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.