Artifact for "Prompt-to-SQL Injection Attacks in LLM-Integrated Web Applications: Risks and Defenses"

Folder structure

RQ1: contains all files related to the replication of the RQ1 attacks
- code: code for the replication of the RQ1 attacks
  - app-backend-agent: the Langchain backend using SQLDatabaseAgent
  - app-backend-chain: the Langchain backend using SQLDatabaseChain
  - app-frontend: used to launch the Gradio chatbot frontend
  - postgres: a docker-compose file to launch the PostgreSQL database
  - pgadmin: a docker-compose file to launch the pgAdmin database manager for easy database inspection
- prompts: contains the prompts used in the RQ1 attacks
RQ2: contains the code, prompts, and list of models used in RQ2
- code: the same code, but with support for more models
- prompts: contains the prompts used in the RQ2, grouped by model
RQ3: contains the red team application testing code used in RQ3
- automated: contains the code for the automated generation of malicious prompts
  - finetune: finetuned Mistral-7B model (code, dataset, and model)
  - generated_prompts: contains successful malicious prompts generated by model
- red-team: contains the code for the red team testing
  - apps: contains the code for the red team applications
  - backend: contains the code to launch the red team server backend
  - dataset: contains the prompts created by the red team
  - frontend: contains the code to launch the red team frontend server
  - prompt-db: a docker-compose file to launch the PostgreSQL database to save tester prompts
RQ4: contains the code for RQ4
- langshield-langchain: implementation of Langshield in Langchain's SQL chain and agent (v0.1.0)
- llm_guard: contains the code and evaluation data for the LLM Guard mitigation. Also contains the our results for the evaluation of the LLM Guard mitigations.
  - eval: contains the LLM Guard evaluation code and data
    - all_prompts: evaluation of final LLM Guard implementation over 1120 prompts
    - detections: evaluation of intermediate LLM Guard implementation over 60 prompts
    - false_positives: evaluation of false positives of deberta-v3-base-prompt-injection over 120 prompts
  - final_implementation_standalone: contains the final standalone implementation of the LLM Guard component of Langshield
    - langchain: generic implementation using Langchain
    - openai: generic implementation using the OpenAI API directly

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
RQ1		RQ1
RQ2		RQ2
RQ3		RQ3
RQ4		RQ4
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Artifact for "Prompt-to-SQL Injection Attacks in LLM-Integrated Web Applications: Risks and Defenses"

Folder structure

About

Releases

Packages

Languages

License

rodrigo-pedro/P2SQL

Folders and files

Latest commit

History

Repository files navigation

Artifact for "Prompt-to-SQL Injection Attacks in LLM-Integrated Web Applications: Risks and Defenses"

Folder structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages