ToxValDB is a large compilation of in vivo toxicology data and risk assessment values. The database originated in response to the need for consistently annotated and computable toxicology data for use in the development and validation of non-animal new approach methods (NAMs). The database has two major components. The first, ToxValDB Stage contains data that closely match data from each source, in both structure and terminology. The second (the main ToxValDB database) maps all source data to a consistent structure and set of vocabularies. The current version of the database (9.6.0) contains 238,617 records covering 39,669 chemicals from over 40 sources.
This repository contains the iput files and R scripts used to generate the files used for DCAP analysis using ToxValDB v9.6.0 data.
- CompTox Chemicals Dashboard
- Navigate to the "Hazard Data" sidebar tab from a chemical page
- Note: Current CCD version is ToxValDB v9.5.0, it will be updated in Fall 2024
- US EPA Clowder Repository
- Versioned releases of ToxValDB in XLSX and MySQL dump file format with associated documentation
- Clone repository
> git clone
- Set up a .Renviron file based on the "Example_Renviron.txt" file
datapath
- Directory path to where the "data" folder is being read from/written to. Default is the repository folder itself.db_user
- Username to an instance of the ToxValDB database - Note: At this time, the ToxValDB database is only available for internal EPA users or from setting up a local copy using the SQL Database Dump file in the linked Clowder repository above.db_pass
- Password to an instance of the ToxValDB databasedb_server
- Host/server for an instance of the ToxValDB databasedb_port
- Port for the host/servertoxval.db
- Name of the ToxValDB database (default of res_toxval_v96)
- Open an RStudio session using the
toxvaldbBMDh.Rproj
file - Load the package using
devtools::load_all()
- Install missing packages as needed. See
NAMESPACE
file
- Install missing packages as needed. See
- Open the
R/driver.R
script - Run the
driver()
function with desired parameters - Wait for export to run (~10-20 minutes depending on how many DTXSID values are included or filtered out)
- Review
data/results/*run_name*
subfolder that is produced. Notable files are:- /results/ToxValDB for BMDh res_toxval_v96.xlsx
- Full dataset pulled from ToxValDB as initially qualifying records
- /results/ToxValDB for BMDh res_toxval_v96 POD filtered.xlsx
- Full dataset filtered to a single POD per study_group
- /results/ToxValDB for BMDh res_toxval_v96 removed entries.xlsx
- List of records filtered out from the POD filtered file with removal reason
- /results/ToxValDB for BMDh res_toxval_v96.xlsx
Submit any questions to Taylor Wall (wall.taylor@epa.gov) or Chelsea Weitekamp (weitekamp.chelsea@epa.gov)