This project enables exploration of a large scale fragment-based chemoproteomics screening data.
Note
Browse web apps that we deployed in an external cloud server (or) run apps locally by forking the code from GitHub (or) download Screening and Competition data for your own research.
Important
web app will sleep if inactive for few days 😴; if a web app is inactive, a Restart this Space
button will appear within the web app. Click that button to reload the app 🤗
Web App |
Code Repo | DOI |
---|
Explore the interactions between ligands and proteins. This is the main navigator to the chemoproteomics data, containing profiles for 407 fragments. The app also showcases competition assays for a few selected fragments
Web App |
Code Repo | DOI |
---|
With this tool, you can in put a set of proteins and see how many fragments interact with them according to our chemoproteomics data. We categorize proteins by their promiscuity/specificity levels to easily detect reported effect such as labeling bias
Web App |
Code Repo | DOI |
---|
Explore fragment profiles from an enrichment perspective. We capture protein annotations of multiple scopes, from domains and families to molecular functions and cellular localization. We offer global and detailed views for each fragment
Web App |
Code Repo | DOI |
---|
Predict whether your fully-functionalized fragment of interest is likely to be promiscuous or associated with a specific interactome signature. We have predefined 10 interactome signatures capturing high-level biological processes that emerged from our chemoproteomics data
Web App |
Code Repo | DOI |
---|
Build a machine learning model on the fly to predict potential interactions between your fully-functionalized fragments and proteins of interest. Sets of proteins are accepted and organized in coherent subsets to maximize the chance of obtaining a good model
This project was led by Georg Winter's group at CeMM
Cite: Offensperger et al., Science (2024)
Additional code repositories associated with the project:
Data Analysis & Paper Figures
- primary-paper-figures: Executable notebooks to produce the main paper figures. DOI.
- interactome-signatures: Executable notebooks for discovering interactome signatures from the primary screening data. DOI
Model Training
- fragment-embedding: Train and predict fully-functionalized fragment descriptors. DOI
- mini-automl: Train small machine learning models using TabPFN. Includes scripts for promiscuity models and interactome signature models. DOI
- mini-xai: Notebook for the Shapley value analysis. DOI
- Interactions web app
Do i have to clear existing text to search for a new Protein?
No. To change selected protein, there is no need to select whole existing term or delete or type new. Just place the mouse cursor in the search box and just start to type new protein!; old text is automatically cleared
I search for a gene (example, DDB1) but did not find it in Protein search box
Names like caspase or ddb1 appear many times in description of proteins, therefore search engine is not able to pick the correct one. Type specific input such as DNA damage-binding protein 1 or DDB1 DNA dama.. to find DDB1. Search with UniProt Accession (Q16531 for DDB1) is better!
I am interested in a specific Fragment (C008) but it is not displayed in the Gen1 search box
Gen1 Fragment selection menu is dynamically updated based on the selected Protein and threshold filters applied. If you are interested in a specific Gen1 Fragment, then clear all filters, i.e., select 'no filter' option for P values, adjusted P alues and filterSet (fS) and/or select a promiscuous Protein such as TOMM22
Changing the threhold for _P_ values, adjusted _P_ values and filterSet (fS) does not change the plot
That is correct. Plots are static. They are not dynamically updated based on threshold values. Changing the threhold for P values, adjusted P values and filterSet (fS) only updates the Table. Pre- set (i.e. default) filterSet (fS or fS2) were used to create plots
What was the criteria to label a Protien or Fragment as Promiscuous ?
10% hit/regulated ratio for Proteins and 5% for Fragments")
Can i change the order of Gen1 Fragments displayed in the search box menu?
Order of Fragments displayed in search menu matches the original order in the Table. Re-adjusting the table, for example: sort the table based on Foldchange (Fc) values or number of Protein/Fragment hits will not change the order of the Fragments in the search menu
Changing the threhold for P values, adjusted P values and filterSet (fS) updates the original Table, consequently results in a change in the order of Gen1 Fragment displayed in search menu
How can i explore second generation (Gen2) Fragment data?
Gen2 Fragment data is available for only a small subset of Gen1 Fragments. Gen2 data is automatically displayed if the selected Gen1 Fragment has elaborates. For example, select C027 in Gen1 search box
- Explore Protein-sets app
How to resolve error message 'More than one input type has been provided!'?
Please only choose one of the options, i.e. input proteins manually, or select a pre-screened Ligand, or upload a file
- Fragment Predictor web app
Explain Promiscuity and Ontology Prediction models
- We built two types of Promiscuity models. Three Type 1 (Global) and nine Type 2 (Specific) models
- We built 10 Ontology models.
- Area Under Receiver Operating Characteristic (AUROC) of > 0.75 is considered a good model 👍
Global | num. Proteins | AUROC |
---|---|---|
Prom-0 | 100+ | 0.837 👍 |
Prom-1 | 200+ | 0.800 👍 |
Prom-2 | 300+ | 0.713 |
Specific | num. Proteins | num. Fragments | AUROC |
---|---|---|---|
Prom-0-0 | 5+ | < 4 | 0.818 👍 |
Prom-0-1 | 10+ | 4 -> 40 | 0.809 👍 |
Prom-0-2 | 50+ | > 40 | 0.857 👍 |
Prom-1-0 | 10+ | < 4 | 0.771 |
Prom-1-1 | 15+ | 4 -> 40 | 0.855 👍 |
Prom-1-2 | 100+ | > 40 | 0.858 👍 |
Prom-2-0 | 50+ | < 4 | 0.771 |
Prom-2-1 | 100+ | 4 -> 40 | 0.743 |
Prom-2-2 | 200+ | > 40 | 0.814 👍 |
Model name | Ontology | AUROC |
---|---|---|
Sign-0 | Endoplasmic reticulum | 0.872 👍 |
Sign-1 | RNA binding | 0.727 |
Sign-2 | Lysosome | 0.938 👍 |
Sign-3 | Proteasome complex | 0.616 |
Sign-4 | Mitochondria | 0.519 |
Sign-5 | Nucleous | 0.702 |
Sign-6 | Transmembrane transporter | 0.756 |
Sign-7 | Microtubule | 0.557 |
Sign-8 | Organic acid binding | 0.706 |
Sign-9 | Envelope | 0.806 👍 |