Skip to content

Commit

Permalink
Merge pull request #23 from JacksonBurns/paper/v3
Browse files Browse the repository at this point in the history
alphaXiv v2 Paper Revisions -> alphaXiv v3
  • Loading branch information
JacksonBurns authored Sep 23, 2024
2 parents 2a596ce + 3d0d1a6 commit 5d735e9
Show file tree
Hide file tree
Showing 3 changed files with 12 additions and 7 deletions.
10 changes: 7 additions & 3 deletions .github/workflows/paper.yml
Original file line number Diff line number Diff line change
Expand Up @@ -32,8 +32,12 @@ jobs:
- name: Build
run: |
cd paper
pandoc --citeproc --bibliography=paper.bib -s paper.md -o paper.pdf
- uses: actions/upload-artifact@v1
pandoc --citeproc -s paper.md -o paper.pdf --template default.latex --pdf-engine=pdflatex --pdf-engine-opt=-output-directory=foo
- uses: actions/upload-artifact@v4
with:
name: paper
path: paper/paper.pdf
path: paper/paper.pdf
- uses: actions/upload-artifact@v4
with:
name: build_files
path: paper/foo
9 changes: 5 additions & 4 deletions paper/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -89,9 +89,10 @@ All of this is not to say that DL has _never_ been applied to QSPR.
Applications of DL to QSPR, i.e. DeepQSPR, were attempted throughout this time period but focused on the use of molecular fingerprints rather than descriptors.
This may be at least partially attributed to knowledge overlap between deep learning experts and this sub-class of descriptors.
Molecular fingerprints are bit vectors which encode the presence or absence of human-chosen sub-structures in an analogous manner to the "bag of words" featurization strategy common to natural language processing.
It is reasonable to assume a DL expert may have bridged this gap to open this subdomain, and its effectiveness proved worthwhile.
In the review of DL for QSPR by Ma and coauthors [@ma_deep_qsar] claim that combinations of fingerprint descriptors are more effective than molecular-level descriptors, either matching our outperforming linear methods across a number of ADME-related datasets.
This study will later refute that suggestion.
Experts have bridged this gap to open this subdomain and proved its effectiveness.
In Ma and coauthors' review of DL for QSPR [@ma_deep_qsar], for example, it is claimed that DL with fingerprint descriptors is more effective than with molecular-level descriptors.
They also demonstrate that DL outperforms or at least matches classical machine learning methods across a number of ADME-related datasets.
The results of this study demonstrate that molecular-level descriptors actually _are_ effective and reaffirm that DL matches or outperforms baselines, in this case linear.

Despite their differences, both classical- and Deep-QSPR shared a lack of generalizability.
Beyond the domains of chemistry where many of the descriptors had been originally devised, models were either unsuccessful or more likely simply never evaluated.
Expand Down Expand Up @@ -270,7 +271,7 @@ Table: Summary of benchmark results. \label{results_table}
+---------------+--------------------+-------------+--------------+------------+-------------------------+------+
|Pgp |~1.3 |AUROC |0.94$^b$ |0.90 |0.89$^b$ | ~ |
+---------------+--------------------+-------------+--------------+------------+-------------------------+------+
|ARA |~0.8 |Accuracy |91$^c$ |0.88 |82* |0.083 |
|ARA |~0.8 |Accuracy |91$^c$ |88 |82* |0.083 |
+---------------+--------------------+-------------+--------------+------------+-------------------------+------+
|Flash |~0.6 |RMSE |13.2$^d$ |13.0 |21.2* |0.021 |
+---------------+--------------------+-------------+--------------+------------+-------------------------+------+
Expand Down
Binary file modified paper/paper.pdf
Binary file not shown.

0 comments on commit 5d735e9

Please sign in to comment.