Skip to content
This repository has been archived by the owner on Mar 13, 2024. It is now read-only.

Latest commit

 

History

History
171 lines (101 loc) · 11.7 KB

README.md

File metadata and controls

171 lines (101 loc) · 11.7 KB

THIS REPOSITORY IS NO LONGER MAINTAINED

psfgen is now replaced by pestifer

psfgen -- Advanced scripts for generating configurations and CHARMM36 topologies for use with NAMD

This repository contains psfgen scripts, TcL scripts for use in VMD, and some associated CHARMM36 topology and parameter files and NAMD config files used to generate initial conditions for production MD simulations of various systems. It should be helpful to anyone already familiar with using psfgen to build systems using CHARMM topologies who also needs access to some more advanced system-building capabilities than are available in the psfgen tutorial. It is strongly recommended that users have familiarity with the psfgen plugin via the excellent user guide. Some of the features this repository provides beyond what psfgen can easily do are the following:

  • support for rudimentary loop model-building to include residues missing in a PDB file but present in the crystallized protein sequence;
  • support for glycans and non-covalently linked sugars and other ligands, including grafting
  • integration between solvation and initial MD simulation config file (easy transfer of box size);
  • support for down-puckered prolines;
  • de-novo membrane-building using packmol

The src directory contains files that can be sourced by psfgen scripts. The charmm directory contains some custom topologies and parameters derived from the July, 2016 charmm36 parameter set. Other directory names indicate the PDB entry for which the files contained therein are applicable.

The repository is being updated continuously. Issue git pull in your local copy often to keep it up-to-date. The master branch should be functional.

Requirements

  1. Software:

    • NAMD v. 2.13 or higher.
    • VMD 1.9.3 or higher.
    • Python 3.x
    • packmol
    • swig, tcl, and tcl headers/devel
    • gcc
  2. CHARMM36 topologies and parameters

Instructions

  1. Clone this repository; if you do this from your home directory, then this creates the ~/psfgen directory.

  2. Set environment variables in ~/.bashrc (adjust pathnames as necessary):

export PSFGEN_BASEDIR=${HOME}/psfgen
export CHARMRUN=${HOME}/namd/NAMD_2.14_Source/Linux-x86_64-g++/charmrun
export NAMD2=${HOME}/namd/NAMD_2.14_Source/Linux-x86_64-g++/namd2

You can also set the variable VMD to point to a specific VMD executable if it is not in your path.

  1. Put CHARMM36 parameters in ~/charmm/toppar:
mkdir ~/charmm
cd ~/charmm
tar zxf toppar_c36_jul20.tgz
ln -s toppar_c36_jul20 toppar

If you put them somewhere else, use the environment variable TOPPARDIR in your ~/.bashrc to specify.

CHARMM topologies are used in the workflow steps in which PSF files are generated, and CHARMM parameters are used during NAMD simulations. At least two files are incompatible with VMD/psfgen: toppar_water_ions.str and top_all36_carb.rtf. Slightly modified versions of these two files appear in the $PSFGEN_BASEDIR/charmm directory. These are based on the July 2020 version of CHARMM force field.

  1. Compile the bondstruct.so module for the loop-minimizing Monte Carlo:
$ cd $PSFGEN_BASEDIR
$ mkdir lib
$ cd src
$ make bondstruct.so
  1. Add the following line to the end of your ~/.vmdrc file:
source $env(PSFGEN_BASEDIR)/scripts/vmdrc.tcl

Structures

  1. 3TGQ -- unliganded core monomeric HIV-1 gp120, with NAG's;

  2. 4ZMJ -- unliganded soluble trimeric HIV-1 Env gp140, with glycans;

  3. 5FUU -- soluble, cleaved JR-FL HIV-1 trimer, with glycans; option to build Man9's on several sites

  4. 1HHP -- apo, dimeric HIV-1 protease;

  5. 2MB5 -- carbonmonoxymyoglobin;

  6. 1F7A -- HIV-1 protease with bound substrate peptide;

  7. 1HIW -- HIV-1 matrix (MA) trimer;

  8. 4H8W -- HIV-1 gp120 core clade A/E, with options to implement any number of the following mutations: (1) S375H (to bring back to clade A/E WT), (2) H61Y, (3) Q105H, (4) V108I, or (5) NIK474-476DMR, all of which represent the clade-C sequence;

  9. 5VN3 -- HIV-1 gp140 SOSIP trimer with bound sCD4 and FAB 17b; option to dock small-molecule CD4 mimetic BNM-III-170

  10. 3G9R -- HIV-1 gp41 membrane proximal external region (MPER) in an engineered coiled-coil trimer

  11. 5GGR -- Nivolumab (``Opdivo'') Fab in complex with PD-1, with options for not including PD-1, and including either sucrose or D-mannitol exipients

  12. 1L2Y -- Trp cage miniprotein

  13. SUCR -- a single solvated sucrose molecule extracted from 5o8l.pdb

  14. 2JIU -- Human eGFR kinase, T790M mutant, ATPMg-bound

  15. ALAD -- a single solvated alanine dipeptide with neutral ends (RESI ALAD in charmm36)

  16. MTL -- a single solvated D-mannitol molecule extracted from 1m2w.pdb

  17. B529 -- a single solvated BMS-529 molecule extracted from 5u7o.pdb, using CGenFF

  18. 5U7O -- HIV-1 gp140 SOSIP trimer with bound BMS-529 entry inhibitor, including glycans but not including Fabs. There is also a version of this molecule with a representative BMS-derived DAVEI inhibitor model-built into the structure (5u7o-davei-l7).

  19. 3PTB -- Trypsin/benzamidine (latter parameterized using CGenFF)

  20. 2EZN -- Cyanovirin-N, with an option to create a CVN-(G4S)n-H6-MPER DAVEI

  21. 2YHH -- Microvirin, with options to create the MVN(Q81K/M83R)-(G4S)n-H6-[MPER|Trp3] DAVEI

  22. BNM -- a single molecule of BNM-III-170 extracted from 5f4p.pdb, using CGenFF, in a water box

  23. 5F4P -- HIV-1 gp210 core with small-molecule CD4 mimic BNM-III-170 bound

  24. 5JYN -- HIV-1 gp41 transmembrane domain triple-helix embedded in a DMPC bilayer

  25. 5VN8 -- HIV-1 gp140 SOSIP trimer in the b12-bound "open" conformation, with option to model-in MPER helices and the small-molecule entry-inhibitor BNM-III-170.

  26. 2K7W -- BAX proapoptotic protein with option to include bound BIM SAHB peptide.

  27. 3CP1 -- HIV-1 gp41 NHR/CHR six-helix bundle, with option to grow in MPER and TM, and membrane-embedding.

  28. EOH -- a single ethanol molecule extracted from 3TOD, using CGENFF

  29. GXG -- a single GXG tripeptide, where X can be any residue, with an option to make a mixture of any molarity of GXG with an ethanol/water cosolvent

  30. 6VSB -- Soluble, stabilized trimeric SARS-CoV-2 S spike protein complex, open

  31. 6VXX -- Soluble, stabilized trimeric SARS-CoV-2 S spike protein complex, closed

  32. 6VYB -- Soluble, stabilized trimeric SARS-CoV-2 S spike protein complex, open

  33. 6M0J -- Complex of SARS-CoV-2 S spike receptor binding domain and ACE2

  34. 6W41 -- Complex of SARS-CoV-2 S spike receptor binding domain and hAb CR3022

  35. 6WAQ -- Complex of SARS-CoV-1 S spike receptor binding domain and nanobody VHH-72

  36. 4BYH -- Sialylated IgG Fc

  37. 4B7I -- Human IgG Fc Bearing Hybrid-type Glycans

  38. 2WAH -- An IgG1 Fc Glycoform (MAN9GLCNAC2)

  39. 5X58 -- Soluble, stabilized trimeric SARS-CoV-1 S spike protein complex, closed

More to come...

Acknowledgments

  1. VMD and NAMD were developed at the Theoretical and Computational Biophysics Group at the NIH Center for Macromolecular Modeling and Bioinformatics at the University of Illinois at Urbana-Champaign. Please cite "W. Humphrey, A. Dalke, and K. Schulten. VMD -- Visual Molecular Dynamics. Journal of Molecular Graphics, 1996;14:33-38" (VMD) and "J. C. Phillips, R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa, C. Chipot, R. D. Skeel, L. Kale, and K. Schulten. Scalable molecular dynamics with NAMD. Journal of Computational Chemistry, 2005;26:1781-1802" (NAMD).

  2. Packmol is a product of Leandro Martinez in the Institute of Chemistry at the University of Campinas. Please cite ``L. Martínez, R. Andrade, E. G. Birgin, J. M. Martínez. Packmol: A package for building initial configurations for molecular dynamics simulations. Journal of Computational Chemistry, 2009;30:2157-2164.''

  3. The CHARMM force field is used in building all topology files in this repository. Please cite: ``A. D. MacKerell, Jr., M. Feig, and C. L. Brooks, III. Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. Journal of Computational Chemistry, 2004;25:1400-1415.''

  4. CGenFF is a web-based service for generating CHARMM General Forcefield parameters that are not already in the CHARMM36 force field. Please cite: ``K. Vanommeslaeghe, E. Hatcher, C. Acharya, S. Kundu, S. Zhong, J. Shim, E. Darian, O. Guvench, P. Lopes, I. Vorobyov, A. D. MacKerell Jr., CHARMM General Force Field: A Force field for Drug-Like Molecules Compatible with the CHARMM All-Atom Additive Biological Force Field, Journal of Computational Chemistry, 2010;31,671-690.''

  5. The RCSB is funded by grant DBI-1338415 from the National Science Foundation, the National Institutes of Health, and the US Department of Energy. Please cite: ``H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne. The Protein Data Bank. Nucleic Acids Research, 2000;28:235-242. doi:10.1093/nar/28.1.235.''

  6. All codes and data in this repository have been made possible with partial support from NIH through grants AI084117, AI093248, GM115249, GM056550, and GM100472, the National Science Foundation through grants DMR-1207389 and MCB-1330205, and the US Army through grants W911NF-12-2-0022, W911-NF-13-1-0046, W911NF-12-R-0011, and W911NF-17-2-0227.

2017-2020, Cameron F Abrams

cfa22@drexel.edu