HANDS ON ANALYSIS SESSION

Introduction to Analysis of REMD simulations

Thomas E. Cheatham III, Daniel R. Roe, Christina Begonzo & Rodrigo Galindo-Murillo

The goal here is to perform an initial processing to a set of Replica Exchange Molecular Dynamics simulations. CPPTRAJ has the ability to process multiple runs using the 'ensemble' command. This command will read in and process trajectories as an ensemble. Similar to ’trajin remdtraj’, except instead of processing one frame at a target temperature, process all frames. This means that action and trajout commands apply to the entire ensemble; note however that not all actions currently function in ’ensemble’ mode.

This example is based on the same system used for the clustering tutorial which is based on the RNA GACC tetranucleotide. The goal here is to perform an initial processing to a set of Temperature Replica Exchange Molecular Dynamics simulations.

The files for this tutorial consists of the AMBER topology file and eight separate trajectory files that have been calculated using different temperatures:

Note: You can also get files using wget

wget http://www.amber.utah.edu/PRACE-BioExcel-Seasonal-School-2019/hremd/FILENAME

Note that although input is provided in files, users are encouraged to use the interactive mode to become better familiar with CPPTRAJ workflow and command options.

The following CPPTRAJ script will read the REMD set of trajectories and perform some analysis on the data:

parm rGACC.nowat.parm7
ensemble rGAAC.nowat.001
strip :Na+
rms RNA first :1-4&!@H= mass out rmsd.dat
average avg.pdb :1-4

CPPTRAJ has the ability to process multiple runs using the 'ensemble' command. This command will read in and process trajectories as an ensemble and read a series of files provided that they have a serial number at the end.. Similar to ’trajin remdtraj’, except instead of processing one frame at a target temperature, process all frames. This means that action and trajout commands apply to the entire ensemble; note however that not all actions currently function in ’ensemble’ mode.

We will briefly look at each option with more detail:

Once your input has been read in, type run to begin trajectory processing and analysis. CPPTRAJ will create a series of files as requested by our script. It will create the file rmsd.dat which contains 9 columns, the first one corresponding to the frame number and 8 columns corresponing to the RMSD value for each one of the trajectories read by the ensemble command. Also, we should now have 8 pdb files with the name avg.pdb.X (where X=0 to 7), which corresponds to the average structure for each individual trajectory.

Copyright Thomas E. Cheatham III, Christina Bergonzo, Daniel Roe & Rodrigo Galindo-Murillo, 2015