Protein Refinement in Discovery Studio 1.7. Presented by: Francisco Hernandez-Guzman, PhD

Similar documents
Bioinformatics for molecular biology

Side-Chain Positioning CMSC 423

CS612 - Algorithms in Bioinformatics

H C. C α. Proteins perform a vast array of biological function including: Side chain

Introduction to proteins and protein structure

Accurate Reconstruction of All-Atom Protein Representations From Side-Chain-Based Low-Resolution Models

WHY MACROMOLECULAR STRUCTURE?

This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is worth 2 points.

Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation

Analysis of protein modeling for envelope glycoprotein GP120 for HIV via bioinformatics approaches

BBSI Lab Assignment for 6/15/2005. ** Turn in the answers to the questions on a separate piece of paper.

An Exact Algorithm for Side-Chain Placement in Protein Design

Strain in Protein Structures as Viewed Through Nonrotameric Side Chains: I. Their Position and Interaction

Protein contacts, inter-residue interactions and side-chain modelling.

Prediction of Protein Side-chain Rotamers from a Backbone-dependent Rotamer Library: A New Homology Modeling Tool

SUPPLEMENTARY INFORMATION. Computational Assay of H7N9 Influenza Neuraminidase Reveals R292K Mutation Reduces Drug Binding Affinity

Chem Lecture 2 Protein Structure

Accuracy of Side-Chain Prediction Upon Near-Native Protein Backbones Generated by Ab Initio Folding Methods

List of Figures. List of Tables

Trans-10,cis-12 Conjugated Linoleic Acid a Novel Inhibitor of Inflammatory Mediators

Minimizing and Learning Energy Functions for Side-Chain Prediction

Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random

SUPPLEMENTARY INFORMATION

Chapter 3: Amino Acids and Peptides

Supplementary Table 1. Data collection and refinement statistics (molecular replacement).

Natural Flavonoids as CDK5 inhibitors: A Homology modeling and Molecular Docking Study

Comparing side chain packing in soluble proteins, protein-protein interfaces, and transmembrane proteins

Trilateral Project WM4

Table S1. X-ray data collection and refinement statistics

Amino Acids. Review I: Protein Structure. Amino Acids: Structures. Amino Acids (contd.) Rajan Munshi

Supplementary Information Janssen et al.

Molecular Dynamics Simulation Study Explaining Inhibitor Selectivity in Different Class of Histone Deacetylases

Minimizing and Learning Energy Functions for Side-Chain Prediction

Proteins and their structure

Protein Secondary Structure

PHARMACOPHORE MODELING

Identification of Structural Mechanisms of HIV-1 Protease Specificity Using Computational Peptide Docking: Implications for Drug Resistance

research papers Automated side-chain model building and sequence assignment by template matching 1. Introduction Thomas C.

Supplementary Figures

COMPARATIVE VEGF RECEPTOR TYROSINE KINASE MODELING FOR THE DEVELOPMENT OF HIGHLY SPECIFIC INHIBITORS OF TUMOR ANGIOGENESIS

Protein Primer, Lumry I, Chapter 10, Enzyme structure, middle

Qualitative and Quantitative analysis of 3D predicted arachidonate 15-lipoxygenase-B (15-LOX- 2) from Homo sapiens

Sialic acids and diabetes : impact of desialylation on insulin receptor s functionality

a) The statement is true for X = 400, but false for X = 300; b) The statement is true for X = 300, but false for X = 200;

Workshop on Analysis and prediction of contacts in proteins

Exploring Energy and Fitness Landscapes of Proteins

GPCR Structure-Based Virtual Screening Approach for CB2 Antagonist Search

Predicting Inactive Conformations of Protein Kinases Using Active Structures: Conformational Selection of Type-II Inhibitors

The role of Ca² + ions in the complex assembling of protein Z and Z-dependent protease inhibitor: A structure and dynamics investigation

Forward Looking Statements

7.014 Problem Set 2 Solutions

Improving amino-acid identification, fit and C ff prediction using the Simplex method in automated model building

Supporting material. Membrane permeation induced by aggregates of human islet amyloid polypeptides

BCH Graduate Survey of Biochemistry

Lecture 15. Membrane Proteins I

We are going to talk about two classifications of proteins: fibrous & globular.

EEG Features in Mental Tasks Recognition and Neurofeedback

Molecular Mechanics Studies of Enzyme Evolutionary Mechanisms

SUPPLEMENTARY INFORMATION

obtained for the simulations of the E2 conformation of SERCA in a pure POPC lipid bilayer (blue) and in a

ARTIFICIAL INTELLIGENCE FOR DIGITAL PATHOLOGY. Kyunghyun Paeng, Co-founder and Research Scientist, Lunit Inc.

Chapter 6. X-ray structure analysis of D30N tethered HIV-1 protease. dimer/saquinavir complex

MODELING AN SMT LINE TO IMPROVE THROUGHPUT

Ras and Cell Signaling Exercise

Prediction of temperature factors from protein sequence

Antigen Receptor Structures October 14, Ram Savan

Reviewers' Comments: Reviewer #2: Remarks to the Author:

Molecular docking analysis of UniProtKB nitrate reductase enzyme with known natural flavonoids

GPR55 is a newly de-orphanized cannabinoid receptor which belongs to the class A G-

Lecture 10 More about proteins

STATION 1 SCIENTIFIC INVESTIGATION VOCABULARY 2015 FALL BENCHMARK BIOLOGY

NIH Public Access Author Manuscript J Am Chem Soc. Author manuscript; available in PMC 2008 September 29.

Structural Analysis of TCRpMHC Complexes Using Computational Tools. Feroze Mohideen Briarcliff High School

VIRTUAL SCREENS WITH MULTIPLE RECEPTOR CONFORMATIONS. Rob Swift, UCI Wednesday, August 3, 2011

Molecular Graphics Perspective of Protein Structure and Function

Monitoring and optimising waste heat recovery for diesel-electric vessel propulsion. Eric Link LMS Imagine.Lab Amesim

Multiple-Choice Questions Answer ALL 20 multiple-choice questions on the Scantron Card in PENCIL

Amino Acids and Proteins Hamad Ali Yaseen, PhD MLS Department, FAHS, HSC, KU Biochemistry 210 Chapter 22

Premium solution for your health

Forward Looking Statements

Domain Flexibility in Retroviral Proteases: Structural Implications for Drug Resistant Mutations,

S system H system /T 0 ph = log([h + ]) E = mc 2 S = klnw G = H T S ph = pk a + log([a ]/[HA]) K a = [H + ][A ]/[HA] G = RTlnK eq e iπ + 1 = 0

PRC2 crystal clear. Matthieu Schapira

proteins Assessment of protein side-chain conformation prediction methods in different residue environments

Computational Modeling of the C-Terminal Glycan of Type IV Pilin

FCC2 5CY7 FCC1 5CY8. Actinonin 5CVQ

SUPPORTING INFORMATION FOR. A Computational Approach to Enzyme Design: Using Docking and MM- GBSA Scoring

Ch5: Macromolecules. Proteins

Chapter 7: Investigations into the Role of the Unusual Disulfide in the nachr Agonist Binding Site

SAM Teacher s Guide Four Levels of Protein Structure

Technical Note # TN-31 Redefining MALDI-TOF/TOF Performance

The Basics: A general review of molecular biology:

4. THE THREE-DIMENSIONAL STRUCTURE OF PROTEINS

Computer Simulation of T-Cell Activation

Detergent solubilised 5 TMD binds pregnanolone at the Q245 neurosteroid potentiation site.

APPLICATION OF MOLECULAR DYNAMICS SIMULATIONS IN THE STUDY OF PROTEIN-LIGAND INTERACTIONS

Lecture 4: 8/26. CHAPTER 4 Protein Three Dimensional Structure

Supporting Information

Mining Protein Structure Data

AutoLigand: A Tool for Finding Ligand Binding Sites. CUHK workshop Dec, 2011

Transcription:

Protein Refinement in Discovery Studio 1.7 Presented by: Francisco Hernandez-Guzman, PhD February 2007

Agenda Part I: Protein Refinement in Discovery Studio 1.7 (20mins) The problem Novel Algorithms at Accelrys Part II: DEMO (25mins) Conclusion 2

Importance and Difficulties Importance of side-chain and loop refinement X-ray Missing data Data is constrained crystal contacts Proteins are dynamics entities Homology modeling Inherent errors and bias from template(s) Missing data In-Silico Structure Based Drug Design Evidence indicates the need to consider flexibility to open active sites, and allow proper docking Difficulties in side-chain and loop refinement Intrinsic flexibility difficult combinatorial problem 1 peptide 2-3 rot. bonds. 3

Protein Refinement in Discovery Studio 1.7 Protein refinement tools can be used to: Correct errors in exp. structs. due to missing exp. data. Provide alternate starting conformations for docking Discovery Studio 1.7 provides two novel algorithms for: Side-chain refinement (rebuilding) ChiRotor 1 Loop refinement (rebuilding) Looper 2 ACCELRYS Results of loop refinement protocol CHARMm based 4 1. V. Spassov, et al. The dominant role of side-chain backbone interactions in structural realization of amino-acid code. ChiRotor: a side-chain prediction algorithm based on side-chain backbone interactions Protein Science 2007. 2. V. Spassov,, et al. LOOPER : A CHARMm based Algorithm for Loop prediction (in preparation)

Side-Chain Refinement using ChiRotor ChiRotor Optimizes: -side-chains conf. systematic CHARMm E search and minimization. Selects: -Best conf. based on CHARMm E. Fast ab intio approach independent on the starting structure Rebuilds the side-chains 5 Animated figure

ChiRotor Methodology Start loop for i from 1 to n Choose Residue i Construct complete structure using lowest energy conformer of each residue. 6 Protein 3D Structure Select a Set of n Residues For Refinement Remove all side chain atoms of selected residues Sample side chain conformations of residue i varying χ 1 Energy Minimize side chain atoms of residue i in CHARMm Save 2 Best Conformations for residue i End loop for i Output: 2n partial structures Energy minimize all selected side chains Start loop for i from 1 to n Replace side chain conformation i with the 2 nd conformer and energy minimize Accept the structure if energy is lower. End loop for i Output: 1 Lowest Energy structure

ChiRotor Results Compared ChiRotor to SCWRL and SCAP 1 Benchmark includes 24 high resolution X-ray crystal structures selected using PISCES 2 Resolution < 1.0Å Sequence identity < 20% ChiRotor More accurate in the core region Relatively fast Computation time is similar to SCAP (fast mode) and much faster (10 to 20 times) than SCAP (slow mode) Not based on a rotamer library Predicted conformations are already minimized by CHARMm thus no further refinement is necessary Method ChiRotor SCWRL SCAP (Fast mode) SCAP (Slow mode) Core RMSD 0.8 1.1 0.9 0.7 All RMSD 1.7 1.7 1.6 1.5 7 1. Xiang, Z. and Honig, B. 2000. Extending the accuracy limits of prediction for side-chain conformations. J. Mol. Biol. 311: 421-430. 2. R. L. Dunbrack Jr.* and M. Muquit http://dunbrack.fccc.edu/pisces.php

Slide hidden during ppt. ChiRotor Results The CHARMm polar force field is more accurate and faster CPU time scales close to O(N2) where N is the number of residues ChiRotor RMSD 2.5 80 ChiRotor CPU Time 2 CPU (min) 70 60 50 40 30 20 CHARMm-Polar-H CHARMm RMSD (Angstroms) 1.5 1 10 0 #res 53 58 64 81 107 123 129 224 233 269 321 24 proteins sorted by sequence length 363 0.5 0 #res 53 58 64 81 CHARMm-Polar-H, core residues CHARMm-Polar-H, all residues CHARMm, core residues CHARMm, all residues 107 123 129 224 233 269 321 363 24 proteins sorted by sequence length 8

Loop Refinement using Looper The Looper loop refinement algorithm Systematically searches for backbone conformation Uses CHARMm minimization Ranks the conformation using CHARMm energy, including solvation energy term Fast, ab intio approach independent of the starting structure Rebuilds the loop de novo Loop Refinement protocol and results in Discovery Studio 1.7 9

Looper Methodology Looper first constructs and optimizes the loop backbone Systematic search of loop conformation by sampling a minimum set of backbone dihedral angles φ and ψ The loop is divided into two halves and each half is constructed independently from the end of the loop and then combined without the side-chain atoms The loops are minimized by CHARMm and ranked by CHARMm energy Continued 10

Looper Methodology Next, Looper constructs the loop side chains and optimizes the loops Construct the side-chains using the ChiRotor and the loop conformations are ranked by CHARMm energies Finally, Looper re-ranks the conformations CHARMm energy minimizations in the first two steps are done without including the solvation energy term In this step, each top ranking loop conformation is re-scored by adding the solvation energy term calculated using Generalized Born approximation (in CHARMm) 11

Looper Results Compared Looper to Fiser 1 and Jacobson 2 publications Benchmark Test Data Sets were taken from Fiser s 1 work For each loop length, 40 loops from 40 different proteins are modeled RMSD were calculated between the model and experimental structure for backbone heavy atoms using the global RMSD definition by Fiser Jacobson s calculation are done in crystal environment which may result in better accuracy, but not practical for model building 12 1. Fiser et al. Prot.Sci., 9:1753-1773, (2000) 2. Jacobson et al. Protens., 55:351-367 (2004)

Looper Results (accuracy) Looper is more accurate than Fiser s Jacobson results are better for longer loops MODELER 9.0 loop refinement method (includes the DOPE function) is better than Fiser s RMSD* [Angstrom] 8 7 6 5 4 3 2 1 0 Looper 1 2 3 4 5 6 7 8 9 10 11 12 Loop Length [residues] * bb atom rmsd MODELER 6.0 (Fiser) Jacobson 13

Looper Results (speed) Looper results are similar to Jacobson s stage 1, but much faster (>100 times faster) than stages 2 and 3 Jacobson s stage 3 results are better for longer loops, but with a high computational cost (>100 times slower than looper) Fiser s timing is similar to Jacobson s methods 25 Looper 20 Time [min] 15 10 5 0 2 3 4 5 6 7 8 9 10 11 12 Number of Residues in Loop 14 Jacobson

15 DEMO

Conclusions One of the main obstacles to any method for protein structure prediction: Combinatorial problem In the case of loop and side chain optimizations: intelligent assumptions the difficulty is reduced, but still significant We demonstrated that it is possible to create efficient algorithms for loop (Looper) and side-chain (ChiRotor) optimizations by reducing the search to the minimal number of initial conformations of each aminoacid residue in combination with energy minimization Continued 16

Conclusions Continued The CHARMm scripts for side-chain and loop predictions are developed entirely on physical principles and do not use any bases of structural data or rotamer libraries The script realization of the algorithms makes it easy to include them as part of any CHARMm modeling protocol The accuracy of the methods is comparable to the accuracy of other known algorithms In the case of ChiRotor, we have achieved an extremely fast code without losing accuracy 17

Acknowledgements R&D Velin Spassov Lisa Yan Paul Flook Marc Fasnacht Jürgen Koska Marketing Dana Haley-Vicente Accelrys 18