Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes

Similar documents
Bioanalytical Quantitation of Biotherapeutics Using Intact Protein vs. Proteolytic Peptides by LC-HR/AM on a Q Exactive MS

Comparison of Full Scan MS2 and MS3 Linear Ion Trap Approaches for Quantitation of Vitamin D

The Investigation of Factors Contributing to Immunosuppressant Drugs Response Variability in LC-MS/MS Analysis

Quantitative Analysis of THC and Main Metabolites in Whole Blood Using Tandem Mass Spectrometry and Automated Online Sample Preparation

GC-MS/MS Analysis of Benzodiazepines Using Analyte Protectants

Dynamic Analysis of HIV-Human Protein-Protein Interactions During Infection

High-Throughput Quantitative LC-MS/MS Analysis of 6 Opiates and 14 Benzodiazepines in Urine

High-Throughput, Cost-Efficient LC-MS/MS Forensic Method for Measuring Buprenorphine and Norbuprenorphine in Urine

Integrated Targeted Quantitation Method for Insulin and its Therapeutic Analogs

Dayana Argoti, Kerry Hassell, Sarah J. Fair, and Joseph Herman Thermo Fisher Scientific, Franklin, MA, USA

MS/MS as an LC Detector for the Screening of Drugs and Their Metabolites in Race Horse Urine

PIF: Precursor Ion Fingerprinting Searching for a Structurally Diagnostic Fragment Using Combined Targeted and Data Dependent MS n

Nitrogen/Protein Determination in Starch by Flash Combustion using Large Sample Weight as an Alternative to the Kjeldahl Method

Using Multiple Mass Defect Filters and Higher Energy Collisional Dissociation on an LTQ Orbitrap XL for Fast, Sensitive and Accurate Metabolite ID

FT-Raman Surface Mapping of Remineralized Artificial Dental Caries

Thermo Fisher Scientific, Sunnyvale, CA, USA; 2 Thermo Fisher Scientific, San Jose, CA, USA

PTM Discovery Method for Automated Identification and Sequencing of Phosphopeptides Using the Q TRAP LC/MS/MS System

Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator.

Thermo Scientific LipidSearch Software for Lipidomics Workflows. Automated Identification and Relative. Quantitation of Lipids by LC/MS

Evaluation of an LC-MS/MS Research Method for the Analysis of 33 Benzodiazepines and their Metabolites

A Fully Integrated Workflow for LC-MS/MS Analysis of Labeled and Native N-Linked Glycans Released From Proteins

Impurity Profiling of Carbamazepine by HPLC/UV

Enhanced LC-MS Sensitivity of Vitamin D Assay by Selection of Appropriate Mobile Phase

Multiplex Protein Quantitation using itraq Reagents in a Gel-Based Workflow

Learning Objectives. Overview of topics to be discussed 10/25/2013 HIGH RESOLUTION MASS SPECTROMETRY (HRMS) IN DISCOVERY PROTEOMICS

LC-MS/MS Method for the Determination of Tenofovir from Plasma

Measuring Phytosterols in Health Supplements by LC/MS. Marcus Miller and William Schnute Thermo Fisher Scientific, San Jose, CA, USA

Essential Lipidomics Experiments using the LTQ Orbitrap Hybrid Mass Spectrometer

LC-MS/MS Method for the Determination of 21 Opiates and Opiate Derivatives in Urine

Determination of Clinically Relevant Compounds using HPLC and Electrochemical Detection with a Boron-Doped Diamond Electrode

Quantification with Proteome Discoverer. Bernard Delanghe

Automating Mass Spectrometry-Based Quantitative Glycomics using Tandem Mass Tag (TMT) Reagents with SimGlycan

Identification and Quantitation of Microcystins by Targeted Full-Scan LC-MS/MS

Supporting Information. Lysine Propionylation to Boost Proteome Sequence. Coverage and Enable a Silent SILAC Strategy for

Improve Protein Analysis with the New, Mass Spectrometry- Compatible ProteasMAX Surfactant

NIH Public Access Author Manuscript J Proteome Res. Author manuscript; available in PMC 2014 July 05.

Babu Antharavally, Ryan Bomgarden, and John Rogers Thermo Fisher Scientific, Rockford, IL

Development of a Human Cell-Free Expression System to Generate Stable-Isotope-Labeled Protein Standards for Quantitative Mass Spectrometry

Don t miss a thing on your peptide mapping journey How to get full coverage peptide maps using high resolution accurate mass spectrometry

Increased Identification Coverage and Throughput for Complex Lipidomes

Proteomics of body liquids as a source for potential methods for medical diagnostics Prof. Dr. Evgeny Nikolaev

SPE-LC-MS/MS Method for the Determination of Nicotine, Cotinine, and Trans-3-hydroxycotinine in Urine

PosterREPRINT INTRODUCTION. 2-D PAGE of Mouse Liver Samples. 2-D PAGE of E.coli Samples. Digestion / Cleanup. EXPERIMENTAL 1-D PAGE of BSA Samples

Internal Calibration System of Thermo Scientific Varioskan Flash with Improved Sensitivity, Accuracy and Dynamic Range

Nature Biotechnology: doi: /nbt Supplementary Figure 1

Characterization of Disulfide Linkages in Proteins by 193 nm Ultraviolet Photodissociation (UVPD) Mass Spectrometry. Supporting Information

Mass Spectrometry and Proteomics - Lecture 4 - Matthias Trost Newcastle University

Biological Mass spectrometry in Protein Chemistry

Sequence Identification And Spatial Distribution of Rat Brain Tryptic Peptides Using MALDI Mass Spectrometric Imaging

Use of a Tandem Mass Spectrometry Research Method for the Analysis of Amino Acids and Acylcarnitines in Dried Blood Spots

Designer Fentanyls Drugs that kill and how to detect them. Cyclopropylfentanyl

Novel Glycan Column Technology for the LC-MS Analysis of Labeled and Native N-Glycans Released from Proteins and Antibodies

ApplicationNOTE EXACT MASS MEASUREMENT OF ACTIVE COMPONENTS OF TRADITIONAL HERBAL MEDICINES BY ORTHOGONAL ACCELERATION TIME-OF-FLIGHT.

for the Identification of Phosphorylated Peptides

Mass Spectrometry. Mass spectrometer MALDI-TOF ESI/MS/MS. Basic components. Ionization source Mass analyzer Detector

Unsupervised Identification of Isotope-Labeled Peptides

Advances in Hybrid Mass Spectrometry

Quantitative chromatin proteomics reveals a dynamic histone. post-translational modification landscape that defines asexual

Characterization of an Unknown Compound Using the LTQ Orbitrap

The distribution of log 2 ratio (H/L) for quantified peptides. cleavage sites in each bin of log 2 ratio of quantified. peptides

Carl Fisher, Terri Christison, Hua Yang, Monika Verma, and Linda Lopez Thermo Fisher Scientific, Sunnyvale, CA, USA

Detection and Quantification of Inorganic Arsenic in Fruit Juices by Capillary Ion Chromatography with Suppressed Conductivity Detection

[ Care and Use Manual ]

Nature Methods: doi: /nmeth.3177

Molecular Cell, Volume 46. Supplemental Information

Join the mass movement towards mass spectrometry

Solving One of Chromatography s Biggest Dilemmas Proper Sealing of Chromatography Autosampler Vials

Extended Mass Range Triple Quadrupole for Routine Analysis of High Mass-to-charge Peptide Ions

New Instruments and Services

Application Note # ET-17 / MT-99 Characterization of the N-glycosylation Pattern of Antibodies by ESI - and MALDI mass spectrometry

Improved Extraction and Analysis of Hexavalent Chromium from Soil and Water

2. Ionization Sources 3. Mass Analyzers 4. Tandem Mass Spectrometry

A Rapid UHPLC Method for the Analysis of Biogenic Amines and Metabolites in Microdialysis Samples

Introduction to Proteomics 1.0

Applying a Novel Glycan Tagging Reagent, RapiFluor-MS, and an Integrated UPLC-FLR/QTof MS System for Low Abundant N-Glycan Analysis

New Solvent Grade Targeted for Trace Analysis by UHPLC-MS

Peptide sequencing using chemically assisted fragmentation (CAF) and Ettan MALDI-ToF Pro mass spectrometry

Mercury Speciation Determinations in Asian Dietary Supplements

A New HILIC/RP Mixed-Mode Column and Its Applications in Surfactant Analysis

Supplementary Materials for

Shotgun metaproteomics of the human distal gut microbiota. Present by Lei Chen

Quantitation of Protein Phosphorylation Using Multiple Reaction Monitoring

Rapid and Direct Analysis of Free Phytosterols by Reversed Phase HPLC with Electrochemical Detection

Comparison of mass spectrometers performances

Protein Reports CPTAC Common Data Analysis Pipeline (CDAP)

Forensic and clinical products and services

Increased Efficiency of Biomolecule Identification by Optimization of Trypsin Digestion Buffers

DART MSI of drugs of abuse in hair

Shotgun Proteomics MS/MS. Protein Mixture. proteolysis. Peptide Mixture. Time. Abundance. Abundance. m/z. Abundance. m/z 2. Abundance.

Lecture 3. Tandem MS & Protein Sequencing

4-Plex itraq Based Quantitative Proteomic Analysis Using an Agilent Accurate -Mass Q-TOF

Double charge of 33kD peak A1 A2 B1 B2 M2+ M/z. ABRF Proteomics Research Group - Qualitative Proteomics Study Identifier Number 14146

An Alternative Approach: Top-Down Bioanalysis of Intact Large Molecules Can this be part of the future? Lecture 8, Page 27

Core-Shell Technology for Proteins and Peptides

New Solvent Grade Targeted for Trace Analysis by UHPLC-MS

Supporting information

Ultra Performance Liquid Chromatography Coupled to Orthogonal Quadrupole TOF MS(MS) for Metabolite Identification

Increasing Extraction Efficiency of Wet Samples Using a Novel New Polymer During Accelerated Solvent Extraction

Supporting Information Parsimonious Charge Deconvolution for Native Mass Spectrometry

User Guide. Protein Clpper. Statistical scoring of protease cleavage sites. 1. Introduction Protein Clpper Analysis Procedure...

Transcription:

Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes Dominic Baeumlisberger 2, Christopher Kurz 3, Tabiwang N. Arrey, Marion Rohmer 2, Carola Schiller 3, Thomas Moehring, Walter A. Möller 3, and Michael Karas 2 Thermo Fisher Scientific, Bremen, Germany, 2 Institute for Pharmaceutical Chemistry, Goethe-University, Frankfurt am Main, Germany, 3 Department of Pharmacology,Goethe-University, Frankfurt am Main, Germany

Overview Purpose: Increase sequence coverage and overall confidence of protein identification using a combination of datasets from three enzyme digests. Methods: Peptides generated by proteolytic digestion of mitochondrial membrane were analyzed using a hybrid quadrupole-orbitrap TM mass spectrometer. Results: Combination of datasets from multiple enzyme digests enabled improved sequence coverage of proteins, increased the total number of unique peptide and protein groups identified, and minimized false-positive discovery rates. Introduction Besides being the main site of adenosine triphosphate (ATP), mitochondria are associated with a range of other processes and diseases such as cell growth, cellular differentiation, mitochondrial disorder, aging processes and cardiac dysfunctions. To obtain a better understanding of these mitochondrial processes and diseases, we need to identify the proteins and proteins modifications involved. The ability to identify and characterize large numbers of proteins from medium- to high- complexity samples has made mass spectrometry (MS) coupled to reversedphase high-performance liquid chromatography (HPLC) a common analytical technique in proteomics. Usually, the extracted proteins are digested with a suitable protease and the resulting peptide mixture is separated and analyzed. Trypsin is the common enzyme of choice for proteomics experiments. Digestion with trypsin (or any single enzyme in general) often results in the identification of large numbers of proteins, but sequence coverage is frequently incomplete. If maximum sequence coverage is desired (e.g. when studying changes in protein modification or different isoforms), then signals covering all or most of the protein sequence are needed. Different approaches have been used to improve protein sequence coverage in proteomics. In this study, data obtained from individual trypsin, chymotrypsin and elastase digests were combined to significantly improve sequence coverage of proteins. Methods Sample Preparation Purified mitochondrial membrane proteins from mouse brain were dissolved in 25 mm triethylammonium bicarbonate buffer. Disulfide bridges were reduced in dithiothreitol, alkylated with iodoacetamide and digested over night with trypsin, chymotrypsin and elastase. Digestion was stopped by freezing at 20 C. Just before separation, each digest was labeled with the Thermo Scientific Amine-Reactive Tandem Mass Tag (TMT 0 ) Reagent, to improve fragmentation, especially of the elastase and chymotrypsin generated peptides. Liquid Chromatography Samples were loaded onto a Thermo Scientific Acclaim PepMap00 C8 pre-column (00 μm 2 cm, C8 5 μm, 00 Å), and separated on a reversed-phase Acclaim PepMap TM 00 C8 column (75 μm 5 cm, C8 3 μm, 20 Å) using the Thermo Scientific EASY-nLC 000 nanoflow HPLC. A 90 min gradient at a flow rate of 300 nl/min was used for the separation. Triplicate runs of individual enzyme digests were performed. Mass Spectrometry All MS and MS/MS spectra were acquired in positive ion mode using a Thermo Scientific Q Exactive hybrid quadrupole-orbitrap mass spectrometer. Full-scan data was obtained at a resolution of 70,000 (at m/z 200), demanding e 6 ions in the mass range 350 800 Da. For the tandem MS, e5 charges were required and the fragment ions were measured at a resolution of 7,500 (at m/z 200). The 0 most intensive ions in a spectrum were selected for fragmentation with a maximum injection time of 200ms. Data Analysis The raw data files were searched using Thermo Scientific Proteome Discoverer software v..3 with Mascot TM v. 2.2. search engine (Matrix Science Ltd, London UK). The peptide tolerance for MS was set at 5 ppm and for MS/MS 20 mmu. A highconfidence peptide filter with FDR of % was used. FIGURE 2. Peptides identified in tripli 2 Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes

Results The Q Exactive TM mass spectrometer provides not only rich fragmentation but also immonium ions, which are important for peptide correlation. Coupled with the high resolution and high mass accuracy in both MS and MS/MS, reliable identification is possible. This is especially very important for peptides generated using less-specific enzymes. Figure shows triplicate runs of individual enzyme digests. Reproducibility rates of 69.9%, 62.3 % and 58. 25 % were obtained for trypsin, elastase and chymotrypsin, respectively. However, at the peptide level, it decreased to 57%, 46.92 % and 42. 97 % (see Figure 2) respectively. FIGURE. Proteins identified in triplicate experiments of each enzyme digest. A common phenomenon which is observ enzymes such as elastase, is the absen terminus. Fragmentation of these peptid and an increase in internal fragment ion ions were generated. Figure 4 shows an IQGGVLAGDVTDVLLLDVTPL with mon FIGURE 2. Peptides identified in triplicate experiments of each enzyme digest. Thermo Scientific Poster Note PN63603_E 06/2S 3

In total 2,007 peptides from a combination of triplicate dataset of 3 enzyme digests were identified. As expected, no peptide common to all three enzyme digests was identified. Less than % of the total number of identified peptides were identified in two enzyme digests. As shown in Figure 3, mostly unique peptides were identified and common peptide sequences in most cases cover regions that could not be identified by one enzyme digest. While the shared peptides between trypsin /chymotrypsin and trypsin/elastase contained basically R and K amino acids at their C termini, 54.05 % of those shared between chymotrypsin and elastase were outside the define cleavage sites (Y, W, F, M, L) of chymotrypsin. Most of these peptides have A, V, L and S at their C-termini, typical cleavage sites for elastase. FIGURE 3. Venn diagram showing unique peptides identified from triplicates experiments in all 3 enzyme digest. As expected, no peptide identified was common to all three enzyme preparations. A common phenomenon which is observed with peptides generated by less-specific enzymes such as elastase, is the absence of charge localization at either the N- or C- terminus. Fragmentation of these peptides results in lack of extended b- or y-ion series and an increase in internal fragment ions. Due to the basic moiety (TMT 0 ), extended b- ions were generated. Figure 4 shows an example of a tandem MS of this peptide, IQGGVLAGDVTDVLLLDVTPL with monoisotopic mass of 2408.38506. h enzyme digest. FIGURE 4. Tandem MS and annotated spectrum of the peptide AIQGGVLAGDVTDVLLLDVTPL generated from elastase digest. b-/a-type ions are shown in red while y-type ions in blue colour. The mass deviation of this peptide was 0.0 ppm (IonScore: 36) in MS and below 0 ppm for fragment ions in MS/MS. A Intensity 0^6 2.0.5.0 0.5 0.0 00 90 80 y 2 + b 6 + b 7 + b b 5 + 0 + b 8 + b 2+ b 2+ 7 6 b 2+ 8 b b + 2 + b 3 + a 6 + y 3 + b 4 + b 9 + b 3 + b b 2+ + b 0 4 + a 7 +a 8 + b 2 + 300 400 500 600 700 800 900 000 00 200 300 400 500 600 700 m/z m/z FTMS + p NSI d Full ms2 204.69@hcd35.00 [20.00-2480.00] 755.4594 B 00.00 80.00 ΣCoverage Coverage (Trypsin) Relative Abundance 70 60 50 40 30 20 0 0 656.3893 229.54 434.7753 470.2938 542.3498 30.2065 868.5435 939.5805.6288 996.6036 20.6965 3.7445 426.775 525.847 200 400 600 800 000 200 400 600 m/z Sequence Coverage 60.00 40.00 20.00 0.00 2 4 6 8 Total number 4 Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes

In general, 992 protein groups were identified in all enzyme digests, of which 8.25% were mitochondrial membrane proteins. Approximately 33% of the total number of identified proteins were present in the combined dataset (Figure 5). This not only lead to a significant increase in the number of protein groups identified but also enhanced the overall sequence coverage. However, the sequence coverage varied from protein to protein. For example, 00% or close to 00% sequence coverage was achieved for the small proteins (>00 amino acid) NADH dehydrogenase [ubiquinone] alpha subcomplex subunit or cytochrome b-c complex subunit, while for larger proteins such as cytochrome b-c complex subunit 2 (> 400 amino acid) as shown in Figure 6, sequence coverage above 90% was obtained. FIGURE 5. Total number of protein groups identified from triplicate runs of all enzymes. The highest number of proteins were identified with trypsin. The use of multiple enzyme digests in p cleavages at sites further away from mo incomplete digestion caused by these p combination of datasets, peptides cove UniProt) from ATP synthase subunit be for all the identified proteins; neverthele were identified. This shows that to som simply inaccessible following digestion combination with technical replicate, mu improve sequence coverage of proteins degree in protein identification. In addit enzymes would have been missed, if o ed by less-specific t either the N- or C- ded b- or y-ion series (TMT 0 ), extended b- of this peptide, 506. FIGURE 6. A) Sequence coverage achieved using different enzymes for a amino acid protein Cytochrome b-c complex subunit 2. Green represents sections of the protein that were identified and white, the sections that were not covered by any of the identified peptides. The sequence coverage increased by 7.3 %, 45.2 %, and 56.4% for trypsin, elastase and chymotrypsin respectively. Combining all datasets, a net increase of 32.8 % is obtained. B) Comparison of sequence coverage from a single enzyme digest (trypsin) to that of the combined dataset for identified membrane proteins. Dark blue bars represent coverage obtain with trypsin alone and red bars from the sum of all enzymes used. A Trypsin 87.86% 5 0 5 20 25 30 35 40 Elastase 64.90% 5 0 5 20 25 30 35 40 Chymotrypsin 60.26% 5 0 5 20 25 30 35 40 + b 2 + b 3 + b 4 + 5 0 5 All 3 enzymes 94.26% 20 25 30 35 40 00 300 400 500 600 700 /z B 00.00 80.00 ΣCoverage Coverage (Trypsin) Sequence Coverage 60.00 40.00 20.00 426.775.7445 525.847 400 600 0.00 2 4 6 8 0 2 4 6 8 20 Total number of identifed membrane proteins Thermo Scientific Poster Note PN63603_E 06/2S 5

The use of multiple enzyme digests in proteomic studies might enable proteolytic cleavages at sites further away from modified peptides, thereby overcoming incomplete digestion caused by these protein modifications. For example, with a combination of datasets, peptides covering almost all known modifications (present in UniProt) from ATP synthase subunit beta were identified (figure 7). This was not true for all the identified proteins; nevertheless, a reasonable number of modified peptides were identified. This shows that to some extent, some portions of the proteome are simply inaccessible following digestion with a single protease. Therefore, in combination with technical replicate, multiple proteases can be used to significantly improve sequence coverage of proteins from a proteome and increase the confidence degree in protein identification. In addition, proteins that were identified by individual enzymes would have been missed, if only this enzyme was used in this experiment. FIGURE 7. Amino acid sequence of ATP synthase subunit beta showing sections of the protein that was identified with annotated known modification (from UniProt). Acetylation is represented by A and phosphorylation by P. Conclusion The use of three different enzymes in proteomics studies enabled an average increase in total number of peptides of approximately 227.5 % and protein groups of about 68.8 % identified. The use of three different enzymes led to an average increase in protein sequence coverage of about 3 %. The use of three different enzymes improved overall confidence in protein identification The use of three different enzymes aided the study of changes in protein sequences and post-translational modifications. The high mass accuracy in both MS and MS/MS minimized false discovery rate (FDR). In spite of the increase in sequence coverage with multiple enzyme digests, the highest number of protein and peptide identification for single proteolytic digest was obtained with trypsin. References. G. Choudhary et al., JPR, 2003, 2 (), 59 67. 2. A. Gardner and G. R. Boles, Curr. Psychiatry Rev., 2005, (3): 255 27. 3. A. E. Speers and C. C. Wu, Chem Rev., 2007, 07(8):3687 374. 4. B. Rietschel et al. MCP, 2009, 8(5):029-43. 5. D. Baeumlisberger et al. Proteomics, 200, 0(2):3905-9. 8 20 Mascot is a registered product of Matrix Science Ltd. All other trademarks are the property of Thermo Fisher Scientific an its subsidiaries. This information is not intended to encourage use of these products in any manners that might infringe the intellectual property rights of others. 6 Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes

www.thermoscientific.com 202 Thermo Fisher Scientific Inc. All rights reserved. ISO is a trademark of the International Standards Organization. All other trademarks are the property of Thermo Fisher Scientific Inc. and its subsidiaries. This information is presented as an example of the capabilities of Thermo Fisher Scientific Inc. products. It is not intended to encourage use of these products in any manners that might infringe the intellectual property rights of others. Specifications, terms and pricing are subject to change. Not all products are available in all countries. Please consult your local sales representative for details. Thermo Fisher Scientific, San Jose, CA USA is ISO Certified. Africa-Other +27 570 840 Australia +6 3 9757 4300 Austria +43 333 50 34 0 Belgium +32 53 73 42 4 Canada + 800 530 8447 China +86 0 849 3588 Denmark +45 70 23 62 60 Europe-Other +43 333 50 34 0 Finland/Norway/Sweden +46 8 556 468 00 France +33 60 92 48 00 Germany +49 603 408 04 India +9 22 6742 9434 Italy +39 02 950 59 Japan +8 45 900 Latin America + 56 688 8700 Middle East +43 333 50 34 0 Netherlands +3 76 579 55 55 New Zealand +64 9 980 6700 Russia/CIS +43 333 50 34 0 South Africa +27 570 840 Spain +34 94 845 965 Switzerland +4 6 76 77 00 UK +44 442 233555 USA + 800 532 4752 PN63603_E 06/2S