PosterREPRINT AN AUTOMATED METHOD TO SELF-CALIBRATE AND REJECT NOISE FROM MALDI PEPTIDE MASS FINGERPRINT SPECTRA

Similar documents
PosterREPRINT A NOVEL APPROACH TO MALDI-TOF-MS SAMPLE PREPARATION. Presented at ABRF 2002, Austin, Texas, USA, 9th - 12th March 2002.

PosterREPRINT INTRODUCTION. 2-D PAGE of Mouse Liver Samples. 2-D PAGE of E.coli Samples. Digestion / Cleanup. EXPERIMENTAL 1-D PAGE of BSA Samples

PosterReprint OVERVIEW

[ Care and Use Manual ]

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data

Mass Spectrometry. - Introduction - Ion sources & sample introduction - Mass analyzers - Basics of biomolecule MS - Applications

INTRODUCTION TO MALDI IMAGING

[application note] DIRECT TISSUE IMAGING AND CHARACTERIZATION OF PHOSPHOLIPIDS USING A MALDI SYNAPT HDMS SYSTEM

PosterREPRINT EVALUATION OF A RF-ONLY STACKED RING BASED COLLISION CELL WITH AXIAL FIELD FOR THE LC-MS-MS ANALYSIS OF THE METABOLITES OF RABEPRAZOLE

2. Ionization Sources 3. Mass Analyzers 4. Tandem Mass Spectrometry

Data Independent MALDI Imaging HDMS E for Visualization and Identification of Lipids Directly from a Single Tissue Section

Comparison of mass spectrometers performances

Currently, a crucial stage in the field of proteomics

Impurity Identification using a Quadrupole - Time of Flight Mass Spectrometer QTOF

SYNAPT G2-S High Definition MS (HDMS) System

Double charge of 33kD peak A1 A2 B1 B2 M2+ M/z. ABRF Proteomics Research Group - Qualitative Proteomics Study Identifier Number 14146

PosterREPRINT SIMULTANEOUS QUANTIFICATION OF PSYCHOTHERAPEUTIC DRUGS IN HUMAN PLASMA BY TANDEM MASS SPECTROMETRY

Unsupervised Identification of Isotope-Labeled Peptides

PHOSPHOPEPTIDE ANALYSIS USING IMAC SAMPLE PREPARATION FOLLOWED BY MALDI-MS and MALDI PSD MX

Biological Mass Spectrometry. April 30, 2014

How to Use TOF and Q-TOF Mass Spectrometers

PTM Discovery Method for Automated Identification and Sequencing of Phosphopeptides Using the Q TRAP LC/MS/MS System

Multiplex Protein Quantitation using itraq Reagents in a Gel-Based Workflow

Technical Note # TN-31 Redefining MALDI-TOF/TOF Performance

A NOVEL METHOD OF M/Z DRIFT CORRECTION FOR OA-TOF MASS SPECTROMETERS BASED ON CONSTRUCTION OF LIBRARIES OF MATRIX COMPONENTS.

ENHANCED IDENTIFICATION OF 2D-GEL ISOLATED PROTEINS FROM ESCHERICHIA COLI USING PSD MX TM

MALDI-TOF. Introduction. Schematic and Theory of MALDI

AB Sciex QStar XL. AIMS Instrumentation & Sample Report Documentation. chemistry

CHARACTERIZATION AND DETECTION OF OLIVE OIL ADULTERATIONS USING CHEMOMETRICS

Chemical Analysis Business Operations Waters Corporation Milford MA

Analysis of Peptides via Capillary HPLC and Fraction Collection Directly onto a MALDI Plate for Off-line Analysis by MALDI-TOF

Peptide sequencing using chemically assisted fragmentation (CAF) and Ettan MALDI-ToF Pro mass spectrometry

Bioanalytical Quantitation of Biotherapeutics Using Intact Protein vs. Proteolytic Peptides by LC-HR/AM on a Q Exactive MS

Operating Instructions

Methods in Mass Spectrometry. Dr. Noam Tal Laboratory of Mass Spectrometry School of Chemistry, Tel Aviv University

LABORATÓRIUMI GYAKORLAT SILLABUSZ SYLLABUS OF A PRACTICAL DEMONSTRATION. financed by the program

[ APPLICATION NOTE ] High Sensitivity Intact Monoclonal Antibody (mab) HRMS Quantification APPLICATION BENEFITS INTRODUCTION WATERS SOLUTIONS KEYWORDS

Automated Sample Preparation/Concentration of Biological Samples Prior to Analysis via MALDI-TOF Mass Spectroscopy Application Note 222

Sequence Identification And Spatial Distribution of Rat Brain Tryptic Peptides Using MALDI Mass Spectrometric Imaging

Databehandling. 3. Mark e.g. the first fraction (1: 0-45 min, 2: min, 3; min, 4: min, 5: min, 6: min).

A Fully Automated, Bottom-up Approach for MALDI-TOF MS Based Discovery Workflows

Ultra Performance Liquid Chromatography Coupled to Orthogonal Quadrupole TOF MS(MS) for Metabolite Identification

Improve Protein Analysis with the New, Mass Spectrometry- Compatible ProteasMAX Surfactant

Ion Source. Mass Analyzer. Detector. intensity. mass/charge

1. Sample Introduction to MS Systems:

Small Molecule Drug Imaging of Mouse Tissue by MALDI-TOF/TOF Mass Spectrometry and FTMS

Mass Spectrometry. Mass spectrometer MALDI-TOF ESI/MS/MS. Basic components. Ionization source Mass analyzer Detector

ApplicationNOTE EXACT MASS MEASUREMENT OF ACTIVE COMPONENTS OF TRADITIONAL HERBAL MEDICINES BY ORTHOGONAL ACCELERATION TIME-OF-FLIGHT.

Electronic Supplementary Information

Matrix Assisted Laser Desorption Ionization Time-of-flight Mass Spectrometry

Solving practical problems. Maria Kuhtinskaja

Lecture 3. Tandem MS & Protein Sequencing

Quantitative Analysis of Drugs of Abuse in Urine using UHPLC Coupled to Accurate Mass AxION 2 TOF Mass Spectrometer

5 Identification of Binding Partners of the Annexin A2 / P11 Complex by Chemical Cross-Linking

High-Throughput Analysis of Oligonucleotides using Automated Electrospray Ionization Mass Spectrometry

Nature Methods: doi: /nmeth.3177

Quadrupole and Ion Trap Mass Analysers and an introduction to Resolution

Biological Mass spectrometry in Protein Chemistry

Characterization of Noncovalent Complexes of Polyelectrolytes by Mass Spectrometry

Primary Structure Analysis. Automated Evaluation. LC-MS Data Sets

Supporting information for: Memory Efficient. Principal Component Analysis for the Dimensionality. Reduction of Large Mass Spectrometry Imaging

PosterREPRINT RAPID, SELECTIVE SCREENING OF URINE SAMPLES FOR GLUCURONIDES BY LC/MS/MS INTRODUCTION ABSTRACT

Analysis of Testosterone, Androstenedione, and Dehydroepiandrosterone Sulfate in Serum for Clinical Research

Introduction to Proteomics 1.0

Bruker Daltonics. autoflex III smartbeam. The Standard in MALDI-TOF Performance MALDI-TOF/TOF. think forward

Mass Spectrometry. Actual Instrumentation

Characterization of Disulfide Linkages in Proteins by 193 nm Ultraviolet Photodissociation (UVPD) Mass Spectrometry. Supporting Information

[ APPLICATION NOTE ] APPLICATION BENEFITS INTRODUCTION WATERS SOLUTIONS KEYWORDS

ASMS Analysis of a cyanobacterial bloom by Matrix-Assisted Laser Desorption/Ionisation Time of Flight Mass Spectrometry. 3rd - 6th June 2002

Supplementary Information. Top-down/bottom-up mass spectrometry workflow using dissolvable polyacrylamide gels

Advancing your Forensic Toxicology Analyses; Adopting the Latest in Mass Spectrometry Innovations

Mass spectra of peptides and proteins - and LC analysis of proteomes Stephen Barnes, PhD

Application of a new capillary HPLC- ICP-MS interface to the identification of selenium-containing proteins in selenized yeast

Agilent Protein In-Gel Tryptic Digestion Kit

Accurate determination of protein methionine oxidation by stable isotope labeling and LC-MS analysis

The Analysis of Proteomic Spectra from Serum Samples. Keith Baggerly Biostatistics & Applied Mathematics MD Anderson Cancer Center

Top 10 Tips for Successful Searching ASMS 2003

Learning Objectives. Overview of topics to be discussed 10/25/2013 HIGH RESOLUTION MASS SPECTROMETRY (HRMS) IN DISCOVERY PROTEOMICS

Automation of Matrix-Assisted Laser Desorption/ Ionization Mass Spectrometry Using Fuzzy Logic Feedback Control

Protein Identification and Phosphorylation Site Determination by de novo sequencing using PepFrag TM MALDI-Sequencing kit

Advances in Hybrid Mass Spectrometry

Moving from targeted towards non-targeted approaches

Mass Spectrometry at the Laboratory of Food Chemistry. Edwin Bakx Laboratory of Food Chemistry Wageningen University

The Detection of Allergens in Food Products with LC-MS

Systematic analysis of protein-detergent complexes applying dynamic light scattering to optimize solutions for crystallization trials

Characterization of an Unknown Compound Using the LTQ Orbitrap

Lab 1 - Klinisk Kemisk Diagnos9k

Ultra High Definition Optimizing all Analytical Dimensions

Quantitative Analysis of -Hydroxybutyrate in Hair Using Target Analyte Finding Processing of Comprehensive GC-HRT Data

INTRODUCTION CH 3 CH CH 3 3. C 37 H 48 N 6 O 5 S 2, molecular weight Figure 1. The Xevo QTof MS System.

Supporting information

for the Identification of Phosphorylated Peptides

NIH Public Access Author Manuscript J Proteome Res. Author manuscript; available in PMC 2014 July 05.

Rapid, Simple Impurity Characterization with the Xevo TQ Mass Spectrometer

The Analysis of Allergens in Raw and Roasted Peanuts Using nanoacquity UPLC and Xevo Q-Tof MS

Fundamental Studies of Matrix-Assisted Laser Desorption/Ionization, Using Time-of-Flight Mass Spectrometry to Identify Biological Molecules

Proteomics of body liquids as a source for potential methods for medical diagnostics Prof. Dr. Evgeny Nikolaev

John Haselden 1, Gordon Dear 1, Jennifer H. Granger 2, and Robert S. Plumb 2. 1GlaxoSmithKline, Ware, UK; 2 Waters Corporation, Milford, MA, USA

ABSTRACT. Dr. Catherine Fenselau, Department of Chemistry and Biochemistry. Presented in this work is a novel bottom up proteomics approach to

CAMAG TLC-MS INTERFACE

Transcription:

Overview AN AUTOMATED METHOD TO SELF-CALIBRATE AND REJECT NOISE FROM MALDI PEPTIDE MASS FINGERPRINT SPECTRA Jeffery M Brown, Neil Swainston, Dominic O. Gostick, Keith Richardson, Richard Denny, Steven Leicester and Phillip Young Purpose Investigation of a new algorithm for self calibration and determination of noise levels of MALDI-TOF peptide mass fingerprint spectra. Methods The algorithm which uses the known mass sufficiency function of peptides is applied to data from several standard protein digests. Waters Corporation, Floats Road, Wythenshawe, Manchester, M23 9LZ United Kingdom Presented at ASMS 2002, Orlando, Florida, 3 rd -6 th June, 2002 may be submitted to the data base search which may "confuse" the search algorithm resulting in random incorrect protein matches. We have developed a new algorithm based on the mass sufficiency function of peptides that rejects noise from PMF spectra. Furthermore, the mass sufficiency function may be used to "self calibrate" the peptide masses of MALDI PMF spectra without the use of an internal reference. Improvements in mass measurement accuracy and noise level determination are measured using a new data base search engine. Results Mass measurement accuracy for data acquired without internal calibration improved from 28.9 ppm RMS to 13.1 ppm RMS. Optimal noise level determination provides more confident data base identifications for low signal:noise data and minor components of mixtures. Introduction MALDI peptide mass fingerprinting (PMF) is an established method for identification of proteins extracted from 2D electrophoresis gels. Successful identification by matching of observed peptides against those theoretically derived from protein data banks depends on the mass measurement accuracy and signal to noise of the peak lists considered. Usually an intensity threshold is applied to the spectral data to remove noise peaks from the real peptide peaks. If the threshold is set too high, small components of peptide mixtures are often ignored and remain unidentified. If the threshold is too low a large number of noise peaks Using this new algorithm peptide peak lists are optimally filtered for noise and the differences between the theoretical and matched peptide masses are corrected to better than 20 ppm RMS without using internal references. Methods and Instrumentation All data were acquired automatically using a Micromass M@LDI-R time of flight mass spectrometer in reflectron mode operating at approximately 12,000 (FWHM) resolution between m/z 1000 and m/z 3000. Data were initially near point lock mass corrected using a single monoisotopic tryptic peptide peak (MH+ 1618.844, VLGIDGGEGKEELFR) acquired from an alcohol dehydrogenase digest loaded near to each sample well. A variety of protein standards were separated by 1D SDS-PAGE (BioRad) and the resulting bands were digested by porcine trypsin (Promega). 1 micro-liter of each solution (approximately 100fmole) were mixed on the sample plate with 1 micro-liter of matrix alpha-cyano-4- hydroxycinnamic acid (concentration of 10 g/l in 50:50 water acetonitrile).

Two mixtures were also loaded - 100 fmole:10 fmole mix of BSA and Beta-galactosidase and a 100 fmole:100 fmole mix of Phosphorylase B and Beta Galactosidase.10 of 10 laser shot spectra were acquired and summed from each sample well ("strong data") and the weakest spectra of the 10 spectra were also recorded ("weak data"). Each spectrum was automatically post processed and data base searched using ProteinLynx Global Server software (Micromass, PLGS 2.0). This software smoothes, background subtracts, centroids, de-isotopes, lock mass corrects, creates peak lists and searches against protein digest databanks. Figure 1. Mass Sufficiency and Mass Scale Zones for Typical and Non-typical Peptides for the Nominal Peptide Masses 1000 Da and 1001 Da The self calibration algorithm determined the optimal noise threshold and calibration factor. These parameters were applied to the data prior to data base searching. Description of Self Calibration Algorithm Peptide masses have been previously shown to have well defined fractional mass values and occupy relatively narrow distribution zones on the mass scale 1. The fractional mass sufficiency (sufficiency) may be calculated from equation (1) where mass is the nominal value of the peptide mass and suff_fact is the mass sufficiency factor (previously reported as 0.00048). sufficiency = suff_fact. mass (eq1) The mass distribution zone or window (mass_window) of 95% of all amino acid combinations may be calculated from equation (2). mass_window = 0.19 + mass/10000 (eq2) Figure 1 depicts the zones of all typical and nontypical peptides for the nominal peptide masses of 1000 Da and 1001 Da. The function deviation(gain) shown in equation (3) represents the overall mass deviation between the expected and observed mass sufficiency and is minimised by varying the spectrum gain factor (gain) over a +/-200 ppm window, massn is the mass value of the n'th peak of the uncorrected mass spectrum, (Int is a rounding function to the nearest integer). deviation(gain) = Σ{Int [mass n.(1-sufficiency).gain] - mass n.(1-sufficiency).gain} 2 (eq3) Noise Rejection Threshold It is assumed that peaks occurring in the zone of non-typical peptide masses are noise and that the noise intensity level extends into the zone of typical peptides. The ratio of the sum of the intensities of non-typical peptides to the sum of the intensities of typical peptides may be used as a measure of "data quality" or signal:noise. The % intensity threshold (or cut off threshold) may be optimised for each individual spectrum by fixing the required "data quality" at a nominal value of 10%.

Results Figures 2a to 8b show the "strong" and "weak" spectra for the 7 tryptic digest samples. For each spectrum both the near point correction and self calibration algorithm mass measurement accuracies are reported in Table 1. The RMS mass accuracy of each spectrum was determined by submitting each peak list against SwissProt (40.6) database and calculating the RMS errors for the differences between the calibrated data and the significant matching peptides for the correct protein match. Searches were performed with a 100 ppm allowable mass error. Fixed carbomidomethyl cysteines and variable oxidised methionine modifications were considered. Figure 4. Ovalbumin a) 100 shot spectra summed, b) 10 shot spectra Table 1 also shows the noise threshold for each spectrum as determined by the algorithm. Figure 5. Chicken Lysozyme a) 100 shot spectra summed, b) 10 shot spectra Figure 2. Glycogen Phosphorylase a) 100 shot spectra summed, b) 10 shot spectra Figure 6. Carbonic Anhydrase a) 100 shot spectra summed, b) 10 shot spectra Figure 3. Beta Galactosidase a) 100 shot spectra summed, b) 10 shot spectra

Figure 7. Serum Albumin and Beta Galactosidase (10:1 mix) a) 100 shot spectra summed, b) 10 shot spectra Figure 8. Phosphorylase B and Beta Galactosidase (1:1 mix) a) 100 shot spectra summed, b) 10 shot spectra Thresholding Method 40 most intense Peaks Peaks used above used above threshold Figure 5% threshold peaks used determined by algorithm Glycogen Phosphorylase as top hit 2 (b) NO YES YES from weak spectrum (Serum Albumin) as top hit 7 (a) YES NO YES Major component of mixture (Beta Galactosidase) Identified as 2nd hit 7 (a) NO NO YES Minor component of mixture Table 2. Success of data base searching using various noise threshold methods Figures 9 to 13 show PGS 2.0 database search results for 2 sets of data. The weak spectrum Figure 2 (b) Glycogen Phosphorylase was identified correctly when the noise threshold applied was determined by the algorithm (at 29%) or if the 40 most intense peaks were searched. (searching NRDB, +/-50ppm tolerance). The sample was not identified when using a 5% noise threshold. For the stronger spectrum Figure 7(a) (mix of 10:1) BSA and Beta-Galatosidase, both components were identified correctly (1st and 2nd hits) when the applied noise threshold was determined by the algorithm at 0.25%, using peaks above a 5% noise threshold or 40 most intense peaks failed to give the correct results. (searching SwissProt, +/-50ppm tolerance). Mass Accuracy (RMS ppm matching peptides) Near Point Lock Mass Calibration Self Calibration Predicted Noise Threshold % Stronger (100 shot) data Spectrum Glycogen Phosphorylase 2 (a) 46 17 3 Beta Galactosidase 3 (a) 39 15 3 Ovalbumin 4 (a) 47 14 1 Chicken Lysozyme 5 (a) 15 20 0.5 Carbonic Anhydrase 6 (a) 30 7 0.5 Serum Albumin+Beta Galactosidase 7 (a) 18 8 0.25 Glycogen Phosphorylase+Beta Galactosidase 8 (a) 8 11 1 Mean 29.0 13.1 Weaker (10 shot) data Glycogen Phosphorylase 2 (b) 57 32 29 Beta Galactosidase 3 (b) 47 14 14 Ovalbumin 4 (b) 36 29 6 Chicken Lysozyme 5 (b) 11 16 1.5 Carbonic Anhydrase 6 (b) 18 12 2 Serum Albumin+Beta Galactosidase 7 (b) 26 24 1 Glycogen Phosphorylase+Beta Galactosidase 8 (b) 29 28 10 Mean 32.0 22.1 Table 1. Mass measurement accuracy, near point calibration and self calibration algorithm and predicted noise thresholds Figure 9. Weak data from Figure 2b - 5% noise threshold - Glycogen Phosphorylase - 2nd hit (1st hit incorrect random match)

PosterREPRINT Figure 10. Weak data from figure 2b - 29% noise Figure 13. Binary Mixture 0.25% threshold (as threshold (determined by algorithm) matching determined by algorithm) - Serum Albumin top hit, peptides from Glycogen phosphorylase as correct matching peptides from correct 2nd hit - Beta top hit Galactosidase shown Conclusion Improvements in mass measurement accuracy using an axial MALDI-TOF mass spectrometer have been observed by the application of a new self calibration algorithm to peptide mass fingerprint spectra. Mass measurement accuracy improved from 29.0ppm RMS (external calibration) to 13.1ppm Figure 11. Binary Mixture 5% noise threshold matching peptides from Serum Albumin top hit, 2nd hit incorrect RMS for higher signal:noise data. For lower signal:noise data the mass accuracy improved from 32.0ppm RMS to 22.1ppm RMS. This improvement enables data base searching using a mass error window of +/- 50ppm rather than +/100ppm thereby improving specificity and reducing the probability of erroneously matching random noise peaks. The algorithm also predicts the optimum noise threshold for each spectrum. Peaks above this threshold were used to identify proteins from poor signal:noise data. Low level components of a binary mixture were also identified with greater Figure 12. Binary Mixture 0.25% noise threshold (determined by algorithm) - matching peptides Serum Albumin top hit, (2nd hit Beta Galactosidase -see Figure 13) confidence than if a fixed threshold or 40 most intense peaks had been submitted to the database. References [1] Mann, Possible peptide masses. Proceedings of the 43rd ASMS, Atlanta, 1995

Author to whom all correspondence should be addressed: Jeff Brown Waters Corporation (Micromass UK Limited) Floats Road, Wythenshawe Manchester, M23 9LZ Tel: + 44 (0) 161 946 2400 Fax: + 44 (0) 161 946 2480 e-mail: jeff.brown@micromass.co.uk WATERS CORPORATION 34 Maple St. Milford, MA 01757 U.S.A. T: 508 478 2000 F: 508 872 1990 www.waters.com Made in the United Kingdom 2002 Waters Corporation October 2002 / WMP218 For research use only. Not for use in diagnostic procedures. Certificate No: 951387