Double charge of 33kD peak A1 A2 B1 B2 M2+ M/z. ABRF Proteomics Research Group - Qualitative Proteomics Study Identifier Number 14146

Similar documents
Mass Spectrometry. Mass spectrometer MALDI-TOF ESI/MS/MS. Basic components. Ionization source Mass analyzer Detector

Sequence Identification And Spatial Distribution of Rat Brain Tryptic Peptides Using MALDI Mass Spectrometric Imaging

Biological Mass Spectrometry. April 30, 2014

PosterREPRINT INTRODUCTION. 2-D PAGE of Mouse Liver Samples. 2-D PAGE of E.coli Samples. Digestion / Cleanup. EXPERIMENTAL 1-D PAGE of BSA Samples

Biological Mass spectrometry in Protein Chemistry

Mass Spectrometry. - Introduction - Ion sources & sample introduction - Mass analyzers - Basics of biomolecule MS - Applications

PTM Discovery Method for Automated Identification and Sequencing of Phosphopeptides Using the Q TRAP LC/MS/MS System

Lecture 3. Tandem MS & Protein Sequencing

PosterREPRINT A NOVEL APPROACH TO MALDI-TOF-MS SAMPLE PREPARATION. Presented at ABRF 2002, Austin, Texas, USA, 9th - 12th March 2002.

Structural Characterization of Prion-like Conformational Changes of the Neuronal Isoform of Aplysia CPEB

Trypsin Mass Spectrometry Grade

TECHNICAL BULLETIN. R 2 GlcNAcβ1 4GlcNAcβ1 Asn

Improve Protein Analysis with the New, Mass Spectrometry- Compatible ProteasMAX Surfactant

Multiplex Protein Quantitation using itraq Reagents in a Gel-Based Workflow

5 Identification of Binding Partners of the Annexin A2 / P11 Complex by Chemical Cross-Linking

Supporting Information. Lysine Propionylation to Boost Proteome Sequence. Coverage and Enable a Silent SILAC Strategy for

Comparison of mass spectrometers performances

MALDI-TOF. Introduction. Schematic and Theory of MALDI

Microalbuminuric Diabetic patients N=18

The N-terminal loop of IRAK-4 death domain regulates ordered assembly of the Myddosome signalling scaffold

Extended Mass Range Triple Quadrupole for Routine Analysis of High Mass-to-charge Peptide Ions

In-Gel Tryptic Digestion Kit

Nature Methods: doi: /nmeth Supplementary Figure 1

PosterREPRINT AN AUTOMATED METHOD TO SELF-CALIBRATE AND REJECT NOISE FROM MALDI PEPTIDE MASS FINGERPRINT SPECTRA

Overview of the Expressway Cell-Free Expression Systems. Expressway Mini Cell-Free Expression System

Protein Identification and Phosphorylation Site Determination by de novo sequencing using PepFrag TM MALDI-Sequencing kit

Phosphorylation of proteins Steve Barnes Feb 19th, 2002 in some cases, proteins are found in a stable, hyperphosphorylated state, e.g.

PHOSPHOPEPTIDE ANALYSIS USING IMAC SAMPLE PREPARATION FOLLOWED BY MALDI-MS and MALDI PSD MX

MALDI Imaging Mass Spectrometry

Characterization of Disulfide Linkages in Proteins by 193 nm Ultraviolet Photodissociation (UVPD) Mass Spectrometry. Supporting Information

Types of Modifications

Agilent Protein In-Gel Tryptic Digestion Kit

Mass Spectrometry and Proteomics - Lecture 4 - Matthias Trost Newcastle University

Peptide sequencing using chemically assisted fragmentation (CAF) and Ettan MALDI-ToF Pro mass spectrometry

SUPPLEMENTAL INFORMATION

Biomolecular Mass Spectrometry

Top 10 Tips for Successful Searching ASMS 2003

Proteomics of body liquids as a source for potential methods for medical diagnostics Prof. Dr. Evgeny Nikolaev

2. Ionization Sources 3. Mass Analyzers 4. Tandem Mass Spectrometry

Nature Biotechnology: doi: /nbt Supplementary Figure 1

A Fully Automated, Bottom-up Approach for MALDI-TOF MS Based Discovery Workflows

Rapid Detection and Identification of Bacteria in Milk Products by Matrix-Assisted Laser Desorption/Ionization Tandem Mass Spectrometry (MALDI-MS/MS)

1. Sample Introduction to MS Systems:

N α -Acetylation of yeast ribosomal proteins and its effect on protein synthesis

Trypsin Digestion Mix

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data

SUNY UPSTATE MEDICAL UNIVERSITY PROTEOMICS CORE

LABORATÓRIUMI GYAKORLAT SILLABUSZ SYLLABUS OF A PRACTICAL DEMONSTRATION. financed by the program

The Detection of Allergens in Food Products with LC-MS

Tivadar Orban, Beata Jastrzebska, Sayan Gupta, Benlian Wang, Masaru Miyagi, Mark R. Chance, and Krzysztof Palczewski

Technical Note # TN-31 Redefining MALDI-TOF/TOF Performance

Supporting Information Parsimonious Charge Deconvolution for Native Mass Spectrometry

Practice Exam 2 MCBII

Supplementary Information. Top-down/bottom-up mass spectrometry workflow using dissolvable polyacrylamide gels

Nature Methods: doi: /nmeth.3177

Lab 1 - Klinisk Kemisk Diagnos9k

Proteomic Biomarker Discovery in Breast Cancer

Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes

Systematic analysis of protein-detergent complexes applying dynamic light scattering to optimize solutions for crystallization trials

Quantitation of Protein Phosphorylation Using Multiple Reaction Monitoring

New Instruments and Services

Application of a new capillary HPLC- ICP-MS interface to the identification of selenium-containing proteins in selenized yeast

Structural Elucidation of N-glycans Originating From Ovarian Cancer Cells Using High-Vacuum MALDI Mass Spectrometry

Tissue and Fluid Proteomics Chao-Cheng (Sam) Wang

Advances in Hybrid Mass Spectrometry

Supporting Information. Evolution of atomically precise silver clusters to superlattices

Edman Sample Preparation Part 2. Bill Henzel Genentech Inc.

MCB130 Midterm. GSI s Name:

1. to understand how proteins find their destination in prokaryotic and eukaryotic cells 2. to know how proteins are bio-recycled

Chapter 10apter 9. Chapter 10. Summary

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

PNGase F Instruction Manual

Data Independent MALDI Imaging HDMS E for Visualization and Identification of Lipids Directly from a Single Tissue Section

Taiwan. University, Hualien 970, Taiwan

Proteins: Proteomics & Protein-Protein Interactions Part I

Sequence Coverage (%) Profilin-1 P UD 2

Proteomics Grade. Protocol. Catalog # Agilent Technologies. Research Use Only. Not for use in Diagnostic Procedures. Version A, January 2010

Databehandling. 3. Mark e.g. the first fraction (1: 0-45 min, 2: min, 3; min, 4: min, 5: min, 6: min).

Nature Biotechnology: doi: /nbt Supplementary Figure 1. RNAseq expression profiling of selected glycosyltransferase genes in CHO.

MALDI-TOF analysis of whole blood: its usefulness and potential in the assessment of HbA1c levels

Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random

Chapter 3. Protein Structure and Function

Development of a Human Cell-Free Expression System to Generate Stable-Isotope-Labeled Protein Standards for Quantitative Mass Spectrometry

MALDI-TOF Mass Spectrometry: A New Rapid ID Method in Clinical Microbiology

Protein sequence mapping is commonly used to

Section 1 Proteins and Proteomics

Problem-solving Test: The Mechanism of Protein Synthesis

Glycosylation analysis of blood plasma proteins

Supplementary Figure 1

Strategies for Characterizing Proteins in Proteomics Research

AB Sciex QStar XL. AIMS Instrumentation & Sample Report Documentation. chemistry

Significance and Functions of Carbohydrates. Bacterial Cell Walls

LECTURE-15. itraq Clinical Applications HANDOUT. Isobaric Tagging for Relative and Absolute quantitation (itraq) is a quantitative MS

Mass Spectrometry at the Laboratory of Food Chemistry. Edwin Bakx Laboratory of Food Chemistry Wageningen University

- 1 - Cell types Monocytes THP-1 cells Macrophages. LPS Treatment time (Hour) IL-6 level (pg/ml)

Introduction to Proteomics 1.0

Supplementary Materials for

New Developments in LC-IMS-MS Proteomic Measurements and Informatic Analyses

Application Note # ET-17 / MT-99 Characterization of the N-glycosylation Pattern of Antibodies by ESI - and MALDI mass spectrometry

Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator.

Transcription:

Abstract The 2008 ABRF Proteomics Research Group Study offers participants the chance to participate in an anonymous study to identify qualitative differences between two protein preparations. We used a series of proteomics methods to identify the major protein components of the two samples. Analysis by SELDI MS and by SDS-PAGE both showed the presence of two major protein components in Sample A and four in Sample B. Peptide Mass Fingerprinting (PMF), using the SELDI reader, gave putative protein identification as human Advanced Glycation (or Glycosylation, see Discussion) Endproduct-specific Receptor (RAGE). MS/MS analysis corroborated this result. Sequence comparisons show that both Sample A and B contain two identical fragments of the RAGE protein, with Sample B containing two extra unique fragments. In-solution digests allowed for an increase of sequence coverage from 79.29% to 91.6%. Methods & Results SDS-PAGE ANALYSIS The first set of Samples A and B were run on 1-D SDS-PAGE. From Sample A two protein bands can be seen and from Sample B four bands are present. Their approximate masses are 70kDa, 33kDa, 26kDa, 15kDa. The two upper bands of Sample B matched the masses of the Sample A protein bands. From Sample A two protein bands can be seen and from Sample B four bands are present. Their approximate masses are 70kDa, 33kDa, 26kDa, 15kDa. The two upper bands of Sample B matched the masses of the Sample A protein bands. A1 A2 250 150 100 75 50 37 25 20 15 B1 B2 B3 B4 Figure 1. SDS-PAGE of Sample A and Sample B. Samples were diluted straight into 20ul of XT Sample Buffer (Bio-Rad), heated for 10 minutes at 80 C and run alongside Precision Plus All Blue Protein Standards (Bio-Rad), on a 4-12% Bis-Tris XT Criterion gel with MES buffer (Bio-Rad). The gel was stained with Coomassie Safe Blue Stain (Bio-Rad) and destained with water. 10 SELDI MS ANALYSIS The second set of samples were analyzed by SELDI MS, spectra are shown in Figure 2. Peaks corresponding to the masses of the bands on the SDS-PAGE are shown in Figure 2. Although MALDI typically results in singlecharged ions, one double-charged peak for the 33kD peak can be seen. The masses do not match those of the bands from the gel precisely, however this shift is commonly seen when comparing MS data to SDS-PAGE. Double charge of 33kD peak A2 A1 Intensity B4 B3 B2 B1 M2+ M/z Figure 2. SELDI MS of Sample A and Sample B

Figure 2. SELDI MS of Sample A and Sample B Samples A and B were dissolved in 20ul of water, then 1ul was spotted onto an NP20 ProteinChip array. The spot was allowed to dry, then matrix was added. The ProteinChip was read in the PBSIIc SELDI reader. Peptide Mass Fingerprinting Following in-gel digestion, peptide mass fingerprint (PMF) data was collected from the two major SDS-PAGE bands A2 and B2 using SELDI MS, see Figure 3. From the Mascot database, peptides were identified as belonging to the human RAGE protein, with 41% sequence coverage. PMF is a relatively low confidence technique requiring further validation, therefore we confirmed this identification by MS/MS analysis. 40 30 A 1526.17 2243.33 2283.46 ua 20 10 0 1185.56 1396.00 1648.98 1878.43 A 2299.42 2512.35 2127.62 2273.86 ua 15 10 5 0 1185.10 1322.73 1201.381300.07 1525.97 1877.80 2242.82 2283.06 2298.95 2273.41 2127.43 2512.18 B B 1500 2000 2500 Q5JP24_HUMAN Mass: 25586 Score: 68 Expect: 0.021 Queries matched: 5 Advanced Glycation Endproductspecific Receptor (Fragment).- Homo sapiens (Human). Number of mass values searched: 7 Number of mass values matched: 5 Sequence Coverage: 41% Matched peptides shown in Bold 1 MAAGTAVGAW VLVLSLWGAV VGAQNITARI GEPLVLKCKG APKKPPQRLE 51 WKLNTGRTEA WKVLSPQGGG PWDSVARVLP NGSLFLPAVG IQDEGIFRCQ 101 AMNRNGKETK SNYRVRVYQI PGKPEIVDSA SELTAGVPNK VVEESRRSRK 151 RPCEQEVGTC VSEGSYPAGT LSWHLDGKPL VPNEKGVSVK EQTRRHPETG 201 LFTLQSELMV TPARGGDPRP TFSCSFSPGL PRHRAL C Figure 3. SELDI PMF results from in-gel digest of bands A2 and B2. 2ul of digests from excised bands A2 and B2 were applied to a Normal Phase (NP20) ProteinChip. After allowing the digest to dry, 2x1ul of 20% CHCA matrix was applied to each spot. The samples were analyzed by the SELDI PCS 4000 instrument with the following instrument settings: Mass range 0-10kDa, focus mass 1.5kDa, laser setting 300nJ. Following data acquisition, each spectrum was internally calibrated and then the unique peptide masses were used to search the Matrix Science Mascot database. Screen shot of the Mascot search is shown above: A) SELDI spectra of PMF analysis of bands A2 and B2, B) Sequence information data, C). Probability scores for the Mascot search.

TOF MS Analysis of samples A and B, dissolved in water and spotted onto NP20 ProteinChips, as described above, were then run in MS mode on the QSTAR XL fitted with the ProteinChip Interface revealing identical spectra, suggesting that these are the same protein (protein fragment). Sample B Sample A Figure 4. TOF-MS of Sample A and Sample B Samples A and B were dissolved in 20ul of water. 1ul was then spotted onto an NP20 ProteinChip array, the spot was allowed to dry, and matrix was then added. The ProteinChip was read in the QSTAR XL instrument fitted with a ProteinChip interface. MS/MS The in-gel trypsin digests from each band on the SDS-PAGE, Figure 1, were spotted onto NP20 ProteinChips. MS/MS analysis was performed on a QSTAR XL Quadrupole MS instrument, fitted with a ProteinChip Interface. Sample A exhibited two bands on non-reducing SDS-PAGE. Band A2 was identified as the 33,168.85 Da extracellular domain of Advanced Glycation Endproduct-specific Receptor (309 amino acids, 79.29% coverage by MS/MS), see figures 5(a) and 6. The first four amino acids for all the bands were from a plasmid for expression in E.coli (see Discussion). Band A1 showed the same tryptic fragments as the Band A2. MW of the band A1 is approximately 2-fold higher than the MW of the band A2. Most likely, the band A1 is a storage artifact of the bacterially-expressed 33.2 kda protein (spontaneous cross-linking of free cysteines under aerobic conditions). This is also seen in sample B, bands B2 and B1. Sample B exhibited four bands on non-reducing SDS-PAGE. The tryptic map of band B2 was found to be identical to the tryptic map of the band A2. Most likely, this is the same protein, i.e., the 33,168.85 Da extracellular domain of Advanced Glycation Endproduct-specific Receptor (309 amino acids, 79.29% coverage by MS/MS), as previously shown by TOF-MS, see figure 5(b).Proteins in the bands A2 and B2 have identical N-termini and identical C-termini, and show no difference in their peptide maps. When in-gel and in-solution digest data is combined, there is an increase of overall amino acid sequence coverage from 79.29% to 91.26 % and thus an increase in confidence for protein identification. Band B3 was unique for the sample B. It was identified as a 24,585.18 Da truncated extracellular domain of Advanced Glycation Endproduct-specific Receptor (225 amino acids, 71.56% coverage by MS/MS), see figure 7.

Band B4 was unique for the sample B. It was identified as a 12,524.39 Da truncated extracellular domain of Advanced Glycation Endproduct-specific Receptor (114 amino acids, 62.28% coverage by MS/MS), see figure 8. A B Figure 5. Screen shots of in-solution digest peptide analysis, 0.8-3kDa. Samples A and B were dissolved in 20ul of water and 1ul spotted onto an NP20 ProteinChip array, then the spot was allowed to dry and matrix added. The ProteinChip was read in the QSTAR XL instrument fitted with a ProteinChip interface. A) Sample A and B) Sample B. A box with a dashed line indicates that this peak is exactly the same as one sequenced in the in-gel digest and matches the theoretical digest of human RAGE. Only new ions were sequenced, i.e., those that were absent in the in-gel digest but could be detected in the in-solution digest. We believe they may bee fragments with missed trypsin cleavages. They are marked with solid rectangles.

10 20 30 40 50 60 70 GSHMAQNITA RIGEPLVLKc kgapkkppqr LEWKLNTGRT EAWKVLSPQG GGPWDSVARV LPNGSLFLPA 80 90 100 110 120 130 140 VGIQDEGIFR CQAMNRNGKE TKSNYRVRVY QIPGKPEIVD SASELTAGVP NKVGTCVSEG SYPAGTLSWH 150 160 170 180 190 200 210 LDGKPLVPNE KGVSVKEQTR RHPETGLFTL QSELMVTPAR GGDPRPTFSC SFSPGLPRhr alrtapiqpr 220 230 240 250 260 270 280 VWEPVPLEEV QLVVEPEGGA VAPGGTVTLT CEVPAQPSPQ IHWMKDGVPL PLPPSPVLIL PEIGPQDQGT 290 300 310 320 330 340 350 YSCVATHSSH GPQESRAVSI SIIEPGEEGP tagsvggsgl gtlalalgil gglgtaalli gvilwqrrqr 360 370 380 rgeerkapen qeeeeerael nqseepeage sstggp Figure 6. Peptide sequence coverage of 79.29% from band A2. Sample A was dissolved in 20ul of water and 1ul spotted onto an NP20 ProteinChip array, then the spot was allowed to dry and matrix added. The ProteinChip was read in the QSTAR XL instrument fitted with a ProteinChip interface. Sequence of bold underlined peptide is predicted from de novo sequencing. Biologically relevant amino acid sequence (AQNITA) starts from Ala5. The protein was identified in the Swiss-Prot database as Q15109, Advanced Glycosylation Endproduct-specific Receptor. Tryptic peptides identified by MS/MS with high confidence are bold red (in-gel digestion) or bold green (insolution digestion). AspN peptide identified by MS/MS with high confidence is bold pink (in-solution digestion) 10 20 30 40 50 60 70 GSHMAQNITA RIGEPLVLKC KGAPKKPPQR LEWKLNTGRT EAWKVLSPQG GGPWDSVARV LPNGSLFLPA 80 90 100 110 120 130 140 VGIQDEGIFR CQAMNRNGKE TKSNYRVRVY QIPGKPEIVD SASELTAGVP NKVGTCVSEG SYPAGTLSWH 150 160 170 180 190 200 210 LDGKPLVPNE KGVSVKEQTR RHPETGLFTL QSELMVTPAR GGDPRPTFSC SFSPGLPRHR ALRTAPIQPR 220 230 240 250 260 270 280 VWEPVPLEEV QLVVEPEGGA VAPGGTVTLT CEVPAQPSPQ IHWMKDGVPL PLPPSPVLIL PEIGPQDQGT 290 300 310 YSCVATHSSH GPQESRAVSI SIIEPGEEGP C terminus Figure 7. Peptide sequence coverage of 71.56% from band B3. Sample B was dissolved in 20ul of water and 1ul spotted onto an NP20 ProteinChip array, then the spot was allowed to dry and matrix added. The ProteinChip was read in the QSTAR XL instrument fitted with a ProteinChip interface. The first four amino acids come from an expression plasmid (see RAGE Protein note before Conclusions). Sequence of bold underlined peptide is predicted from de novo sequencing. Biologically relevant amino acid sequence (AQNITA) starts from Ala5. Protein was identified in the SwissProt database as Q15109, Advanced Glycation Endproduct-specific Receptor. Tryptic peptides identified by MS/MS with high confidence are bold red (in-gel digestion) or bold green (in-solution digestion). 10 20 30 40 50 60 70 GSHMAQNITA RIGEPLVLKC KGAPKKPPQR LEWKLNTGRT EAWKVLSPQG GGPWDSVARV LPNGSLFLPA 80 90 100 110 120 250 260 VGIQDEGIFR CQAMNRNGKE TKSNYRVRVY QIPGKPEIVD SASELTAGVP CEVPAQPSPQ IHWMKDGVPL 270 280 290 300 310 PLPPSPVLIL PEIGPQDQGT YSCVATHSSH GPQESRAVSI SIIEPGEEGP C terminus Figure 8. Peptide sequence coverage of 62.28% from band B4. Sample B was dissolved in 20ul of water and 1ul spotted onto an NP20 ProteinChip array, then the spot was allowed to dry and matrix added. The ProteinChip was read in the QSTAR XL instrument fitted with a ProteinChip interface. The first four amino acids come from an expression plasmid (see RAGE Protein note before Conclusions). Sequence of bold underlined peptide is predicted from de novo sequencing. Biologically relevant amino acid sequence (AQNITA) starts from Ala5. Protein was identified in the SwissProt database as Q15109, Advanced Glycation Endproduct-specific Receptor. Tryptic peptides identified by MS/MS with high confidence are bold red (in-gel digestion) or bold green (in-solution digestion).

DISCUSSION We suspect that the source protein, RAGE, or Receptor for Advanced Glycation Endproducts, was expressed in E.Coli with a His-tag in pet28 or similar plasmid, purified using Ni-NTA or a similar resin, then the His tag was cleaved by thrombin, leaving the tetrapeptide GSHM upstream of the RAGE coding sequence. NB: SwissProt identifies the protein Advanced Glycosylation Endproduct-specific Receptor, while the literature typically refers to this protein as Advanced Glycation Endproduct-specific Receptor. The distinction lies in whether the native protein receives sugar moieties via enzymatic reactions in the endoplasmic reticulum or Golgi apparatus (glycosylation), or from simple chemical reactions based on encountering reducing sugars (glycation). In any event, no post-translational modification would have occurred with prokaryotic expression of the protein. RAGE is a cell surface protein with a single membrane-spanning region containing a small, ~40 residue, cytosolic domain and a large extracellular region composed of 3 Ig-like domains. Because of its link to the development of diseases such as rheumatoid arthritis and diabetes, this protein is of great interest to the medical field. It is proposed that the interaction of RAGE with its ligands results in pro-inflammatory gene activation. srage, a soluble form of the receptor lacking the transmembrane and signaling domain, is thought to counteract the affects of the full length receptor and is being investigated as a potential therapeutic agent. CONCLUSION We performed multiple types of analysis serially, with the findings of each analysis confirming the next. Although multiple bands were seen on the gel, with corresponding multiple peaks on the SELDI spectra, we consider that the actual number of proteins identified was one, RAGE. Combinations of multiple methods in proteomics research not only allow for a wider coverage of the information to be determined (in this case combining the different methods increased the overall sequence coverage from 79.29% to 91.6%), but also lend validity to the findings of each individual method.