Peptide and Protein Analysis Primary (1 ) structure of a peptide or protein is the amino acid sequence

Similar documents
Introduction to Peptide Sequencing

2. Ionization Sources 3. Mass Analyzers 4. Tandem Mass Spectrometry

Biological Mass Spectrometry. April 30, 2014

Amino Acids. Amino Acids. Fundamentals. While their name implies that amino acids are compounds that contain an NH. 3 and CO NH 3

BIOCHEMISTRY REVIEW. Overview of Biomolecules. Chapter 4 Protein Sequence

CS612 - Algorithms in Bioinformatics

Proteins are sometimes only produced in one cell type or cell compartment (brain has 15,000 expressed proteins, gut has 2,000).

AA s are the building blocks of proteins

Biomolecules: amino acids

Properties of amino acids in proteins

Gentilucci, Amino Acids, Peptides, and Proteins. Peptides and proteins are polymers of amino acids linked together by amide bonds CH 3

1. Describe the relationship of dietary protein and the health of major body systems.

Chemical Nature of the Amino Acids. Table of a-amino Acids Found in Proteins

1-To know what is protein 2-To identify Types of protein 3- To Know amino acids 4- To be differentiate between essential and nonessential amino acids

Amino acids-incorporated nanoflowers with an

LAB#23: Biochemical Evidence of Evolution Name: Period Date :

A Chemical Look at Proteins: Workhorses of the Cell

2. Which of the following amino acids is most likely to be found on the outer surface of a properly folded protein?

Page 8/6: The cell. Where to start: Proteins (control a cell) (start/end products)

Molecular Biology. general transfer: occurs normally in cells. special transfer: occurs only in the laboratory in specific conditions.

Chapter 3: Amino Acids and Peptides

PROTEINS. Amino acids are the building blocks of proteins. Acid L-form * * Lecture 6 Macromolecules #2 O = N -C -C-O.

Objective: You will be able to explain how the subcomponents of

MS/MS Scan Modes. Eötvös University, Budapest April 16, MS/MS Scan Modes. Árpád Somogyi. Product Ion Scan Select. Scan. Precursor Ion Scan Scan

9/6/2011. Amino Acids. C α. Nonpolar, aliphatic R groups

Practice Problems 3. a. What is the name of the bond formed between two amino acids? Are these bonds free to rotate?

Classification of amino acids: -

Reactions and amino acids structure & properties

PROTEINS. Building blocks, structure and function. Aim: You will have a clear picture of protein construction and their general properties

Biological systems interact, and these systems and their interactions possess complex properties. STOP at enduring understanding 4A

Lecture 3. Tandem MS & Protein Sequencing

Mass Spectrometry. Mass spectrometer MALDI-TOF ESI/MS/MS. Basic components. Ionization source Mass analyzer Detector

CHAPTER 21: Amino Acids, Proteins, & Enzymes. General, Organic, & Biological Chemistry Janice Gorzynski Smith

For questions 1-4, match the carbohydrate with its size/functional group name:

CHM333 LECTURE 6: 1/25/12 SPRING 2012 Professor Christine Hrycyna AMINO ACIDS II: CLASSIFICATION AND CHEMICAL CHARACTERISTICS OF EACH AMINO ACID:

Biomolecular Mass Spectrometry

Authors. Abstract. Introduction. Food

Lecture 4. Grouping Amino Acid 7/1/10. Proteins. Amino Acids. Where Are Proteins Located. Nonpolar Amino Acids

This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is worth 2 points.

Chemistry 121 Winter 17

Moorpark College Chemistry 11 Fall Instructor: Professor Gopal. Examination # 5: Section Five May 7, Name: (print)

For questions 1-4, match the carbohydrate with its size/functional group name:

Methionine (Met or M)

AMINO ACIDS STRUCTURE, CLASSIFICATION, PROPERTIES. PRIMARY STRUCTURE OF PROTEINS

Introduction to proteins and protein structure

Copyright 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings

AP Bio. Protiens Chapter 5 1

Section 1 Proteins and Proteomics

Biochemistry - I. Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture 1 Amino Acids I

The Structure and Function of Large Biological Molecules Part 4: Proteins Chapter 5

Amino Acids. Review I: Protein Structure. Amino Acids: Structures. Amino Acids (contd.) Rajan Munshi

Lipids: diverse group of hydrophobic molecules

If you like us, please share us on social media. The latest UCD Hyperlibrary newsletter is now complete, check it out.

Amino Acids. Lecture 4: Margaret A. Daugherty. Fall Swiss-prot database: How many proteins? From where?

Biomolecules Amino Acids & Protein Chemistry

Midterm 1 Last, First

LC-MS Analysis of Amino Acids on a Novel Mixed-Mode HPLC Column

Introduction to Protein Structure Collection

(30 pts.) 16. (24 pts.) 17. (20 pts.) 18. (16 pts.) 19. (5 pts.) 20. (5 pts.) TOTAL (100 points)

Level and activity of D-amino acids in mouse brain tissue and blood

Operating Instructions

Enzyme Catalytic Mechanisms. Dr. Kevin Ahern

1. (38 pts.) 2. (25 pts.) 3. (15 pts.) 4. (12 pts.) 5. (10 pts.) Bonus (12 pts.) TOTAL (100 points)

Towards a New Paradigm in Scientific Notation Patterns of Periodicity among Proteinogenic Amino Acids [Abridged Version]

(65 pts.) 27. (10 pts.) 28. (15 pts.) 29. (10 pts.) TOTAL (100 points) Moorpark College Chemistry 11 Spring Instructor: Professor Gopal

Biological Mass spectrometry in Protein Chemistry

BCMB Chapter 3 (part1)

Analysis of L- and D-Amino Acids Using UPLC Yuta Mutaguchi 1 and Toshihisa Ohshima 2*

Date: EXERCISE 4. Figure 1. Amino acid structure.

Proteins consist in whole or large part of amino acids. Simple proteins consist only of amino acids.

Biology. Lectures winter term st year of Pharmacy study

Cahn - Ingold - Prelog system. Proteins: Evolution, and Analysis Lecture 7 9/15/2009. The Fischer Convention (1) G (2) (3)

PAPER No. : 16, Bioorganic and biophysical chemistry MODULE No. : 22, Mechanism of enzyme catalyst reaction (I) Chymotrypsin

استاذ الكيمياءالحيوية

Fundamentals of Organic Chemistry CHEM 109 For Students of Health Colleges

BCHS 3304/ Exam I. September 26,

Human Biochemistry Option B

Macromolecules of Life -3 Amino Acids & Proteins

Study of Amino Acids in DDGS

BCMB 3100 Chapter 3 (part 1)

Lecture 5- DNA Arrays and Introduction to Protein Structure and Function. Light directed synthesis of oligonucleotides NVTR O O O O O BPG BPG BPG BPG

The Structure and Function of Macromolecules

Short polymer. Dehydration removes a water molecule, forming a new bond. Longer polymer (a) Dehydration reaction in the synthesis of a polymer

Cells N5 Homework book

Analysis of free amino acids in tobacco with a new LC/MS/MS. procedure. S. C. Moldoveanu R.J. Reynolds Tobacco Co.

Chemical Mechanism of Enzymes

Name. The following exam contains 44 questions, valued at 2.6 points/question. 2. Which of the following is not a principal use of proteins?

GL Science Inertsearch for LC Inertsil Applications - Acids. Data No. Column Data Title Solutes Eluent Detection Data No.

Maha AbuAjamieh. Tamara Wahbeh. Mamoon Ahram

Levels of Protein Structure:

AMINO ACIDS AND PEPTIDES Proteins/3 rd class of pharmacy

Moorpark College Chemistry 11 Fall Instructor: Professor Gopal. Examination #5: Section Five December 7, Name: (print) Section:

Application of a new capillary HPLC- ICP-MS interface to the identification of selenium-containing proteins in selenized yeast

Introduction. Basic Structural Principles PDB

CH 2 CO 2. The structures of the naturally occurring amino acids are given on page 3 of this chapter.

Fundamentals of Soft Ionization and MS Instrumentation

1 Digestion and absorption. Lecture #14 Lecturer: PhD Alexander N. Koval

Metabolomics: quantifying the phenotype

Third Exam Practice Test

Transcription:

n-suppport cyclization 2 3 3 FM R n 1) Pd(0), Et 3 i 2) piperidine R 2 2 3 3 2 R n PyBP R 2 2 3 R n F 3 2, 3, F,! anisole anisole! R n 3 R 2 R 2 49 Peptide and Protein Analysis Primary (1 ) structure of a peptide or protein is the amino acid sequence Amino acid analyzer- automated instrument to determine the amino acid content of a peptide or protein. Individual amino acids are separated by hplc, then detected by post-column derivatization peptide -orprotein [] reduce any disulfide bonds liquid chromatography Enzymatic digestion -or- 3, Δ derivatize w/ ninhydrin 3 R 2 individual amino acids Detected w/ UV-vis Different amino acids have different chromatographic mobilities (retention times) 1972 obel Prize in hemistry William tein tanford Moore 50 25

Reaction of primary amines with ninhydrin 3 R 2 Intense purple color Amino Acid Analysis hromatogram o, why is it necessary to use a post- rather than pre-column derivatization protocol? Why are there are only 17 AA s in the chromatogram? 51 Fluorescence Detection- less background, greater sensitivity, lower detection limits Absorption spectroscopy- wavelength that light absorbs, moloecules are in an electronically excited state Emission spectroscopy- the excited molecules relax by emission of a photon. Fluorescence- excitation wavelength and emission wavelength are different. Molecule will emit light at longer (lower energy) wavelength than is absorbs. 52 26

Fluorescent tags Dansyl- detected by UV or fluorescence l 3 3 3 R 2 3 3 Dansyl chloride R 2 PA (o-phthalaldehyde)- detected by fluorescence 3 R 2 R 2 highly fluorescent 53 Reversed-phase (-18) PL Trace 5 pmols amino acids w/ PA, 2 2 54 27

Attomol detection w/ laser induced fluorescence 2 2 R 2 excitation: 488 nm emission: 560 nm 10-3 milli 10-6 micro 10-9 nano 10-12 pico 10-15 fempto 10-18 atto 10-21 zepto Avagadro s number 10 23 55 ysteine vs. ystine 3 [] 3 2 2 2 [] 3 2 2 2 2 Ellman's Reagent p 8 2 2 2 2 λ max = 412 nm ε ~ 13,600 mol -1 ml -1 cm -1 56 28

Peptide and Protein equences: primary (1 ) structure- amino acid sequence -labeling with anger s reagent: anger s (2,4-dinitrofluorobenzene) reagent reacts with the -terminal amino group and has a diagnostic UV absorbance that is detected after enzymatic digestion and amino acid analysis 2 2 F 3 2 Δ nucleophilic aromatic substitution 2 2 2 enzymatic digestion -or- 3, Δ 2 2 R n 3 plus other unlabeled amino acids -terminal amino acid is specifically labeled with a unique UV chromophore 57 -terminal sequencing: arboxypeptidase- enzyme that hydrolyzed amide bonds of a peptide or protein starting from the -termial end (exopeptidase) R 1 arboxypeptidase 3 3 R 2 Zn 2, 2 R 2 peptide has a new -terminal AA 3 derivatize and identify by PL ydrolyze peptide with hydazine ( 2-2 ) 3 R 2 2 2 3 -terminal AA is still an amino acid 2 R n 2 All other AA's are converted to the hydrazides 58 29

Edman Degradation: chemical method for the sequential cleavage and identification of the amino acids of a peptide, one at a time starting from the -terminus. Reagent: Ph-==, phenylisothiocyanate Ph p 9.0 2 2 then Ph 2 2 -phenylthiohydantoin:! separated by PL,! detected by UV-vis! -1 peptide with a new! -terminal amino acid! (repeat degradation cycle)! 59 Peptide sequencing by Edman degradation: Monitor the appearance of -phenylthiohydantoin over time to get the peptide sequence. Good for peptides up to ~ 25 amino acids long. Longer peptides and proteins must be cut into smaller fragments before Edman sequencing Fluorescent Edman sequencing reagent 2 2 Fluorescein (a common fluorescent dye) Fluorescein Isothiocyanate (a fluorescent Edman reagent) 60 30

Enzymatic and chemical cleavage of peptides and proteins at defined sites 3 Enzymatic trypsin: cleaves at the -terminal side of basic residues, Arg, Lys but not is 2 trypsin 2 3 3 2 3 3 chymotrypsin: cleaves at the -terminal side of aromatic residues Phe, Tyr, Trp 2 chymotrypsin 2 3 3 3 2 61 thermolysin: cleaves at the -terminal side of hydrophobic residues Phe, Trp, Leu 3 R 2 thermolysin 3 3 3 2 2 hymotrypsin cleavage products Trypsin cleavage products Tyr Arg Asp-Asn-Gln Leu-Lys Gly-Gly-Phe Ile-Arg-Pro-Lys Leu-Arg-Arg-Ile-Arg-Pro-Lys-Leu-Lys-Trp Tyr-Gly-Gly-Phe-Leu-Arg Trp-Asp-Asn-Gln Trypsin: Tyr-Gly-Gly-Phe-Leu-Arg Arg Ile-Arg-Pro-Lys Leu-Lys Trp-Asp-Asn-Gln hymotrypsin: Tyr Gly-Gly-Phe Leu-Arg-Arg-Ile-Arg-Pro-Lys-Leu-Lys-Trp Asp-Asn-Gln 62 31

ther ommonly Used Protein Digest Reagents Glu- cleaves to the -terminal side of Glu residues (cleavage at Asp is 100-300 times slower) Asp- cleaves to the -terminal side of Asp residues and cysteic acid Lys- cleaves to the -terminal side of Lys residues Lys- cleaves to the -terminal side of Lys residues Arg- cleaves to the -terminal side of Arg residues on-specific proteases: pepsin, proteinase K, subtilysin 63 hemical cleavage of peptides and proteins at defined sites yanogen bromide (Br-): cleaves to the -terminal side of methionine residues 3 2 Br- 3 2 3 3-3 3 2 2 3 R 1 2 3 3 2 2 3 2 -terminal homo-serine 64 32

Lawton s Reagent: cleavage at -terminal side of cysteine residues 2Ph Br Lawton's Reagent 3 3 2 Lawton's Reagent 3 Ph 2 R 3 _ 2 2 Ph 2 3 Br 2 2 Ph 2 3 2 2 3 2 3 3 3 2 ys is converted to er 65 EPIDERMAL GRWT FATR (EGF)! 2 -A1 ER2 TYR3 PR4 GLY5 Y6 PR7 ER8 ER9 TYR10! AP11 GLY12 TYR13 Y14 LEU15 A16 GLY17 GLY18 VAL19! Y20 MET21 I22 ILE23 GLU24 ER25 LEU26 AP27 ER28! TYR29 TR30 Y31 A32 Y33 VAL34 ILE35 GLY36 TYR37! ER38 GLY39 AP40 ARG41 Y42 GL43 TR44 ARG45 AP46! LEU47 ARG48 TRP49 TRP50 GLU51 LEU52 ARG53-2!! Trypsin hymotrypsin yanogen Bromide Disulfides bridges at:!ys6 - ys20! ys14 - ys31! ys33 - ys42!!!!. ohen et al. J. Biol. hem. 1972, 247, 5928-5934!!!!! 1972, 247, 7612-7621! 1973, 248, 7669-7672! 66 33

Peptide sequencing by tandem mass spectrometry Ionization: IM (secondary ion mass spectrometry) Time-of-flight (TF) mass spectrometer Methods to get large, polar molecules into the gas phase for M analysis FAB: Fast Atom Bombardment MALDI: Matrix-Assisted Laser Desorption Ionization EI: Electrospray Ionization Mass spectrometry gives mass/charge (m/z) ratio Introduction to Proteomics: Tools for the ew Biology, Liebler, D.., umana Press: 2002 67 Mass spectrometry is a gas phase technique. Peptides (and proteins) are charged, polar, high molecular weight molecules (ions). ow can peptides and proteins be coaxed into the gas phase? Electrospray ionization (EI): analyte is introduced into the mass spectrometer as an aerosol. liquid chromatography or capillary electrophoresis (separate the analytes)! oulombic fission to the mass analyzer - 68 34

MALDI ionization (matrix-assisted laser desorption): analyte is co-crystallized with an organic molecule that has an intense UV absorption. A laser that is tuned to the absorption of the matrix, is pulsed at the MALDI matrix and energy is indirectly transferred to the analyte. to the mass analyzer 2002 obel Prize in hemistry John Fenn (EI) Koichi Tanaka (MALDI) Laser pulse 3 2 3 inapinic acid 2 α-cyano-4-hydroxycinnamic acid (A) 2 2,5-dihydroxybenzoic acid (DB) 69 Mass pectrometry (M): measures the mass to charge ratio (m/z) Dalton (Da) or mass unit (u) = units for measuring molecular masses. ne Da. = 1/12 the mass of the 12 atom Monoisotopic mass sum of the exact masses of the most abundant isotope of each element in a molecule Average mass sum of the averaged masses of each element in a molecules, weighted according to isotopic abundance. ominal mass mass calculated using the integer mass of the most abundant isotope for each element (=1, =12, =16, =14, etc.) Isotope Mass atural Abundance 1 1.0078 99.99% 2 2.0141 0.015 12 12 98.89 13 13.0034 1.11 14 14.0031 99.64 15 15.0001 0.36 16 15.9949 99.76 17 16.9991 0.04 18 17.9992 0.2 Isotope Mass atural Abundance 31 P 30.9737 100 32 31.9721 95 33 32.9715 0.76 34 33.9679 4.22 36 35.9671 0.02 70 35

Glu-Gly-Val-Asn-Asp-Asn-Glu-Glu-Gly-Phe-Phe-er-Ala-Arg (EGVDEEGFFAR) 66 95 19 26 12 661 95 14 19 16 26 Monoisotopic Mass 1569.66956 (100%) Average Mass 1570.5722 12 65 13 1 1 95 14 19 16 26, etc. (80.5%) 12 64 13 2 1 95 14 19 16 26, etc. (37.3%) 12 63 13 3 1 95 14 19 16 26, etc. (12.7%) 12 63 13 4 1 95 14 19 16 26, etc. (3.5%) lide adapted from lecture material developed by A. Burlingame,. Guan, and M. Baldwin (UF) entitled Mass pectrometry and Proteomics 1569 1570 1571 1572 1573 1574 1575 1576 m/z http://www-personal.umich.edu/~junhuay/pattern.htm 71 As the number of atoms in the molecule increases, the pattern of masses due to the presence of isotopes will change 1295 2095 2465 3655 3660 5730 1000 2000 3000 m/z 4000 5000 6000 lide adapted from lecture material developed by A. Burlingame,. Guan, and M. Baldwin (UF) entitled Mass pectrometry and Proteomics 72 36

What does the isotopic distributions tell us? 100 90 524.3 100 90 262.6 80 80 70 70 Relative Abundance 60 50 40 30 20 525.3 Relative Abundance 60 50 40 30 20 263.1 10 526.2 10 263.6 0 520 521 522 523 524 525 526 527 528 529 m/z 0 258 259 260 261 262 263 264 265 266 267 m/z lide adapted from lecture material developed at the University of Lund 73 Resolution and Resolving Power (RP): terms used interchangably igh resolution Low resolution 6130 6140 6150 6160 6170 m/z The smallest mass difference (ΔM) between peaks such that the valley between them is a specified fraction of the peak height Full Width alf Maximum (FWM): Width of a single peak measured at 50% peak apex. ΔM lide adapted from lecture material developed by A. Burlingame,. Guan, and M. Baldwin (UF) entitled Mass pectrometry and Proteomics 10% 50% ΔM 74 37

Mass Accuracy (MA) - the difference between the experimental mass (M exp ) and the theoretical value (M calc ), calculated from elemental composition. MA = M exp M calc M calc M exp = 1569.684, M calc = 1569.6696 (ppm for high resolution M) accuracy = 9.2 ppm igh resolution means better mass accuracy m/z: 784.775 784.860 784.848 784.830 M: 1569.549 1569.720 1569.695 1569.661 RP=400 (76.8 ppm) RP=800 (32.1 ppm) RP=1600 (16.2 ppm) RP=3200 (5.5 ppm) lide adapted from lecture material developed by A. Burlingame,. Guan, and M. Baldwin (UF) entitled Mass pectrometry and Proteomics 783 785 787 783 785 787 783 785 787 783 785 787 m/z m/z m/z m/z 75 Peptide Mass Fingerprinting : Proteins (or peptides) are digested in a predictable way and the masses of the resulting peptide fragments are unique enough to identify the protein. Requires a database of known sequences and search software to compare (score) the experimentally observed masses with the calculated masses in the database. m/z = 1529 ± 1 Da 478 peptide fragments from 1529.7 ± 0.1 164 mouse/human genome 1529.73 ± 0.01 25 1529.7340 ± 0.001 4 1529.7348 ± 0.0001 2 76 38

Many peptides and proteins give multiply charged ions ID: collision induced dissociation [M ] accelerated into M collision cell (e, Ar, Xe) analyze fragments to get sequence ollision of the [M] ion with the gas causes it to fragment, analysis of these fragments ions gives sequence information charge to -terminus 2 a 1 b 1 c 1 a 2 b 2 c 2 a 3 b 3 c 3 x 1 y 1 z 1 R 2 x 2 y 2 z 2 R 4 x 3 y 3 z 3 -terminus charge to R 2 -terminal fragment R 3 2 -terminal fragment 77 Peptide sequencing by tandem mass spectrometry anospray apillary Electrospray Ion ource Q1 ollision ell (Q2) Q3 to the detector elect peptide to be analyzed fragment the peptide Analyze the peptide fragments 1116.67 1247.70 1287.73 1424.85 1375.76 1574.20 1505.77 1665.89 1811.85 1849.12 elect m/z 1505.8 for Q2 2005.07 2550.52 2476.21 2719.48 Peptides fragment in a predictable manner charge to b 1 b 2 b -terminus 3 2 R 2 R 4 y 1 y 2 y 3 charge to -terminus 1000 1500 2000 2500 3000 m/z 78 39

Amino Acids orted by Mass b 1 b 2 b 3 2 R 2 R 4 y 1 y 2 y 3 average exact - -R- Glycine G 75.07 75.03 57.1 Alanine A 89.10 89.05 71.1 erine 105.09 105.04 87.1 Proline P 115.13 115.05 97.1 Valine V 117.15 117.08 99.1 Threonine T 119.12 119.06 101.1 ysteine 121.16 121.02 103.1 Isoleucine I 131.18 131.09 113.2 Leucine L 131.18 131.09 113.2 Asparagine 132.12 132.05 114.1 Aspartic Acid D 133.11 133.04 115.1 Glutamine Q 146.15 146.07 128.2 Lysine K 146.19 146.11 128.1 Glutamic Acid E 147.13 147.13 129.1 Methionine M 149.21 149.05 131.2 istidine 155.16 155.02 137.1 Phenylalanine F 165.19 165.19 147.2 Arginine 74.20 174.11 156.2 Tyrosine Y 181.19 181.07 163.2 Tryptophan W 204.23 204.09 186.2 79 ome ambiguities with M sequencing leucine (L) vs isoleucine (I): difficult to distinguish, must look at fragmentation of the sidechain 3 2 3 2 lysine (K, m/z=128.09) vs glutamine (Q, m/z = 128.06) 2 2 3 Ac 2 3 2 42 amu's 2 2 3 Ac 2 no reaction gly (G) gly (G) = 114.04 = asn ()= 114.04 ala (A) gly (G) = 128.06 = gln (Q) = 128.06 = lys (K)= 128.09 gly (G) val (V) = 156.09 = arg (R) = 156.10 ala (A) asp (D) = glu (E) gly (G) = 186.06 = trp (W) = 186.08 ser () val (V) = 186.1 = trp (W) = 186.08 80 40

2 2 2 2 3 3 3 3 3 2 3 2 2 3 3 ID 3 3 2 3 3 2 2 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 2 3 b 5 fragment: m/z = 542.1 2 -or- 3 2 3 3 3 2 3 3 3 2 2 y 6 fragment: m/z = 688.3 2 2 2 2 2 2 81 2 2 2 b 1 y 10 2 3 3 b 2 y 9 3 b 3 3 y 8 2 3 b 4 3 y 7 2 b 5 y 6 2 3 3 2 3 3 3 3 2 2 2 Glu Leu Val Ile er Leu Ile Val Glu er Lys 129 113 99 113 87 113 113 99 129 87 145 b 6 y 5 b 7 y 4 b 8 y 3 b 9 y 2 b 10 y 1 2 2 2 2 2 82 41

83 entral Dogma DA mra protein post-translational modifications genome transcriptome proteome 84 42