Bovine factor X1 (Stuart factor): Amino-acid sequence of heavy chain (blood coagulation/homology to serine protease)

Similar documents
BIOCHEMISTRY REVIEW. Overview of Biomolecules. Chapter 4 Protein Sequence

specificity." Whereas trypsin acts almost exclusively on peptide bonds properties.1 These include molecular weights (approximately 25,000 and 24,000,

2. Which of the following amino acids is most likely to be found on the outer surface of a properly folded protein?

LAB#23: Biochemical Evidence of Evolution Name: Period Date :

AA s are the building blocks of proteins

Molecular Biology. general transfer: occurs normally in cells. special transfer: occurs only in the laboratory in specific conditions.

PAPER No. : 16, Bioorganic and biophysical chemistry MODULE No. : 22, Mechanism of enzyme catalyst reaction (I) Chymotrypsin

CHAPTER 21: Amino Acids, Proteins, & Enzymes. General, Organic, & Biological Chemistry Janice Gorzynski Smith

Objective: You will be able to explain how the subcomponents of

Page 8/6: The cell. Where to start: Proteins (control a cell) (start/end products)

CS612 - Algorithms in Bioinformatics

Gentilucci, Amino Acids, Peptides, and Proteins. Peptides and proteins are polymers of amino acids linked together by amide bonds CH 3

Biological systems interact, and these systems and their interactions possess complex properties. STOP at enduring understanding 4A

1. Describe the relationship of dietary protein and the health of major body systems.

Chemical Nature of the Amino Acids. Table of a-amino Acids Found in Proteins

Introduction to Peptide Sequencing

Chemistry 121 Winter 17

This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is worth 2 points.

Chemical Mechanism of Enzymes

Biomolecules: amino acids

Amino acids. Side chain. -Carbon atom. Carboxyl group. Amino group

Reactions and amino acids structure & properties

Classification of amino acids: -

Levels of Protein Structure:

Lipids: diverse group of hydrophobic molecules

1-To know what is protein 2-To identify Types of protein 3- To Know amino acids 4- To be differentiate between essential and nonessential amino acids

Amino Acids. Amino Acids. Fundamentals. While their name implies that amino acids are compounds that contain an NH. 3 and CO NH 3

Proteins are sometimes only produced in one cell type or cell compartment (brain has 15,000 expressed proteins, gut has 2,000).

Amino Acids. Review I: Protein Structure. Amino Acids: Structures. Amino Acids (contd.) Rajan Munshi

(30 pts.) 16. (24 pts.) 17. (20 pts.) 18. (16 pts.) 19. (5 pts.) 20. (5 pts.) TOTAL (100 points)

Properties of amino acids in proteins

Four melanocyte-stimulating hormones have the following amino acid sequences:

If you like us, please share us on social media. The latest UCD Hyperlibrary newsletter is now complete, check it out.

BIO th September, 1997

Amino Acid Sequence of Chicken Heart Cytochrome c

Cahn - Ingold - Prelog system. Proteins: Evolution, and Analysis Lecture 7 9/15/2009. The Fischer Convention (1) G (2) (3)

Moorpark College Chemistry 11 Fall Instructor: Professor Gopal. Examination # 5: Section Five May 7, Name: (print)

Enzyme Catalytic Mechanisms. Dr. Kevin Ahern

Amino acids-incorporated nanoflowers with an

The Structure and Function of Large Biological Molecules Part 4: Proteins Chapter 5

The Structure and Function of Macromolecules

Amino acid metabolism

Biomolecules Amino Acids & Protein Chemistry

Copyright 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings

Short polymer. Dehydration removes a water molecule, forming a new bond. Longer polymer (a) Dehydration reaction in the synthesis of a polymer

Methionine (Met or M)

9/6/2011. Amino Acids. C α. Nonpolar, aliphatic R groups

Chapter 3: Amino Acids and Peptides

A Chemical Look at Proteins: Workhorses of the Cell

BCHS 3304/ Exam I. September 26,

Studies on the Primary Structure of the Influenza Virus Hemagglutinin

Four Classes of Biological Macromolecules. Biological Macromolecules. Lipids

SIMPLE BASIC METABOLISM

1. (38 pts.) 2. (25 pts.) 3. (15 pts.) 4. (12 pts.) 5. (10 pts.) Bonus (12 pts.) TOTAL (100 points)

Macromolecules of Life -3 Amino Acids & Proteins

PROTEINS. Building blocks, structure and function. Aim: You will have a clear picture of protein construction and their general properties

Chapter 10. Regulatory Strategy

Third Exam Practice Test

of Androctonus australis Hector

Arginine side chain interactions and the role of arginine as a mobile charge carrier in voltage sensitive ion channels. Supplementary Information

AP Bio. Protiens Chapter 5 1

Nature Methods: doi: /nmeth Supplementary Figure 1

The Basics: A general review of molecular biology:

Quantitative LC-MS/MS Analysis of Glucagon. Veniamin Lapko, Ph.D June 21, 2011

PROTEIN METABOLISM DEPT OF BIOCHEMISTRY ACS MEDICAL COLLEGE CHENNAI - 77

Can we learn something new about peptide separations after 40 years of RP and HILIC chromatography? Martin Gilar April 12, MASSEP 2016

For questions 1-4, match the carbohydrate with its size/functional group name:

Phenylketonuria (PKU) Structure of Phenylalanine Hydroxylase. Biol 405 Molecular Medicine

Cells. Variation and Function of Cells

AMINO ACIDS STRUCTURE, CLASSIFICATION, PROPERTIES. PRIMARY STRUCTURE OF PROTEINS

2. Ionization Sources 3. Mass Analyzers 4. Tandem Mass Spectrometry

Biochemistry 2 Recita0on Amino Acid Metabolism

paper and beads don t fall off. Then, place the beads in the following order on the pipe cleaner:

Protein Modification Overview DEFINITION The modification of selected residues in a protein and not as a component of synthesis

Biology. Lectures winter term st year of Pharmacy study

Computer-generated Models of Blood Coagulation Factor Xa, Factor IXa, and Thrombin Based upon Structural Homology with Other Serine Proteases*

Saccharomyces cerevisiae*

CHM333 LECTURE 6: 1/25/12 SPRING 2012 Professor Christine Hrycyna AMINO ACIDS II: CLASSIFICATION AND CHEMICAL CHARACTERISTICS OF EACH AMINO ACID:

Chapter 21 Lecture Outline

Cells N5 Homework book

BIO 311C Spring Lecture 15 Friday 26 Feb. 1

PHRM 836 September 1, 2015

Localization of Methylated Arginine in the Al Protein from Myelin

Metabolism of amino acids. Vladimíra Kvasnicová

Bio 100 Serine Proteases 9/26/11

Student number. University of Guelph Department of Chemistry and Biochemistry Structure and Function In Biochemistry

Introduction to proteins and protein structure

endopeptidases aminopeptidases carboxypeptidases hydrolyzes a peptide bond somewhere in the middle of the polypeptide

Anfinsen,'0 and the cystines were then reduced and converted to analogues of

PROTEINS. Amino acids are the building blocks of proteins. Acid L-form * * Lecture 6 Macromolecules #2 O = N -C -C-O.

Chymotrypsin Lecture. Aims: to understand (1) the catalytic strategies used by enzymes and (2) the mechanism of chymotrypsin

Catabolism of Carbon skeletons of Amino acids. Amino acid metabolism

استاذ الكيمياءالحيوية

Chapter 11: Enzyme Catalysis

Macromolecules Structure and Function

Practice Problems 3. a. What is the name of the bond formed between two amino acids? Are these bonds free to rotate?

Chapter 23 Enzymes 1

THE AMINO ACID SEQUENCE OF HYPERTENSIN II

Review II: The Molecules of Life

Transcription:

Proc. Nat. Acad. Sci. USA Vol. 72, No. 8, pp. 3082-3086, August 1975 Biochemistry Bovine factor X1 (Stuart factor): Amino-acid sequence of heavy chain (blood coagulation/homology to serine protease) K. TITANI, K. FUjIKAWA, DAVID L. ENFIELD, LOWELL H. ERICSSON, KENNETH A. WALSH, AND HANS NEURATH Department of Biochemistry, University of Washington, Seattle, Wash. 98195 Contibuted by Hans Neurath, June 9, 1975 ABSTRACT The amino-acid sequence of the heavy chain of bovine blood coagulation factor X1 (Stuart factor) isolated before and after activation has been determined. Sequence analysis was performed on fragments obtained by cleavage with cyanogen bromide and by tryptic digestion. Comparison of the complete sequence with those of other hepatic and pancreatic serine proteases demonstrates homology of the heavy chain of activated factor X1 (factor XI.) with the B chain of bovine thrombin as well as with bovine trypsin, chymotrsins A and B, and porcine elastase. The activation peptide cleaved near the amino terminus by a protease from Russell's viper venom differs in both size and sequence from those of other serine proteases. With three exceptions, all of the residues which are important in the catalytic functions of trypsin and chymotrypsin occur in corresponding loci in the heavy chain of factor Xa. These findings suggest that the three-dimensional structure of the heavy chain is similar to that of the pancreatic serine proteases and that these enzymes have evolved from a common ancestral gene. Factor X (Stuart factor) is the zymogen of a protease which upon activation by a complex of activated factor IX (factor IXa) and factor VIII catalyzes the conversion of prothrombin to thrombin (1). Bovine factor X is a glycoprotein with a molecular weight of 55,100. It is composed of two polypeptide chains ("heavy" and "light") held together by one or more disulfide bonds. The intact protein can be separated chromatographically into two components (factor Xi and X2) which are identical in molecular weight and amino-acid composition (2-4). Upon activation by the complex of factors IXa and VIII in the presence of calcium and phospholipid, by factor VII and tissue factor in the presence of calcium, by a protease from Russell's viper venom, or by trypsin, glycopeptides are split, either near the amino or both the amino and carboxyl ends of the heavy chain, and an enzymatically active serine protease, factor Xa, is formed (5-8). The heavy chain of factor Xa is homologous to pancreatic and certain hepatic serine proteases, as evidenced by similarities of the amino-terminal sequence and the sequence surrounding the serine and histidine residues of the active site (9). The sequence of the light chain of bovine factor X1 (10), for which the pancreatic proteases have no counterpart, is homologous to the amino-terminal sequences of bovine prothrombin and factor IX (11). All three of these factors are vitamin K dependent, and this dependency appears to be expressed in the post-transcriptional synthesis of y-carboxyglutamyl residues (12, 13) in the amino-terminal portions of the zymogens. This communication presents the amino-acid sequence of the heavy chain of bovine factor X1, and an examination of the structural homology of this chain with those of other serine proteases. Abbreviation:, carbohydrate. 3082 METHODS Factor X1 was isolated from bovine plasma and converted to the active form (factor Xia) by incubation with a protease from Russell's viper venom (4, 5). Factors X1 and XIa were reduced with dithioerythritol, and S-alkylated with iodoacetic acid or 4-vinylpyridine, and the chains were then separated as described (4, 5). Cyanogen bromide fragments were prepared and separated as described (9). For clarity, fractions I through VI and A through E of the previous paper (9) are designated herein as CB I through CB VI and CB A through CB E. Fraction III was further separated into two fragments, CB III-1 and CB III-2, by recycling gel filtration on a Sephadex -S50 column in 9% formic acid. Acid cleavage of an aspartyl-proline bond in the activation peptide was carried out by the method of Fraser et al. (14). Tryptic digests of the whole heavy chain were prepared before and after succinylation of a- and E-amino groups (15), and the peptides were separated by combinations of gel filtration and ion exchange chromatography. Amino acid analyses were performed on Beckman (model 120C) and Durrum (model D-500) amino-acid analyzers according to the method of Spackman et al. (16). Automatic sequence analyses were performed on the Beckman Sequencer (model 890B) by the method of Edman and Begg (17) as modified by Hermodson et al. (18), and on the Sequemat by the method of Laursen (19). Column chromatography of glycopeptides was monitored by reaction with both ninhydrin and phenol-sulfuric acid (20). Hexose, hexosamine, and N-acetyl-neuraminic acid were analyzed by the methods of Dubois et al. (21), Elson and Morgan as described by Gardell (22), and Warren (23), respectively. RESULTS The amino-acid sequences of seven cyanogen bromide fragments of the heavy chains (H1 and Hia) of bovine factors X1 and Xia, and of several smaller fragments produced by chemical or enzymatic subfragmentation were determined largely by automatic sequence analysis as shown diagrammatically in Fig. 1. The sequences of the remaining portions of the fragments were determined by conventional methods on small peptides. The cyanogen bromide fragments were aligned with the aid of four tryptic peptides containing methionine (residues 109-126, 193-201, 206-240, and 279-281), which were isolated from the whole heavy chain, H1. Tryptic digestion of fragment CB II (370, ph 8.0 for 1 hr) yielded two large peptides, T-1 and T-2. Peptide T-1 was identical to the activation peptide (residues 1-51) except for

Biochemistry: Titani et al. Proc. Nat. Acad. Sci. USA 72 (1975) 3083 Trp Asn Arg Ile Met Met Met Met Ser 365152 117 123 193 216 233 Activation with a proteose from Russell's viper venom Trp Arg = Met Thr Lou.- (HI) 280 300 307 36 51 IlIe EisM'lk'%lS&lk--IM CJleovoge at Met a I1 Ser Leu il 52 Ala 233 300 307 Trp li CBIII-I.eu Lys Leu (H10) CBII 194 liethr PS SI Il Th 281 300 307 p s Ph Serysi 52 CB C-2 124 CB III-2 217233 Trypsin CGB IV IF CB VI (CB C-3) GI eu Trp Arg Asp L s G~~~~~ B V-T 36 5157 99 CB II-T-I CB II-T-2 FIG. 1. Diagrammatic summary of fragments generated from the heavy chains of bovine factors X1 and Xia for sequenator analyses. The top bar represents the heavy chain (H1) of factor X1 and the residues that are important for its fragmentation, in addition to those at the amino- and carboxyl-termini, at the active site, and those to which carbohydrate () is attached. The hatched sections of horizontal bars indicate the portions of sequence determined by sequenator analyses. The sequence of the section in the interior of CB III-2 was determined on a tryptic peptide (residues 152-171) isolated from the whole heavy chain. a slightly lower content of N-acetyl-neuraminic acid which appeared to be caused by partial loss under the acidic conditions of cleavage with cyanogen bromide. Most of the carbohydrate of the heavy chain H1 (4) was recovered in the activation peptide (30), and was entirely associated with a tetrapeptide, Gly-Leu-Asn-Arg (residues 34-37). This peptide was obtained by digestion of the activation peptide with chymotrypsin followed by prolonged digestion with trypsin (370, ph 8.0 for 2 days). The attachment of carbohydrate to Asn-36 was assumed from the results of sequence analysis. The sequence of the activation peptide (or of CB II T-1) was largely determined on the intact peptide. The remainder of the sequence was derived from a peptide (residues 20-51) produced by cleavage of an aspartyl-proline bond, and from a chymotryptic peptide (residues 34-51). The remaining sequence (residues 52-117) of CB II was elucidated, first by sequenator analysis of fragments CB C-2 (9) and CB II T-2 (Fig. 1), which yielded the first 44 amino-terminal residues, and then by conventional analysis of peptides representing residues 96-117. Methionyl-threonine bonds are known to resist cleavage with cyanogen bromide (24). Hence incomplete cleavage at residues 123-124 yielded two overlapping fragments, CB 111-1 (residues 118-193) and CB III-2 (residues 124-193), which were of similar size and charge. They were separated by recycling gel filtration (five cycles) on a Sephadex G-50 column, prior to sequenator analysis. The remainder of the sequence of CB III was determined on tryptic and chymotryptic peptides isolated from a mixture of the two fragments. Sequenator analysis yielded the amino-terminal sequence of 13 residues of fragment CB VI (residues 194-216). The sequence of the remaining portion of the fragment was determined conventionally on chymotryptic peptides. Fragment CB IV comprises residues 217-280. Twentyfour amino-terminal residues were previously reported (9) for an identical fragment, CB C-3, of the heavy chain (HIa). The remainder of the sequence of fragment CB IV was determined by conventional methods on nine chymotryptic peptides and 11 tryptic peptides isolated before and after succinylation of fragment CB IV. Fragment CB V (residues 281-307), which represents the carboxyl-terminal portion of the heavy chain, contained a small amount of carbohydrate [hexosamine 0.92, hexose 2.7, and neuraminic acid (Neu) 0.48 mol/mol of peptide]. The sequence of the fragment through Pro-302 was determined with the sequencer on the intact fragment and on a tryptic peptide (residues 291-307). The carbohydrate was recovered in two chymotryptic peptides (residues 300-307), which were separated on Dowex 50-X2. These two peptides had the same amino-acid composition, but different carbohydrate content, i.e., CB V C-1 (Thro.6, Pro4.09, Val."o, Leui.5o, hexosamineo.92, hexose2.9, and Neuo.94) and CB V C-2 (Thro.94, Pro4.21, Vali.oo Leu1.59, hexosamineo.91, hexose2.4, and Neuo.2o). Subtractive Edman degradations established Thr-300 at their amino termini. Sequence analyses on the Sequemat of the peptides, attached to triethylenetetramine resin, established the remainder of the sequence but yielded no phenylthiohydantoin in the first turn. The absence of threonine phenylthiohydantoin is explained by the fact that only Thr-300 could be the attachment site for carbohydrate in these octapeptides. The carboxyl-terminal leucine was confirmed by carboxypeptidase A digestion. However, the low leucine content both in the peptides and upon digestion with carboxypeptidase A for 24 hr, 370, ph 8.0 in 4 M urea (0.4 mol/mol of peptide) indicate that the carboxyterminus is heterogeneous and ends with either Pro-306 or Leu-307. The complete amino-acid sequence of 307 residues of the heavy chain (Hi) of factor X1 is shown in Fig. 2. Carbohydrate units are attached to at least two residues, Asn-36 and Thr-300, but the possibility of a third site cannot be entirely excluded. The sequences around Asn-36 and Thr-300 are consistent with those, Asn()-X-Thr (or Ser) and Thr (or

3084 Biochemistry: Titani et al. Proc. Nat. Acad. Sci. USA 72 (1975) 5 10 15 20 Trp-Ala-Ile-His-Thr-Ser-Glu-Asp-Ala-Leu-Asp-Ala-Ser-Glu-Leu-Glu-His-Tyr-Asp-Pro- 25 30 35 1 40 Ala-Asp-Leu-Ser-Pro-Thr-Glu-Ser-Ser-Leu-Asp-Leu-Leu-Gly-Leu-Asn-Arg-Thr-Glu-Pro- 45 50 J 55 60 Ser-Ala-Gly-Glu-Asp-Gly-Ser-Gln-Val-Val-Arg-Ile-Val-Gly-Gly-Arg-Asp-Cys-Ala-Glu- 65 70 75 80 Gly-Glu-Cys-Pro-Trp-Gln-Ala-Leu-Leu-Val-Asn-Glu-Glu-Asn-Glu-Gly-Phe-Cys-Gly-Gly- 85 90 95 100 Thr-Ile-Leu-Asn-Glu-Phe-Tyr-Val-Leu-Thr-Ala-Ala-His-Cys-Leu-His-Gln-Ala-Lys-Arg- 105 110 115 120 Phe-Thr-Val-Arg-Val-Gly-Asp-Arg-Asn-Thr-Gln-Glu-Gly-Asp-Glu-Glu-Met-Ala-His-Glu- 125 130 135 140 Val-Glu-Met-Thr-Val-Lys-His-Ser-Arg-Phe-Val-Lys-Glu-Thr-Tyr-Asp-Phe-Asp-Ile-Ala- 145 150 155 160 Val-Leu-Arg-Leu-Lys-Thr-Pro-Ile-Arg-Phe-Arg-Asn-Val-Ala-Pro-Ala-Cys-Leu-Pro-Glu- 165 170 175 180 Lys-Asp-Trp-Ala-Ala-Glu-Thr-Leu-Gln-Thr-Lys-Thr-Gly-Ile-Val-Ser-Gly-Phe-Gly-Arg- 185 190 195 200 Thr-His-Glu-Lys-Gly-Arg-Leu-Ser-Ser-Thr-Leu-Lys-Met-Leu-Glu-Val-Pro-Tyr-Val-Asp- 205 210 215 220 Arg-Ser-Thr-Cys-Lys-Leu-Ser-Ser-Ser-Phe-Thr-Ile-Thr-Pro-Asn-Met-Phe-Cys-Ala-Gly- 225 230 235 240 Tyr-Asp-Thr-Gln-Pro-Glu-Asp-Ala-Cys-Gln-Gly*-Asp-Ser-Gly-Gly -Pro-His-Val-Thr-Arg- 245 250 255* 260 Phe-Lys-A.sp-Thr-Tyr-Phe-Val-Thr-Gly-Ile-Val-Ser-Trp-Gly-Glu-Gly-Cys-Ala-Arg-Lys- 265 270 275 280 Gly-Lys-Phe-Gly-VW.l-Tyr-Thr-Lys-Val-Ser-Asn-Phe-Leu-Lys-Trp-Ile-Asp-Lys-Ile-Met- CflO 285 290 295 1 Lys-Ala-Arg-Ala-Gly-Ala-Ala-Gly-Ser-Arg--Gly-His-Ser-Glu-Ala-Pro-Ala-Thr-Trp-Thr- 305 Val-Pro-Pro-Pro-Leu-Pro-Leu FIG. 2. The amino-acid sequence of the heavy chain of bovine factor X1. The vertical arrow indicates the bond cleaved upon activation by a protease of Russell's viper venom. represents carbohydrate units. Asterisk (*) indicates tentative identification. Overlap of Arg- 151 to Asn-152 is also tentative. Ser) ()-X-X-Pro, proposed by Winzler (25) and Jolles et al. (26), respectively. The total amino-acid composition of the heavy chain (H1) calculated from the sequence (Asp18, Asn8, Thr2s, Sermo, Gluas, Gln7, Proj& Gly27, Ala2s, Cys9, Val2l, Met5, Ilelo, Leu23, Tyr7, Phel3, Trp&, Lysl6, Hiss, and Argl7) is in good agreement with that obtained by analysis (4). The calculated molecular weight of the peptide portion of the heavy chain (H1) is 33,773. With carbohydrate added (approximately 10% of factor X1 by weight) the heavy chain has a molecular weight of 39,300 which is in good agreement with the value (38,000) obtained by ultracentrifugal analysis (4). DISCUSSION The amino-acid sequence of the heavy chain of factor X1 reported herein is complete except for the disulfide pairing of the nine half-cystine residues. Preliminary experiments indicate the presence of four intra-chain disulfide bonds between residues 58 and 63, 78 and 94, 204 and 218, 229 and 257, and one inter-chain bond between residue 157 and the light chain. Of these four intra-chain pairs, three are present in corresponding positions in all of the pancreatic serine proteases (27). The sequence of factor X2 has not yet been completed, but partial sequence data (approximately 50% of the heavy chain) give no indications of differences. Jackson et al. (3) suggested that the two proteins differ in carbohydrate content, while Fujikawa et al. (4) found no difference in either amino-acid or carbohydrate composition of the two proteins. As already discussed (11), the peptide portion of bovine factor X1 (447 residues) is apparently smaller than that of prothrombin. Precursor proteins of the three vitamin K-de-

Biochemistry: Titani et al. Table 1. Functional residues in hepatic and pancreatic serine proteases* Residue number Proc. Nat. Acad. Sci. USA 72 (1975) 3085 Enzyme Residuet Chymo- Factor Xia Factor Xiat Thrombin Chymo- Chymo- Function trypsin heavy chain heavy chain B-chain Trypsin trypsin A trypsin B Elastase "Charge-relay" system 57 93 His His His His His His 102 138 Asp Asp Asp Asp Asp Asp 195 233 Ser Ser Ser Ser Ser Ser 214 252 Ser Ser Ser Ser Ser Ser Ion-pair 16 52 Be Ile Ile Ile Ile Val 194 232 Asp Asp Asp Asp Asp Asp H-bond to Gly-193 32 68 Leu Met Ser Ser Ser Ser 40 76 Gly Leu His His His His Orientation of Ser-195 193 231 Gly Gly Gly Gly Gly Gly 196 234 Gly Gly Gly Gly Gly Gly Substrate binding 189 227 Asp Asp Asp Ser Ser Ser pocket 190 228 Ala Ala Ser Ser Ser Gly 191 229 Cys Cys Cys Cys Cys Cys 192 230 Gln Glu Gln Met Met Gln H-bonded structure 214 252 Ser Ser Ser Ser Ser Ser anti-parallel to 215 253 Trp Trp Trp Trp Trp Phe substrate 216 254 Gly Gly Gly Gly Gly Val 226 264 Gly Gly Gly Gly Ala Thr P2-site 99 135 Tyr Leu Leu Ile Val Val Identity with factor Xia heavy chain (%) 47 40 34 33 33 * All enzymes are of bovine origin, except porcine elastase. t All sequences except that of factor Xia heavy chain are numbered according to bovine chymotrypsinogen A. $ Residues which differ in factor Xia heavy chain from the corresponding residues in trypsin are printed in italics. pendent blood coagulation factors, prothrombin and factors IX and X, are homologous and appear to have evolved from a common ancestral gene; however, only factor XI is composed of two peptide chains. Mattock and Esnouf (28) have claimed that bovine factor X may occur as a single polypeptide chain, which is subsequently cleaved to yield the two chains commonly isolated. These facts together suggest that bovine factor X may be synthesized as a single polypeptide and that a peptide segment is cleaved from the middle of the chain prior to or during isolation. The heavy chain of factor Xa contains almost all of the functional residues identified in pancreatic serine proteases and hence, as discussed below, is the carrier of catalytic activity. The light chain strongly binds calcium, and probably phospholipid (which enhance the activity of factor Xa in the presence of factor V) and thus appears to regulate the activity of the heavy chain. As described by Fujikawa et al. (29, 30), a heptadecapeptide containing carbohydrate (residues 291-307) is cleaved from the carboxyl-terminal portion of the heavy chain of bovine factor X during activation by factor IXa in the presence of phospholipid. Of the various plasma and pancreatic serine proteases, only bovine factor X has this extended carboxylterminal portion (residues 284-307). This peptide, having an unusual content of proline and carbohydrate, appears to be unnecessary for the activity of factor Xa, since an alternative method of activation, by a protease from Russell's viper venom, leaves the peptide attached. The activation peptide released from the amino terminus of the heavy chain (residues 1-51) differs in size and sequence from those of bovine trypsinogen and chymotrypsinogen, and appears to be only distantly related to the A- chain of bovine thrombin (31). The functional or physiological role of the two carbohydrate-containing activation and degradation peptides of factor X, if any, is not yet known. In analogous systems, the activation peptides of bovine trypsinogen (32) and complement C 3 (33) are reported to have pharmacological activity. The sequence of the heavy chain of bovine factor X1a (residues 52-283) is homologous to those of other plasma (31) and pancreatic serine proteases (27). Although a complete comparison among the plasma enzymes is not yet possible, the degrees of identity can be calculated by aligning the available sequences so as to yield maximum homology. As shown in Table 1, the heavy chain of factor Xia and the B-chain of thrombin (31) show the highest degree of identity (47%) as might be expected from the similar functional role of the two proteins (34). This is of the same order as the identity between bovine trypsin and chymotrypsin A (46%). Among the pancreatic serine proteases, trypsin shows the highest degree of identity with the heavy chain of factor Xia (40%), as might be expected from the common specificity of these two enzymes toward small substrates (35). Homology is less evident at the carboxyl-terminus, since the heavy chain of factor Xia contains approximately 24 extra residues when oompared to the pancreatic enzymes. Table 1 also provides convincing evidence of homology when functional residues (36-38) are compared. Components of the "charge-relay system" of pancreatic serine proteases are all present in the corresponding loci of the heavy chain of bovine factor Xia (His-93, Asp-138, Ser-233). A few changes occur among residues which form the substrate binding pocket of bovine trypsin and chymotrypsin A. These types of changes are also observed among pancreatic serine proteases and explain the different substrate specificities of these enzymes. Histidine- 40 in chymotrypsinogen, which may have a special significance among the pancreatic enzymes, is replaced by glycine-76 in the heavy chain of factor Xia. Nevertheless, these

3086 Biochemistry: Titani et al. results suggest that the three-dimensional structure of the heavy chain of factorxia is similar to those of the pancreatic serine proteases, and that its active site is similar to that of pancreatic trypsin. If, indeed, the conformation of the heavy chain of factor XIa is similar to that of bovine trypsin, approximately 63% of the internal residues would occupy identical positions in these two proteins. Verification of these assumptions must await an x-ray analysis of the structure of bovine factor Xa. The authors wish to thank Dr. E. W. Davie for valuable discussion and Dr. M. A. Hermodson for contribution to the initial phase of this work. Thanks are also due to Santosh Kumar, Richard Granberg, Douglas Marshall, and Judy Lane for excellent technical a&tance. This work was supported in part by the National Institutes of Health (GM 15731 and HL 11857) and by the Ambrican Cancer Society (BC 91P). D.L.E. is an Associate Investigator of the Hughes Medical Institute. 1. Davie, E. W. & Fujikawa, K. (1975) Annu. Rev. Biochem. 44, 799-829. 2. Jackson, C. M. & Hanahan, D. J. (1968) Biochemistry 7, 4506-4517. 3. Jackson, C. M. (1972) Biochemistry 11, 4873-4882. 4. Fujikawa, K., Legaz, M. E. & Davie, E. W. (1972) Biochemistry 11, 4882-4891. 5. Fujikawa, K., Legaz, M. E. & Davie, E. W. (1972) Biochemistry 11, 4892-4899. 6. Fujikawa, K., Legaz, M. E. & Davie, E. W. (1974) Biochemistry 13, 4508-4516. 7. Radcliffe, R. D. & Barton, P. G. (1973) J. Biol. Chem. 248, 6788-6795. 8. Jesty, J. & Nemerson, Y. (1974) J. Biol. Chem. 249,509-515. 9. Titani, K., Hermodson, M. A., Fujikawa, K., Ericsson, L. H., Walsh, K. A., Neurath, H. & Davie, E. W. (1972) Biochemistry 11, 4899-4902. 10. Enfield, D. L., Ericsson, L. H., Walsh, K. A., Neurath, H. & Titani, K. (1975) Proc. Nat. Acad. Sci. USA 72, 16-19. 11. Enfield, D. L., Ericsson, L. H., Fujikawa, K., Titani, K., Walsh, K. A. & Neurath, H. (1974) FEBS Lett. 47, 132-135. 12. Stenflo, J. (1974) J. Biol. Chem. 249,5527-5535. 13. Stenflo, J., Fernlund, P., Egan, W. & Roepstorff, P. (1974) Proc. Nat. Acad. Sci. USA 71, 2730-2733. Proc. Nat. Acad. Sci. USA 72 (1975) 14. Fraser, K. L., Poulson, K. & Haber, E. (1972) Biochemistry 11,4974-4977. 15. Yaoi, Y., Titani, K. & Narita, K. (1964) J. Biochem. Jap. 56, 222-229. 16. Spackman, D. H., Stein, W. H. & Moore, S. (1958) Anal. Chem. 30,1190-1206. 17. Edman, P. & Begg, G. (1967) Eur. J. Biochem. 1, 80-91. 18. Hermodson, M. A., Ericsson, L. H., Titani, K., Neurath, H. & Walsh, K. A. (1972) Biochemistry 11, 4493-4502. 19. Laursen, R. A. (1971) Eur. J. Biochem. 20,89-102. 20. Hirs, C. H. W. (1967) in Methods in Enzymology, ed. Hirs, C. H. W. (Academic Press, New York), Vol. 11, pp. 411-413. 21. Dubois, M., Gilles, K. A., Hamilton, J. K., Rebers, P. A. & Smith, F. (1956) Anal. Chem. 28,350-356. 22. Gardell, S. (1957) in Methods of Biochemical Analysis, ed. Glick D. (Interscience Publishers, New York), Vol. 6, pp. 289-317. 23. Warren, L. (1959) J. Biol. Chem. 234,1971-1975. 24. Schroeder, W. A., Shelton, J. B. & Shelton, J. R. (1969) Arch. Biochem. Biophys. 130,551-556. 25. Winzler, R. J. (1973) in Hormonal Proteins and Peptides, ed. Li, C. H. (Academic Press, New York), Vol. 1, pp. 1-15. 26. Jolles, J., Schentgen, F., Alais, C., Fiat, A. M. & Jolles, P. (1972) Helv. Chim. Acta 55, 2872-2883. 27. Shotten, D. M. & Hartley, B. S. (1970) Nature 225,802-806. 28. Mattock, P. & Esnouf, M. P. (1973) Nature New Biol. 242, 90-92. 29. Fujikawa, K., Coan, M. H., Legaz, M. E. & Davie, E. W. (1974) Biochemistry 13,5290-5299. 30. Fujikawa, K., Titani, K. & Davie, E. W. (1975) Proc. Nat. Acad. Sci. USA, in press for September. 31. Magnusson, S. (1971) The Enzymes, ed. Boyer, P. D. (Academic Press, New York), 3rd Ed., Vol. 3, pp. 277-321. 32. Abita, J. P., Moulin, A. & Lazdunski, M. (1973) FEBS Lett. 34,251-255. 33. Hugh, T. E., Vallota, E. H. & Miller-Eberhard, H. J. (1975) J. Biol. Chem. 250, 1472-1478. 34. Fujikawa, K., Coan, M. H., Enfield, D. L., Titani, K., Ericsson, L. H. & Davie, E. W. (1974) Proc. Nat. Acad. Sci. USA 71, 427-430. 35. Esnouf, M. P. & Williams, W. J. (1962) Biochem. J. 84,62-71. 36. Stroud, R. M., Kay, L. M. & Dickerson, R. E. (1971) Cold Spring Harbor Symp. Quant. Biol. 36, 125-140. 37. Hartley, B. S. (1968) Phil. Trans. R. Soc. Ser B 257, 77-87. 38. Segal, D. M., Powers, J. C., Cohen, G. H., Davies, D. R. & Wilcox, P. E. (1971) Biochemistry 10, 3728-3738.