Research Article 853. *Corresponding author.

Similar documents
Article No. mb J. Mol. Biol. (1998) 284, 1095±1111

Molecular Dynamics of HIV-1 Reverse Transcriptase

The structure of HIV-1 reverse transcriptase complexed with 9-chloro-TIBO: lessons for inhibitor design

SUPPLEMENTARY INFORMATION

(B D) Three views of the final refined 2Fo-Fc electron density map of the Vpr (red)-ung2 (green) interacting region, contoured at 1.4σ.

Supplementary Table 1. Data collection and refinement statistics (molecular replacement).

CS612 - Algorithms in Bioinformatics

Steered Molecular Dynamics Simulation on the Binding of NNRTI to HIV-1 RT

Department of Microbiology, School of Medicine, Box , University of Washington, Seattle, WA 98195, USA

Supplementary Information Janssen et al.

Structure of the measles virus hemagglutinin bound to the CD46 receptor. César Santiago, María L. Celma, Thilo Stehle and José M.

Supplementary Material

Table S1. X-ray data collection and refinement statistics

RNase H Cleavage of the 5 End of the Human Immunodeficiency Virus Type 1 Genome

Phenylketonuria (PKU) Structure of Phenylalanine Hydroxylase. Biol 405 Molecular Medicine

Detergent solubilised 5 TMD binds pregnanolone at the Q245 neurosteroid potentiation site.

Introduction to proteins and protein structure

Domain Flexibility in Retroviral Proteases: Structural Implications for Drug Resistant Mutations,

JOURNAL OF VIROLOGY, Oct. 2000, p Vol. 74, No. 20. Copyright 2000, American Society for Microbiology. All Rights Reserved.

Excerpt from J. Mol. Biol. (2002) 320, :

Amprenavir complexes with HIV-1 protease and its drug-resistant mutants altering hydrophobic clusters

FCC2 5CY7 FCC1 5CY8. Actinonin 5CVQ

172R 172K TAM-2/172R TAM-2/172K. AZT concentration [nm] AZT concentration [nm] MgCl 2 2.5K 2.5K 5K 2.5K 5K 2.5K K 5K 2.5K 5K 2.5K 50 2.

Supplementary Figure 1 Preparation, crystallization and structure determination of EpEX. (a), Purified EpEX and EpEX analyzed on homogenous 12.

Structural Aspects of Drug Resistance and Inhibition of HIV-1 Reverse Transcriptase

Supporting Information

Amino Acids. Review I: Protein Structure. Amino Acids: Structures. Amino Acids (contd.) Rajan Munshi

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Chapter 6. X-ray structure analysis of D30N tethered HIV-1 protease. dimer/saquinavir complex

SUPPLEMENTARY INFORMATION

Translation Activity Guide

Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random

Received 4 December 2002/Accepted 25 November 2003

Supplementary Materials for

Protein Secondary Structure

2. Which of the following amino acids is most likely to be found on the outer surface of a properly folded protein?

KDM2A. Reactions. containing. Reactions

Crystal structure of fructose-1,6-bisphosphatase complexed with

Biological Mass Spectrometry. April 30, 2014

Chemical Mechanism of Enzymes

This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is worth 2 points.

Copyright 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings

WHY MACROMOLECULAR STRUCTURE?

Protein Primer, Lumry I, Chapter 10, Enzyme structure, middle

Objective: You will be able to explain how the subcomponents of

Structural biology of viruses

BIO 311C Spring Lecture 15 Friday 26 Feb. 1

Biomolecules: amino acids

The Basics: A general review of molecular biology:

Supplementary Figures

SUPPORTING INFORMATION FOR. A Computational Approach to Enzyme Design: Using Docking and MM- GBSA Scoring

Antigen Receptor Structures October 14, Ram Savan

Review II: The Molecules of Life

Human Immunodeficiency Virus Type 2 Reverse Transcriptase Activity in Model Systems That Mimic Steps in Reverse Transcription

SUPPLEMENTARY INFORMATION (SI) FIGURES AND TABLES

Probing the Potential Glycoprotein Binding Site of Sindbis Virus Capsid Protein With Dioxane and Model Building

Secondary Structure North 72nd Street, Wauwatosa, WI Phone: (414) Fax: (414) dmoleculardesigns.com

Secondary Structure. by hydrogen bonds

Crystal structure of the neutralizing antibody HK20 in complex with its gp41 antigen

Biological systems interact, and these systems and their interactions possess complex properties. STOP at enduring understanding 4A

PRC2 crystal clear. Matthieu Schapira

Chemistry 135, First Exam. September 23, Chem 135, Exam 1 SID:

Practice Problems 3. a. What is the name of the bond formed between two amino acids? Are these bonds free to rotate?

List of Figures. List of Tables

NIH Public Access Author Manuscript J Am Chem Soc. Author manuscript; available in PMC 2008 September 29.

Common Core Structure of Amyloid Fibrils by Synchrotron X-Ray Diffraction

VIRTUAL SCREENS WITH MULTIPLE RECEPTOR CONFORMATIONS. Rob Swift, UCI Wednesday, August 3, 2011

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Advances in membrane-protein crystallization: From detergent-free crystallization to in situ approaches. Dr. Jana Broecker

Cbl ubiquitin ligase: Lord of the RINGs

SUMMARY STATEMENT ( Privileged Communication )

Bioinformatics for molecular biology

Proteins. Amino acids, structure and function. The Nobel Prize in Chemistry 2012 Robert J. Lefkowitz Brian K. Kobilka

Levels of Protein Structure:

Crystal structures of HIV-2 protease in complex with inhibitors containing the hydroxyethylamine dipeptide isostere

Point total. Page # Exam Total (out of 90) The number next to each intermediate represents the total # of C-C and C-H bonds in that molecule.

Statin inhibition of HMG-CoA reductase: a 3-dimensional view

The role of Ca² + ions in the complex assembling of protein Z and Z-dependent protease inhibitor: A structure and dynamics investigation

Data collection. Refinement. R[F 2 >2(F 2 )] = wr(f 2 ) = S = reflections 326 parameters 2 restraints

SUPPLEMENTARY INFORMATION FOR. (R)-Profens Are Substrate-Selective Inhibitors of Endocannabinoid Oxygenation. by COX-2

Genetic information flows from mrna to protein through the process of translation

Cheiron School BL Practice of BL38B1 (Protein Crystallography)

Substrate variations that affect the nucleic acid clamp activity of reverse transcriptases

Modeling the HIV-1 Intasome: A Prototype View of the Target of Integrase Inhibitors

Structure of the RNA-dependent RNA polymerase of poliovirus Jeffrey L Hansen 1, Alexander M Long 2 and Steve C Schultz 1 *

3.2 Ligand-Binding at Nicotinic Acid Receptor Subtypes GPR109A/B

CHAPTER 21: Amino Acids, Proteins, & Enzymes. General, Organic, & Biological Chemistry Janice Gorzynski Smith

CHAPTER 9: CATALYTIC STRATEGIES. Chess vs Enzymes King vs Substrate

SUPPLEMENTAL MATERIAL. UNC119 is required for G protein trafficking in sensory neurons

HOMEWORK II and Swiss-PDB Viewer Tutorial DUE 9/26/03 62 points total. The ph at which a peptide has no net charge is its isoelectric point.

Arginine side chain interactions and the role of arginine as a mobile charge carrier in voltage sensitive ion channels. Supplementary Information

X-ray crystallographic analysis of 72 Data collection A colorless prism crystal of C Br 4 having approximate dimensions of mm was

Probing the catalytic mechanism of an antifibrotic copper metallodrug

Table S1: Kinetic parameters of drug and substrate binding to wild type and HIV-1 protease variants. Data adapted from Ref. 6 in main text.

Crystal structure of haemoglobin from donkey (Equus asinus) at 3Å resolution

brought to you by and REFERENCES

Amino acids. Side chain. -Carbon atom. Carboxyl group. Amino group

SUPPLEMENTARY INFORMATION

Life Sciences 1A Midterm Exam 2. November 13, 2006

Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation

Transcription:

Research Article 853 Structure of unliganded HIV-1 reverse transcriptase at 2.7 Å resolution: implications of conformational changes for polymerization and inhibition mechanisms Y Hsiou 1, J Ding 1, K Das 1, AD Clark, Jr 1, SH Hughes 2 and E Arnold 1 * Background: HIV-1 reverse transcriptase (RT) is a major target for anti-hiv drugs. A considerable amount of information about the structure of RT is available, both unliganded and in complex with template-primer or nonnucleoside RT inhibitors (NNRTIs). But significant conformational differences in the p66 polymerase domain among the unliganded structures have complicated the interpretation of these data, leading to different proposals for the mechanisms of polymerization and inhibition. Results: We report the structure of an unliganded RT at 2.7 Å resolution, crystallized in space group C2 with a crystal packing similar to that of the RT NNRTI complexes. The p66 thumb subdomain is folded into the DNAbinding cleft. Comparison of the unliganded RT structures with the DNA-bound RT and the NNRTI-bound RT structures reveals that the p66 thumb subdomain can exhibit two different upright conformations. In the DNA-bound RT, the p66 thumb subdomain adopts an upright position that can be described as resulting from a rigid-body rotation of the p66 thumb along the thumb s knuckle located near residues Trp239 (in strand 14) and Val317 (in 15) compared with the thumb position in the unliganded RT structure. NNRTI binding induces an additional hinge movement of the p66 thumb near the thumb s knuckle, causing the p66 thumb to adopt a configuration that is even more extended than in the DNA-bound RT structure. Addresses: 1 Center for Advanced Biotechnology and Medicine (CABM) and Rutgers University Chemistry Department, 679 Hoes Lane, Piscataway, NJ 08854-5638, USA and 2 ABL- Basic Research Program, NCI-Frederick Cancer Research and Development Center, PO Box B, Frederick, MD 21702-1201, USA. *Corresponding author. E-mail: arnold@lion.cabm.rutgers.edu Key words: AIDS, drug design, mechanism of polymerization, non-nucleoside inhibition mechanism, polymerase structure Received: 4 April 1996 Revisions requested: 2 May 1996 Revisions received: 28 May 1996 Accepted: 30 May 1996 Structure 15 July 1996, 4:853 860 Current Biology Ltd ISSN 0969-2126 Conclusions: The p66 thumb subdomain is extremely flexible. NNRTI binding induces both short-range and long-range structural distortions in several domains of RT, which are expected to alter the position and conformation of the template-primer. These changes may account for the inhibition of polymerization and the alteration of the cleavage specificity of RNase H by NNRTI binding. Introduction The reverse transcriptase (RT) of human immunodeficiency virus type 1 (HIV-1) is an attractive target for antiviral drugs (see recent reviews in [1 4]). HIV-1 RT is a heterodimer that consists of a 66 kda subunit (p66) and a 51 kda subunit (p51) (Fig. 1). The p66 subunit is composed of a polymerase domain and an RNase H domain. The p51 subunit contains only the polymerase domain. Both the p66 and p51 polymerase domains contain four subdomains (fingers, palm, thumb and connection) which are arranged differently in the two subunits. Only the p66 polymerase domain has a DNA-binding cleft, a functional polymerase active site, and a site for binding non-nucleoside inhibitors (NNRTIs). Structures of HIV-1 RT in complexes with various nonnucleoside RT inhibitors (NNRTIs) [5 10], and in complex with a 19-mer/18-mer double-stranded DNA and an antibody Fab fragment [11] have been reported. Structures of unliganded HIV-1 RT have been solved in three different crystal forms ([12,13], EA and coworkers, unpublished data). It is reasonable to expect that comparing the structures of unliganded HIV-1 RT with those of HIV-1 RT complexed with NNRTIs or nucleic acid should yield valuable insights into the mechanisms of polymerization and inhibition. This information should be useful in the development of new or improved inhibitors of HIV-1 RT. However, in the unliganded HIV-1 RT structures already reported, there are significant conformational differences in the p66 polymerase domain, particularly in the conformation of the 6 10 9 sheet and the position of the p66 thumb subdomain ([12,13], EA and coworkers, unpublished data). This has led to controversy regarding the role of the p66 thumb in the polymerization reaction, and different interpretations of the inhibition mechanism of HIV-1 RT by NNRTIs. Here, we report the structure of an unliganded HIV-1 RT crystallized in space group C2, a crystal form closely related to, but not isomorphous with, that of a number of

854 Structure 1996, Vol 4 No 7 Figure 1 Stereo C trace of the unliganded HIV-1 RT, drawn using the program RIBBONS [32]. The heterodimer is colored by subdomains: fingers, blue; palm, red; thumb, green; connection, yellow; and RNase H, orange. Every 50th residue is labeled: p66, 1 556 and p51, 1001 1427. HIV-1 RT NNRTI complexes reported previously. This unliganded HIV-1 RT structure has a molecular packing in the crystal lattice similar to that of the NNRTI-bound HIV-1 RT structures crystallized in the C2 crystal form. Comparison of these HIV-1 RT structures should therefore help to distinguish conformational changes induced by NNRTI binding from changes caused by a different crystal packing arrangement. Results and discussion Overview of HIV-1 RT structure and flexibility Although the unliganded HIV-1 RT structure reported here was determined in a crystal form similar to that present in crystals of the HIV-1 RT NNRTI complexes, the cell parameters differ considerably. The unit cell of the unliganded RT crystals is approximately 4% (10 Å) longer along the a axis and 10% (10 Å) shorter along the c axis relative to that of the NNRTI-bound RT. This suggests that the repositioning of the p66 thumb (see later discussion) causes a considerable change in crystal packing. In the structures of RT NNRTI complexes with the C2 crystal form, two symmetry-related molecules juxtapose in such a way that the p66 fingers subdomain of one molecule contacts both the RNase H domain and the tip of the p66 thumb subdomain of the other. In the crystal lattice of the unliganded RT reported here, the RT molecules are arranged in a manner similar to that in the RT NNRTI crystals. Even though the p66 thumb subdomain is folded into the DNA-binding cleft, the same regions of the p66 thumb subdomain and the fingers subdomain of the symmetry-related molecule interact with each other, although with different specific contacts. Compared with the relative positions of symmetry-related molecules in the RT NNRTI structures, the symmetry-related molecules in this unliganded RT structure have a 20 Å translation in the xz plane relative to each other. This translation permits the RT molecules in this structure to pack more closely along the c axis, and to expand slightly along the a axis, which would account for the changes in the observed cell parameters. The structure of unliganded HIV-1 RT has been determined in several crystal forms. Rodgers et al. [12] reported the structure of an unliganded HIV-1 RT in the space group C2 with 4 molecules per asymmetric unit. We have solved the structure of an unliganded HIV-1 RT complexed with an antibody Fab fragment, which was crystallized in space group P3 2 12 with unit cell dimensions isomorphous with those of the RT DNA Fab complex (EA and coworkers, unpublished data). In these unliganded HIV-1 RT structures, the p66 thumb subdomain is folded down into the DNA-binding cleft. In the structure of unliganded RT reported here, the p66 thumb subdomain is also folded down into the DNA-binding cleft. Therefore, with dramatically different crystal packing arrangements, the RT molecules in these crystal lattices exhibits strikingly similar folding. The rms deviations between the structures of RT reported here and by Rodgers et al. are 1.17 Å, 1.13 Å, and 0.97 Å based on the superposition of C atoms for the whole RT molecule (925 C atoms), the p66 subunit (475 C atoms), and the p51 subunit (393 C atoms), respectively. The largest difference between subdomains is that the p66 thumb subdomain in the unliganded RT structure reported here is rotated further into the DNA-binding cleft by 7 relative to the structure reported by Rodgers et al. This displacement could be the result of a relatively closer crystal packing arrangement in the current crystal form. Esnouf et al. [13] also described the structure of an unliganded HIV- 1 RT that was crystallized in space group P2 1 2 1 2 1. In this structure, the p66 thumb subdomain is in an upright conformation similar to that seen in the structures of RT NNRTI complexes and the RT DNA complex. Moreover, the 6 10 9 sheet of the p66 palm subdomain moves toward the p66 thumb subdomain in a manner different from that seen in the other unliganded

Research Article Structure of unliganded HIV-1 reverse transcriptase Hsiou et al. 855 RT structures ([12], EA and coworkers, unpublished data). In the structure of unliganded HIV-1 RT reported here, the position and conformation of the 6 10 9 sheet agree very well with that found in the unliganded RT structures reported earlier ([12], EA and coworkers, unpublished data). The different conformation of the p66 thumb subdomain and the repositioning of the 6 10 9 sheet in the unliganded RT reported by Esnouf et al. [13] may be a consequence of the method they used to prepare the unliganded HIV-1 RT crystals. In that study, crystals were obtained by soaking out a weakly-bound NNRTI inhibitor (HEPT) from crystals of the HIV-1 RT HEPT complex [13]. Examination of the molecular packing in the structure of unliganded HIV-1 RT reported by Esnouf et al. indicates that the p66 thumb subdomain is interlocked between the p66 thumb and RNase H subdomains of the symmetryrelated RT molecule. It appears that the p66 thumb subdomain in this structure of unliganded RT is restricted by crystal packing in such a way that it cannot fold down into the DNA-binding cleft despite the NNRTI HEPT having been soaked out. At any rate, the differences in the position of the p66 thumb subdomain in different HIV-1 RT structures provides ample evidence for high flexibility of the p66 thumb subdomain; a flexibility that may play a vital role in the binding and translocation of the nucleic acid substrate. Polymerase active site and NNRTI-binding pocket (NNIBP) The polymerase active site is located at the 6 10 9 sheet of the p66 palm subdomain. The position of the 9 10 hairpin turn in the unliganded form is nearly identical to that in the NNRTI-bound HIV-1 RT, suggesting that the position of the 6 10 9 sheet is not significantly affected by the binding of an NNRTI. The Tyr Met Asp Asp motif (residues 183 186) in both the p66 and p51 subunits is in an unusual II turn conformation that has been observed in several other HIV-1 RT structures [7 10,12,13]. Clear density peaks were found to have good coordination geometry with O 1 of Asp185 (2.8 Å) and O 1 of Asp186 (2.8 Å) at the p66 polymerase active site, and with the O 1 of Asp443 (3.1 Å), the Oε2 of Glu478 (2.4 Å), and the O 1 and O 2 of Asp498 (2.5 Å and 2.4 Å, respectively), at the RNase H active site. Due to the current resolution limit, we cannot distinguish whether these peaks correspond to water or magnesium ions. Structures of the Moloney leukemia virus (MMLV) RT fragment [14] and the fingers and palm subdomains of the p66 subunit in this unliganded HIV-1 RT (rms deviation of 1.4 Å based on 57 C atoms in the fingers and 56 C atoms of the palm subdomain) were superposed. The results indicate that the position of the electron density peak at the polymerase active site in this unliganded HIV-1 RT is quite close to that of the Mn 2+ ion seen in the MMLV RT structure. The structures of RT complexed with the NNRTIs show that the NNIBP is formed primarily by amino-acid residues of the 5 6 loop, 6, the 9 10 hairpin, the 12 13 hairpin, and 15 of p66, and the 7 8 connecting loop of p51 [8]. In the structure of the unliganded RT reported here and that reported by Rodgers et al., and in the structure of the RT DNA Fab complex, the region corresponding to the NNIBP in the RT NNRTI structures is occupied by the side chains of Tyr181, Tyr188, Phe227, and Trp229. When an NNRTI binds, the NNIBP is created by the reorientation of the side chains of Tyr181 and Tyr188, and by the repositioning of the 12 13 14 sheet that contains the primer grip. In the unliganded RT structure reported by Esnouf et al., a small cavity is present at the NNIBP region and the 6 10 9 and 12 13 14 sheets (using our nomenclature for secondary structural elements [11]) are shifted, along with the side chains of residues Tyr181, Tyr183, and Tyr188, toward this small residual cavity [13]. This reorientation of the side chain of residue Tyr183 has not been seen in other unliganded RTs ([12], EA and coworkers, unpublished data). Because the p66 thumb subdomain in the Esnouf et al. structure is apparently constrained in a conformation similar to that in the RT NNRTI complex, the displacement of the side chain of Tyr183 and the altered positions of the 6 10 9 and 12 13 14 sheets may be caused by an incomplete collapse of the empty drugbinding pocket relative to the other HIV-1 RT structures not containing NNRTIs. Calculation of the solvent-accessible surface area based on a 1.6 Å probe for the current unliganded HIV-1 RT structure reveals two surface depressions at the base of the NNRTI-binding pocket that are plausible candidates for the entrance to the pocket (Fig. 2). These two surface depressions were also observed in the RT DNA Fab complex. One surface depression is located at the proposed entrance near Lys101, Lys103, Val179 (p66), and Glu138 (p51) (site 1) [9]. The other one is near the base of the thumb subdomain between two adjacent polypeptide segments: the 5 6 connecting loop (containing Lys101 and Lys103) and the 13 14 hairpin (containing Pro236 and Lys238) (site 2). Both sites are exposed to solvent. It is possible that an NNRTI can approach the binding pocket via either site and displace the Tyr181 and Tyr188 side chains. Once the NNRTI is bound, however, only the putative entrance near site 1 remains accessible [9]. Site 2 disappears in the NNRTI-bound HIV-1 RT structure as a result of a hinge-like movement of the 12 13 14 sheet relative to the 6 10 9 sheet. The exact inhibitor entrance and the mechanism(s) of entry are still unclear. Conformational distortions induced by NNRTI binding As noted by Rodgers et al., the displacement of the p66 thumb subdomain is the most dramatic change between the structures of the unliganded RT, the RT DNA Fab

856 Structure 1996, Vol 4 No 7 Figure 2 Stereoview of the solvent-accessible surface of the p66 palm subdomain in the unliganded RT structure showing the two surface depressions (indicated by arrows) near the NNIBP that are potential entrances to the inhibitor-binding pocket. One (site 1) is located at the proposed entrance near residues Lys101, Lys103, Val179 (p66) and Glu138 (p51). The other (site 2) is between Lys101/Lys103 and Pro236/Lys238, that is, at the base of the p66 thumb subdomain. The latter one disappears after binding of an NNRTI. Water molecules (cyan) are seen in both surface depressions. complex, and the RT NNRTI complexes. The structures of RT in this unliganded form and in the DNA-bound form superimposed quite well with an rms deviation of 1.3 Å based on all the C atoms, excluding the p66 thumb subdomain. Superposition of the structure of the RT DNA Fab complex and the unliganded form of HIV-1 RT, based on the C atoms of the 6 10 9 sheet, reveals that the p66 thumb subdomain, in the RT DNA Fab complex, is rotated 30 away from the p66 fingers and the polymerase active site (Fig. 3a), similar to that noted by Rodgers et al. This rotation functions like a thumb s knuckle and the axis of the rotation is located at the base of thumb (around residue Trp239 and Val317). The other subdomains of the polymerase domain and the RNase H domain in p66 remain in similar positions in the unliganded RT and the RT DNA Fab complex. The structures of the unliganded RT and the NNRTIbound RT agree quite nicely within individual subdomains (rms deviations are within 1.5 Å, based on the C atoms in each subdomain). However, superposition of the whole molecules does not show as good agreement (rms deviation of 2.1 Å). If the structures are superimposed based on the C atoms of the 6 10 9 sheet, binding an NNRTI causes a hinge-like movement between the 6 10 9 sheet and the 12 13 14 sheet. The hinging is near, but not coincident with the axis of the thumb s knuckle motion (Fig. 3b). This hinge-like movement, which is similar to the internal swivel motion suggested by Jäger et al. [15], extends the position of the p66 thumb subdomain beyond the upright position seen in the RT DNA complex. In the inhibitor-bound HIV-1 RT structures, the p66 thumb subdomain is rotated approximately 40 relative to the p66 fingers subdomain compared with its position in the unliganded RT. This rotation is significantly larger than the corresponding 30 rotation in the RT DNA Fab complex. Comparing the unliganded RT and the DNA-bound RT shows that the hinge movement is coupled with the binding of NNRTIs. Associated with the hinge movement of the p66 thumb, the p66 connection and RNase H subdomains move away relative to the fingers subdomain with 13 and 15 rotations, respectively, compared with the corresponding positions in the unliganded RT (Fig. 3b). The position of the p66 fingers subdomain in the HIV-1 RT NNRTI complexes is relatively unchanged (less than 4 rotation when compared with the unliganded RT). The subdomains in the p51 subunit of the NNRTI-bound RT also move (10, 12, 15, and 17 for fingers, palm, thumb, and connection subdomains, respectively). As a result, the intersubunit interactions at the heterodimer interfaces are maintained in both the unliganded RT and the RT NNRTI complex, suggesting that the heterodimer interface is relatively stable. It is interesting to note that the binding of a relatively small inhibitor molecule is accompanied by such dramatic rearrangements of the subdomains. Mechanistic implications for the inhibition by NNRTIs Comparison of the available structures of HIV-1 RT indicates that the binding of an NNRTI causes both shortrange and long-range distortions of the HIV-1 RT structure. The short-range distortions include conformational changes of the amino acid residues and/or structural elements that form the NNIBP, such as reorientation of the side chains of Tyr181 and Tyr188, and displacement of the 12 13 14 sheet. These distortions affect the precise geometry of the polymerase active site. The long-range distortions involve a hinging movement of the p66 thumb subdomain that results in the displacement of the p66 connection subdomain, the RNase H domain, and the p51 subunit relative to the polymerase active site. The observation that the chemical step of DNA polymerization is slowed by NNRTI binding [16,17] may be explained by the short-range structural distortions induced by NNRTI binding, including the conformational changes of the primer grip. Both short-range and long-range structural distortions induced by NNRTI binding alter the interactions between the nucleic acid substrates and HIV-1 RT, and affect the precise positioning of the template-primer relative to the polymerase and the RNase H active sites. NNRTI-induced changes in RT

Research Article Structure of unliganded HIV-1 reverse transcriptase Hsiou et al. 857 Figure 3 Superposition of (a) unliganded RT and RT DNA Fab complex and (b) unliganded RT and RT -APA ( -anilinophenylacetamide) complex based on 89 C atoms in the p66 palm subdomain, including the 6 10 9 region. The unliganded RT is shown in red, RT -APA in blue, and RT DNA Fab in green. A comparison of the two superpositions reveals that NNRTI binding appears to be accompanied by a long-range distortion that is coupled with a hinge motion (indicated by curved arrows) between the 6 10 9 and 12 13 14 sheets at the p66 palm subdomain (within the circle). The different positions of the thumb in different HIV-1 RT structures supports the idea that this subdomain could play a role during polymerization. interactions with nucleic acid substrates may partly account for the observations that NNRTI binding can alter the affinity of HIV-1 RT for the template-primer [16,18] as well as the specificity of RNase H cleavage [19]. HIV-1 RT mutations near the polymerase active site that had a greater effect on RNase H activity than on polymerase activity [20 22] have been described, supporting the idea that communication between distant sites on HIV-1 RT (e.g., the polymerase and RNase H active sites, the NNIBP, and other sites that interact with template-primer) can occur via the template-primer [8,11,22,23]. It is difficult to be certain, however, that repositioning of the template-primer would be solely responsible for the effect of NNRTI binding on RNase H cleavage specificity, as binding does cause other long-range movements of subdomains of both p51 and p66 relative to each other. The long-range distortions caused by NNRTIs have the potential to affect other RT activities such as strand transfer, strand displacement, and recognition of the trna used for priming reverse transcription. Biological implications The cumulative biological, genetic, and structural information about HIV-1 reverse transcriptase (RT) has considerably enhanced our understanding of the mechanisms of polymerization and inhibition. However, the structures of unliganded HIV-1 RT previously reported differ significantly in the conformation of the polymerase domain of the p66 subunit. This has led to controversy concerning the involvement of the p66 thumb subdomain in the mechanism of DNA polymerization and the mechanism of inhibition by non-nucleoside inhibitors (NNRTIs). We report here the crystal structure of an unliganded HIV-1 RT crystallized in space group C2, with molecular packing similar to that of the NNRTI-bound HIV-1 RT complexes. The overall folding of this enzyme is similar to that determined by Rodgers et al. [12]. In these two structures, the p66 thumb subdomain is folded into the DNA-binding cleft. The differences observed among the unliganded HIV-1 RTs in different crystal forms demonstrates the considerable flexibility of the p66 thumb subdomain, which may play an important role in the recognition of nucleic acid substrates and the translocation of the enzyme along the nucleic acid after incorporation of each new nucleotide. It is likely that the flexibility of the p66 thumb allows it to facilitate the repositioning (and reloading ) of RT during polymerization. A comparison of the structures of the NNRTI-bound RT complexes with those of unliganded RT and the HIV-1 RT DNA Fab complex [11] shows that the binding of

858 Structure 1996, Vol 4 No 7 an NNRTI causes both short-range and long-range structural distortions. A hinge movement at the base of the p66 thumb subdomain (near the thumb s knuckle ) appears to constrain the thumb in a wide-open configuration in the HIV-1 RT NNRTI complexes, which is extended beyond the upright configuration seen in the HIV-1 RT DNA Fab structure. Biochemical and structural data suggest that the conformational changes accompanying NNRTI binding alter the positioning of the template-primer relative to the polymerase and RNase H active sites. This could potentially account both for the inhibition of RT activity (short-range effects) and the alteration of RNase H cleavage specificity (longrange effects) by NNRTIs. Materials and methods Crystallization and data collection The unliganded HIV-1 RT was crystallized ([24] and references therein) using modified conditions reported by Kohlstaedt et al. for crystallizing the HIV-1 RT/nevirapine complex [5]. The initial crystals were grown in the absence of NNRTIs by streak-seeding with crushed RT/ -APA crystals. After crystals of unliganded HIV-1 RT were obtained, they were used for further seeding. Crystals grew to an average size of 0.1 mm 0.2 mm 0.4 mm after two weeks and had the symmetry of space group C2, which is the same as that of the HIV-1 RT NNRTI complexes. The cooled diffraction data were collected at the F1 beam line ( = 0.91 Å) of the Cornell High Energy Synchrotron Source (CHESS) from seven crystals cooled to 10 C. Data were recorded on Fuji imaging plates with an overall R merge of 11.8% and a completeness of 76.6% to 3.3 Å resolution. Another diffraction dataset was collected at the CHESS F1 beam line from a single crystal at 165 C using the flash-cooled technique [25,26]. This dataset, which was used for structural determination and refinement, has an overall R merge of 5.5%, a completeness of 99.5% to 2.7 Å resolution (no cutoff was used, and the average measurement redundancy was 3.5). The HKL package [27,28] was initially used to process, reduce, scale, and postrefine the data. Final data were scaled using SCALA (CCP4) [29]. A TAMM derivative diffraction dataset was collected at the CHESS A1 beam line and was scaled to the native dataset using FHSCAL of CCP4. The statistics of all diffraction data are summarized in Table 1. Structure determination and refinement The structure of the unliganded HIV-1 RT was determined with the molecular replacement (MR) technique as implemented in X-PLOR [30], using the diffraction data from the frozen crystal and the coordinates of the unliganded RT provided by Rodgers et al. as the starting model [12]. The rotation function search and translation function search yielded one distinct solution. The correctness of the MR solution was confirmed by the positions of mercury atoms in a difference Fourier map computed between the TAMM derivative dataset and the native dataset phased with the initial MR model. A subset of 6% of the frozen dataset was separated for calculating the free R-factor to monitor the progress of refinement. No subset of data was generated for the free R-factor calculation for the cooled diffraction dataset due to the relative incompleteness of the dataset. Iterative refinement and model building were performed using the programs X-PLOR [30] and O [31]. Omit maps were calculated for both the frozen and the cooled datasets and then averaged between the frozen and cooled datasets to improve the map quality and reduce model bias (KD et al. unpublished data). The final refinement converged to an R-factor of 24.9% and a free R-factor of 33.6% for the frozen crystal dataset, and an R- factor of 23.0% for the cooled dataset. The fit of the atomic model into the final 2mF obs F calc electron density map in the region of the p66 connection subdomain is shown in Figure 4. A total of 267 water Table 1 Summary of crystallographic data and refinement statistics. Datasets Native (cooled) Native (frozen) TAMM (frozen) Diffraction data statistics No. of crystals 7 1 1 No. of images 79 112 92 Temperature ( C) 10 165 165 a,b,c (Å) 240.3,71.5,95.5 235.5,70.3,93.3 234.0,69.7,93.0 ( ) 104.4 106.1 106.2 Resolution (Å) 3.3 2.7 3.7 Observed reflections (%) 24 727 142 770 46 901 Unique reflections (%) 18 345 41 049 15 521 Completeness (%) 76.6 99.5 99.4 R merge * (%) 11.8 5.5 11.2 R deriv (%) 21.8 Refinement statistics Resolution range (Å) 8.0 2.7 No. of reflections 38 310 R factor # (%) 24.9% Free R factor (%) 33.6% No. of protein atoms 7692 No. of water molecules 267 Average B factor (Å 2 ) 44 Rms deviations Bond (Å) 0.01 Angles ( ) 2.0 *R merge = I obs <I> / <I>. R deriv = I PH I P / I P. No. of reflections used in refinement have F 2 (F). # R= F obs F calc / F obs.

Research Article Structure of unliganded HIV-1 reverse transcriptase Hsiou et al. 859 Figure 4 Stereoview of a portion of a (2mF obs F calc ) difference Fourier map at the p66 connection subdomain, at 2.7 Å resolution. The phases were computed from the current atomic model and the map is contoured at 1.4. The side chain and the carboxyl groups are well defined in the electron density map. molecules were found that have good stereochemistry. Some residues are apparently disordered and have very weak electron density. These residues have been modeled as alanines, including residues 22, 28, 64, 70 71, 134 136, 139, 169, 177, 194, 203, 218, 224, 242, 278, 281, 291, 297, 305, 311 312, 334, 336, 356 359, 404, 407, 413 414, 424, 448, 451, 512, 514, 527, 549 551, and 556 in the p66 subunit, and residues 36, 67 69, 88 89, 173, 197, 238, 281 282, 295, 297, 305 306, 308, 356 358, 361 362, 407, 424 425, and 427 in the p51 subunit. A part of the connecting loop between F and 13 in the p51 subunit (residues 219-231) has weak electron density and thus has not been modeled. Final refinement statistics are listed in Table 1. Accession numbers Coordinates for the final model of the native structure refined using the frozen dataset have been deposited with the Brookhaven Protein Data Bank (entry code1dlo). Acknowledgements We thank David Rodgers, Steve Gamblin, and Steve Harrison for providing the coordinates of their unliganded HIV-1 RT structure prior to publication. We thank the other members of the Arnold and Hughes laboratory, including Pat Clark, Peter Frank, Andrew Holmes, Wade Huber, Karen Lentz, and Dave Miller, Dawn Resnick, Chris Tantillo and Wanyi Zhang for their assistance and helpful discussion; the members of CHESS staff for their help with X-ray data collection; the Frederick Biomedical Supercomputing Center for providing Cray-YMP computing time; the Keck Structural Biology Computing Center at CABM; and Andrew Holmes for critical reading of the manuscript. The research in EA s laboratory has been supported by NIH grants (AI 27690 and AI 36144), Janssen Research Foundation, and Hoechst General Pharma Research, a grant from NCI-ABL and the Keck Foundation, and SHH s laboratory is sponsored in part by the National Cancer Institute, DHHS, under contract with ABL and by NIGMS. References 1. Larder, B.A. (1993). Inhibitors of HIV reverse transcriptase as antiviral agents and drug resistance. In Reverse Transcriptase. (Skalka, A.M. & Goff, S.P., eds), pp. 205 222, Cold Spring Harbor Laboratory Press, New York. 2. Tantillo, C., et al., & Arnold, E. (1994). Locations of anti-aids drug binding sites and resistance mutations in the three-dimensional structure of HIV-1 reverse transcriptase: implications for mechanisms of drug inhibition and resistance. J. Mol. Biol. 243, 369 387. 3. De Clercq, E. (1995). Toward improved anti-hiv chemotherapy: Therapeutic strategies for intervention with HIV infections. J. Med. Chem. 38, 2491 2517. 4. Arnold, E., Das, K., Ding, J., Yadav, P.N.S., Hsiou, Y., Boyer, P.L. & Hughes, S.H. (1996). Targeting HIV reverse transcriptase for anti- AIDS drug design. Drug Design and Discovery. 13, 29 47. 5. Kohlstaedt, L.A., Wang, J., Friedman, J.M., Rice, P.A. & Steitz, T.A. (1992). Crystal structure at 3.5 Å resolution of HIV-1 reverse transcriptase complexed with an inhibitor. Science 256, 1783 1790. 6. Smerdon, S.J., et al., & Steitz, T.A. (1994). Structure of the binding site for non-nucleoside inhibitors of the reverse transcriptase of human immunodeficiency virus type 1. Proc. Natl. Acad. Sci. USA 91, 3911 3915. 7. Ren, J., et al., & Stammers, D. (1995). High resolution structures of HIV-1 RT from four RT-inhibitor complexes. Nat. Struct. Biol. 2, 293 302. 8. Ding, J., et al., & Arnold, E. (1995). Structure of HIV-1 reverse transcriptase in a complex with the non-nucleoside inhibitor -APA R 95845 at 2.8 Å resolution. Structure 3, 365 379. 9. Ding, J., et al., & Arnold, E. (1995). Structure of HIV-1 RT/TIBO R 86183 reveals similarity in the binding of diverse non-nucleoside inhibitors. Nat. Struct. Biol. 2, 407 415. 10. Ren, J., et al., & Stuart, D. (1995). The structure of HIV-1 reverse transcriptase complexed with 9-chloro-TIBO: lessons for inhibitor design. Structure 3, 915 926. 11. Jacobo-Molina, A., et al., & Arnold, E. (1993). Crystal structure of human immunodeficiency virus type 1 reverse transcriptase complexed with double-stranded DNA at 3.0 Å resolution shows bent DNA. Proc. Natl. Acad. Sci. USA 90, 6320 6324. 12. Rodgers, D.W., et al., & Harrison, S.C. (1995). The structure of unliganded reverse transcriptase from the human immunodeficiency virus type 1. Proc. Natl. Acad. Sci. USA 92, 1222 1226. 13. Esnouf, R., Ren, J., Ross, R., Jones, Y., Stammers, D. & Stuart, D. (1995). Mechanism of inhibition of HIV-1 reverse transcriptase by nonnucleoside inhibitors. Nat. Struct. Biol. 2, 303 308. 14. Georgiadis, M.M., Jessen, S.M., Ogata, C.M., Telesnitsky, A., Goff, S.P. & Hendrickson, W.A. (1995). Mechanistic implications from the structure of a catalytic fragment of Moloney murine leukemia virus reverse transcriptase. Structure 3, 879 892. 15. Jäger, J., Smerdon, S.J., Wang, J., Boisvert, D.C. & Steitz, T.A. (1994). Comparison of three different crystal forms shows HIV-1 reverse transcriptase displays an internal swivel motion. Structure 2, 869 876. 16. Spence, R.A., Kati, W.M., Anderson, K.S. & Johnson, K.A. (1995). Mechanism of inhibition of HIV-1 reverse transcriptase by nonnucleoside inhibitors. Science 267, 988 993. 17. Rittinger, K., Divita, G. & Goody, R. (1995). Human immunodeficiency virus reverse transcriptase substrate-induced conformational changes and the mechanism of inhibition by non-nucleoside inhibitors. Proc. Natl. Acad. Sci. USA 92, 8046 8049. 18. Debyser, Z., et al., & De Clercq, E. (1991). An antiviral target on reverse transcriptase of human immunodeficiency virus type 1 revealed by tetrahydroimidazo-[4,5,1-jk][1,4]benzodiazepinone-2(1h)- one and -thione derivatives. Proc. Natl. Acad. Sci. USA 88, 1451 1455. 19. Palaniappan, C., Fay, P.J. & Bambara, R.A. (1995). Nevirapine alters the cleavage specificity of ribonuclease H of human immunodeficiency virus 1 reverse transcriptase. J. Biol. Chem. 270, 4861 4869.

860 Structure 1996, Vol 4 No 7 20. Boyer, P.L., Ferris, A.L. & Hughes, S.H. (1992). Cassette mutagenesis of the reverse transcriptase of human immunodeficiency virus type 1. J. Virol. 66, 1031 1039. 21. Boyer, P.L., Ferris, A.L. & Hughes, S.H. (1992). Mutational analysis of the fingers domain of human immunodeficiency virus type-1 reverse transcriptase. J. Virol. 66, 7533 7537. 22. Boyer, P.L., et al., & Hughes, S.H. (1994). Mutational analysis of the fingers and palm subdomains of human immunodeficiency virus type-1 (HIV-1) reverse transcriptase. J. Mol. Biol. 243, 472 483. 23. Kleim, J.-P., et al., & Riess, G. (1996). Selective pressure of a quinoxaline class non-nucleoside inhibitor of human immunodeficiency virus type 1 (HIV-1) reverse transcriptase (RT) on HIV-1 replication results in the emergence of nucleoside RT inhibitor specific (RT L74 V/I, V75 L/I) HIV-1 mutants. Proc. Natl. Acad. Sci. USA 93, 34 38. 24. Clark, A.D., Jr., Jacobo-Molina, A., Clark, P., Hughes, S.H. & Arnold, E. (1995). Crystallization of human immunodeficiency virus type-1 reverse transcriptase with and without nucleic acid substrates, inhibitors and an antibody Fab fragment. In Methods in Enzymology. (Campbell, J.L., ed.), pp. 171 185, Academic Press, New York. 25. Teng, T.Y. (1990). Mounting of crystals for macromolecular crystallography in a free-standing thin film. J. Appl. Cryst. 24, 387 391. 26. Rodgers, D.W. (1994). Cryocrystallography. Structure 2, 1135 1140. 27. Otwinowski, Z. (1993). Oscillation Data Reduction Program. In Proceeding of the CCP4 Study Weekend: Data Collection and Processing. (Sawyer, L., Issacs N., Bailey, S., eds), pp 55 62, SERC Daresbury Laboratory, UK. 28. Miner, W. (1993). XDISPLAYF Program, Purdue University. 29. Collaborative Computational Project, No. 4. (1994). The CCP4 Suite: Programs for Protein Crystallography. Acta Cryst. D 50, 760 763. 30. Brünger, A.T. (1993). X-PLOR Manual Version 3.1: A system for X- ray crystallography and NMR. Yale University Press, New Haven and London. 31. Jones, T.A., Zou, J.Y., Cowan, S.W. & Kjeldgaard, M. (1991). Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Cryst. A 47, 110 119. 32. Carson, M. (1987). Ribbon models of macromolecules. J. Mol. Graphics 5, 103 106.