Biophysical Characterization of Engineered HIV-1 Immunogen. Derived from gp120 and a Lightweight Scaffolding Protein

Similar documents
Identification of Mutation(s) in. Associated with Neutralization Resistance. Miah Blomquist

Human Immunodeficiency Virus type 1 (HIV-1) gp120 / Glycoprotein 120 ELISA Pair Set

Characterization of the DNA-mediated Oxidation of Dps, a Bacterial Ferritin

Influenza B Hemagglutinin / HA ELISA Pair Set

Low ds/dn Does Not Correlate With High Variation of Amino Acid Sequences Along the gp120 Protein Structure

EMERGING ISSUES IN THE HUMORAL IMMUNE RESPONSE TO HIV. (Summary of the recommendations from an Enterprise Working Group)

Influenza A H7N9 (A/Anhui/1/2013) Hemagglutinin / HA ELISA Pair Set

Luminescent platforms for monitoring changes in the solubility of amylin and huntingtin in living cells

Supplementary Information

Lane: 1. Spectra BR protein ladder 2. PFD 3. TERM 4. 3-way connector 5. 2-way connector

Human Immunodeficiency Virus type 1 (HIV-1) p24 / Capsid Protein p24 ELISA Pair Set

From Antibody to Vaccine a Tale of Structural Biology and Epitope Scaffolds

Chapter 8. Interaction between the phosphatidylinositol 3- kinase SH3 domain and a photocleavable cyclic peptide

Crystal structure of the neutralizing antibody HK20 in complex with its gp41 antigen

HIV-1 p24 ELISA Pair Set Cat#: orb54951 (ELISA Manual)

New Technology in Vaccine Engineering

Week 5 Section. Junaid Malek, M.D.

Supporting Information

<Supplemental information>

Recombinant Protein Expression Retroviral system

N-Glycosidase F Deglycosylation Kit

Crystallization-grade After D After V3 cocktail. Time (s) Time (s) Time (s) Time (s) Time (s) Time (s)

Supplementary Material

Affinity Purification of Photosystem I from Chlamydomonas reinhardtii using a Polyhistidine Tag

Influenza A H1N1 HA ELISA Pair Set

Influenza A H1N1 (Swine Flu 2009) Hemagglutinin / HA ELISA Pair Set

EpiQuik Total Histone H3 Acetylation Detection Fast Kit (Colorimetric)

Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation

Total Histone H3 Acetylation Detection Fast Kit (Colorimetric)

Supplementary Figure-1. SDS PAGE analysis of purified designed carbonic anhydrase enzymes. M1-M4 shown in lanes 1-4, respectively, with molecular

Multiple-Choice Questions Answer ALL 20 multiple-choice questions on the Scantron Card in PENCIL

EPIGENTEK. EpiQuik Global Acetyl Histone H3K27 Quantification Kit (Colorimetric) Base Catalog # P-4059 PLEASE READ THIS ENTIRE USER GUIDE BEFORE USE

Western Immunoblotting Preparation of Samples:

HIV Anti-HIV Neutralizing Antibodies

Structural biology of viruses

Protocol for purification of recombinant protein from 300 ml yeast culture

Boosts Following Priming with gp120 DNA

ARV Mode of Action. Mode of Action. Mode of Action NRTI. Immunopaedia.org.za

Protocol for Gene Transfection & Western Blotting

2009 H1N1 Influenza ( Swine Flu ) Hemagglutinin ELISA kit

Supplementary Figures

Cell Lysis Buffer. Catalog number: AR0103

Bacterial expression of a VLP Sub-unit for rapid and cheap influenza vaccination

Chapter 9. Protein Folding, Dynamics, and Structural Evolution

Characterization of Influenza Hemagglutinin Mutants for the Elucidation of Key Residues Effect on Activation

MagCapture Exosome Isolation Kit PS Q&A

Antigen Receptor Structures October 14, Ram Savan

Aperto Cell Lysis and Protein Solubilization Users Manual

Detergent solubilised 5 TMD binds pregnanolone at the Q245 neurosteroid potentiation site.

HIV: RECEPTORS AND CO-RECEPTORS

Supporting Information for:

Mitochondrial Trifunctional Protein (TFP) Protein Quantity Microplate Assay Kit

Trident Membrane Protein Extraction Kit

Broad Spectrum Protease Inhibitor Cocktail

SUPPLEMENTARY MATERIAL

Structural Insights into HIV-1 Neutralization by Broadly Neutralizing Antibodies PG9 and PG16

Application of μmacs Streptavidin MicroBeads for the analysis of HIV-1 directly from patient plasma

Supplementary Figure 1 Preparation, crystallization and structure determination of EpEX. (a), Purified EpEX and EpEX analyzed on homogenous 12.

Spike Trimer RNA. dsdna

Isolation of a Broadly Neutralizing Antibody with Low Somatic Mutation from a Chronically Infected HIV-1 Patient

Nature Methods: doi: /nmeth Supplementary Figure 1

Title: Column Chromatography of Green Fluorescent Protein

Overview of the Expressway Cell-Free Expression Systems. Expressway Mini Cell-Free Expression System

GOVX-B11: A Clade B HIV Vaccine for the Developed World

Chromatin IP (Isw2) Fix soln: 11% formaldehyde, 0.1 M NaCl, 1 mm EDTA, 50 mm Hepes-KOH ph 7.6. Freshly prepared. Do not store in glass bottles.

7.06 Cell Biology EXAM #3 April 24, 2003

CD4 T Cell Decline Is Not Associated With Amino Acid Changes in HIV-1 gp120

CS612 - Algorithms in Bioinformatics

Supporting information (protein purification, kinetic characterization, product isolation, and characterization by NMR and mass spectrometry):

Antibodies: LB1 buffer For 50 ml For 10ml For 30 ml Final 1 M HEPES, ph 2.5 ml 0.5 ml 1.5 ml 50mM. 5 M NaCl 1.4 ml 280 µl 0.

Introduction to proteins and protein structure

Supplementary Figure 1. ALVAC-protein vaccines and macaque immunization. (A) Maximum likelihood

Supplementary Materials for

PRODUCT INFORMATION & MANUAL

Final Report. Identification of Residue Mutations that Increase the Binding Affinity of LSTc to HA RBD. Wendy Fong ( 方美珍 ) CNIC; Beijing, China

HUMAN IMMUNODEFICIENCY VIRUS

PHAR3316 Pharmacy biochemistry Exam #2 Fall 2010 KEY

7.013 Spring 2005 Problem Set 7

H5N1 ( Avian Flu ) Hemagglutinin ELISA Pair Set

Tel: ; Fax: ;

The Schedule and the Manual of Basic Techniques for Cell Culture

In Silico Vaccine Design Based on Molecular Simulations of Rhinovirus Chimeras Presenting HIV-1 gp41 Epitopes

BabyBio IMAC columns DATA SHEET DS

Recipes for Media and Solution Preparation SC-ura/Glucose Agar Dishes (20mL/dish, enough for 8 clones)

Europium Labeling Kit

Supporting Information. Controlled Nucleation and Growth of DNA Tile Arrays within Prescribed DNA Origami Frames and Their Dynamics

Biological Sciences 4087 Exam I 9/20/11

Chromatin Immunoprecipitation (ChIPs) Protocol (Mirmira Lab)

SUPPLEMENTAL INFORMATION

Catalysis & specificity: Proteins at work

Note: During 30 minute incubation; proceed thru appropriate sections below (e.g. sections II, III and V).

Section 6. Junaid Malek, M.D.

Instructions. Fuse-It-mRNA easy. Shipping and Storage. Overview. Kit Contents. Specifications. Note: Important Guidelines

Molecular Graphics Perspective of Protein Structure and Function

Broad and Potent Neutralizing Antibodies from an African Donor Reveal a New HIV-1 Vaccine Target

HIV Vaccines: Basic Science

Exosome ELISA Complete Kits

MECHANISMS OF VIRAL MEMBRANE FUSION AND ITS INHIBITION

Lecture 2: Virology. I. Background

Transcription:

Biophysical Characterization of Engineered HIV-1 Immunogen Derived from gp120 and a Lightweight Scaffolding Protein Edgar Faison April 24, 2018 A Thesis Presented in Partial Fulfillment for the Honors Bachelor s Degree in Chemistry with a concentration in Biochemistry at the University of North Carolina at Chapel Hill. Research Was Conducted in the Laboratory of Dr. Nikolay Dokholyan under the Supervision of Dr. Cheng Zhu. Approved: Nikolay Dokholyan, Ph.D. Thesis Advisor Andrew Moran, Ph.D. Reader Yosuke Kanai, Ph.D. Reader 1

Abstract An effective vaccine for the Human Immunodeficiency Virus (HIV) has been a global health priority for decades. The HIV-1 surface protein Env is the sole envelope protein on the surface of the virus, presenting a number of obstacles such as a carbohydrate coat on the surface of the protein and high variability in numerous regions of the protein thereby rendering many traditional vaccine strategies ineffective. Here, a strategy to exploit the variability in glycosylation among a limited number of non-essential yet conserved sites of Env has yielded a hybrid protein consisting of an epitope from Env and a light-weight, thermally stable scaffold protein that may function as an immunogen for HIV. The biophysical characterization of this engineered protein was attempted computationally via molecular dynamics simulations and experimentally via circular dichroism and x-ray crystallography to confirm the computational model and to evaluate the in vitro structure of the protein as well as the fidelity of the epitope. Introduction The Human Immunodeficiency Virus (HIV) remains a global epidemic for decades, evading many attempts at a working and lasting vaccine. HIV is a retrovirus which utilizes a membrane coated with a trimeric envelope protein (Env) that functions in docking and entry of the host cell. The three subunits are heterodimers consisting of gp41, a transmembrane protein embedded in the viral membrane that assists in trimerization and acts as a membrane anchor, and gp120, which recognizes the transmembrane glycoprotein CD4 in conjunction with the chemokine receptors CCR4 and CXCR5 on the host cell to mediate membrane fusion. As a result of this intimate interaction with the host cell, the external surface of gp120 is the primary target for eliciting an immune response such as neutralizing antibodies (NAbs), as well as the most conducive target in deriving man-made immunogens. 1 Neutralizing antibodies are effective in fending off viral infection by negating the virus s ability to infect the host cell; for example, a neutralizing antibody for HIV may bind to the the CD4 binding domain on gp120, thereby preventing the virus from docking and fusing to the cell membrane. However, the virus has several strategies to mitigate the host immune response and likewise pose challenges in vaccine development. First, the surface of Env is covered in a carbohydrate coat via N-linked glycosylation; this coat occludes most of the protein s surface and shields the majority of antigenic sites. 2 Oftentimes, regions that would be ideal targets for conventional NAbs are protected by heavy glycosylation in a pocket domain on gp120, such as the aformentioned CD4 binding domain. 3 Env is also incredibly plastic and ever-changing, owing to its high variability in amino acid sequence among HIV clades and a resulting variation in linked carbohydrates in a given population of viruses. 4,5 The virus accomplishes this with variable loops (referred to as V1-V5) that can withstand a high degree of change while still retaining the protein s original functionality. While NAbs may be produced in response to the initial infection, changes in these variable regions will render these antibodies ineffective in the same time scale as producing those antibodies. 6 2

Lastly, Env exists in multiple metastable conformations that can only be stabilized by CD4 or other binders, further complicating exploitation to elicit NAbs. Despite these obstacles, there are several strategies being employed to create a vaccine. A popular strategy is to induce broadly neutralizing antibodies (bnabs) which recognize over 80 to 90% of viral strains. These bnabs are rare, however, occurring in a small percentage of individuals after 2-4 years of viral infection. 7 These bnabs only target a select few antigen site on the Env complex, including the CD4 binding site and a conserved glycan site among others. These bnabs have some structural peculiarities that may grant them broadly neutralizing properties including larger complementarity-determining regions and somatic modifications. 8 A special method must be employed to promote these sort of antibodies by promoting the correct B-cell pathway with the correct type of immunogen at specific time points along the B cell maturation timeline. Figuring out the correct combination and timing of booster immunogens may present another daunting obstacle in determining an effective vaccine method comparable to those stated above. 9 Seeking these bnabs may be an extraneous task: instead of seeking to elicit a singular, broadly effective immune response that could potentially be made ineffective by a singular or relatively few mutations in a viral strain, a cocktail of immunogens which exploit numerous conserved epitopes on the gp120 could prove to be more effective and resistant to mutation. The Dokholyan lab has made such a panel of engineered prototype immunogens. These proteins consist of a light-weight, thermostable scaffolding protein with an antigen site from HIV type 1 clade C gp120. Unlike many other approaches which target exposed regions of gp120, these antigen site were sampled from the few regions which usually have a non-functional carbohydrate; in other words they are Carboydrate Occluded Neutralizing Epitopes (CONEs). The key reason for choosing these counter-intuitive targets stems from an analysis of a diverse panel of gp120 sequences by McCoy et al, suggesting 89% of HIV strains are missing at least one conserved glycosylation site which would normally protect a specific antigenic site. 10 Moreover, it appears HIV in acutely infected individuals tend to be underglycosylated by about 4 positions which were located in both variable and conserved regions. 11 This further evidences a potential target in regions that are usually glycosylated since these regions tend to be more conserved than normally unprotected regions. Thus, should a cocktail of these proteins be used as immunogens for clade C HIV, the likelihood of being rendered ineffective by viral mutation is lower due to the non-universal selective pressure to do so per site. Here, one such protein, named C1S1 because it consists of CONE number 1 on scaffolding protein number 1, is characterized structurally and thermodynamically both computationally and experimentally through a variety of means. Since this protein was rationally designed for the purpose of being used as a potential vaccine, it is important to check the structure experimentally and ensure that the design of the protein matches what was intended and that the integrity of the antigenic site is kept in vitro. Moreover, it is just as important to examine the stability and behavior of the protein in silico and in vitro to make sure the protein will not unfold at body temperature or exhibit any sort of properties or peculiarities than can cause any aberrant interactions. 3

Materials and Methods Molecular Dynamics Molecular dynamics simulations were ran on the C1S1 design structure three times using the Gromacs molecular simulation package with the Charmm27 force field. Simulation boxes were solvated with the TIP3P model of water and NaCl was added to a concentration of 150 mm. Simulation boxes were relaxed and equilibrated at constant volume and temperature at 310 K for 100 ps in 50,000 step sizes, then constant pressure and temperature at 1 atm and 310 K. Once equilibrated, the production molecular dynamics simulation were ran for 100 ns in 5 10 7 step sizes. RMSF data and b factor data were extracted from these trajectories and compared. Protein Expression and Purification The expression gene for C1S1 was inserted into a bacterial expression vector (pet 14b) and transformed in the BL21 strain of Escherichia coli. A single colony was selected to inoculate 100 ml of LB medium containing 10 µg/ml ampicillin to grow overnight at 37 C. From this culture, approximately 25 ml were aliquoted into two to four 1L LB media containing 10 µg/ml ampicillin and grown to an OD 600 of at least 0.60. Protein expression was induced with IPTG to a final concentration of 1 mm for 3 hr at 37 C. Cells were pelleted by centrifugation at 5,000 g for 20 minutes. Pellets were frozen in -80 C for storage. For each round of purification via affinity chromatography, 2 bacterial pellets were thawed and resuspended to a total volume of 60 ml using buffer A (20 mm PB, 100 mm NaCl, 40 mm imidizole, ph 7.4) with 20 µg PMSF, 60 µg pepstain A, and 10 µl β-mercaptoethanol at approximately 4 C. Cells were lysed via sonication (40 minutes, 5 second bursts followed by 5 second rest) and centrifuged at 50,000 g for 20 minutes. Supernatant was collected and passed through a 0.2 µm filter prior to purification via His-Tag nickel affinity chromatography. The column was equilibrated with 30 ml of buffer A, loaded with the protein lysate, washed with buffer A, and finally eluted on a gradient with buffer B (20 mm PB, 100 mm NaCl, and 500 mm imidizole at ph 7.4) ramping to 90% in 40 minutes. Fractions were collected and stored in 4 C for use in thermal denaturing studies or for crystal screening. Following affinity chromatography, samples were changed to SEC buffer (20 mm Tris and 500 mm NaCl at ph 7.4). An analytical grade size exclusion column was equilibrated with 120 ml of SEC buffer. The sample was injected and the peak corresponding to an unaggregated protein was collected. The fractions were combined and the sample was concentrated down to a final volume of roughly 200 µl. Samples were flash frozen with liquid nitrogen and stored in -80 C. 4

Thermally Denaturing Circular Dichroism Protein samples were concentrated and changed to a buffer 60 mm PB buffer ph 7.4. Samples mean residue ellipticity (MRE) from 190 to 260 nm were analyzed on a Chirascan Plus on a temperature ramp from 25 C to 95 C. A melting curve was constructed by comparing the evolution of the MRE signal at 208 nm as the temperature was increased. Dialysis and Histidine Tag Cleavage Following purification via His-Tag affinity chromatography, fractions containing C1S1 were concatenated and dialyzed in TEV reaction buffer (50 mm Tris, 200 mm NaCl in 4 L at ph 8.0) for at least 4 hours prior to cleaving at either 4 C or 25 C. Varying concentrations of TEV were added to samples at approximately a 1:100 TEV:C1S1 mass ratio and left to incubate for up to 4 days. Time point samples were taken for analysis via SDS-PAGE (running at 1 hour at 200 V on 18% polyacrylamide gels). Crystal Screening A thawed protein sample following SEC at a concentration of 40 ng/µl was spun down at approximately 10,000 g to remove aggregates. The first round of crystal screening was conducted with a set of commercial screening solutions via a Rigaku Phoenix crystal screening robot to set up a set of four 96 multiwell sitting drop trays. Drops were monitored for up to 30 days at 20 C for crystal hits. Two crystal hit conditions were selected for 24 well optimization: 1. 20% w/v PEG 3050 and 100 mm NaOAc at ph 4.5; 2. 2 M Tris and 100 mm NaCl at ph 8.5. Results Protein Design The hybrid protein C1S1 was designed by members of the Dokholyan laboratory using the in-house program Eris. A motif mining algorithm developed in the Dokholyan laboratory was employed to search the Protein Data Bank for lightweight, thermally stable scaffolding proteins that had structures that were similar to the selected antigen region from gp120. The matched proteins had the corresponding structures mutated to the antigen site with another in-house program, and the G mut were calculated. Largely destabilizing structures were thrown out and a panel of stabilized or weakly destabilizing mutants were kept. A 10 amino acid antigen site originating from the triple β-sheet on gp120 (named CONE 1) was grafted onto a similarly structured motif on the heterogeneous nuclear ribonucleoprotein D0, or hnrnp (named Scaffold 1), to form C1S1. 5

Figure 1: Side by Side Comparison of C1S1 and its Parent Scaffold The parent scaffold is shown on the left and the engineered immunogen on the right.12 Residues shown in purple were replaced with residues shown in cyan. Molecular Dynamics RMSF data and b factor visualization structures are shown in Figure 2. Overall, the scaffold appears to have reasonably low fluctuations at around 1-2 A for all three simulations and structural variability with different grafted epitopes including CONE 1. Note the frequent spike in RMSF at residues 35-40. Figure 2: C1S1 MD Simulation Data Generated from Gromacs A. RMSF data generated from 3 Gromacs simulations, shown in green, blue, and red. B. C1S1 structure with b factor spectrum representation; red denotes residues with relatively high b factors, white for intermediate, and blue for relatively low. Protein Expression The target protein has been successfully expressed and purified numerous times. Typically for each batch of C1S1 purified from 2L of induced E. coli, nickel column elution occurred at around 32% buffer B for an 6

absorbance reading of about 800-1000 mau; size exclusion purification for crystallization purposes typically involved the concatenation of two batches of nickel column purified C1S1, and eluted at 103.433 ml for a peak absorbance reading of about 800-1000 mau. Size exclusion peaks suggest that C1S1 may exist in a two conformations that cannot be separated by the column. Histidine Tag Cleavage TEV cleavage results do not appear to be conclusive. Multiple trials have been executed with different TEV stocks for extended amounts of time to result in either partially cleaved protein (Figure 3A) or uncleaved protein (Figure 3B). Interestingly, the partially cleaved protein from Figure 4A formed a hydrogel when concentrated. This is a phenomenon commonly seen in nuclear RNA binding proteins which the scaffold for C1S1 belongs to. Figure 3: SDS-PAGE Results for different TEV Protease Cleavage Experiments A. First TEV cleavage result, with 4,6, and 9 hour time points at room temperature show partial cleavage. B. Another, later TEV cleavage, with 1, 2, and 4 day time points followed by a control for cold (left) and room temperature (right) experiments show inconclusive results. Note 50% cleavage in A as opposed to what appears to be 0% cleavage in B. Circular Dichroism CD data (shown in Figure 4A) for both C1S1 and its parent scaffold protein display very similar curves, suggesting close structural similarities between the two proteins in vitro. A melting curve from thermal denaturing CD data is presented in Figure 4B. The melting point of the protein is about 57 C. Results suggest C1S1 would be stable at human body temperature at 37 C with only about 5% of protein unfolded at this temperature in vitro. 7

Figure 4: CD Spectra for C1S1 and Parent Protein with Melting Curves A. C1S1 and its parent protein share similar structures in solution given the similarity of CD spectra. B. Melting curves built from CD spectra from 25 to 95 C show C1S1 is slightly destabilized but still retains a high melting point. Crystal Screening Several hits (Figure 6) for C1S1 were found in 96 well screens using commercial reagents, summarized in Table 1. Crystal hits, however, were often numerous, small, and irregularly formed, suggesting crystallization would be a problem in the 24 well optimization by hanging drop. The two chosen conditions were expanded with partially cleaved C1S1. Results showed at best very small, singular crystals at 10% w/v PEG 3050, 100 mm NaOAc at ph 4.5 grown at 4 C. Other wells showed aggregates forming almost immediately after adding protein if a high concentration of crystallizing agent or salt was present in the drop. Figure 5: Representative Crystals from Screening Conditions Screening crystals for 10% w/v PEG 3050, 100 mm NaOAc at ph 4.5. Screening well for scale, drop is approximately 1µL. Note small size and malformed shape. Table 1: Crystal Screening Conditions The first two entries were expanded in 24 well hanging drop experiments Crystallizing Agent/Organic Solvent Buffer (ph) Salt 20% w/v PEG 3350 100 mm NaOAC (ph 4.5) 2 M Tris (ph 8.5) 100 mm NaCl 30 % w/v glycerol; 14% v/v 2-propanol 140 mm CaCl 2 ; 70 mm NaCl 2% w/v PEG 8,000 500 mm Li 2 SO 4 8

Discussion C1S1 Design Is Thermostable and Retains Shape at Body Temperature RMSF data presented in Figure 2 shows that epitope sites have a RMSF of around 1 Å. However, the b factor around some sites are rather high b factor values, which are a measure of uncertainty in electron density: this high disorder can be attributed to the rather long side chains with displaced electrons at these positions, such as histidine and asparagine at positions 28 and 30, respectively. CD spectra for C1S1 and the parent scaffold are rather similar, suggesting that they share structural characteristics and likely have the same general shape. The melting curve for C1S1 built from the CD signal at 208 nm (α helix) show that C1S1 is slightly destabilized compared to the parent protein but still retains structure at 37. Both computational and experimental data suggest the native structure that C1S1 adopts is thermostable, has low disorder at epitope sites, and assumes the overall shape of its parent scaffold. Designing a thermostable immunogen that retains shape at elevated temperatures is important in ensuring that the epitope of interest is being introduced to the immune system. If the protein readily unfolds at 37 C then any sort of immune response elicited by introducing the defective immunogen will not be able to recognize any version of gp120. C1S1 Contains Disordered Regions That May Inhibit Crystallization Although some crystal hits were identified during crystal screening, no harvestable crystals could be collected for x-ray diffraction. A possible explanation for the poor and unreliable crystal formation can be found in b factor and RMSF data from molecular dynamic simulations, which suggest that the protein contains some regions with large fluctuations. These unstructured regions on the surface of globular proteins can impede the protein s ability to crystallize due to a higher entropic barrier requirement in immobilizing these disordered side chains for the sake of packing with other proteins. According to Price et al, there are certain residues which exhibit high side chain entropy that strongly anti-correlate with successful crystallization should they reside on the surface of the protein, namely glutamine, tyrosine, asparagine, glutamic acid, and lysine. C1S1 contains many of these residues on its surface which also tend to have the largest B factors in the protein, particularly lysine. 13 It may be possible to edit some of these high disorder residues to one with low entropy side chains for crystallization, although this may completely change the structure of the protein in unpredictable ways. C1S1 May Retain Properties Associated hnrnps Such as Forming Hydrogels Interestingly, upon partially cleaving a concentrated sample (approx. 2L worth) of C1S1 with TEV, protein aggregates started to form while dialyzing for an extended period of time at room temperature. This behavior is an abnormality for C1S1, as uncut samples of C1S1 have been concentrated to greater extents and dialyzed for just as long without showing any signs of aggregation; thus, this may be a behavior exclusive to C1S1 where a 9

significant portion ( 50%) has the His-Tag cleaved. Concentrating the sample even further and centrifuging at 10,000 g resulted in a hydrogel phase at the bottom of the tube. This phase has not been extensively studied for C1S1, however it is important to note that C1S1 s parent scaffold is a type of heterogeneous nuclear RNA binding (hnrnp) protein. According to Hughes, these proteins among others contain low-complexity, aromatic rich kinked segments (LARKS) which tend to stack to form fibril structures. LARKS have three shared characteristics among them: high solubility due to the presence of asparagine, glutamine, and serine; backbone flexibility afforded by glycine; and interaction motifs, particularly due to aromatic residues. 14 C1S1 has several such motifs, namely residues 19-24, YFSKFG, and 42-49, GEGVVTFK the former of which resides on an α helix and the latter on a β strand according to the design model. These motifs present on C1S1 may explain the described behavior if cutting the His-Tag off the protein allowed for interactions between these LARKS to take place, stack into fibrils, and form a hydrogel. The conditions must first be repeated and examined with, for example, electron microscopy in order to confirm that such is actually taking place. Nevertheless, while attaining the required concentration is unlikely to happen if C1S1 were to be used as an immunogen, it would be interesting to see if these properties result in C1S1 nucleating a phase separation event with other proteins. Acknowledgements I would like to thank Nikolay Dokholyan for allowing me into his lab and for giving me the opportunity to explore biochemical and biophysical research. I would also like to thank Cheng Zhu for being a valued mentor, role model, and friend both inside the lab and out. Finally, I would like to thank the Chancellor s Science Scholars for their support outside of the lab and throughout my years at UNC. References 1. Wyatt, R.,Sodroski, J. The HIV-1 Envelope Glycoproteins: Fusogens, Antigens, and Immunogens. Science. 1998. 280(5371):1884-8. 2. Cao, L., et al, Paulson, J.C. Global site-specific N-glycosylation analysis of HIV envelope glycoprotein. Nature Communications. 2017. 10.1038 3. Wyatt R., Kwong P.D., Desjardins E., Sweet R.W., Robinson J., Hendrickson W.A., Sodroski J.G. The antigenic structure of the HIV gp120 envelope glycoprotein. Nature. 1998. 393(6686):705-11. 4. Wei, X. et al, Shaw G.M. Antibody neutralization and escape by HIV-1. Nature. 2003. 10.1038 5. Soldemo, M., Karlsson Hedestam G.B. Env-Specific Antibodies in Chronic Infection. Front. Immunol. 2017. 0.3389/fimmu.2017.01057 6. Seaman, M.S., et al, Mascola, J.R. Tiered Categorization of a Diverse Panel of HIV-1 Env Pseudoviruses for Assessment of Neutralizing Antibodies. Journal of Virology. 2010. 10.1128. 1439-1452. 7. Cohen, Y.Z., Dolin, R. Novel HIV vaccine strategies: overview and perspective. Ther. Adv. Vaccines. 2013. 10.1177/2051013613494535. 99-112. 8. Kwong, P.D., Mascola, P.D. Human Antibodies that Neutralize HIV-1: Identification, Structures, and B Cell Ontogenies. Immunity. 2012. 0.1016/j.immuni.2012.08.012. 9. Haynes, B.F., Kelsoe G., Harrison S.C., Kepler T.B. B-cell lineage immunogen design in vaccine development with HIV-1 as a case study. Nat. Biotechnol.. 2012. 10.1038/nbt.2197. 10

10. McCoy, L.E., et al, Burton, D.R. Holes in the Glycan Shield of the Native HIV Envelope Are a Target of Trimer-Elicited Neutralizing Antibodies. Cell Rep. 2016. 10.1016/j.celrep.2016.07.074. 11. Ping, L.H., et al, Swanstrom, R. Comparison of Viral Env Proteins from Acute and Chronic Infections with Subtype C Human Immunodeficiency Virus Type 1 Identifies Differences in Glycosylation and CCR5 Utilization and Suggests a New Strategy for Immunogen Design. Journal of Virology. 2013. 10.1128/JVI.03577-12 12. Nagata, T. et al, Katashira, M. Structure and interactions with RNA of the N-terminal UUAG-specific RNAbinding domain of hnrnp D0. J. Mol. Biol. 1999. 10.2210/pdb1HD0/pdb 13. Price, W.N. 2nd. et al, hunt, J.F. Understanding the physical properties that control protein crystallization by analysis of large-scale experimental data. Nat. Biotechnol. 2009. 10.1038/nbt.1514. 14. Hughes, M.P. et al, Eisenberg, D. Atomic structures of low-complexity protein segments reveal kinked β sheets that assemble networks. Science. 2018. 10.1126/science.aan6398 11