METHODS Study subjects. Genotyping quality control. Statistical analysis. SNP selection

Similar documents
Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

Genetic variants on 17q21 are associated with asthma in a Han Chinese population

During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin,

Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research

Title: PGx250: An Exploratory Pharmacogenetic Analysis of Asthma Exacerbations in African-Americans treated

Supplementary Methods

Supplementary Figure 1. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations.

Genomewide Linkage of Forced Mid-Expiratory Flow in Chronic Obstructive Pulmonary Disease

Association between interleukin-17a polymorphism and coronary artery disease susceptibility in the Chinese Han population

A total of 2,822 Mexican dyslipidemic cases and controls were recruited at INCMNSZ in

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

Chapter 4 INSIG2 Polymorphism and BMI in Indian Population

Introduction to the Genetics of Complex Disease

Supplementary information for: A functional variation in BRAP confers risk of myocardial infarction in Asian populations

Mapping susceptibility genes for asthma and allergy

Association between the -77T>C polymorphism in the DNA repair gene XRCC1 and lung cancer risk

Comprehensive Evaluation of the Association of APOE Genetic Variation with Plasma Lipoprotein Traits in U.S. Whites and African Blacks

Genetic variations in the Wnt signaling pathway affect lung function in asthma patients

AMERICAN INDIAN CHILDREN WITH ASTHMA: IMMUNOLOGIC AND GENETIC FACTORS 6/10/15

T-bet polymorphisms are associated with asthma and airway hyperresponsiveness

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Figure 1. (a) A Q-Q plot of all the 310,520 uncorrected (red crosses) and corrected (blue star) chi-square statistics from the

Supplementary Figures

Genetics and Pharmacogenetics in Human Complex Disorders (Example of Bipolar Disorder)

Effect of Variation in CHI3L1 on Serum YKL-40 Level, Risk of Asthma, and Lung Function

Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing

The genetic architecture of type 2 diabetes appears

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis

NIH Public Access Author Manuscript N Engl J Med. Author manuscript; available in PMC 2009 January 21.

Drug Metabolism Disposition

Original Article. C18orf1 located on chromosome 18p11.2 may confer susceptibility to schizophrenia

CS2220 Introduction to Computational Biology

SUPPLEMENTARY DATA. 1. Characteristics of individual studies

QTL detection for traits of interest for the dairy goat industry

LTA Analysis of HapMap Genotype Data

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Title:The effect of CD14 and TLR4 gene polimorphisms on asthma phenotypes in adult Turkish asthma patients: a genetic study

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012

Comprehensive Testing of Positionally Cloned Asthma Genes in Two Populations

New Enhancements: GWAS Workflows with SVS

Single Nucleotide Polymorphisms in the NOS Genes as Predictors of the Levels of Exhaled Nitric Oxide

Lack of Association between Endoplasmic Reticulum Stress Response Genes and Suicidal Victims

Mendelian & Complex Traits. Quantitative Imaging Genomics. Genetics Terminology 2. Genetics Terminology 1. Human Genome. Genetics Terminology 3

University of Groningen. Metabolic risk in people with psychotic disorders Bruins, Jojanneke

Global variation in copy number in the human genome

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies

Identification of regions with common copy-number variations using SNP array

Advance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library

Histocompatibility Evaluations for HSCT at JHMI. M. Sue Leffell, PhD. Professor of Medicine Laboratory Director

CONTRACTING ORGANIZATION: University of California, Irvine Irvine CA

Step up if needed (first, check adherence, environmental control and comorbid conditions) Patients ASSESS CONTROL. Step down if possible

Genetics and Genomics in Medicine Chapter 8 Questions

Epigenetics. Jenny van Dongen Vrije Universiteit (VU) Amsterdam Boulder, Friday march 10, 2017

Supplementary Figure 1 Forest plots of genetic variants in GDM with all included studies. (A) IGF2BP2

Genetic Testing and Analysis. (858) MRN: Specimen: Saliva Received: 07/26/2016 GENETIC ANALYSIS REPORT

Assessing Accuracy of Genotype Imputation in American Indians

Identifying Mutations Responsible for Rare Disorders Using New Technologies

November 9, Johns Hopkins School of Medicine, Baltimore, MD,

Variants in the Plasmacytoma Variant Translocation Gene (PVT1) Are Associated With End-Stage Renal Disease Attributed to Type 1 Diabetes

IL10 rs polymorphism is associated with liver cirrhosis and chronic hepatitis B

Lack of association of IL-2RA and IL-2RB polymorphisms with rheumatoid arthritis in a Han Chinese population

Study on G-protein-coupled receptor polymorphisms reveals close relationship with asthma in a population of China.

ANALYSIS OF IL17 AND IL17RA POLYMORPHISMS IN SPANISH PSORIASIS PATIENTS: ASSOCIATION WITH RISK FOR DISEASE.

Polymorphisms in the vascular endothelial growth factor (VEGF) gene associated with asthma

The PTGDR gene is not associated with asthma in 3 ethnically diverse populations

PREPARED FOR: U.S. Army Medical Research and Materiel Command Fort Detrick, Maryland

Rare Variant Burden Tests. Biostatistics 666

Supplementary information

Results. Introduction

Supplementary Figures

Quality Control Analysis of Add Health GWAS Data

Received February 24, 2011; accepted March 23, 2011; Epub April 1, 2011; published May 15, 2011

Multiplex target enrichment using DNA indexing for ultra-high throughput variant detection

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Supplementary webappendix

Additional Disclosure

Association of XRCC5 polymorphisms with COPD and COPD-related phenotypes in the Han Chinese population: a case-control cohort study

Performing. linkage analysis using MERLIN

White Paper Guidelines on Vetting Genetic Associations

Association Between F9 Malmö, Factor IX And Deep Vein Thrombosis

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

Supporting Online Material for

Imaging Genetics: Heritability, Linkage & Association

Genomic structural variation

UNIVERSITY OF CALIFORNIA, LOS ANGELES

Outline FEF Reduced FEF25-75 in asthma. What does it mean and what are the clinical implications?

ARTICLE Fine Mapping on Chromosome 10q22-q23 Implicates Neuregulin 3 in Schizophrenia

Dan Koller, Ph.D. Medical and Molecular Genetics

Compound heterozygosity Yurii S. Aulchenko yurii [dot] aulchenko [at] gmail [dot] com. Thursday, April 11, 13

Handling Immunogenetic Data Managing and Validating HLA Data

Genome-wide association studies for human narcolepsy and other complex diseases

Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education; Beijing, , People's Republic of China

Supplementary Online Content

Supplementary Online Content

Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels.

Statistical power and significance testing in large-scale genetic studies

Integrated detection and population-genetic analysis of SNPs and copy number variation

Association between atopic dermatitis-related single nucleotide polymorphisms rs and psoriasis vulgaris in a southern Chinese cohort

Transcription:

METHODS Study subjects Among the African Caribbean families recruited in Barbados, asthma was defined as both a reported history of asthma and a history of physician-diagnosed asthma (past/current), plus a history of wheezing without an upper respiratory infection (URI) for 2 out of 4 hallmark symptoms (wheezing with a URI, cough without a URI, shortness of breath, and tightness in the chest). E1,E2 Among the African American families recruited through the Collaborative Study on the Genetics of Asthma (CSGA), E3,E4 asthma was defined by using criteria including at least 2 symptoms consistent with asthma, <3 pack-years of cigarette exposure, and a physician s diagnosis of asthma, in addition to either (1) bronchial hyperresponsiveness defined by a fall in baseline FEV 1 20% at 25 mg/ml methacholine or (2) in subjects whose FEV 1 was reduced, reversibility defined as a 15% increase in baseline FEV 1 after an inhaled bronchodilator (albuterol). Spirometry was performed according to American Thoracic Society guidelines E5 by using a KOKO (Pulmonary Data Services, Inc, Louisville, Colo) pneumotach connected to a laptop computer. African American patients and controls participating in the ALI and sepsis study were defined in accordance with the American College of Chest Physicians E6 and Society of Critical Care Medicine Consensus statements, E7 and subjects were recruited after local Institutional Review Board approval at Johns Hopkins University and the University of Maryland in Baltimore, Md, Emory University in Atlanta, Ga, and the Medical College of Wisconsin in Milwaukee, Wis. SNP selection In the initial study, we assessed the full length of the gene encoding MYLK (GenBank Accession number: U48959; 217.6 kb, 32 exons) by direct sequencing of PCR amplicons in all exons, exonintron boundaries (including 100 bases of intronic sequence on either side), 59 and 39-untranslated regions (UTRs) and 2 kb upstream of 59 UTR, and identified 51 SNPs (10 exonic, 31 intronic, 9 59 UTR, and 1 SNP located in the noncoding region of exon 1), 4 of which were novel nonsynonymous coding SNPs (csnps) using individual DNA samples from 36 subjects equally divided into European Americans and African Americans as previously described. E1 Subsequently, a panel of 25 MYLK SNPs, including 4 novel SNPs previously identified by sequencing, was selected to genotype in an African American population selected for either severe sepsis or sepsis-associated ALI on the basis of (1) sufficient MAF (10%) in a population of African descent, (2) inter-snp distance (>0.5 kb), and (3) ready availability using the ABI TaqMan SNP Genotyping Assay platform. In the current study, we used both Clayton s htsnp E8 (available at: http://www-gene.cimr.cam.ac.uk/clayton/software/stata/htsnp/) and Stram s TagSNPs E9,E10 (available at: http://www-rcf.usc.edu/ ;stram/tagsnps.html) methods for the selection of haplotype tagging SNPs with genotypes for the 25 SNPs from 61 African American healthy controls participating in the ALI study described. E11 We selected these methods because, although both algorithms essentially rely on LD, they have slightly different approaches of acquiring this information. Briefly, htsnp relies on 2 measurements (proportion of diversity explained and r 2 ), which represent the ability of any SNP subset to maintain the overall SNP diversity of the entire region in question. TagSNP was written to circumvent problems that arise when there are multiple rare haplotypes in the haplotype estimation step, by calculating a haplotype dosage for the full set of SNPs, and then essentially performing a regression analysis on the selected subsets of SNPs and the full set, to maximize the squared correlation between the true haplotype dosage and the predicted dosage in the haplotype tagging set of SNPs. Because haplotype tagging methods are still under much debate and development, and in part because of our recent experience in selecting tagging SNPs for populations of African descent, E12 we chose both of these methods in combination, with the aim of obtaining a more convincing view of which SNPs we may want to discard. Only SNPs retained by both programs using an R 2 cutoff of 0.8 were retained as tagging SNPs, resulting in the removal of the 6 most redundant markers from the previous panel, for a final panel of 19 MYLK SNPs (Table E2). Genotyping quality control Approximately 8% of samples were repeated for quality control with an average error rate of 1.3%. Mendelian inconsistencies identified using Sib-Pair (1.00a17) with 3.5% discordant genotype pairs in the Barbados family dataset. Genotypes were made missing as appropriate for members of nuclear families involved, which rendered 177 genotypes (1.2 %) missing for further analyses. Statistical analysis Association testing using FBAT. For haplotype analyses in the 2 family-based asthma populations, an expectation-maximization algorithm, which maximizes the likelihood of phased haplotype frequencies based on all observed genotypes in the nuclear family, was used. Adjusting of confounding caused by multiple comparisons. Empiric P values were calculated through permutation analysis by using Monte Carlo methods under the null hypothesis of complete independence between haplotypes and the observed phenotypes. FBAT s sampling procedure stops when at least 100 Monte Carlo based test statistic values are generated that are less than the observed value or after 100,000 replicate samples are generated, whichever comes first. Haplotype association testing in the severe sepsis and ALI population For re-evaluation of the 4-SNP haplotype in the case-control designed severe sepsis/ali population, haplotype frequency was estimated via a maximum likelihood method using the expectationmaximization algorithm under the assumption of Hardy-Weinberg equilibrium implemented in SNPem. E11,E13 The omnibus likelihood ratio test P value from SNPem was used as the global P value. RNA preparation and gene expression in human PBMCs Total RNA was extracted from human PBMCs of 28 unrelated subjects from Barbados (15 patients with asthma and 13 controls) by using the Trizol Reagent (catalog no. 15596-026; Invitrogen, Carlsbad, Calif). Subjects were also selected on the basis of their rs936170 genotypes (GG, GC, CC); the CC genotype was rare (6.9% in the Barbados population), and only 2 subjects with asthma with the CC genotype were available for the expression study. The result was a group divided with 6 or 7 in each subgroup (6 GG asthmatic, 7 GG nonasthmatic, 7 GC and 2 CC asthmatic, 6 GC nonasthmatic). Additional purification was performed on RNAeasy columns (catalog no. 74104; Qiagen, Valencia, Calif). The quality of total RNA samples was assessed by using an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, Calif). RNA samples were labeled according to Illumina-recommended protocols. In brief, 0.5 mg total RNA from each sample was labeled by using the Illumina TotalPrep RNA Amplification Kit (catalog no. IL1791; Ambion, Austin, Tex) in a process of cdna synthesis and in vitro transcription similar to that of other chip manufacturers. Single-stranded RNA (crna) was generated and labeled by incorporating biotin-16-uridine-59-triphosphate (catalog no. 11388908910; Roche Diagnostics GmbH, Mannheim, Germany). Biotin-labeled crna 0.85 ug was hybridized (16 hours) to Illumina s Sentrix

HumanRef-8 Expression BeadChips (catalog no. 11201828; Illumina, San Diego, Calif). The hybridized biotinylated crna was detected with streptavidin-cy3 and quantitated by using Illumina s BeadStation 500GX Genetic Analysis Systems scanner. Array data analysis Preliminary analysis of the scanned data was performed by using Illumina BeadStudio software. The basic Illumina data are returned from the scanner in the form of an.idat file that contains single intensity data values/gene after the computation of a trimmed mean average for each probe type represented by a variable number of bead probes/gene on the array. The BeadStudio software returns information on the number and SD of all the bead measurements per probe/ gene as well as a detection call based on a comparison between the measured intensity calculated for a single probe/gene and the intensities measured for a large number of negative control beads built-in to the BeadChip arrays (D 5 % above negative/100, 1 5 perfect, ie, the intensity value of a gene is greater than all the intensities for every negative control tested). The background filtering (any gene below D 5 0.95 for 75% of samples was eliminated from further analysis) was performed. Then the data were log-transformed and normalized to a chip-wise mean by using the CLUSTER tool (available at: http:// rana.stanford.edu/clustering). E14 Significance testing was performed by comparing expression levels among patients with asthma and healthy controls stratified by SNP rs936170 G>C genotypes (GG or GC/CC) by using the unpaired t test. Statistical significance (P values) was defined at the 5% level for all analyses. RESULTS Study populations Among the 848 African Caribbean family members in Barbados and 368 African American family members, distributions of tige concentrations were similar to those described previously. E1,E15 Families from both populations were originally selected through an asthmatic proband. Total serum IgE was significantly higher (P <.0001-.03) among patients with asthma compared with subjects without asthma in both populations (Table E1). Both populations were highly atopic, especially the Caribbean families (mean total serum IgE was 622 ng/ml and 378 ng/ml for the Caribbean and American populations, respectively). Association among SNPs Frequencies of the 19 MYLK SNPs in both populations are summarized in Table E2. All SNPs were in Hardy- Weinberg equilibrium among both sets of founders. Moderate to high LD (according to the definition of Gabrieletal E16 ) was observed for 8 out of 18 pairings of contiguous SNPs (D9 ranging from 0.88 to 1.0) in the African Caribbean population. Because of an average inter-snp distance of 11 kb, the resulting blocks were small (2 blocks observed in the 39 smmylk isoform, composed of markers rs6800971 and rs820463 in block 1, and rs3845915 and ss65824112 in block 2, with D9 50.96 for both blocks), and overall patterns across the region revealed low LD (Fig E1). A similar pattern was observed in the African American founders except that there was relatively higher LD among markers in the border of the nmmylk and smmylk isoforms of the gene, resulting in a 7kb block composed of markers rs820336 and rs11717184. Genetic association analyses in the African Caribbean population All individual SNP analyses between the 19 markers across the 3 MYLK regions and asthma and log[tige] were first performed under the robust genotype model (data not shown) with results giving the strongest signals suggesting the additive model. Analyses performed under dominant and recessive models (data not shown) supported the additive model results and did not identify any additional associations. In addition to the negative association among the Caribbean families between both asthma and log[tige] (P 5.009 and P 5.05, respectively) and a promoter SNP (rs936170; Fig 2, A and B) of the smmylk isoform and the 3-SNP and 4-SNP haplotype associations in this region, a negative association for log[tige] (P 5.03) was also observed for a nonsynonymous SNP that creates a proline to histidine amino acid change in the second exon of the nmmylk isoform (rs28497577, referred to as MYLK_007 in Gao et al E11 ; Table E3); this SNP had previously been shown to be part of a 3-SNP haplotype that was significantly associated with risk of ALI, but only among African American patients. E11 Additional associations (single-marker and haplotype) observed for markers in the nmmylk isoform included significant evidence for linkage and association in the 59 region of MYLK (nmmylk) between the 2 phenotypes (asthma and log[tige]) and 2-SNP, 3-SNP, and/or 4-SNP haplotypes (Table E4). The strongest associations were for a 4-SNP haplotype (rs11707609-rs9840993; GTAC) in the 59 region (region 1, Fig 1) for asthma and log[tige] (P 5.001 and.001, respectively), and a 2-SNP haplotype beginning with the second SNP (rs28497577-rs11707609; CG) for log[tige] (P 5.013). In the third region, near the 39 end of MYLK (smmylk), a 4-SNP haplotype (rs6800971-ss65824112; CTCC) provided the strongest signal for asthma and log[tige] (P 5.007 and P 5.01, respectively). An intronic SNP (rs3845915) at the 39 end of MYLK contributed significantly in determining the direction of this association, and similar to the other associations, this SNP had previously been implicated in a haplotype conferring risk of sepsis. E11 DISCUSSION Another limitation in our study design is the ability to directly test for association between causal variants because we did not use a large panel of SNPs that could fully represent variability in the MYLK gene. Therefore, we cannot be certain the causal variant is included in our panel of markers. Because the primary objective of this study was to compare associations previously observed for a sepsis/ali study to associations between the same set of SNPs and asthma, the SNPs used here was selected from our previous resequencing project of the 32 MYLK exons, exon-intron boundaries, and 59 and 39-UTRs of MYLK. At the time of this publication, there are 388 SNPs

reported for the MYLK gene in the West African Yoruban International HapMap Project population, which most closely matches our 2 populations of African descent, of which 53 SNPs are tagging SNPs (R 2 > 0.80; http:// www.hapmap.org/; International HapMap Consortium, HapMap data Rel 21a/phaseII Jan07, on NCBI B35 assembly, dbsnp b125). There are 4 coding, nonsynonymous MYLK SNPs, and 7 known 59 UTR and three 39 UTR SNPs. On the basis of our previous sequencing in African Americans, E11 we identified 10 SNPs in 59 UTR and 8 SNPs in coding region of MYLK with 4 of them conferring an amino acid change, and 3 out of the 4 nonsynonymous SNPs were included in these studies. The sliding window haplotype analysis presented here does provide a more comprehensive search for evidence of association than could be attained by analysis of individual SNPs, but we acknowledge the limitations of our approach. Intronic SNPs gave our strongest statistical evidence of linkage in the presence of disequilibrium, but our SNPs are limited to the first 100 base pairs on either side of each exon. Further effort is clearly warranted to identify the likely causal variants associated with atopic asthma. REFERENCES E1. Barnes KC, Neely JD, Duffy DL, Freidhoff LR, Breazeale DR, Schou C, et al. Linkage of asthma and total serum IgE concentration to markers on chromosome 12q: evidence from Afro-Caribbean and Caucasian populations. Genomics 1996;37:41-50. E2. Barnes K, Hizawa N, Neely J, Duffy D, Freidhoff L, Ehrlich E, et al. Linkage analysis of atopy to chromosome 12q markers in an Afro- Caribbean population [abstract]. J Allergy Clin Immunol 1997;99:S476. E3. CSGA. The Collaborative Study on the Genetics of Asthma: a genomewide search for asthma susceptibility loci in ethnically diverse populations. Nat Genet 1997;15:389-92. E4. Xu J, Meyers DA, Ober C, Blumenthal MN, Mellen B, Barnes KC, et al. Genomewide screen and identification of gene-gene interactions for asthma-susceptibility loci in three U.S. populations: collaborative study on the genetics of asthma. Am J Hum Genet 2001;68:1437-46. E5. Standardization of spirometry, 1994 update. American Thoracic Society. Am J Respir Crit Care Med 1995;152:1107-36. E6. Bernard GR, Artigas A, Brigham KL, Carlet J, Falke K, Hudson L, et al. Report of the American-European Consensus conference on acute respiratory distress syndrome: definitions, mechanisms, relevant outcomes, and clinical trial coordination. Consensus Committee. J Crit Care 1994;9:72-81. E7. American College of Chest Physicians/Society of Critical Care Medicine Consensus Conference: definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. Crit Care Med 1992;20:864-74. E8. Clayton D. Choosing a set of haplotype tagging SNPs from a larger set of diallelic loci. Available at: http://www-gene.cimr.cam.ac.uk/clayton/ software/stata/htsnp/htsnp.pdf. Accessed April 9, 2007. E9. Stram DO, Haiman CA, Hirschhorn JN, Altshuler D, Kolonel LN, Henderson BE, et al. Choosing haplotype-tagging SNPS based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the Multiethnic Cohort Study. Hum Hered 2003;55: 27-36. E10. Stram DO. Tag SNP selection for association studies. Genet Epidemiol 2004;27:365-74. E11. Gao L, Grant A, Halder I, Brower R, Sevransky J, Maloney JP, et al. Novel polymorphisms in the myosin light chain kinase gene confer risk for acute lung injury. Am J Respir Cell Mol Biol 2006;34: 487-95. E12. Chi PB, Duggal P, Kao WH, Mathias RA, Grant AV, Stockton ML, et al. Comparison of SNP tagging methods using empirical data: association study of 713 SNPs on chromosome 12q14.3-12q24.21 for asthma and total serum IgE in an African Caribbean population. Genet Epidemiol 2006;30:609-19. E13. Fallin D, Cohen A, Essioux L, Chumakov I, Blumenfeld M, Cohen D, et al. Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer s disease. Genome Res 2001;11:143-51. E14. Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A 1998;95:14863-8. E15. Zambelli-Weiner A, Ehrlich A, Stockton ML, Grant AV, Zhang S, Levett PN, et al. Evaluation of the CD14/-260 polymorphism and house dust endotoxin exposure in the Barbados asthma genetics study. J Allergy Clin Immunol 2005;115:1203-9. E16. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The structure of haplotype blocks in the human genome. Science 2002;296:2225-9.

TABLE E1. Demographic characteristics of the 2 asthma populations* African Caribbean African American Characteristics Founders Total Founders Total No. of subjects 342 848 165 368 Percent male sex (%) 49.7% 49.1% 44.2% 48.5% Age (y) 45.8 6 10.5 28.8 6 16.6 42.7 6 10.6 28.0 6 16.5 Percent asthmatic (%) 10.4% 41.6% 37.7% 70.4% Geometric mean IgE concentration 563.6 621.8 354.0 378.1 (95% CI) (473.8-670.4) (561.2-689.0) (276.3, 453.5) (318.9, 448.2) P value (patients with asthma vs.001 <.0001.888.030 subjects without asthma) *Plus-minus values are means 6 SDs. Serum total IgE concentrations were expressed as geometric mean in ng/ml, adjusted for age and sex, centered around the predicted mean for 20-year-old men.

TABLE E2. Chromosome 3 location, MAF, and type of MYLK SNPs genotyped in 342 African Caribbean founders from Barbados and 165 African American founders from Baltimore Washington, DC* SNP Chromosome position (build 125) Inter-SNP distance (bp) Type Nucleotide/amino acid change Minor allele frequency (N) Caribbean founders American founders rs36025624 124862305 0 Intron G/C 0.109 (288) 0.084 (154) rs28497577 124833528 28,777 C.62 C/A, Pro>His 0.404 (282) 0.366 (146) rs11707609 124824325 9203 Intron G/A 0.085 (271) 0.096 (151) rs1869863 124818557 5768 Intron G/T 0.236 (288) 0.215 (156) rs34261801 124800686 17,871 Intron A/G 0.077 (287) 0.097 (150) rs9840993 124778794 21,892 C.439 T/C, Pro>Ser 0.329 (207) 0.377 (159) rs3796164 124773962 4832 C.782 T/C, Val>Ala 0.437 (287) 0.43 (158) rs820332 124766385 7577 Intron C/A 0.171 (249) 0.167 (150) rs820349 124760229 6156 Intron C/T 0.475 (281) 0.484 (154) rs936170 124743718 16,511 Promoter, 23269à G/C 0.282 (284) 0.283 (152) rs820336 124736682 7036 Intron G/A 0.138 (289) 0.173 (153) ss65824100 124732006 4676 IVS171477 C/T 0.241 (292) 0.272 (156) rs11717814 124729452 2554 Intron G/C 0.145 (289) 0.199 (156) rs820364 124720506 8946 Intron T/C 0.290 (284) 0.389 (153) rs820325 124706578 13,928 Intron G/A 0.172 (267) 0.275 (142) rs6800971 124693470 13,108 Intron C/T 0.216 (287) 0.189 (151) rs820463 124677938 15,532 C.4842 T/C 0.474 (269) 0.428 (153) rs3845915 124660735 17,203 Intron C/T 0.376 (278) 0.434 (158) ss65824112 124658305 2430 IVS31172 C/G 0.190 (281) 0.274 (155) *SNPs were named relative to the representative mrna (Genbank accession U48959) of full-length MYLK (32 exons) in reference genomic sequence NT_005543. Since submission of the original report, E11 information on MYLK SNPs identified by sequencing was uploaded into National Center for Biotechnology Information dbsnp (http://www.ncbi.nlm.nih.gov/snp). Marker rs36025624 was previously referred to as hcv1602689, rs28497577 was MYLK_007, rs3426180 was hcv1602709, rs9840993 was MYLK_002, rs3796164 was MYLK_003, ss65824100 was MYLK_025, and ss65824112 was MYLK_037. àcounted from the transcriptional initiation site of the smmylk form of the gene.

TABLE E3. FBAT in 2 independent population of African descent (Barbados and CSGA)* Marker Barbados CSGA Asthma log[tige] Asthma log[tige] rs36025624 G 1.460 (0.144) 1.201 (0.230) 1.313 (0.189044) 1.295 (0.195304) 0.895 (45) 0.895 (61) 0.914 (22) 0.914 (22) C 21.460 (0.144) 21.201 (0.230) 21.313 (0.189044) 21.295 (0.195304) 0.105 (45) 0.105 (61) 0.086 (22) 0.086 (22) rs28497577 C 1.670 (0.095) 2.086 (0.037) 21.216 (0.223811) 21.126 (0.260157) 0.613 (86) 0.613 (115) 0.653 (40) 0.653 (45) A 21.670 (0.095) 22.086 (0.037) 1.216 (0.223811) 1.126 (0.260157) 0.387 (86) 0.387 (115) 0.347 (40) 0.347 (45) rs11707609 G 0.630 (0.529) 1.297 (0.195) 0.812 (0.416976) 0.517 (0.605112) 0.902 (30) 0.902 (39) 0.898 (19) 0.898 (23) A 20.630 (0.529) 21.297 (0.195) 20.812 (0.416976) 20.517 (0.605112) 0.098 (30) 0.098 (39) 0.102 (19) 0.102 (23) rs1869863 G 20.268 (0.789) 20.199 (0.842) 20.039 (0.968596) 0.403 (0.687246) 0.768 (72) 0.768 (104) 0.777 (44) 0.777 (48) T 0.268 (0.789) 0.199 (0.842) 0.039 (0.968596) 20.403 (0.687246) 0.232 (72) 0.232 (104) 0.223 (44) 0.223 (48) rs34261801 A 0.945 (0.344) 0.621 (0.534) 0.920 (0.357455) 0.537 (0.591090) 0.908 (32) 0.908 (38) 0.892 (14) 0.892 (18) G 20.945 (0.344) 20.621 (0.534) 20.920 (0.357455) 20.537 (0.591090) 0.092 (32) 0.092 (38) 0.108 (14) 0.108 (18) rs9840993 T 0.291 (0.771) 1.035 (0.301) 20.538 (0.590857) 20.072 (0.942399) 0.677 (65) 0.677 (79) 0.620 (43) 0.620 (48) C 20.291 (0.771) 21.035 (0.301) 0.538 (0.590857) 0.072 (0.942399) 0.323 (65) 0.323 (79) 0.380 (43) 0.380 (48) rs3796164 T 0.128 (0.899) 0.066 (0.947) 0.530 (0.595799) 0.617 (0.536932) 0.560 (94) 0.560 (128) 0.567 (48) 0.567 (56) C 20.128 (0.899) 20.066 (0.947) 20.530 (0.595799) 20.617 (0.536932) 0.440 (94) 0.440 (128) 0.433 (48) 0.433 (56) rs820332 C 0.243 (0.808) 0.572 (0.567) 20.327 (0.743421) 20.185 (0.853474) 0.876 (47) 0.876 (72) 0.840 (29) 0.840 (31) A 20.243 (0.808) 20.572 (0.567) 0.327 (0.743421) 0.185 (0.853474) 0.124 (47) 0.124 (72) 0.160 (29) 0.160 (31) rs820349 C 0.216 (0.829) 20.665 (0.506) 0.000 (1.000) 0.727 (0.467242) 0.531 (97) 0.531 (131) 0.509 (44) 0.509 (53) T 20.216 (0.829) 20.665 (0.506) 0.000 (1.000) 20.727 (0.467242) 0.469 (97) 0.469 (131) 0.491 (44) 0.491 (53) rs936170 G 2.587 (0.0097) 1.940 (0.052) 20.453 (0.650613) 20.437 (0.661944) 0.711 (83) 0.711 (119) 0.729 (45) 0.729 (54) C 22.587 (0.0097) 21.940 (0.052) 0.453 (0.650613) 0.437 (0.661944) 0.289 (83) 0.289 (119) 0.271 (45) 0.271 (54) rs820336 G 20.631 (0.528) 20.691 (0.489) 20.336 (0.736522) 20.690 (0.489994) 0.822 (52) 0.822 (69) 0.810 (22) 0.810 (28) A 0.631 (0.528) 0.691 (0.489) 0.336 (0.736522) 0.690 (0.489994) 0.178 (52) 0.178 (69) 0.190 (22) 0.190 (28) ss65824100 C 20.033 (0.974) 1.072 (0.284) 0.164 (0.869417) 20.104 (0.917072) 0.756 (73) 0.756 (105) 0.722 (46) 0.722 (51) T 0.033 (0.974) 21.072 (0.284) 20.164 (0.869417) 0.104 (0.917072) 0.244 (73) 0.244 (105) 0.278 (46) 0.278 (51)

TABLE E3. Continued Barbados Marker Asthma log[tige] Asthma log[tige] rs11717814 G 0.216 (0.829) 0.583 (0.560) 21.508 (0.131668) 21.437 (0.150591) 0.813 (55) 0.813 (74) 0.805 (28) 0.805 (32) C 20.216 (0.829) 20.583 (0.560) 1.508 (0.131668) 1.437 (0.150591) 0.187 (55) 0.187 (74) 0.195 (28) 0.195 (32) rs820364 T 0.973 (0.331) 0.428 (0.668) 20.443 (0.657508) 20.703 (0.482326) 0.697 (92) 0.697 (123) 0.608 (39) 0.608 (50) C 20.973 (0.331) 20.428 (0.668) 0.443 (0.657508) 0.703 (0.482326) 0.303 (92) 0.303 (123) 0.392 (39) 0.392 (50) rs820325 G 0.363 (0.716) 0.414 (0.679) 20.644 (0.519567) 20.644 (0.519765) 0.807 (61) 0.807 (78) 0.721 (33) 0.721 (40) A 20.363 (0.716) 20.414 (0.679) 0.644 (0.519567) 0.644 (0.519765) 0.193 (61) 0.193 (78) 0.279 (33) 0.279 (40) rs6800971 C 0.245 (0.806) 0.074 (0.941) 20.905 (0.365561) 21.418 (0.156153) 0.767 (56) 0.767 (83) 0.812 (26) 0.812 (33) T 20.245 (0.806) 20.074 (0.941) 0.905 (0.365561) 1.418 (0.156153) 0.233 (56) 0.233 (83) 0.188 (26) 0.188 (33) rs820463 T 20.191 (0.848) 1.060 (0.289) 20.038 (0.969342) 20.548 (0.583465) 0.535 (77) 0.535 (115) 0.588 (42) 0.588 (52) C 0.191 (0.848) 21.060 (0.289) 0.038 (0.969342) 0.548 (0.583465) 0.465 (77) 0.465 (115) 0.412 (42) 0.412 (52) rs3845915 C 0.655 (0.512) 0.517 (0.605) 0.364 (0.716187) 20.488 (0.625666) 0.609 (77) 0.609 (105) 0.559 (52) 0.559 (60) T 20.655 (0.512) 20.517 (0.605) 20.364 (0.716187) 0.488 (0.625666) 0.391 (77) 0.391 (105) 0.441 (52) 0.441 (60) ss65824112 C 0.797 (0.425) 0.753 (0.451) 0.217 (0.827903) 20.107 (0.914715) 0.789 (58) 0.789 (77) 0.727 (34) 0.727 (40) G 20.797 (0.425) 20.753 (0.451) 20.217 (0.827903) 0.107 (0.914715) 0.211 (58) 0.211 (77) 0.273 (34) 0.273 (40) *z Scores (P value) and allele frequencies (number of families) were tested for the 19 MYLK SNPs and asthma-associated phenotypes under an additive model (significant P values in boldface). CSGA

TABLE E4. Association testing results for 2-SNP to 4-SNP haplotype windowing of 19 MYLK SNPs in 3 populations of African descent for asthma, log[total IgE], severe sepsis, and ALI *Global P value generated by the Omnibus overall haplotype profile testing. Frequencies in combined case groups versus controls. àx 2 values generated by permutation in SNPem.

FIG E1. LD estimation for 19 variants in MYLK spanning 217 kb on chromosome 3. Pairwise LD estimated using D9 values in 342 African Caribbean founders in Barbados (right panel) and 165 African American founders (left panel) is represented as red squares for strong LD (LD 2, D9 51), pink squares for weak LD (LD 2, D9 < 1), blue squares for nonsignificant LD (LD < 2, D9 51), and white squares for little or no LD (LD < 2, D9 < 1).

FIG E2. Distributions of 2log 10 (P value) for single MYLK SNP or haplotypes from SNP windowing across the MYLK gene in the African American families. Results of 2-SNP to 4-SNP haplotype windowing for asthma (A) and log[total serum IgE concentrations] (log[tige]; B) are displayed. The y axis indicates the value of 2log 10 (P); the x axis indicates the relative position for each SNP marker locus at the MYLK gene region on human chromosome 3 in the 59 to 39 direction. The vertical line in black represents the position of each SNP marker on x axis and its height on y axis, indicating the value of 2log 10 (P). The horizontal lines in red represent the 2-SNP windowing, and the lines in blue represent the 3-SNP windowing.