SUPPLEMENTARY INFORMATION

Size: px
Start display at page:

Download "SUPPLEMENTARY INFORMATION"

Transcription

1 Supplementary materials: Table of Contents (Shi et al., Common variants on chromosomes 6p22.1 are associated with schizophrenia) A. Full Methods S2 1. Recruitment and assessment of subjects S2 2. Selection of subjects for GWAS (GAIN and NONGAIN genotyping phases). S6 3. Genotyping S7 4. Quality control of genotype data S7 5. Analysis of population substructure S11 6. Statistical analysis of association S15 7. Imputation analysis of untyped SNPs S18 8. Statistical method for combining p values from MGS, ISC and SGENE S18 9. Functional relationships among genes (MGS GWAS) S Power analyses S20 B. Supplementary Results 1 MGS results: LD in MHC; combined EA/AA analysis S22 Supplementary Results 2 Polygenic Score Analysis S25 Supplementary Results 3 Candidate genes S26 C. Supplementary References S27 D. Supplementary Acknowledgements Grant Support S28 Supplementary Tables Table S1: Sources of clinical information for MGS subjects S3 Table S2a b: Demographics, diagnoses, recruitment site and clinical features of MGS cases S4 Table S3: Control demographics S6 Table S4: Summary of genotyped MGS samples S7 Table S5: Genotyping QC summary (final dataset) S7 Table S6: Number of subjects excluded by QC filters for autosomal SNP analysis S8 Table S7: Number of additional EA subjects excluded by QC filters for X chr analysis only S9 Table S8: Number of subjects passing QC filters S10 Table S9: SNP exclusion criteria and number of excluded SNPs for European ancestry samples S10 Table S10: SNP exclusion criteria for African American samples S11 Table S11: Correlation of EA PCs computed with (ALLPCs) vs. without (EAPCs) HapMap subjects S12 Table S12: Correlation (EA subjects): NPCs with (ALLNPCs) vs. without (EANPCs) HapMap subj S13 Table S13: Correlation (AA subjects): NPCs with (ALLNPCs) vs. without (EANPCs) HapMap subj S14 Table S14: MGS results for significant MHC SNPs with/without correction for HLA specific PCs S16 Table S15: Weights and Ns for combining Z scores from three GWAS: MGS, ISC, SGENE S19 Table S16: Power analysis for meta analysis of MGS, ISC and SGENE S20 Table S17: Power analysis for MGS EA+AA, EA and AA analyses S21 Table S18: MGS combined European ancestry + African American analysis S24 Table S19: Candidate polymorphisms based on meta analyses of previous studies S26 Supplementary Figures Figure S1: Overview of Procedure and Results S2 Figure S2: PC1 and Heterozyogosity for AA subjects S9 Figure S3: Continental Principal Components S11 Figure S4: Correlations of EA PCs with SNPs in chromosomes 8p and 6p S12 Figure S5: Chr 8p specific PC3 S13 Figure S6: North South and East West within Europe gradients S14 Figure S7: EANPC5 separates Ashkenazi ancestry S15 Figure S8: US EAs vs. Australians S15 Figure S9: Autosomal and X chromosome GC lambda values S17 Figure S10: Linkage disequilibrium across the extended MHC region (MGS EA sample) S22 Figure S11: MHC region LD and association test results in the MGS African American dataset S23 1

2 Supplementary Datafiles (each Excel file has multiple worksheets as listed) Supplementary Datafile 1.xls (detailed data from MGS GWAS and additional data from meta analysis) Includes worksheets: README (contents); EA, AA, EA&AA (MGS results, P<0.001, for each analysis); Chr6_Fig1 Data (data for Figure 1 plus HLA DQA1); Chr4 Meta Results (SNPs with meta analytic P<1E 06) Supplementary Datafile 2.xls (detailed data from MGS GWAS for SNPs in 75 schizophrenia candidate genes and for polymorphisms reported as positive in a recent candidate gene meta analysis SZGENE) Includes README (contents); Gene list ; GWAS EA; GWAS AA; GWAS EA&AA; results for SZGENE SNPs Supplementary Datafile 3.pdf (contains MGS genotyping cluster plots for critical chr6 SNPs) Note that qualified scientists can obtain all raw and filtered MGS results from dbgap. Figure S1: Overview of procedure and results: Molecular Genetics of Schizophrenia GWAS and Meta analysis MGS Samples MGS GWAS European ancestry (EA) African American (AA) 2,681 cases / 2,653 controls 1,286 cases / 971 controls Affymetrix 6.0 GWAS array MGS GWAS analyses EA (primary analysis; Table 1), AA (Table 1).EA+AA (Table S19) Top results are presented in the tables listed above. Results with P < are in Supplementary Datafile 1. META ANALYSIS MGS EA International Schizophrenia SGENE Consortium (ISC) 2,681 cases 3,322 cases 2,005 cases 2,653 controls 3,587 controls 12,837 controls Combined dataset: 8,008 cases / 19,077 controls P value meta analysis weighted for case/control sample sizes MAIN RESULT A. Genome wide Full Methods significant association 7 SNPs on chromosome 6p22.1 with P < 5 x 10 8 (Table 2, Figure 1) Lowest P = 9.54E 09 (rs , 27,251,862 bp) FULL METHODS 1. Recruitment and assessment of subjects. a. Cases. Self reported European ancestry (EA) and African American (AA) unrelated adult cases with DSM IIIR (SGI study) or DSM IV (MGS1, MGS2 studies) schizophrenia (SZ) or schizoaffective disorder (SA) were collected under institutional review board approved protocols in three studies, Schizophrenia 2

3 Genetics Initiative (SGI) 1, Molecular Genetics of Schizophrenia Part 1 (MGS1) 2, and MGS2 3, as previously described in detail in those references. Briefly: SGI subjects (2.4%) were recruited by three research groups in the United States (see Acknowledgments) in a study designed to collect families with affected sibling pairs (and other affected members when available), ascertaining through probands with schizophrenia recruited through clinical settings. One member per family (proband) SGI family was selected for the GWAS sample. MGS1 subjects (10.4%) were collected by ten sites (see acknowledgements) in the United States and Australia in a study designed to collect affected sibling pairs (and other affected members when available), ascertaining through probands with schizophrenia recruited through clinical settings and community residences. One member per MGS1 family (proband) was selected for the GWAS sample. MGS2 (87.2%) subjects were collected by the same ten sites as MGS1. Single cases were selected (and both parents were asked to provide blood samples to collect case parent trios where possible, families were not studied in the GWAS except for quality control purposes). Probands had DSM IV schizophrenia, or DSM IV schizoaffective disorder but with a history of meeting Criteria A for schizophrenia for at least six months of the course of illness. These two disorders co aggregate in families with similar relative risks. 4 Source of information % of subjects Table S1: Sources of clinical information for MGS subjects Number of sources % of subjects Family history of psychosis % of subjects DIGS % 1 1.5% None known 69.3% FIGS % % Likely 13.8% Med Records 89.8% % Definite 16.8% The three studies used the same clinical assessments (with minor modifications for MGS1 and 2 compared with SGI) and diagnostic procedures. Across all cases regardless of study, Table S1 summarizes the sources of information available, as well as the results of screening for family history of psychotic disorders. Interviews were conducted in person by trained research interviewers using the Diagnostic Interview for Genetic Studies v2.0 (DIGS) 5 ; the Family Interview for Genetic Studies (FIGS) 5 was completed with an informant where possible; and medical records were obtained with the subject s written consent. The DIGS versions used in SGI and MGS are available at Almost all cases had at least two sources of information; in the 1.5% of cases where only one source was available, this was medical records in all but two, in situations where the subject was willing to participate and sign consent for the study, but was unable to give reliable historical information (or preferred not to do so), and informants were not available, but there were extensive clinical records that permitted a confident diagnosis. In all three studies, two expert diagnosticians then independently reviewed all available information about each case, made all applicable diagnoses (DSM IIIR for SGI which was completed earlier; DSM IV for MGS), and they arrived at a primary consensus best estimate final diagnosis 6 of SZ or SA for included cases (in MGS2 this required a consensus confidence level of "definite" or "likely"). For MGS1 and 2 (97.6% of cases), the consensus ratings included each DSM schizophrenia criterion, a confidence level for each diagnosis, ratings of family history, and dimensional ratings of symptoms and course. 3

4 Table S2a: Demographic characteristics and diagnoses MGS cases Analyzed sample EA cases EA controls AA cases AA controls Sex Count % count % count % count % Male 1, , Female , ,681 2, Age Mean SD mean SD mean SD mean SD at evaluation at onset na na na na Diagnosis Count % count % count % count % SCZ 2, na na 1, na na SADT na na na na SAMT na na na na AU CO NUH ATL WU STAN NY LSU UCSF IW SGI (AU=Australia; CO=Colorado; NUH=Northshore University Health System; ATL=Emory Univ.; STAN=Stanford; NY=Mt. Sinai; LSU=LSU New Orleans; UCSF=U California San Francisco; IW=Univ Iowa; SGI=NIMH Schizophrenia Genetics Initiative.) Table S2b: Clinical differences between schizophrenia and schizoaffective cases SCZ SA SCZ SA P Mean SD Mean SD P % % AgeRx Sex (female) <1E 08 AAO A criteria SCZ (present/absent) PosSx Delusions BizPosSx Bizarre Delusions Neg/Disorg E 08 Hallucinations MoodSx <1E 08 SCZ hallucinations Deterioration Disorg Speech E 04 SumCritA E 05 Disorg Behavior E 05 Negative Symptoms AgeRx = age at first treatment for psychosis); AAO = age at onset; SumCritA = sum of schizophrenia A criteria (column 2). Symptom factors (M plus) include PosSx (positive symptoms; delusions, hallucinations and paranoia), BizPosSx (bizarre positive symptoms; control delusions, other bizarre delusions, commenting/conversing hallucinations, thought broadcasting/audible thoughts/thought echo); Neg/Disorg (negative and disorganized symptoms; blunted affect, poverty of speech, formal thought disorder, disorganized behavior) and MoodSx (mood symptoms; depression, mania, and moodcongruent psychotic symptoms). Table S2a summarizes demographic, diagnostic and recruitment site information about post QC MGS2 cases and controls. Table S2b summarizes contrasts of clinical variables in schizophrenia vs. schizoaffective cases (note that the analyses of symptom factors include 3,552 subjects with valid and complete ratings). Data are final consensus diagnostician ratings based on all sources of information. Analyses were t tests for continuous variables (left column) and χ 2 for categorical variables (right column. Symptom factors are based on diagnostician ratings of duration and severity using the Lifetime Dimensions of Psychosis Scale (LDPS). 7 Family studies consistently demonstrate familial co aggregation of schizophrenia and schizoaffective disorder. 4,8,9 Diagnostic criteria separating these two disorders are subjective, and inter rater reliability is often low across research groups. 10 One large family study found the highest familial risk of 4

5 schizophrenia in relatives of schizophrenia probands with severe, chronic illness and manic features. 11 However, because some DSM IV schizoaffective patients have only brief periods of persistence of psychosis after long mood disorder episodes, which bears little resemblance to schizophrenia, the MGS investigators decided in advance to include schizoaffective cases, but to require at least six months of illness consistent with schizophrenia (and mood disorder during 30% of the clinical course). Table 2b shows that schizoaffective and schizophrenia cases had similar AAO, positive symptoms, bizarre positive symptoms (characteristic of schizophrenia) and deterioration of functioning. Schizophrenia cases had more negative symptoms, disorganized thinking and behavior; schizoaffective cases were more likely to be female. This is indeed one of the problems with this distinction: female patients have on average a later age at onset (P=1E 04 here for schizophrenia cases); later onset cases tend to have greater preservation of affect and communication, which makes it easier for clinicians to obtain descriptions of mood symptoms. Note for example that in this sample, the ratio of males to females was 2.27:1, but for both subtypes of schizoaffective disorder (depressed and bipolar types), the ratio was approximately 1:1. There is no evidence from family or twin studies that schizoaffective disorder breeds true, and much greater evidence for a genetic relationship to schizophrenia. Here, we achieved our goal of recruiting severely ill schizoaffective cases with illnesses that are clinically consistent with schizophrenia, thereby recruiting the entire range of schizophrenia cases, with and without concurrent mood symptoms. The primary analysis has been carried out based on this a priori ascertainment rule. b. Controls. EA and AA unrelated adult control subjects were collected under MGS2 as previously described 3 by Knowledge Networks (Menlo Park, CA). Briefly, KN uses random digit dialing to area codes selected to represent the national population, to recruit individuals to join a nationwide survey panel (approximately 60,000 individuals at any one time, with constant turnover). Communication with panel members is by and web interaction, but initial recruitment does not require internet connection, and those who agree to join but have no internet access are given a web TV to facilitate participation. Panel members are then contacted to invite participation in specific surveys. All EA control subjects and 41% of AA controls were recruited by KN using these methods. There was an insufficient number of AA individuals in the panel who were interested in participating to complete the collection, thus KN contracted with a second survey research company (SSI Opt In). SSI/Opt in uses banner ads on websites to recruit participants. In total, 3,364 EA and 1,301 AA control subjects were recruited, of which 772 AA subjects (59%) were recruited by SSI/Opt in. KN recruited a control subject from 21.7% of targeted EA and 15.7% of targeted AA households consented, completed the clinical questionnaire and provided a blood specimen, which is a high rate for population based studies requiring a blood sample. SSI/Optcompleted only 2.1% of targeted individuals (i.e., who initially responded to ads). Table S3 compares demographic characteristics of individuals targeted for recruitment by KN vs. the final sample. There is an excellent match except for an upward shift in education which is larger for AA than for EA subjects. Using an IRB approved procedure, individuals who responded to the initial invitation to participate were provided with an explanation of the study online (with an opportunity to phone for more information), and provided preliminary consent online. Participation required completion of an online clinical questionnaire. For those who completed the questionnaire, a national phlebotomy company (EMSI) 5

6 Table S3: Control demographics EA EA AA AA Male 48% 47% 37% 38% Female 52% 53% 63% 62% % 14% 13% 13% % 27% 32% 32% % 29% 38% 40% % 30% 18% 15% < HS 10% 8% 10% 3% HS 30% 27% 23% 15% Some coll. 31% 31% 41% 51% Bachelor 29% 34% 27% 30% Northeast 22% 18% 17% 14% Midwest 27% 27% 23% 26% South 35% 37% 55% 54% West 16% 18% 5% 6% Non Metro 18% 15% 7% 8% Metro 82% 85% 93% 92% then was asked to contact the individual and obtain a blood specimen. Participants signed written informed consent at that time. All control participants gave written consent for their biological materials and clinical questionnaire responses to be used for any medical research at the discretion of NIMH. As described in the consent form, the link between ID and name of control participants was destroyed after one year, so that the control sample is fully anonymized for medical research purposes. The clinical questionnaire was completed online. Volunteers were not included or excluded from the panel based on questionnaire responses, rather, all consenting subjects were recruited, and clinical questionnaire responses collected for use by investigators to include or exclude based on the needs of the study. The questionnaire included: The Composite International Diagnostic Interview Short Form 12, modified for lifetime (rather than 12 month) common mood, anxiety, and substance use disorders; Items for lifetime diagnosis or treatment of schizophrenia or schizoaffective disorder, or hallucinations or delusions, or of bipolar disorder; Ethnic origins of each grandparent (permitting multiple responses per grandparent). 12 item Eysenck neuroticism and extraversion scales. 13 Additional items for sexual orientation, height, and maximum adult weight outside of pregnancy, and a nicotine dependence screen. For this study, we excluded control individuals who endorsed any item for a psychotic or bipolar disorder, or who failed to answer any of those questions, or who were outliers in the number of missing items or of yes items in the questionnaire as a whole (slightly less than 8% were excluded). 2. Selection of subjects for GWAS (GAIN and NONGAIN genotyping phases). Case and control subjects were genotyped in two phases. This project was selected for participation in the Genetic Association Information Network (GAIN) ( which provided GWAS genotyping for selected studies. (Note that GAIN was not involved with the collection of subjects or the scientific design of the study i.e., there was a GAIN genotyping phase for MGS, but not a GAIN schizophrenia study.) Because of limitations in GAIN s sample size capacity, only part of the sample could be genotyped under GAIN s auspices. The remainder of the sample (called nongain here) was genotyped in the same laboratory several months later as described below, under U01 funding from NIMH. Table S4 summarizes the samples that were submitted for genotyping to each phase of the study. Approximately half of the EA and 95% of the AA subjects were genotyped within GAIN, and the remainder (at the same lab, see below) under U01 funding from NIMH. Note that not all potentially eligible MGS cases and controls were submitted for GWAS genotyping. Many of the samples had previously studied in the Gejman lab in previous experiments. Some samples in the NIMH repository database for MGS either performed poorly in those experiments, or were ancestry outliers based on a set of ancestry informative SNPs genotyped for those experiments. 6

7 Table S4: Summary of genotyped MGS samples European ancestry subjects African American subjects case control Duplicate Trio Case Control duplicate trio GAIN NonGAIN Total Note: The same 31 EA trios (probands and their parents) were genotyped in GAIN and in NonGAIN. 3. Genotyping. Genotyping was carried out at the Broad Institute National Center for Genotyping and Analysis (funded by the U.S. National Center for Research Table S5: Genotyping QC (final dataset) Resources), under the directions of Drs. Stacey Gabriel Post QC dataset EA AA and Daniel Mirel. (Note that the GAIN genotyping was Samples Total N: carried in this lab in the Fall of 2007, and the nongain Avg Call Rate 99.7% 99.6% SNPs Total N: 696, ,468 genotyping in the Spring of 2008, using the same chip and Call Rate Autosomes 99.7% 99.6% procedures.) Genotyping was carried using the X 99.8% 99.5% Affymetrix 6.0 array, which assays 906,600 SNPs and Dup. concordance Aut 99.8% 99.7% X 99.8% 99.85% 946,000 copy number probes (for non polymorphic sites for analysis of copy number in gaps between SNPs. 14 Whereas on the older Affymetrix 500K array perfect match and mismatch probes for each SNP were tiled onto the array, on the 6.0 array, identical optimal PM probes are tiled for each allele. Genotype calls were made the BIRDSEED program, a module of the BIRDSUITE package. 15 A summary of QC parameter values for the final, post QC dataset is shown Table S5, and details are provided below. Genotypes are called based on normalized intensity values for the colors labeling the two alleles. BIRDSEED sets expected locations for the three genotype cluster centers for each SNP, based on HapMap sample data, and then searches for the true centers. Because these centers can shift in different genotyping runs, calls are usually made on batches of arrays run at approximately the same time (e.g., a 96 well plate), which maximizes call rate (as we confirmed with our own data). Genotypes with confidence > 0.1 score (as defined by Korn et al.) were considered missing. 4. Quality control of genotype data. QC of the GAIN EA and AA datasets was carried out in close collaboration with Justin Paschall of NCBI in collaboration with the GAIN analysis consultant, Dr. Gonçalo Abecasis (University of Michigan). One of the benefits of the GAIN project was the opportunity that it provided for investigators to learn from each other about GWAS analysis issues. We are indebted to these individual for their contributions to the QC of our data. Quality control and association analyses were carried out with the software package PLINK 16, supplemented where needed by custom scripts and programs. 4a. Exclusion criteria for DNA samples European ancestry (EA) subjects. EA subjects were excluded if they had: (1) genotyping call rate less than 97%; (2) genotypic gender inconsistent with the gender reported by the site or control data file; 7

8 (3) a proportion of heterozygous genotypes > 28.5% or < 26.0%. Unusually high or low heterozygosity values are thought to occur primarily because of genotyping/dna quality problems (or ancestry differences as discussed below). Note that these thresholds were based on analyses using all SNPs for which data were reported by the lab, prior to other QC. The appropriate threshold will vary according to the range of minor allele frequencies in the data (which also varies with ancestry). (4) one or more ancestry related principal component scores (see below) that were considered outliers to the distribution of that score, indicating greater African, Asian or Native American ancestry. For each pair of genotyped duplicates, one sample was excluded if it was a heterozygosity outlier or had a lower genotyping call rate. We performed IBD analysis to confirm known family relationships and to detect unexpected ( cryptic, unsuspected) relationships. Genome wide kinship was estimated for each pair of subjects based on the IBS of 167,302 SNPs (one from each consecutive four post QC SNPs as described below). Eighteen subjects were excluded because they had estimated kinship coefficients larger than 0.1 with at least 50 other subjects. Six unexpected duplicates, nineteen unexpected parent child pairs, three unexpected sibling pairs and two unexpected grandparent child pairs were detected according to the IBD sharing pattern. For each relative pair, we excluded the subject with lower genotyping call rate or extreme heterozygosity proportion. When two related controls had similar QC metrics, we excluded the younger control subject (because the subject had passed through less of the period of risk for schizophrenia). When two related cases had similar QC metrics, we examined the clinical data (blind to genotype or association results) and chose the case with the best documentation of clinical features. For the six cryptic duplicate pairs, we excluded one subject of lower call rate from four pairs and excluded both subjects for the other two pairs because the DNA identification/contamination. Table S6 summarizes how many EA subjects were excluded by each criterion (some failed multiple criteria); 321 cases/controls were excluded from the 5,655 submitted for genotyping (5.68%). Table S6: Number of subjects excluded by QC filters for autosomal SNP analysis Criterion European ancestry African American call rate heterozygosity proportion inconsistent gender unexpected duplicate unexpected relatedness 23 0 principal component outliers subjects with kinship > 0.1 with more than 50 subjects 18 3 clinical data review 2 2 Total subjects excluded Note: 1 Among six cryptic duplicate pairs, both subjects excluded for 2 pairs and one subject excluded for 4 pairs. For the EA subjects passing sample QC criteria, we performed additional QC for X chromosome analysis. Because males always have homozygous genotypes while females may have three possible genotypes, males have higher call rates than females. We excluded any male subject if his X chromosome call rate was less than 98.8% (based on the observed distribution of male genotyping call rate). We excluded any female if the X chromosome call rate was less than 97.0%, or if the X chromosome proportion of 8

9 heterozygous SNPs was larger than 40% or less than 24% (thresholds chosen based on observed distributions). In total, sixty five additional EA subjects were excluded for X chromosome SNP association analysis (Table S7). Table S7: Number of additional EA subjects excluded by QC filters for X chr analysis only Criterion European ancestry African American call rate male 11 2 call rate female 50 0 heterozygosity proportion female 4 0 Total subjects excluded b. Exclusion criteria for DNA samples African American subjects. We applied the same filters to African American (AA) samples except that we used a different approach to detect the outliers of the distribution of the genotype heterozygosity proportion. Because African Americans are an admixed population (mixing of European and African ancestors within the last 1 20 generations in most individuals, and some admixture of Native Americans), the proportion of heterozygous SNPs for each subject will be correlated with the admixture proportion that ranges from 0% African ancestry to 100% African ancestry. (This is because there is a generally higher heterozygosity in African populations, and because SNPs have been included on the array specifically for African ancestry subjects, so that admixed subjects will have a larger proportion of heterozygous genotypes both for the SNPs that are more commonly seen in Europeans and for those more commonly seen in Africans). AA Samples with (for example) 50% European ancestry have a larger heterozygosity proportion than samples with 0% or 90% European ancestry. Thus, an outlier of the distribution of the genotype heterozygosity proportion should be defined relative to samples with similar admixture proportion. We carried out Principal Component Analysis using EIGENSTRAT 17 based on SNPs passing SNP filters. Then we used the first principal component (PC1) as a surrogate for admixture proportion, because these PC clearly separates European from African HapMap subjects (and EA from AA subjects, see below). For samples with PC1 > 0.02 (i.e., the main cluster in Figure S2), a LOESS curve (lower panel) was fitted using R software package 18, the SD of the distribution around this curve was estimated, and samples with heterozygosity proportion > 4 SD above or below the curve were excluded as outliers for heterozygosity proportion given their PC1 score. Figure S2: PC1 and Heterozyogosity for AA subjects CASE 1 2 HET PC

10 Table S8 summarizes the number of AA cases and controls excluded by each criterion. In total, 109 cases/controls were removed from analysis. Two additional AA cases/controls were removed from X chromosome analysis because the X chromosome call rate was low. Table S8: Number of subjects passing QC filters European ancestry subjects African American subjects Autosomal analysis X chr analysis Autosomal analysis X chr analysis Controls Cases Controls Cases Controls Cases Controls Cases GAIN NonGAIN Total c. SNP exclusion criteria for European ancestry subjects. GAIN EA and NonGAIN EA subjects were genotyped by the same lab using the same genotyping platform, at different times (November December 2007 for GAIN; May July 2008 for nongain). To avoid false associations due to technical variation in assays over time, we performed separate SNP QC for GAIN EA, nongain EA, and the combined datasets, and excluded any SNP that failed any criterion in any of of these three QC analyses. Also, because 31 trios (table S1) were genotyped in both GAIN and NonGAIN, we utilized the SNP genotyping concordance rate of 93 duplicates (=3 31 trios) as an additional SNP QC filter. We removed SNPs that failed plate effect tests as follows: to test if genotypes of SNP j on plate k (k=1,,k) had a different frequency from the remaining plates, we designated plate k as cases and all other plates as controls, and performed a trend test for SNP j to obtain P jk (k=1,,k) for this contrast. A SNP was excluded if at least one plate produced P < 10 8 (~1 /N tests) or at least two plates produced P < 10 4 (based on experience rather than any specific statistical justification). Table S9 summarizes the criteria for filtering bad SNPs, the number of SNPs excluded by each filter and the number of SNPs passing all filters. One SNP may fail multiple filter criteria. Most of the SNPs that showed strong evidence of plate effect also failed other criteria. For example, for 62,749 autosomal SNPs with plate effect, 77.5% had MAF < 1%, 25.0% had call rate less than 95% and 20.6% failed concordance requirement. Particularly, rare SNPs were more likely to fail the plate effect test. Table S9: SNP exclusion criteria and number of excluded SNPs for European ancestry samples Number of excluded SNPs SNP filtering criteria Chr 1 22 Chr X Pseudo autosomal SNP call rate < 95% in either GAIN or NonGAIN or combined HWE p value < 10 6 based on control samples in either GAIN or NonGAIN or combined, (using only females for chr X SNPs) MAF < 1% in combined dataset Failed plate effect test in GAIN, NonGAIN or combined dataset MENDEL errors > 2 in either GAIN or NonGAIN >1 disconcordant genotypes in either GAIN 29 duplicates or NonGAIN duplicates >2 disconcordant genotypes for 93 (=3 31 trios) samples genotyped in both GAIN and NonGAIN Total genotyped SNPs Excluded SNPs SNPs passing QC filters

11 4d. SNP exclusion criteria for African American subjects. Because there were only 102 cases and 20 controls in the NonGAIN AA dataset, compared with 1256 cases and 990 controls in the GAIN AA data, we performed SNP QC by combining all AA subjects. The SNP QC metrics were computed based on subjects passing sample QC criteria. Table S10 summarizes the criteria for excluding SNPs, the number of SNPs excluded by each filter and the number of SNPs passing all filters. Note that there were only 16,824 autosomal SNPs with MAF < 1% in African American samples compared to 134,325 in European ancestry samples. This difference also explained the fact that the number of SNPs with evidence of plate effect in African American sample was much less than that in European ancestry samples because rare SNPs were more likely to fail plate effect test. Table S10: SNP exclusion criteria for African American samples Number of excluded SNPs SNP filtering criteria Chr 1 22 Chr X Pseudo autosomal call rate < 95% HWE p value < MAF < 1% failed plate effect test MENDEL errors > disconcordant genotypes > Total genotyped SNPs Excluded SNPs SNPs passing QC filters Analysis of population substructure. Principal Components Analyses were carried out with EIGENSTRAT 17 to evaluate population substructure and to determine optimal corrections. Data for one out of every four SNPs (as described above) were included and PC scores included for each subject (typically 20 scores). It has been previously shown that some of these will reflect ancestry differences among subjects, as indexed by allele frequency differences among the ancestral populations that are represented. Two main types of ancestral variation are of interest here: the major continental Figure S3: Continental Principal Components ALLNPC ALLNPC1 ANC AA CEU CHBJPT EA YOR ancestries (European, African and Asian with Native American resembling Asian at this scale); and within Europe variation. In order to understand the within Europe ancestries that are indexed by different PCs, we select ~ 1840 MGS cases and controls who reported a single ancestry on both sides of their families (with no ancestry reported as don t know ) these data were collected in both the DIGS interview for cases and in the clinical questionnaire administered to controls. These relationships are summarized below. We considered whether to include HapMap subjects in the analysis to anchor continental effects. Figure S3 shows MGS AA cases/controls (red) and EA cases/controls (purple) with HapMap subjects from Chinese Japanese populations (green), Yoruban (light blue) and European/CEU (dark blue, hidden by the EA subjects). This shows that our EA subjects are slightly more diverse than the CEU HapMap 11

12 subjects, with one projection toward the Asian pole (probably due to small Native American admixtures in some subjects), and one toward the African pole (which we show below is due to Mediterranean and Askhkenazi Jewish ancestry). The AA subjects have highly variablle degrees of European admixture. However, one captures these variations with or without HapMap subjects in the PCA. Table S11 shows correlations of EAPCs (PCs for EA cases/controls without HapMap subjects) vs. ALLPCs (computed with EA, AA and HapMap subjects). The major PCs that were detected in the analysis with HapMap subjects included were also detected in the analysis that included only the EA subjects. Note that in this table, we have indicated which PCs show a geographical gradient, based on the analyses discussed below. N S indicates North South, E W is East West, Ashk is Ashkenazi. Another effect illustrated here is that when SNPs throughout the genome are included in the analysis, there are components which are entirely due to local effects. For example, in Table S10, EAPC3 and ALLPC4 are due almost entirely to variation within the chromosome 8p common inversion region (~ 8 12 Mb), and to variation within the chromosome 6 MHC region (~ Mb), as well as several smaller contributions from other regions. Table S11: Correlation of EA PCs computed with (ALLPCs) vs. without (EAPCs) HapMap subjects Geog EAPCs ALLPCs N S N S N S (8p) E W none Ashk Ashk none none Geog N S sl N S (8p) E W none Ashk/ MHC Ashk/ MHC none none none Figure S4: Correlations of EA PCs with SNPs in chromosomes 8p (left) and 6p (right) 12

13 These local effects can be demonstrated by computing the linear correlation between SNP genotypes (coded as the number of one of the alleles present in each subject, 0 2) and each PC score. Alternatively, in PLINK one can simply perform a quantitative association analysis for each PC score for EAPC3 Figure S5: Chr 8p specific PC EAPC CASE 1 2 the SNPs included in the PCA (S. Purcell, personal communication). Figure S4 illustrates a genome wide Manhattan plot where each dot is the Pearson correlation (varying above and below 0 in the middle of the Y axis) between genotypes for one SNP and the PC score being tested. The left plot shows that EAPC 3 is driven almost entirely by SNPs from the 8p inversion, and the right plot shows that EAPC6 is driven by a combination of SNPs throughout the genome and, more strongly, by SNPs on chromosome 6p which are within the MHC region. Figure S5 shows that EAPC3 (chr 8p), on the Y axis, produces a characteristic pattern of 3 clusters which do not correlate with geographical ancestry in our experience, 8p always produced one of the larger PCs in Europeanancestry data, and always produces this pattern. Consistent with approaches suggested by others (G. Abecasis, personal communication to DFL; N. Patterson, personal communication to DFL), we were concerned about obscuring evidence for association in a local region by correcting association tests for a PC reflecting variation only in that region. This might be particularly true for MHC associated diseases. Therefore, we removed chr8:8 12Mb and Chr6:24 35Mb SNPs and re computed all PCs, now designated as EANPCs (MGS EA subjects only, no chr6 or chr8), AANPCs (AA subjects only) and ALLPCs (EA and AA subjects plus HapMap subjects). Table S12 shows correlations between EANPCs and ALLNPCs, and again, both continental and within Europe dimensions are detected either way, with a high correlation between specific pairs of components. Also shown are the P values for t tests for EA case vs. control values for each PC. Table S12: Correlation (EA subjects): NPCs with (ALLNPCs) vs. without (EANPCs) HapMap subjects Case cont P Geog EANPCs ALLNPCs 1E 7 N S E 7 sl N S E 5 N S E 3 E W None Ashk None None None None Geography N S sl N S E W None Ashk None None sl. Ashk None None Case Cont P 2XE E

14 Table S13 shows similar data for AA subjects, showing that AA subjects show primarily North South variation (as expected), indexed by AANPCs 1 and 2, and almost entirely by ALLNPC 1. Table S13: Correlation (AA subjects): NPCs with (ALLNPCs) vs. without (EANPCs) HapMap subjects AANPCs Case Cont P ALLPCs Case Cont P Based on these data, we corrected all association tests for PCs computed with the subjects who were actually included in the analysis, but only for PCs that demonstrated either a geographical gradient, and/or which demonstrated a case control difference. The selected components are listed in the section on Statistical analysis of association, and the use of these components in the tests is described there. (Note that for AANPCs, we did not correct for AANPC7 which produced a small case control difference, because the Eigenvalue for this component is very small and not localized in the genome. In the course of these analyses, we also observed that there was a modest correlation between PCs computed with autosomal SNPs vs. those computed with chromosome X SNPs. This could reflect differences in the ancestral history of X chromosome SNPs based on very recent ancestry of father and Figure S6: North South and East West within Europe gradients mother. Therefore, we computed PCs separately for the X chromosome (EAXPCs, AAXPCs, ALLXPCs) and considered their geographical gradients and case control differences as for autosomal PCs, selecting appropriate XPCs to correct in analyses (see below). The relationships of PCs to geographical gradients were determined by plotting pairs of PCs forsubjects who reported a single family ancestry, labeling each ancestry. ANOVAs were carried out in preliminary analyses to confirm relationships. As shown in Figure S6, subjects (cases or controls) who reported one ancestry are shown by specific symbols (Anglo Saxon, Mediterranean, Ashkenazi, Russian/Slavic, Northern European (Scandinavian) and Western European (French, German, etc.); all other cases and controls are 14

15 shown in red or blue respectively. PC1 indexes a North South gradient, with Anglo Saxon and Northern European at the top, then Mediterranean, and then Ashkenazi. Figure S3 (above) shows that African ancestry is far below Ashkenazi. PC3 indexes East West, with Russian/Slavic to the right here. Note that case interviews queried childhood religion, so we could distinguish Jews who self reported as Russian or Polish (countries of origin for most U.S. Ashkenazi immigrants ) non Jewish Slavic families. In the Figure, symbols for Slavic/Russian ancestry appear in both the Slavic and the Ashkenazi circle, but the latter are Ashkenazi individuals. EANPC5 (=ALLNPC6) further separates the Ashkenazi cluster from all others (Figure S7). Not all PCs with a Figure S7: EANPC5 separates Ashkenazi ancestry EANPC EANPC1 EURANC AngloSax Medit Ashk Russ/Slav NorthEur WestEur EAPC1 Figure S8: US EAs vs. Australians EAPC COUNTRY AU US geographical distribution differed between cases and controls, but it seemed prudent to include all geographic PCs as covariates. Finally, Figure S8 shows that the within Europe origins of Australians is similar to that in Americans, except that there are far fewer Ashkenazi individuals in Australia. 6. Statistical analysis of association. Logistic regression analysis was carried out to evaluate the evidence of association for each single marker, correcting for potential population stratification using principal component scores that (1) reflected the population substructure consistent with self reported ancestry; or (2) have statistically significant differences between cases and controls; or (3) corresponded to a statistically significant eigenvalue. (Also, the scores were computed without SNPs in the MHC and 8p inversion regions as discussed above.) For X chromosome SNPs, the homozygous genotypes were coded as 0 and 2 for male. For X chromosome SNPs and pseudo autosomal SNPs, we also corrected for sex indicator to eliminate potential bias due to different genotype distribution and strong call rate difference between males and females. Top EA and AA results are shown and discussed in the main text; EA+AA results are shown below in Supplementary Results 1b. The covariates used in the logistic regression models for association analysis were: EA analyses: EANPCs 1 5; EAXPCs 1 and 5 (and a covariate for sex) AA analyses: AANPCs 1 2; AAXPCs 1 2 (and a covariate for sex) Combined EA+AA analyses (Suppl Results 1b): ALLNPCs 1, 2, 3, 4, 6; ALLXPCs 1,2,5 (and sex) In response to a reviewer comment about the possibility of unique population structure in the MHC region, we computed principal components (using EIGENSTRAT) using 416 SNPs (out of 2357 post QC 15

16 SNPs) on chromosome 6p between Mb, filtered so that no marker marker pair had r 2 <0.4. We then re computed association tests for MHC SNPs in the MGS EA sample with both the global PCs used in the planned analysis and also the top 10 MHC PCs as covariates. While the effect on P values was variable, for all 7 SNPs with genome wide significant P values in the meta analysis, MGS P values were essentially unchanged (and were slightly smaller for 5/7 SNPs) with MHC correction. We conclude that the significant results obtained in the meta analysis are not due to failure to correct for population substructure that is specific to MHC, and that our PCs based on genome wide components (i.e., those not primarily due to local effects) are adequate for population substructure correction in this sample. Table S14: MGS results for significant MHC SNPs with and without correction for MHC specific PCs MGS P MGS P Rs_ID BP COMB P No MHC corr MHC corr rs E E E 02 rs E E E 02 rs E E E 02 rs E E E 02 rs E E E 02 rs E E E 02 rs E E E 03 Genomic control lambda values 19 are shown below in Figure S9. We have considered P values < 5 X 10 8 as evidence of genome wide significant association between a SNP and disease status. Three recent papers have applied different approaches to estimating a genome wide threshold for GWAS analyses, based on the assumption that the threshold should take into account tests of all common SNPs. 20,21,22 The estimated thresholds have ranged from 1 7 X 10 8, with 5 X 10 8 representing a reasonable choice based on these data. 16

17 Figure S9: Autosomal and X chromosome GC lambda values 17

18 7. Imputation analysis of untyped SNPs. We selected 192 regions that had at least one genotyped SNP with P < in one of the three analyses (EA, AA or combined). Hapmap SNPs in the 192 regions were imputed using MACH 1.0 ( Phased haplotypes for 60 CEU subjects (120 haplotypes) were used as the reference for imputing EA sample genotypes; these haplotypes plus 120 phased YRI haplotypes were used to impute genotypes of AA subjects, who represent admixtures of these two ancestries. Haung et al. 23 concluded that for MACH 1.o imputation in 29 world populations, in most populations, mixtures including individuals from at least two HapMap panels produced the highest imputation accuracy, consistent with our approach here. Any SNP imputed with information content r 2 <0.3 was excluded from association analysis because of lack of power. This threshold was recommended in an unpublished earlier paper on MACH 1.0 by the Abecasis group, as providing a good balance between power (retaining well imputed data) and accuracy (excluding poorly imputed data). For untyped SNPs or missing genotypes, MACH estimated genotypes conditioning on the flanking markers, as a dosage (i.e., the estimate is non integer, reflecting the weighted probabilities of each possible genotype). We then used logistic regression to test association between disease status and the estimated genotypic dosage, correcting the population stratification as before (but this was done with a custom program, because non integer genotype dosages cannot be submittedto PLINK in an external file). We did not observe larger signals from this analysis except for the SNP rs that is located in the MHC region. To perform P value meta analysis for MGS, ISC and SGENE, we imputed Hapmap SNPs in the six regions (MAD1L1, FXR1, MHC, TCF4, NEK1, PTBP2) and performed logistic regression analysis for EA samples and EA+AA combined samples. For these six regions, we reported the p values based on observed genotypic values for genotyped SNPs and the p values based on imputed dosage for imputed SNPs. 8. Statistical method for combining p values from MGS, ISC and SGENE. For a given SNP, MGS, ISC and SGENE shared P values, direction of allelic effects and the number of cases/controls (Table S14). Because overlapping samples from the three datasets were removed (i.e., the Aberdeen sample was included here only in analyses for ISC and not SGENE), the P values from the three studies were independent. We converted the p values into Z scores taking the allelic direction of the effect into consideration. We then used the following linear combination of Z scores to summarize the association signals from three studies: Z = w1 Z1 + w1z 2 + w3z 3 where w1 + w2 + w3 = 1. For any given combination of weights ( w 1, w2, w3 ), Z ~ N(0,1 ) if each Z k follows N(0,1). Let ξk denote the non centrality parameter of Z score Z, i.e. ξ = E Z ) for k=1, 2, 3 under an alternative k k hypothesis. Then, it is straightforward to show that weights w, w, w ) ( ξ, ξ, ) maximize the ( k ( ξ3 non centrality parameter of the combined Z score, and hence maximize the power of the combined test. On the other hand, the non centrality parameter for a trend test 24 is 18

19 1/ 2 1/ 2 nk, + 1 nk, 1 ( Rk 1)[ pk (1 pk )] ξ k = 2nk. nk nk 1+ pk ( Rk 1) Here, is the number of cases, is the number of controls, and n nk,+1 n = + k, 1 k nk, + 1 nk, 1 is the total number of samples of study k. R is the relative risk and p is the allele frequency of the controls in k study k. We assumed that the relative risks are consistent across three studies (a fixed effect model) and the allele frequencies were identical across three studies (all from the EA population), thenξ = E Z ) is asymptotically proportional to k ( k k 1/ 2, 1, 1 2 nk + nk nk nk nk η k =. Hence, the asymptotically most powerful test is given by taking the following weights: η w. k k = / 2 [ η1 + η2 + η3 ] Table S15 gives the sample sizes and optimal weights for MGS, ISC, SGENE for meta analysis. Table S15: Weights and Ns for combining Z scores from three GWAS: MGS, ISC, SGENE # case # control # total weight for EA analysis MGS EA ISC SGENE TOTAL An exploratory meta analysis was also carried out that substituted the MGS EA+AA dataset for the MGS EA dataset. As there are no other large AA GWAS datasets available for meta analysis, results will be reported elsewhere. 9. Functional relationships among genes (MGS GWAS). We explored several bioinformatic tools to quantify the functional relationships among genes, and found that they defined different relationships among genes containing SNPs with evidence for association in the MGS GWAS analyses. Improvements in genome annotation will be needed to make such analyses more stable. We annotated gene relationships of SNPs using SNPPER ( Genome Variation Scanner ( and the RefSeq gene list We explored functional properties of 93 genes with one SNP with P < within 10 kb (European ancestry, African American, or combined analyses) using: GeneCards ( AceView ( and 19

20 EntrezGene ( We supplemented these sources by searching for information on gene relationships using the webbased tool GRAIL (Gene Relationships Across Implicated Loci; Raychaudhuri, S., et al., under review; and 25 ). GRAIL uses a text search method to search for subsets of genes that are mentioned in PubMed abstracts using similar non common words. GRAIL results are available upon request, or can be reanalyzed by the reader with the current GRAIL version based on any list of genes of interest from this study. 10a. Power analysis for meta analysis of MGS, ISC and SGENE (Table S16). For each combination of allele frequency (p) and allele relative risk (R), we computed the power of detecting additive genetic effect assuming that relative risks and allele frequencies are consistent across three studies. The power was computed based on the number of cases and the number of controls for MGS, ISC and SGENE. We also assumed that the SNP was genotyped in three studies which is a reasonable approximation because the information content of imputation for the top SNPs was very high. Given p, R and number of cases/controls, the asymptotic non centrality parameter of the trend test (Z score) is given by (1) 1/ 2 1/ 2 nk, + 1 nk, 1 ( R 1)[ p(1 p)] ξ k = 2nk. nk nk 1+ p( R 1) Because we combine Z scores with linear weights w, w, w ) ( ξ, ξ, ) (Refer to Table S15 for ( ξ3 sample size and weights for meta analysis of EA samples), the non centrality parameter of the combined Z score test is give byξ = w 1ξ1 + w2ξ 2 + w3ξ 3. The power is then computed at significance 8 levelα = 5 10 : P ( Z > q α / 2 = 5.45). Table S16 shows the results of power analyses for metaanalysis of the EA samples. ξ Table S16: Power analysis for meta analyses of MGS (EA), ISC and SGENE Risk allele frq GRR b. Power analysis for MGS. For each combination of allele frequency (p) and allele relative risk (R), we computed the power of detecting additive genetic effect. The power was computed based on the number of cases and the number of controls for MGS. The power analysis was for genotyped SNPs. Given p, R and number of cases/controls, the asymptotic non centrality parameter ξ of the trend test 20

21 8 (Z score) was computed given by (1). The power was then computed at significance levelα = 5 10 : P ξ ( Z > q α / 2 = 5.45). Table S17 summarizes power calculations for the AA, EA and EA+AA samples. Table S17: Power analysis for MGS EA, AA and EA+AA analyses Risk allele frq GRR EA analysis AA analysis EA+AA analysis

22 B. Supplementary Results 1 MGS GWAS (LD in MHC region; EA+AA analysis) B1a. Linkage disequilibrium (LD) values across the extended MHC region are illustrated for the MGS European ancestry sample Figure S10 and for the MGS African American dataset in Figure S11. Figure S10: Linkage disequilibrium across the extended MHC region (MGS European ancestry sample). Shown are LD relationships (r squared, color coded) for the 198 SNPs in the extended MHC region on chromosome 6 that yielded P values < 10 4 in the meta analysis of MGS (EA), ISC and SGENE data, computed from MGS EA subjects (genotyped and imputed SNPs). The location is shown for the peak evidence for association at Mb in the extended Class I region. Locations of the classical HLA Class I, II and III regions are also shown. 26 HLA DQA1, which showed nearly significant evidence for association, is in the Class II region, which is not in LD with the extended Class I region where significant evidence for association was observed. 22

23 Figure S11: MHC region LD and association test results in the MGS African American dataset. Shown are LD data (computed from MGS AA data) for 23 of the 26 SNPs in the LD portion of Figure 1 (3 of the SNPs were not part of the AA dataset), with AA association test results for all genotyped and imputed SNPs in the region. Some evidence for association was observed near TRIM38 and in the FKSG83/POM121L2 region, but support for association is too modest to assume that these results can localize the stronger signal observed in the European ancestry meta analysis. 23

Tutorial on Genome-Wide Association Studies

Tutorial on Genome-Wide Association Studies Tutorial on Genome-Wide Association Studies Assistant Professor Institute for Computational Biology Department of Epidemiology and Biostatistics Case Western Reserve University Acknowledgements Dana Crawford

More information

Quality Control Analysis of Add Health GWAS Data

Quality Control Analysis of Add Health GWAS Data 2018 Add Health Documentation Report prepared by Heather M. Highland Quality Control Analysis of Add Health GWAS Data Christy L. Avery Qing Duan Yun Li Kathleen Mullan Harris CAROLINA POPULATION CENTER

More information

Supplementary Figure 1. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations.

Supplementary Figure 1. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations. Supplementary Figure. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations. a Eigenvector 2.5..5.5. African Americans European Americans e

More information

New Enhancements: GWAS Workflows with SVS

New Enhancements: GWAS Workflows with SVS New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Illustrative example of ptdt using height The expected value of a child s polygenic risk score (PRS) for a trait is the average of maternal and paternal PRS values. For example,

More information

Supplementary Figure 1 Dosage correlation between imputed and genotyped alleles Imputed dosages (0 to 2) of 2-digit alleles (red) and 4-digit alleles

Supplementary Figure 1 Dosage correlation between imputed and genotyped alleles Imputed dosages (0 to 2) of 2-digit alleles (red) and 4-digit alleles Supplementary Figure 1 Dosage correlation between imputed and genotyped alleles Imputed dosages (0 to 2) of 2-digit alleles (red) and 4-digit alleles (green) of (A) HLA-A, HLA-B, (C) HLA-C, (D) HLA-DQA1,

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Country distribution of GME samples and designation of geographical subregions.

Nature Genetics: doi: /ng Supplementary Figure 1. Country distribution of GME samples and designation of geographical subregions. Supplementary Figure 1 Country distribution of GME samples and designation of geographical subregions. GME samples collected across 20 countries and territories from the GME. Pie size corresponds to the

More information

GENOME-WIDE ASSOCIATION STUDIES

GENOME-WIDE ASSOCIATION STUDIES GENOME-WIDE ASSOCIATION STUDIES SUCCESSES AND PITFALLS IBT 2012 Human Genetics & Molecular Medicine Zané Lombard IDENTIFYING DISEASE GENES??? Nature, 15 Feb 2001 Science, 16 Feb 2001 IDENTIFYING DISEASE

More information

Human population sub-structure and genetic association studies

Human population sub-structure and genetic association studies Human population sub-structure and genetic association studies Stephanie A. Santorico, Ph.D. Department of Mathematical & Statistical Sciences Stephanie.Santorico@ucdenver.edu Global Similarity Map from

More information

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22. Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.32 PCOS locus after conditioning for the lead SNP rs10993397;

More information

During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin,

During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin, ESM Methods Hyperinsulinemic-euglycemic clamp procedure During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin, Clayton, NC) was followed by a constant rate (60 mu m

More information

2) Cases and controls were genotyped on different platforms. The comparability of the platforms should be discussed.

2) Cases and controls were genotyped on different platforms. The comparability of the platforms should be discussed. Reviewers' Comments: Reviewer #1 (Remarks to the Author) The manuscript titled 'Association of variations in HLA-class II and other loci with susceptibility to lung adenocarcinoma with EGFR mutation' evaluated

More information

BADDS Appendix A: The Bipolar Affective Disorder Dimensional Scale, version 3.0 (BADDS 3.0)

BADDS Appendix A: The Bipolar Affective Disorder Dimensional Scale, version 3.0 (BADDS 3.0) BADDS Appendix A: The Bipolar Affective Disorder Dimensional Scale, version 3.0 (BADDS 3.0) General information The Bipolar Affective Disorder Dimension Scale (BADDS) has been developed in order to address

More information

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S. Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S. December 17, 2014 1 Introduction Asthma is a chronic respiratory disease affecting

More information

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 Introduction Loss of erozygosity (LOH) represents the loss of allelic differences. The SNP markers on the SNP Array 6.0 can be used

More information

Identification of regions with common copy-number variations using SNP array

Identification of regions with common copy-number variations using SNP array Identification of regions with common copy-number variations using SNP array Agus Salim Epidemiology and Public Health National University of Singapore Copy Number Variation (CNV) Copy number alteration

More information

LTA Analysis of HapMap Genotype Data

LTA Analysis of HapMap Genotype Data LTA Analysis of HapMap Genotype Data Introduction. This supplement to Global variation in copy number in the human genome, by Redon et al., describes the details of the LTA analysis used to screen HapMap

More information

Supplementary information for: Exome sequencing and the genetic basis of complex traits

Supplementary information for: Exome sequencing and the genetic basis of complex traits Supplementary information for: Exome sequencing and the genetic basis of complex traits Adam Kiezun 1,2,14, Kiran Garimella 2,14, Ron Do 2,3,14, Nathan O. Stitziel 4,2,14, Benjamin M. Neale 2,3,13, Paul

More information

Dan Koller, Ph.D. Medical and Molecular Genetics

Dan Koller, Ph.D. Medical and Molecular Genetics Design of Genetic Studies Dan Koller, Ph.D. Research Assistant Professor Medical and Molecular Genetics Genetics and Medicine Over the past decade, advances from genetics have permeated medicine Identification

More information

Global variation in copy number in the human genome

Global variation in copy number in the human genome Global variation in copy number in the human genome Redon et. al. Nature 444:444-454 (2006) 12.03.2007 Tarmo Puurand Study 270 individuals (HapMap collection) Affymetrix 500K Whole Genome TilePath (WGTP)

More information

CS2220 Introduction to Computational Biology

CS2220 Introduction to Computational Biology CS2220 Introduction to Computational Biology WEEK 8: GENOME-WIDE ASSOCIATION STUDIES (GWAS) 1 Dr. Mengling FENG Institute for Infocomm Research Massachusetts Institute of Technology mfeng@mit.edu PLANS

More information

Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels.

Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels. Supplementary Online Material Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels. John C Chambers, Weihua Zhang, Yun Li, Joban Sehmi, Mark N Wass, Delilah Zabaneh,

More information

Genome-wide association studies (case/control and family-based) Heather J. Cordell, Institute of Genetic Medicine Newcastle University, UK

Genome-wide association studies (case/control and family-based) Heather J. Cordell, Institute of Genetic Medicine Newcastle University, UK Genome-wide association studies (case/control and family-based) Heather J. Cordell, Institute of Genetic Medicine Newcastle University, UK GWAS For the last 8 years, genome-wide association studies (GWAS)

More information

Pirna Sequence Variants Associated With Prostate Cancer In African Americans And Caucasians

Pirna Sequence Variants Associated With Prostate Cancer In African Americans And Caucasians Yale University EliScholar A Digital Platform for Scholarly Publishing at Yale Public Health Theses School of Public Health January 2015 Pirna Sequence Variants Associated With Prostate Cancer In African

More information

Introduction to LOH and Allele Specific Copy Number User Forum

Introduction to LOH and Allele Specific Copy Number User Forum Introduction to LOH and Allele Specific Copy Number User Forum Jonathan Gerstenhaber Introduction to LOH and ASCN User Forum Contents 1. Loss of heterozygosity Analysis procedure Types of baselines 2.

More information

SUPPLEMENTARY DATA. 1. Characteristics of individual studies

SUPPLEMENTARY DATA. 1. Characteristics of individual studies 1. Characteristics of individual studies 1.1. RISC (Relationship between Insulin Sensitivity and Cardiovascular disease) The RISC study is based on unrelated individuals of European descent, aged 30 60

More information

Introduction to the Genetics of Complex Disease

Introduction to the Genetics of Complex Disease Introduction to the Genetics of Complex Disease Jeremiah M. Scharf, MD, PhD Departments of Neurology, Psychiatry and Center for Human Genetic Research Massachusetts General Hospital Breakthroughs in Genome

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

University of Groningen. Metabolic risk in people with psychotic disorders Bruins, Jojanneke

University of Groningen. Metabolic risk in people with psychotic disorders Bruins, Jojanneke University of Groningen Metabolic risk in people with psychotic disorders Bruins, Jojanneke IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from

More information

UNIVERSITY OF CALIFORNIA, LOS ANGELES

UNIVERSITY OF CALIFORNIA, LOS ANGELES UNIVERSITY OF CALIFORNIA, LOS ANGELES BERKELEY DAVIS IRVINE LOS ANGELES MERCED RIVERSIDE SAN DIEGO SAN FRANCISCO UCLA SANTA BARBARA SANTA CRUZ DEPARTMENT OF EPIDEMIOLOGY SCHOOL OF PUBLIC HEALTH CAMPUS

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

Nature Methods: doi: /nmeth.3115

Nature Methods: doi: /nmeth.3115 Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Fig 1. Comparison of sub-samples on the first two principal components of genetic variation. TheBritishsampleisplottedwithredpoints.The sub-samples of the diverse sample

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Missense damaging predictions as a function of allele frequency

Nature Neuroscience: doi: /nn Supplementary Figure 1. Missense damaging predictions as a function of allele frequency Supplementary Figure 1 Missense damaging predictions as a function of allele frequency Percentage of missense variants classified as damaging by eight different classifiers and a classifier consisting

More information

Supplementary information. Supplementary figure 1. Flow chart of study design

Supplementary information. Supplementary figure 1. Flow chart of study design Supplementary information Supplementary figure 1. Flow chart of study design Supplementary Figure 2. Quantile-quantile plot of stage 1 results QQ plot of the observed -log10 P-values (y axis) versus the

More information

Association-heterogeneity mapping identifies an Asian-specific association of the GTF2I locus with rheumatoid arthritis

Association-heterogeneity mapping identifies an Asian-specific association of the GTF2I locus with rheumatoid arthritis Supplementary Material Association-heterogeneity mapping identifies an Asian-specific association of the GTF2I locus with rheumatoid arthritis Kwangwoo Kim 1,, So-Young Bang 1,, Katsunori Ikari 2,3, Dae

More information

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1 From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...

More information

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis HMG Advance Access published December 21, 2012 Human Molecular Genetics, 2012 1 13 doi:10.1093/hmg/dds512 Whole-genome detection of disease-associated deletions or excess homozygosity in a case control

More information

indicated in shaded lowercase letters (hg19, Chr2: 217,955, ,957,266).

indicated in shaded lowercase letters (hg19, Chr2: 217,955, ,957,266). Legend for Supplementary Figures Figure S1: Sequence of 2q35 encnv. The DNA sequence of the 1,375bp 2q35 encnv is indicated in shaded lowercase letters (hg19, Chr2: 217,955,892-217,957,266). Figure S2:

More information

New methods for discovering common and rare genetic variants in human disease

New methods for discovering common and rare genetic variants in human disease Washington University in St. Louis Washington University Open Scholarship All Theses and Dissertations (ETDs) 1-1-2011 New methods for discovering common and rare genetic variants in human disease Peng

More information

Problem 3: Simulated Rheumatoid Arthritis Data

Problem 3: Simulated Rheumatoid Arthritis Data Problem 3: Simulated Rheumatoid Arthritis Data Michael B Miller Michael Li Gregg Lind Soon-Young Jang The plan

More information

Week #1 Classification & Diagnosis

Week #1 Classification & Diagnosis Week #1 Classification & Diagnosis 3 Categories in the Conceptualisation of Abnormality Psychological Dysfunction: Refers to a breakdown in cognitive, emotional or behavioural functioning. Knowing where

More information

Assessing Accuracy of Genotype Imputation in American Indians

Assessing Accuracy of Genotype Imputation in American Indians Assessing Accuracy of Genotype Imputation in American Indians Alka Malhotra*, Sayuko Kobes, Clifton Bogardus, William C. Knowler, Leslie J. Baier, Robert L. Hanson Phoenix Epidemiology and Clinical Research

More information

Supplemental Methods: GWA for MDD. MDD-etiology-supplement.doc 5/21/ :33 PM. Table of Contents

Supplemental Methods: GWA for MDD. MDD-etiology-supplement.doc 5/21/ :33 PM. Table of Contents Supplemental Methods: GWA for MDD Authors Document: Date: Sullivan, et al. MDD-etiology-supplement.doc 5/21/2010 12:33 PM Table of Contents Genomewide Linkage Studies of MDD or Neuroticism...2 Supplemental

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

Big Data Training for Translational Omics Research. Session 1, Day 3, Liu. Case Study #2. PLOS Genetics DOI: /journal.pgen.

Big Data Training for Translational Omics Research. Session 1, Day 3, Liu. Case Study #2. PLOS Genetics DOI: /journal.pgen. Session 1, Day 3, Liu Case Study #2 PLOS Genetics DOI:10.1371/journal.pgen.1005910 Enantiomer Mirror image Methadone Methadone Kreek, 1973, 1976 Methadone Maintenance Therapy Long-term use of Methadone

More information

CHROMOSOMAL MICROARRAY (CGH+SNP)

CHROMOSOMAL MICROARRAY (CGH+SNP) Chromosome imbalances are a significant cause of developmental delay, mental retardation, autism spectrum disorders, dysmorphic features and/or birth defects. The imbalance of genetic material may be due

More information

NIH Public Access Author Manuscript Nat Genet. Author manuscript; available in PMC 2012 September 01.

NIH Public Access Author Manuscript Nat Genet. Author manuscript; available in PMC 2012 September 01. NIH Public Access Author Manuscript Published in final edited form as: Nat Genet. ; 44(3): 247 250. doi:10.1038/ng.1108. Estimating the proportion of variation in susceptibility to schizophrenia captured

More information

CNV PCA Search Tutorial

CNV PCA Search Tutorial CNV PCA Search Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Data Preparation 2 A. Join Log Ratio Data with Phenotype Information.............................. 2 B. Activate only

More information

A Genome-wide Association Study in Han Chinese Identifies Multiple. Susceptibility loci for IgA Nephropathy. Supplementary Material

A Genome-wide Association Study in Han Chinese Identifies Multiple. Susceptibility loci for IgA Nephropathy. Supplementary Material A Genome-wide Association Study in Han Chinese Identifies Multiple Susceptibility loci for IgA Nephropathy Supplementary Material Xue-Qing Yu 1,13, Ming Li 1,13, Hong Zhang 2,13, Hui-Qi Low 3, Xin Wei

More information

Reviewers' comments: Reviewer #1 (Remarks to the Author):

Reviewers' comments: Reviewer #1 (Remarks to the Author): Reviewers' comments: Reviewer #1 (Remarks to the Author): Major claims of the paper A well-designed and well-executed large-scale GWAS is presented for male patternbaldness, identifying 71 associated loci,

More information

5/2/18. After this class students should be able to: Stephanie Moon, Ph.D. - GWAS. How do we distinguish Mendelian from non-mendelian traits?

5/2/18. After this class students should be able to: Stephanie Moon, Ph.D. - GWAS. How do we distinguish Mendelian from non-mendelian traits? corebio II - genetics: WED 25 April 2018. 2018 Stephanie Moon, Ph.D. - GWAS After this class students should be able to: 1. Compare and contrast methods used to discover the genetic basis of traits or

More information

Systems of Mating: Systems of Mating:

Systems of Mating: Systems of Mating: 8/29/2 Systems of Mating: the rules by which pairs of gametes are chosen from the local gene pool to be united in a zygote with respect to a particular locus or genetic system. Systems of Mating: A deme

More information

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data. Supplementary Figure 1 PCA for ancestry in SNV data. (a) EIGENSTRAT principal-component analysis (PCA) of SNV genotype data on all samples. (b) PCA of only proband SNV genotype data. (c) PCA of SNV genotype

More information

Open Translational Science in Schizophrenia. Harvard Catalyst Workshop March 24, 2015

Open Translational Science in Schizophrenia. Harvard Catalyst Workshop March 24, 2015 Open Translational Science in Schizophrenia Harvard Catalyst Workshop March 24, 2015 1 Fostering Use of the Janssen Clinical Trial Data The goal of this project is to foster collaborations around Janssen

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTRY INFORMTION Supplementary Figure 1. Q-Q plot of RE/NIMH autism family TDT results. The quantile-quantile plot of the expected and observed P-values is shown. The blue circles represent the

More information

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley

More information

Statistical Genetics : Gene Mappin g through Linkag e and Associatio n

Statistical Genetics : Gene Mappin g through Linkag e and Associatio n Statistical Genetics : Gene Mappin g through Linkag e and Associatio n Benjamin M Neale Manuel AR Ferreira Sarah E Medlan d Danielle Posthuma About the editors List of contributors Preface Acknowledgements

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Relationship of the Icelandic psychosis family to five individuals (X1 X5) who carry the haplotype harboring RBM12 c.2377g>t but not RBM12 c.2377g>t. An asterisk marks whole-genome-sequenced

More information

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK CHAPTER 6 DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK Genetic research aimed at the identification of new breast cancer susceptibility genes is at an interesting crossroad. On the one hand, the existence

More information

Rare Variant Burden Tests. Biostatistics 666

Rare Variant Burden Tests. Biostatistics 666 Rare Variant Burden Tests Biostatistics 666 Last Lecture Analysis of Short Read Sequence Data Low pass sequencing approaches Modeling haplotype sharing between individuals allows accurate variant calls

More information

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc. Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag

More information

Our Stage 1 genotype scan was performed using Illumina Human1 Beadarrays, which have a

Our Stage 1 genotype scan was performed using Illumina Human1 Beadarrays, which have a Supplementary Note Analysis of Stage 1 GWAS and design of the Stage 2 iselect array Our Stage 1 genotype scan was performed using Illumina Human1 Beadarrays, which have a gene-centric design, and Illumina

More information

Variation in PNPLA3 is associated with outcomes. in alcoholic liver disease

Variation in PNPLA3 is associated with outcomes. in alcoholic liver disease Variation in PNPLA3 is associated with outcomes in alcoholic liver disease Chao Tian 1, Renee P. Stokowski 1, David Kershenobich 2, Dennis G. Ballinger 1,3, David A. Hinds 1 1. Perlegen, 2021 Stierlin

More information

Analysis with SureCall 2.1

Analysis with SureCall 2.1 Analysis with SureCall 2.1 Danielle Fletcher Field Application Scientist July 2014 1 Stages of NGS Analysis Primary analysis, base calling Control Software FASTQ file reads + quality 2 Stages of NGS Analysis

More information

MOLECULAR EPIDEMIOLOGY Afiono Agung Prasetyo Faculty of Medicine Sebelas Maret University Indonesia

MOLECULAR EPIDEMIOLOGY Afiono Agung Prasetyo Faculty of Medicine Sebelas Maret University Indonesia MOLECULAR EPIDEMIOLOGY GENERAL EPIDEMIOLOGY General epidemiology is the scientific basis of public health Descriptive epidemiology: distribution of disease in populations Incidence and prevalence rates

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 Full inter-omic cross-sectional correlation network Statistically-significant inter-omic cross-sectional Spearman correlations (p adj

More information

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis BST227 Introduction to Statistical Genetics Lecture 4: Introduction to linkage and association analysis 1 Housekeeping Homework #1 due today Homework #2 posted (due Monday) Lab at 5:30PM today (FXB G13)

More information

Title: Pinpointing resilience in Bipolar Disorder

Title: Pinpointing resilience in Bipolar Disorder Title: Pinpointing resilience in Bipolar Disorder 1. AIM OF THE RESEARCH AND BRIEF BACKGROUND Bipolar disorder (BD) is a mood disorder characterised by episodes of depression and mania. It ranks as one

More information

Title:Validation study of candidate single nucleotide polymorphisms associated with left ventricular hypertrophy in the Korean population

Title:Validation study of candidate single nucleotide polymorphisms associated with left ventricular hypertrophy in the Korean population Author's response to reviews Title:Validation study of candidate single nucleotide polymorphisms associated with left ventricular hypertrophy in the Korean population Authors: Jin-Kyu Park (cardiohy@gmail.com)

More information

Supplementary figures

Supplementary figures Supplementary figures Supplementary Figure 1: Quantile-quantile plots of the expected versus observed -logp values for all studies participating in the first stage metaanalysis. P-values were generated

More information

Office Practice Coding Assistance - Overview

Office Practice Coding Assistance - Overview Office Practice Coding Assistance - Overview Three office coding assistance resources are provided in the STABLE Resource Toolkit. Depression & Bipolar Coding Reference: n Provides ICD9CM and DSM-IV-TR

More information

Lab 5: Testing Hypotheses about Patterns of Inheritance

Lab 5: Testing Hypotheses about Patterns of Inheritance Lab 5: Testing Hypotheses about Patterns of Inheritance How do we talk about genetic information? Each cell in living organisms contains DNA. DNA is made of nucleotide subunits arranged in very long strands.

More information

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018 An Introduction to Quantitative Genetics I Heather A Lawson Advanced Genetics Spring2018 Outline What is Quantitative Genetics? Genotypic Values and Genetic Effects Heritability Linkage Disequilibrium

More information

Drug Metabolism Disposition

Drug Metabolism Disposition Drug Metabolism Disposition The CYP2C19 intron 2 branch point SNP is the ancestral polymorphism contributing to the poor metabolizer phenotype in livers with CYP2C19*35 and CYP2C19*2 alleles Amarjit S.

More information

2007: Alcohol Use Across the Lifespan

2007: Alcohol Use Across the Lifespan Washington University School of Medicine Digital Commons@Becker Posters 2007: Alcohol Use Across the Lifespan 2007 DSM-IV nicotine withdrawal and alcohol dependence: Association findings with the Nicotinic

More information

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX The Impact of Relative Standards on the Propensity to Disclose Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX 2 Web Appendix A: Panel data estimation approach As noted in the main

More information

ARIC Manuscript Proposal #2426. PC Reviewed: 9/9/14 Status: A Priority: 2 SC Reviewed: Status: Priority:

ARIC Manuscript Proposal #2426. PC Reviewed: 9/9/14 Status: A Priority: 2 SC Reviewed: Status: Priority: ARIC Manuscript Proposal #2426 PC Reviewed: 9/9/14 Status: A Priority: 2 SC Reviewed: Status: Priority: 1a. Full Title: Statin drug-gene interactions and MI b. Abbreviated Title: CHARGE statin-gene GWAS

More information

Rating Mental Impairment with AMA Guides 6 th edition:

Rating Mental Impairment with AMA Guides 6 th edition: Rating Mental Impairment with AMA Guides 6 th edition: Practical Considerations and Strategies CSME/CAPDA C-CAT Course, March 24, 2018 William H. Gnam, PhD, MD, FRCPC (william.gnam@gmail.com) Consultant

More information

MEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)

MEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Ordering Information Acceptable specimen types: Fresh blood sample (3-6 ml EDTA; no time limitations associated with receipt)

More information

Warfarin Pharmacogenetics Reevaluated Subgroup Analysis Reveals a Likely Underestimation of the Maximum Pharmacogenetic Benefit by Clinical Trials

Warfarin Pharmacogenetics Reevaluated Subgroup Analysis Reveals a Likely Underestimation of the Maximum Pharmacogenetic Benefit by Clinical Trials AJCP /ORIGINAL ARTICLE Warfarin Pharmacogenetics Reevaluated Subgroup Analysis Reveals a Likely Underestimation of the Maximum Pharmacogenetic Benefit by Clinical Trials Gary Stack, MD, PhD, 1,2 and Carleta

More information

Effects of age-at-diagnosis and duration of diabetes on GADA and IA-2A positivity

Effects of age-at-diagnosis and duration of diabetes on GADA and IA-2A positivity Effects of age-at-diagnosis and duration of diabetes on GADA and IA-2A positivity Duration of diabetes was inversely correlated with age-at-diagnosis (ρ=-0.13). However, as backward stepwise regression

More information

Predicting Country of Origin from Genetic Data G. David Poznik

Predicting Country of Origin from Genetic Data G. David Poznik Predicting Country of Origin from Genetic Data G. David Poznik Introduction Genetic variation in Europe is spatially structured; similarity decays with geographic distance. The most striking visual manifestation

More information

RATING MENTAL WHOLE PERSON IMPAIRMENT UNDER THE NEW SABS: New Methods, New Challenges. CSME/CAPDA Conference, April 1, 2017

RATING MENTAL WHOLE PERSON IMPAIRMENT UNDER THE NEW SABS: New Methods, New Challenges. CSME/CAPDA Conference, April 1, 2017 RATING MENTAL WHOLE PERSON IMPAIRMENT UNDER THE NEW SABS: New Methods, New Challenges CSME/CAPDA Conference, April 1, 2017 William H. Gnam, PhD, MD, FRCPC (william.gnam@gmail.com) Consultant Psychiatrist

More information

# For the GWAS stage, B-cell NHL cases which small numbers (N<20) were excluded from analysis.

# For the GWAS stage, B-cell NHL cases which small numbers (N<20) were excluded from analysis. Supplementary Table 1a. Subtype Breakdown of all analyzed samples Stage GWAS Singapore Validation 1 Guangzhou Validation 2 Guangzhou Validation 3 Beijing Total No. of B-Cell Cases 253 # 168^ 294^ 713^

More information

Self reported ethnicity

Self reported ethnicity Self reported ethnicity Supplementary Figure 1 Ancestry stratifies patterns of human genetic variations. PCA plots (1 st, 2 nd and 3 rd components) estimated from human genotypes. Individuals are coloured

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Study design.

Nature Genetics: doi: /ng Supplementary Figure 1. Study design. Supplementary Figure 1 Study design. Leukopenia was classified as early when it occurred within the first 8 weeks of thiopurine therapy and as late when it occurred more than 8 weeks after the start of

More information

Introduction to genetic variation. He Zhang Bioinformatics Core Facility 6/22/2016

Introduction to genetic variation. He Zhang Bioinformatics Core Facility 6/22/2016 Introduction to genetic variation He Zhang Bioinformatics Core Facility 6/22/2016 Outline Basic concepts of genetic variation Genetic variation in human populations Variation and genetic disorders Databases

More information

Handling Immunogenetic Data Managing and Validating HLA Data

Handling Immunogenetic Data Managing and Validating HLA Data Handling Immunogenetic Data Managing and Validating HLA Data Steven J. Mack PhD Children s Hospital Oakland Research Institute 16 th IHIW & Joint Conference Sunday 3 June, 2012 Overview 1. Master Analytical

More information

Supplementary Methods

Supplementary Methods Supplementary Methods Populations ascertainment and characterization Our genotyping strategy included 3 stages of SNP selection, with individuals from 3 populations (Europeans, Indian Asians and Mexicans).

More information

Summary ID#7029. Clinical Study Summary: Study F1D-MC-HGKQ

Summary ID#7029. Clinical Study Summary: Study F1D-MC-HGKQ CT Registry ID# 7029 Page 1 Summary ID#7029 Clinical Study Summary: Study F1D-MC-HGKQ Clinical Study Report: Versus Divalproex and Placebo in the Treatment of Mild to Moderate Mania Associated with Bipolar

More information

Accurate Diagnosis of Primary Psychotic Disorders

Accurate Diagnosis of Primary Psychotic Disorders Accurate Diagnosis of Primary Psychotic Disorders The Care Transitions Network National Council for Behavioral Health Montefiore Medical Center Northwell Health New York State Office of Mental Health Netsmart

More information

The genetics of complex traits Amazing progress (much by ppl in this room)

The genetics of complex traits Amazing progress (much by ppl in this room) The genetics of complex traits Amazing progress (much by ppl in this room) Nick Martin Queensland Institute of Medical Research Brisbane Boulder workshop March 11, 2016 Genetic Epidemiology: Stages of

More information

Association Between F9 Malmö, Factor IX And Deep Vein Thrombosis

Association Between F9 Malmö, Factor IX And Deep Vein Thrombosis G T G A G A T G A T A T T T C G A A G A A T A A A G A T G C C C T G G C T T T G G C T T G A T C T C T G G T A C C T T A T G T T T A A A G A A G G A T G G G A A Association Between F9 Malmö, Factor IX And

More information

Identifying Youth at Clinical High Risk for Psychosis

Identifying Youth at Clinical High Risk for Psychosis Identifying Youth at Clinical High Risk for Psychosis Jean Addington PhD University of Calgary Department of Psychiatry 1 Identifying Youth at Clinical High Risk for Psychosis Part 1: What do we know about

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Jones HJ, Stergiakouli E, Tansey KE, et al. Phenotypic manifestation of genetic risk for schizophrenia during adolescence in the general population. JAMA Psychiatry. Published

More information

ARTICLE Fine Mapping on Chromosome 10q22-q23 Implicates Neuregulin 3 in Schizophrenia

ARTICLE Fine Mapping on Chromosome 10q22-q23 Implicates Neuregulin 3 in Schizophrenia ARTICLE Fine Mapping on Chromosome 10q22-q23 Implicates Neuregulin 3 in Schizophrenia Pei-Lung Chen, 1,2,6 Dimitrios Avramopoulos, 1,3,6 Virginia K. Lasseter, 3 John A. McGrath, 3 M. Daniele Fallin, 4,5

More information

DEFINITIONS OF HISTOCOMPATIBILITY TYPING TERMS

DEFINITIONS OF HISTOCOMPATIBILITY TYPING TERMS DEFINITIONS OF HISTOCOMPATIBILITY TYPING TERMS The definitions below are intended as general concepts. There will be exceptions to these general definitions. These definitions do not imply any specific

More information

On Missing Data and Genotyping Errors in Association Studies

On Missing Data and Genotyping Errors in Association Studies On Missing Data and Genotyping Errors in Association Studies Department of Biostatistics Johns Hopkins Bloomberg School of Public Health May 16, 2008 Specific Aims of our R01 1 Develop and evaluate new

More information

BST227: Introduction to Statistical Genetics

BST227: Introduction to Statistical Genetics BST227: Introduction to Statistical Genetics Lecture 11: Heritability from summary statistics & epigenetic enrichments Guest Lecturer: Caleb Lareau Success of GWAS EBI Human GWAS Catalog As of this morning

More information