FUNCTIONAL GENE SETS IN POST-TRAUMATIC STRESS DISORDER

Similar documents
Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes

Whole liver transcriptome analysis for the metabolic adaptation of dairy cows

How To Use MiRSEA. Junwei Han. July 1, Overview 1. 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2

Package GANPAdata. February 19, 2015

PCxN: The pathway co-activity map: a resource for the unification of functional biology

chapter 1 - fig. 2 Mechanism of transcriptional control by ppar agonists.

Package inote. June 8, 2017

Gene Ontology 2 Function/Pathway Enrichment. Biol4559 Thurs, April 12, 2018 Bill Pearson Pinn 6-057

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles

Supplementary Methods

PathAct: a novel method for pathway analysis using gene expression profiles

NIH Public Access Author Manuscript Int J Cardiol. Author manuscript; available in PMC 2014 November 20.

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection

Journal Club Semmler Lorenz

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Package AbsFilterGSEA

Gene and Pathway Analysis of Metabolic Traits in Dairy Cows

Comparison of Invariant NKT Cells with Conventional T Cells by Using Gene Set Enrichment Analysis (GSEA)

Predication-based Bayesian network analysis of gene sets and knowledge-based SNP abstractions

Cell subset prediction for blood genomic studies

Gene Set Enrichment Analysis: A Knowledge-Based Approach for Interpreting Genome-wide Expression Profiles

Nature Immunology: doi: /ni Supplementary Figure 1. RNA-Seq analysis of CD8 + TILs and N-TILs.

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION

Patnaik SK, et al. MicroRNAs to accurately histotype NSCLC biopsies

HHS Public Access Author manuscript IJC Metab Endocr. Author manuscript; available in PMC 2017 October 20.

Integrative Omics for The Systems Biology of Complex Phenotypes

fasting blood glucose [mg/dl]

TITLE: TREATMENT OF ENDOCRINE-RESISTANT BREAST CANCER WITH A SMALL MOLECULE C-MYC INHIBITOR

Package propoverlap. R topics documented: February 20, Type Package

SUPPLEMENTARY INFORMATION

High AU content: a signature of upregulated mirna in cardiac diseases

SUPPLEMENTARY APPENDIX

Supporting Information

Expanded View Figures

Digitizing the Proteomes From Big Tissue Biobanks

SCIENCE CHINA Life Sciences. Screening and regulatory network analysis of survival-related genes of patients with colorectal cancer

Using dualks. Eric J. Kort and Yarong Yang. October 30, 2018

Stress and Disease. Chapter 8. Elsevier items and derived items 2008 by Mosby, Inc., an affiliate of Elsevier Inc.

Predictive Biomarkers

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm:

Detecting gene signature activation in breast cancer in an absolute, single-patient manner

SUPPLEMENTARY FIGURE LEGENDS

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

Nature Methods: doi: /nmeth.3115

Multiplexed Cancer Pathway Analysis

Post-Traumatic Stress Disorder (PTSD) Among People Living with HIV

SubLasso:a feature selection and classification R package with a. fixed feature subset

Pathways of changes in hypermethylation in F1 offspring exposed to intrauterine hyperglycemia

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

ONCOLOGY REPORTS 29: , 2013

Introduction to Gene Sets Analysis

Data complexity measures for analyzing the effect of SMOTE over microarrays

silent epidemic,. (WHO),

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

MASTER REGULATORS USED AS BREAST CANCER METASTASIS CLASSIFIER *

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

The use of random projections for the analysis of mass spectrometry imaging data Palmer, Andrew; Bunch, Josephine; Styles, Iain

NIH Public Access Author Manuscript Best Pract Res Clin Haematol. Author manuscript; available in PMC 2010 June 1.

Classification of cancer profiles. ABDBM Ron Shamir

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al.

Nature Immunology: doi: /ni Supplementary Figure 1. Huwe1 has high expression in HSCs and is necessary for quiescence.

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Curriculum Vitae. Curriculum Vitae

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

Interpreting Diverse Genomic Data Using Gene Sets

Data analysis in microarray experiment

The Immune System: The Mind Body Connection. Presented by Margaret Kemeny, Ph.D. Department of Psychiatry, University of California, San Francisco

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

Examination of Mechanisms of Hepatotoxicity of Anti-diabetic PPARγ Agonists Using Applied Biosystems Rat Whole Genome Microarrays

Reactions to Trauma and Clinical Treatment for PTSD

WHATEVER IT TAKES! UNDERSTANDING ADVERSE CHILD EXPERIENCES. Nadine Burke Harris, MD, MPH CEO, Center for Youth Wellness March 1, 2013

DeSigN: connecting gene expression with therapeutics for drug repurposing and development. Bernard lee GIW 2016, Shanghai 8 October 2016

The epigenetic landscape of T cell subsets in SLE identifies known and potential novel drivers of the autoimmune response

SUPPLEMENTARY FIGURES

Practical Experience in the Analysis of Gene Expression Data

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Supplemental Figure S1. RANK expression on human lung cancer cells.

Title: Pathway-Based Classification of Cancer Subtypes

Statistical Assessment of the Global Regulatory Role of Histone. Acetylation in Saccharomyces cerevisiae. (Support Information)

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer

* To whom correspondence should be addressed.

Supplementary. properties of. network types. randomly sampled. subsets (75%

OVARIAN CANCER POSSIBLE TUMOR-SUPPRESSIVE ROLE OF BATF2 IN OVARIAN CANCER

Trauma and Stress: Neurobiology and the Impact on Development

SERUM CYTOKINES ARE ELEVATED IN PATIENTS WITH IRRITABLE BOWEL SYNDROME (IBS) BUT LARGELY UNRELATED TO SYMPTOM CHARACTERISTICS

Supplementary data Table S3. GO terms, pathways and networks enriched among the significantly correlating genes using Tox-Profiler

Supplementary Online Content

Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures

PREPARED FOR: U.S. Army Medical Research and Materiel Command Fort Detrick, Maryland

Study Guide Unit 3 Psych 2022, Fall 2003

Copyright 2015 American College of Cardiology Foundation

Clinical Relevance of Biological Alterations in PTSD. Rachel Yehuda, PhD Mount Sinai School of Medicine New York, NY

Estimating the Number of Clusters in DNA Microarray Data

Molecular Heterogeneity of High Gleason Prostate Cancer

GENERAL SUMMARY Corpus luteum is a transient endocrine structure formed from the ruptured ovarian follicle. Its main function is to secrete P 4, a pro

Karthik Ravi¹ and Inhan Lee². Journal of

Transcription:

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY C h e m i s t r y a n d B i o l o g y 2016, 1, p. 43 48 B i o l o g y FUNCTIONAL GENE SETS IN POST-TRAUMATIC STRESS DISORDER A. A. ARAKELYAN Institute of Molecular Biology NAS Republic of Armenia In the study principal component analysis and gene set enrichment analysis was carried out to evaluate genome wide expression and identify functional gene sets affected in peripheral blood mononuclear cells of patients with post-traumatic stress disorder and animal models of the disease. The results obtained indicate the significance of the genetic component in the pathogenesis of post-traumatic stress disorder, and provide strong evidence for the involvement of chronic inflammation, neuronal and cytokine/growth factor signaling pathways, in disease development and progression. In addition, the results obtained demonstrated that combination of principal component analysis with gene set enrichment analysis is an efficient strategy for discrimination of phenotype related gene expression variability from other factors. Keywords: gene principal component, gene set enrichment, post-traumatic stress disorder. Introduction. Post traumatic stress disorder (PTSD) is a complex, severe and chronic psychiatric illness, an anxiety disorder (DSM-IV-TR code: 309.81) that may develop in a person, who has experienced, witnessed, or learned about a terrifying event or ordeal, in which grave physical harm occurred or was threatened. As estimated 70% of adults have experienced a traumatic event at least once in their life and up to 20% of these people go on to develop PTSD. It is predicted that in the near future 8 10% of the population will suffer PTSD at some time in their lives [1, 2]. The molecular pathomechanisms of PTSD are not well defined and only beginning to be understood and this lack of knowledge hampers our ability to find superior therapeutic approaches to the treatment of this disorder. Promising findings suggest that both environment and genetic factors are involved in PTSD-generation mechanisms, and that neuronal, endocrine, and immune alterations might be in a sufficient degree responsible for disease progression [2, 3]. However, due to insufficiency of relevant data, a molecular picture of generation and development of PTSD is yet unclear. Genome wide expression analysis with DNA microarrays has become a mainstay of genomics research [4, 5]. The challenge no longer lies in obtaining gene expression profiles, but rather in interpreting the results to gain insight the E-mail: aarakelyan@sci.am

44 Proc. of the Yerevan State Univ. Chemistry and Biology, 2016, 1, p. 43 48. molecular basis of diseased conditions. The common approach is to find a number of differentially and highly expressed genes responsible for the generation and progression of pathological changes. However, this kind of analysis has several limitations, from which the important one is that the information on diseaseassociated alterations in metabolic, regulatory or signaling pathways may be missed. Recently introduced Gene Set Enrichment Analysis (GSEA), which is designed to evaluate functional gene sets, overcomes these limitations [6]. Further extension of this approach may be its combination with the Principal Component Analysis (PCA). It is obvious, that variation in gene expression may be the result of combined action of several factors, often not connected to the phenotype of interest. PCA allows discrimination between these effects, as it is a linear dimensionality reduction technique, which identifies statistically uncorrelated orthogonal directions Principal Components (PC) of maximum variance in the original data. The component loadings of each gene may show to what extent differences in gene expression contribute to phenotypic diversity. Here we applied combined PCA and GSEA techniques to evaluate the functional gene sets affected in peripheral blood mononuclear cells (PBMC) of patients with PTSD at both early and advanced stages of the disease. In addition, we conducted comparative analysis of PTSD-related gene expression patterns in human and mouse model. Materials and Methods. Human and Mouse Datasets. We use two datasets (GDS1020 and GSE8870) available in the Gene Expression Omnibus repository (www.ncbi.nlm.nih.gov/geo/). Description of samples is available in the dataset summaries. From human dataset probes that had at least 50% of present calls were chosen. For the mouse dataset probes containing less that 50% of missing values were chosen. The probe identifiers from each platform were converted to the HUGO-approved gene symbols, averaging log transformed expression values of multiple probes targeting the same gene. This resulted in 4545 and 32205 genes for human and mouse datasets respectively. Principal Component Analysis and Phenotype Matching for Human Dataset. PCA was performed using MATLAB 7.13 software (Mathworks Inc., USA). We identified components, PCs that most accurately classify samples according to their phenotypes (PTSD vs. Controls). For that purpose, scores and phenotypes were assigned binary values (scores: positive 1, negative 0; phenotypes: PTSD 1, Control 0) to obtain 2 2 table. Further, the significance of phenotype differences in gene expression for each component was tested by Fisher s exact test. p < 0.05 was considered as significant. For PCs showing significant association with phenotypes we generated gene lists ranked from most positive to most negative factor loadings. Gene Set Enrichment Analysis. GSEA analysis was performed using GSEA 2.0 software [6]. As data source pre-ranked list of human genes and mouse gene expression dataset were used. This application scores a sorted list of genes with respect to their enrichment of selected functional categories (KEGG, Biocarta and GO). The significance of the enrichment score was assessed using 1000 permutations. For multiple testing adjustments, Benjamini and Hochberg s false discovery rate (FDR) < 25% was used. p < 0.05 (after FDR correction) were considered as significant.

Arakelyan A. Functional Gene Sets in Post-Traumatic Stress Disorder. 45 Results and Discussion. First, we analyzed the gene expression profiles of PBMCs in PTSD affected subjects and controls. PCA reduced sample complexity of 4545 patterns of gene expression are available for PBMCs in the dataset, by grouping them into the sets of 32 orthogonal factors or PCs (Fig. 1, A). Fig. 1. The percentage of gene expression variability explained by PCs (A) and classification of phenotypes (PTSD vs. Controls) according to the first PC (B). Next, we identified PCs associated with disease phenotype using Fisher s exact test. The results showed that the first PC associate with disease phenotype (Fig. 1, B; Tab. 1). Positive PC scores were associated with PTSD samples, whereas negative PC scores were associated with Control samples. Table 1 Phenotype association with PC1 score and accuracy of classification Phenotype Control PTSD Note: * ** Component score * negative 14 (87.5) 5 (29.4) * positive 2 (12.5) ** 12 (70.6) data presented as count (percentage); p = 0.001 (two tailed Fisher s exact test). In order to evaluate the biological significance of the identified PC, we performed GSEA using GSEA v 2.0 software [6]. Since the ranked gene lists were sorted in descending order, the top-enriched categories can be considered as upregulated in PTSD and the opposite is true in case of down-regulated categories. The analysis revealed the 11 up-regulated functional gene sets at padjusted < 0.05 (Tab. 2) involved in inflammation, extracellular signaling and metabolism. For identification of genes, contributing to several pathways, we have performed leading edge analysis. Overall 356 genes contributed to the leading edge of up- and down-regulated datasets of which 29 genes appeared in two or more gene sets (Fig. 2). HRAS and MAPK1 were the most frequent genes and appeared in 11 and 10 gene sets respectively.

46 Proc. of the Yerevan State Univ. Chemistry and Biology, 2016, 1, p. 43 48. T a b l e 2 The biological significance of PCs related to PTSD Category Number of genes in gene set p adjusted Condition Cell adhesion molecules 42 0.005 up-regulated Hematopoietic cell lineage 47 0.018 up-regulated Cytokine cytokine receptor interaction 65 0.050 up-regulated ECM receptor interaction 20 0.076 up-regulated Type I diabetes mellitus 20 0.078 up-regulated Cell communication 17 0.076 up-regulated EPO pathway 16 0.043 up-regulated NGF pathway 16 0.079 up-regulated IL2 pathway 19 0.041 up-regulated Type II diabetes mellitus 17 0.075 up-regulated Ribosome 46 0.004 up-regulated The analysis of brain gene expression patterns in mouse models of PTSD revealed 74 significantly up-regulated gene sets (KEGG and Biocarta) involved in neurotransmission, extracellular signaling pathways, inflammation and general metabolic processes. The identified functional gene sets included the majority of categories up-regulated in PMBCs isolated from blood of PTSD subjects (Tab. 3). Fig. 2. Genes appearing in two or more gene sets. The results obtained demonstrated that PBMCs of PTSD affected subjects bear specific hallmark of gene expression compared with non-affected subjects. Importantly, our data showed significant overlap of functional sets between human and animal model data, which indicates their importance in the development and progression of PTSD. Our findings are in line with previous data indicating the involvement of neuro immune endocrine system in pathogenesis of this disease. Characteristic suppression of the hypothalamic-pituitary adrenal axis in sustained PTSD suppresses the action of endogenous cortisol on proinflammatory cytokine production [7, 8]. Recently, it was shown that the levels of several inflammatory cytokines, CRP and other inflammatory molecules are increased in the blood of PTSD patients, as well as changes in WBC subpopulations have been noticed

Arakelyan A. Functional Gene Sets in Post-Traumatic Stress Disorder. 47 [9 11]. This persisting chronic inflammation may, in turn, affect the function of central nervous system. It is known that the pro-inflammatory cytokines IL-6 and TNF-, together with IL-1 are the most important mediators of the so-called sickness response, which is described as a series of depression like symptoms that occur during peripheral immune system activation [12]. Functional gene sets up-regulated in the mouse model of PTSD similar to gene sets identified in human T a b l e 3 Category Number of genes in gene set p adjusted Condition Cell adhesion molecules 101 0.000 up-regulated Cytokine cytokine receptor interaction 216 0.000 up-regulated Type I diabetes mellitus 42 0.000 up-regulated Neuroactive ligand receptor interaction 224 0.004 up-regulated Lysine degradation 42 0.004 up-regulated ECM receptor interaction 81 0.006 up-regulated Cell communication 78 0.012 up-regulated IL2 RB pathway 34 0.018 up-regulated VEGF signaling pathway 66 0.037 up-regulated The PCA technique allows reducing the overall variability of microarray data to a relatively small number of components with some biological meanings [13, 14]. We demonstrated that with this approach it is possible to identify components that are responsible for disease-related changes in gene expression. In this study, we have found that only 30% of gene expression variability was related to the development of PTSD. The remaining variability may be defined by a number of other factors, including heterogeneity of cell populations, age, gender, ethnic origin, etc. GSEA directly scores pre-defined gene sets for differential expression and especially aims to identify gene sets with subtle but coordinated expression changes that cannot be detected by cutoff methods (for review see 15). The key principle is that even weak expression changes in individual genes gathered to a large gene set can show a significant pattern. By changing the focus from individual genes to a set of genes or pathways, the GSEA approach enables understanding of cellular processes as an intricate network of functionally related components [15]. Conclusion. In this paper we used two-step approach for evaluation of genome wide expression associated with PTSD in human subjects and animal models. In the first step PCA was applied to identify the contribution of gene expression changes to PTSD phenotype, and at the second step GSEA was used to target functional gene sets related to disease phenotype. The results of the present study clearly indicate the significance of the genetic component in the pathogenesis of PTSD, and provide strong evidence for involving alterations of neuronal, endocrine and immune system at the gene expression level. In addition, the results of the present study demonstrate that the combination of PCA with GSEA is an efficient tool to discriminate phenotype related gene expression variability from other influencing factors and to have a more complex outlook into molecular mechanisms underlying a pathological condition.

48 Proc. of the Yerevan State Univ. Chemistry and Biology, 2016, 1, p. 43 48. In conclusion, our results give biological insights into the molecular processes occurring in PTSD and introduce a combined approach to discriminate phenotype effects on gene expression from influencing factors. Received 02.02.2016 R E F E R E N C E S 1. Jacobs W.J., Dalenberg C. Subtle Presentations of Post-Traumatic Stress Disorder: Diagnostic Issues. // Psychiatr. Clin. North Am., 1988, v. 21, p. 835 845. 2. Connor M.D., Butterfield M.I. Post-Traumatic Stress Disorder. // FOCUS, 2003, v. 1, p. 247 262. 3. Gitlin J.M. Study Identifies Gene X Environment Link to PTSD. // JAMA, 2008, v. 299, p. 1291 1305. 4. Schena M., Shalon D., Davis R.W., Brown P.O. Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray. // Science, 1995, 270, p. 467 470. 5. Ito T., Sakaki Y. Toward Genome-Wide Scanning of Gene Expression: A Functional Aspect of the Genome Project. // Essays Biochem., 1996, v. 31, p. 11 21. 6. Subramanian A., Tamayo P., Mootha V.K., Mukherjee S., Ebert B.L., Gillette M.A., Paulovich A., Pomeroy S.L., Golub T.R., Lander E.S., Mesirov J.P. Gene Set Enrichment Analysis: A Knowledge-Based Approach for Interpreting Genome-Wide Expression Profiles. // Proc. Natl Acad. Sci. USA., 2005, v. 102, p. 15545 15550. 7. Yehuda R., Giller E.L., Southwick S.M., Lowy M.T., Mason J.W. Hypothalamic-Pituitary- Adrenal Dysfunction in Posttraumatic Stress Disorder. // Biol. Psychiatry, 1991, v. 30, p. 1031 1048. 8. Gander M.L., von Känel R. Myocardial Infarction and Post-Traumatic Stress Disorder: Frequency, Outcome, and Atherosclerotic Mechanisms. // Eur. J. Cardiovasc. Prev. Rehabil., 2006, v. 13, p. 165 172. 9. von Känel R., Hepp U., Kraemer B., Traber R., Keel M., Mica L., Schnyder U. Evidence For Low-Grade Systemic Proinflammatory Activity in Patients with Posttraumatic Stress Disorder. // J. Psychiatr. Res., 2007, v. 41, p. 744 752. 10. Spitzer C., Barnow S., Völzke H., Wallaschofski H., John U., Freyberger H.J., Löwe B., Grabe H.J. Association of Posttraumatic Stress Disorder with Low-Grade Elevation of C-Reactive Protein: Evidence from the General Population. // J. Psychiatr. Res., 2010, v. 44, p. 15 21. 11. Wilson S.N., van der Kolk B., Burbridge J., Fisler R., Kradin R. Phenotype of Blood Lymphocytes in PTSD Suggests Chronic Immune Activation. // Psychosomatics, 1999, v. 40, p. 222 225. 12. Watkins L.R., Maier S.F., Goehler L.E. Immune Activation: The Role of Pro-Inflammatory Cytokines in Inflammation, Illness Responses and Pathological Pain States. // Pain., 1995, v. 63, p. 289 302. 13. Yeung K.Y., Ruzzo W.L. Principal Component Analysis for Clustering Gene Expression Data. // Bioinformatics, 2001, v. 17, p. 763 774. 14. Fehrmann R.S., de Jonge H.J., Ter Elst A., de Vries A., Crijns A.G., Weidenaar A.C., Gerbens F., de Jong S., van der Zee A.G., de Vries E.G., Kamps W.A., Hofstra R.M., Te Meerman G.J., de Bont E.S. A New Perspective on Transcriptional System Regulation (TSR): Towards TSR Profiling. // PLoS One, 2008, v. 3, p. e1656. 15. Nam D., Kim S.Y. Gene-Set Approach for Expression Pattern Analysis. // Brief Bioinform., 2008, v. 9, p. 189 197.