The Cancer Genome Atlas Project Overview

Similar documents
Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

New Opportunities in Cancer Research National Academy of Sciences Advancing Health Research in Poland and the United States December 3, 2009

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

Chromatin marks identify critical cell-types for fine-mapping complex trait variants

Genes, Aging and Skin. Helen Knaggs Vice President, Nu Skin Global R&D

The Cancer Genome Atlas & International Cancer Genome Consortium

BST227: Introduction to Statistical Genetics

Session 4 Rebecca Poulos

Inner Life of a Cell. Harvard Fall Video. ml

Session 2: Biomarkers of epigenetic changes and their applicability to genetic toxicology

Nature vs nurture: Epigenetics

Session 4 Rebecca Poulos

The Curing AI for Precision Medicine. Hoifung Poon

Epigenetics. Jenny van Dongen Vrije Universiteit (VU) Amsterdam Boulder, Friday march 10, 2017

Cancer Immunotherapy Artificial Intelligence: Market Shares, Strategies, and Forecasts, 2017 to 2023

Understanding DNA Copy Number Data

LESSON 3.2 WORKBOOK. How do normal cells become cancer cells? Workbook Lesson 3.2

Transformation of Normal HMECs (Human Mammary Epithelial Cells) into Metastatic Breast Cancer Cells: Introduction - The Broad Picture:

Gene Regulation Part 2

Epigenetic drift in aging twins

Analysis with SureCall 2.1

Latest Press Release. pancreatic cancer icd10

Building the Ohio Innovation Economy

PERSONALIZED GENETIC REPORT CLIENT-REPORTED DATA PURPOSE OF THE X-SCREEN TEST

Developing New Therapies to Treat Glioblastoma

WHEN DO MUTATIONS OCCUR?

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Liposarcoma*Genome*Project*

Deploying the full transcriptome using RNA sequencing. Jo Vandesompele, CSO and co-founder The Non-Coding Genome May 12, 2016, Leuven

The Cancer Genome Atlas

Cancer Epigenetics: Methods And Protocols (Methods In Molecular Biology) READ ONLINE

Introduction to Systems Biology of Cancer Lecture 2

Introduction to Cancer Bioinformatics and cancer biology. Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) January 20, 2015

The Cancer Genome Atlas Pan-cancer analysis Katherine A. Hoadley

Advances in Breast Cancer Research: Genetics, Biology, and Clinical Applications

Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They?

September 20, Submitted electronically to: Cc: To Whom It May Concern:

Temple University Department of Biology College of Science & Technology. Biology 3368

An epigenetic approach to understanding (and predicting?) environmental effects on gene expression

Mutations in epigenetic regulators including SETD2 are gained during relapse in pediatric acute lymphoblastic leukemia

TUMOR-SUPPRESSOR GENES. Molecular Oncology Michael Lea

Einführung in die Genetik

HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007

Take-Home Final Exam: Mining Regulatory Modules from Gene Expression Data

The BRCA Exchange: Global Data Sharing and Knowledge Exchange to Enable Accurate Clinical Care

UNIVERSITY OF CALIFORNIA, LOS ANGELES

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

Fragile X Syndrome. Genetics, Epigenetics & the Role of Unprogrammed Events in the expression of a Phenotype

Blast Searcher Formative Evaluation. March 02, Adam Klinger and Josh Gutwill

Day 0 Sunday, July 8: Arrivals. Day 1 Monday, July 9: Introduction, Fundamentals, and Germline Variation. *As of 6/29/18

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

EPIGENOMICS PROFILING SERVICES

BIO360 Fall 2013 Quiz 1

Modeling gene expression using five histone modifications

Chapter 1 : Genetics 101

Genetics and Genomics in Medicine Chapter 6 Questions

Nature Methods: doi: /nmeth.3115

StratomeX Visual Analysis of Large-Scale Heterogeneous Genomics Data for Cancer Subtype Characterization

Integrated genomic analyses of ovarian carcinoma

Mapping by recurrence and modelling the mutation rate

Precision medicine: How to exploit the growing knowledge on the evolving genomes of cells to improve cancer prevention and therapy.

Surveillance and SEER Where are we going? NAACCR Meeting June 23, 2017 Lynne Penberthy MD, MPH

Epigenetic programming in chronic lymphocytic leukemia

Cancer Informatics Lecture

Title:DNA Methylation Subgroups and the CpG Island Methylator Phenotype in Gastric Cancer: A Comprehensive Profiling Approach

Regulation of Gene Expression in Eukaryotes

Plenary Session 1: Genomics Session Chair: David Malkin, The Hospital for Sick Children, Toronto, Ontario, Canada 8:00 a.m. 10:00 a.m.

p53 and Apoptosis: Master Guardian and Executioner Part 2

Making_It_Personal Using_DNA_to_Tailor_Cancer_Treatments English mp4_

Facts from text: Automated gene annotation with ontologies and text-mining

R. Piazza (MD, PhD), Dept. of Medicine and Surgery, University of Milano-Bicocca EPIGENETICS

GENETIC TESTING: WHAT DOES IT REALLY TELL YOU? Lori L. Ballinger, MS, CGC Licensed Genetic Counselor University of New Mexico Cancer Center

PTM Discovery Method for Automated Identification and Sequencing of Phosphopeptides Using the Q TRAP LC/MS/MS System

Clinical NLP, PubGene Clinical trials in Coremine Oncology Text processing and information extraction for surgery planning form

Understanding Signaling Pathways by Modifying Sensitivity to PLX4720 in B-RAF V600E Melanoma

RNA-Seq Atlas of Glycine max: A guide to the Soybean Transcriptome

PRECISION INSIGHTS. GPS Cancer. Molecular Insights You Can Rely On. Tumor-normal sequencing of DNA + RNA expression.

Rett Syndrome Research Trust Awards $6.2 Million in 2017 to Speed a Cure for Rett Syndrome

Confronting the Complexity of Cancer

EPIGENETICS AND MENTAL HEALTH. William J. Walsh, Ph.D.

Epigenetics: Why you don t have teeth in your eyeballs

ccfdna Webinar Series: The Basics and Beyond

High resolution melting for methylation analysis

Clinical and Pathological Significance of Epigenomic Changes in Colorectal Cancer

10/27/2013. Biological Insights from Genetics of Rheumatoid Arthritis Contribute to Drug Discovery. Yukinori Okada, MD, PhD.

A Versatile Algorithm for Finding Patterns in Large Cancer Cell Line Data Sets

Epigenetics: Basic Principals and role in health and disease

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014

2/21/2016. Cancer Precision Medicine: A Primer. Ovarian Cancer Statistics and Standard of Care in 2015 OUTLINE. Background

You Are What Your Mother Ate: The Science of Epigenetics

Ovarian Cancers: Evolving Paradigms in Research and Care Report Release Wednesday, March 2, 2016

Urine Biomarkers for Prostate Cancer Detection

Integrating Genome and Functional Genomics Data to Reveal Perturbed Signaling Pathways in Ovarian Cancers

Karyotypes Detect Chromosome Mutations

Biomedical resources for text mining

Beads-on-a- The 30nm Fibre Active Chromosome The Metaphase Chromosome. Less active genes During interphase During cell division

STRUCTURAL CHROMOSOMAL ABERRATIONS

Title: Systematic review of lung function and COPD with peripheral blood DNA methylation in population based studies

Transcription:

TheCancerGenomeAtlasProjectOverview InSeptember2006,TheCancerGenomeAtlas(TCGA)programannouncedplanstomap genomicchangesoflung,brainandovariancancertumorsusinglargescalesequencing technologies. Seven institutions in five states were selected as Cancer Genome Characterization Centers (CGCCs). CGCCs included the Broad Institute of MIT and Harvard; Harvard Medical School and Brigham and Women's Hospital, Lawrence Berkeley National Laboratory, Memorial Sloan Kettering Cancer Center, The Sidney KimmelComprehensiveCancerCenteratJohnsHopkinsUniversity,StanfordUniversity School of Medicine, and the University of North Carolina Lineberger Comprehensive CancerCenter. The analysis of sequence information generated by the CGCCs would entail trying to identify genes that were involved in the mutation or metastasis process for the lung, brainorovariantumors.tosupporttheidentificationprocess,inapril2007,thesophic Biomax teamed with NCI to annotate the 3,168 Brain, Lung and Ovarian cancer genes previously identified in the Cancer Gene Index Project. This project entailed a deeper review of both abstracts and full text papers to find only those genes involved in mutationandmetastasisprocesses.evidencefortheabnormalityandmutationprocess wouldbeannotatedbyphdscientistsbyreviewingbothmedlineabstractsandfulltext papers.belowisabreakdownofthe3,168genesbydiseasetypetobereviewedinthe project. Brain 1,143 Lung 1,182 Ovarian 843 Total 3,168 TheMedlineabstractcorpuswasupdatedtoincludeallpublicationsthroughJanuary1, 2007andNCImadefulltextpapersavailableforreview.Linguisticanalysisofthenew materialwasperformedtoupdateandrefreshthegenename,cancerdiseasetermsand newcategorieswereaddedforabnormalityandmetastasisprocesses.phdscientists followed the roadmap below to identify Brain (B), Lung (L) and Ovarian (O) cancer genesinvolvedintheabnormalityandmetastasisprocess.

Theannotatorsorganizedthetargetannotatedgenesintothreecategories: CategoryA:Geneswhichrelatetoallthreetypesofcancerdisease(B&L&O) CategoryB:Geneswhichrelatetotwoofthethreetypesofcancerdiseases (B&L,B&O,L&O). CategoryC:Geneswhichrelatetoonlyasingletypeofcancerdisease(B,O,L). Figure1:Thecategorizationprocesswasdesignedtostructurethetargetedgenesset andidentifythemolecularabnormalityabstractsandfulltextpapers.nlpanalysiswas performedonabstractsandfulltextpapersusingbiomaxbioltlinguistictoolforeach category gene to detect molecular abnormality information. Custom dictionaries with expandedabnormalityandmetastasistermswereusedinthenlpanalysis. RulesfortheManualAnnotationProcess: 1. Abnormalitiesrelatedtoexpression a. Forabnormalitiesrelatedtogeneorproteinexpressionitissufficienttogive informationaboutthetypeofexpressionchange(expressionup regulated,

expression down regulated, expression changed). It is not required to give detailednumericalinformationabouttherespectivechangeinexpression. b. Usually the information about the expression change can be found in the abstract.otherwisetheannotatorhastocheckthefulltext. 2. Abnormalitiesrelatedtoepigeneticchanges a. For abnormalities related to epigenetic changes it is sufficient to give information about the type of change (hyper methylation, hypomethylation). It is not required to give information about the respective nucleotide(s)affected. b. Usually the information about the epigenetic change can be found in the abstract.otherwisetheannotatorhastocheckthefulltext. 3. Abnormalitiesrelatedtoposttranslationalmodifications a. Posttranslational modifications include changes of phosphorylation, glycosylation(notinversion1oftheabnormalitydictionary,willbeincluded inversion2) b. Forabnormalitiesrelatedtoposttranslationalmodificationsitissufficientto give information about the type of change (hyper phosphorylation, hypophosphorylation). It is not required to give detailed information on the amino acid(s)thatisaffected. c. Usually the information about the posttranslational modification can be foundintheabstract.otherwisetheannotatorhastocheckthefulltext. 4. Abnormalitiesrelatedtomutations a. Mutations include: gene mutations, chromosomal changes, SNPs, polymorphisms,rearrangements b. For abnormalities related to mutations a detailed description of the respective mutation is to be given, e.g. detailed information about the respectivebase pairchange,amino acidchange. c. Usuallythisdetailedinformationcannotbededucedfromtheabstract.Thus thefulltexthastobeusedtogetthedetailedinformation. AtlasProjectResults InSeptember2007,themanuallyannotatedgenesweredeliveredtoNCI.Ofthe3,168 cancerbrain,lungandovariangenesreviewed,577wereidentifiedasgenesrelatedto mutationandmetastasisprocess.

InsilcoResearchusingtheCancerGenomeAtlasGenes The Atlas Cancer Genes have been integrated into various use cases in the BioXM Knowledge Management System to support Insilco research. The system allows scientists to explore brain, ovarian and lung cancer genes involved in the abnormality processandfindrelationshipswithotherscientificelementssuchaspathways,chemical compoundsandinformationcontainedinclinicaltrialsdatabases. Thescreenimagebelow(figure?)shows11braincancergeneswithincreasesingene expressionexperimentswhichcontributedtotheabnormalityprocess.figure?showsa sample report the CNTF cancer gene with the sentence, NCI thesaurus role code and evidence code in the specific sentence from the PubMed abstract annotated by PhD curators. Figure2:11BrainNeoplasmgeneswithincreasedgeneexpressioninvolvedinmutation andabnormalityprocess.

Figure3:AgenereportonCNTFbraincancergeneprovidesthePubMedsentence,a linktotheabstractandncirolecodesandevidencecodesfortheabnormality process.allatlasgeneshavesimilarlevelsofinformationmanuallyannotatedbyphd curators.