COSMIC - Catalogue of Somatic Mutations in Cancer http://cancer.sanger.ac.uk/cosmic https://academic.oup.com/nar/articl e-lookup/doi/10.1093/nar/gkw1121
Data In Large-scale systematic screens Detailed genic analyses ICGC/TCGA/CGP QC (data input format, details & noise) Publication datasheets Data Extraction GRCh37 co-ords & allele seqs ATRAC Sample Details QC (standardise) Genic annotation (HGVS syntax) QC (sequence and annotation checks) Broad genomic data QC (reject ~30%) Deep genic data COSMIC
Search for Disease, Gene, Variant, Sample, Study in either search bar Much information is open to public, but some features require a subscription/login. News window in upper right. Tools Cancer Browser Data Curation information and details Release Notes inform of new additions/advancements to COSMIC ***This is no longer the Beta version as of 8/1/17.
Select Data Source - COSMIC vs. Cell Lines Cell Lines Project and COSMIC house different data. To select a cell line to look into, look at the Cell Line Browser, or NCI-60 Browser if applicable
Cell Lines Data Once you select a cell line to view, sample page opens. Menu on left click on any item in menu to navigate to that item or scroll to desired information. Overview cell line information: sample type, demographics, institute, microsatellite stability, STR Profile, Sequence stats, and Downloadable files Mutation Spectrum tab information can be matched to mutation profiles through COSMIC Home Page Sequence Context tab bases flanking mutations Heatmap tab shows frequent mutations
Cell Lines Variants tab Can view by variant type tabs in window Breakpoints: search by hg38 coordinates Exportable in.tsv and.csv Click on Gene name to go to gene page Click on mutation nomenclature to go to mutation page can determine mutation recurrence in cell lines dataset and pathway involvement
Mutation Filters Scroll right to see expandable mutation filters.
Under/over expression Z-Score CN Type CNV and Expression Data Minor allele Copy number observed Hg38 coordinates Average Ploidy
Searching in COSMIC Search by disease type brings you to Cancer Browser see slides 24-30 Search by gene brings you to search results page that can link to gene/mutation overview (same format as Cell Lines Project) Search by mutation bring you to mutation overview page (same format as Cell Lines Project) All data are derived directly from a cited data source Usually a PubMed ID
Gene Overview Page Several Items on Menu on left panel Gene view view domains within gene and where mutations reside Genome browser - view genomic context Overview general information Tissue expression/mutations in each tissue type Distribution what kind of mutations observed? Drug Resistance do mutations confer drug resistance Variants table of observed variants in gene References curated references for gene
Gene Overview Page Navigate the gene page by selecting an item from the left menu or scrolling through page.
p53 an inactivating mutation profile p53 is oncogenic when its activity is destroyed by inactivating mutations: Loss - of - function. including substitutions, insertions and deletions which cause an early STOP codon Nonsense subs Frameshift Indels
p53 dominant missense mutations However, specific p53 mutations also show Gain of function - key missense substitutions cause an increase in malignancy - and cause resistance to chemotherapies (eg cisplatin) p.r175h reduces apoptosis p.r248q & p.r273h/c increase proliferation & invasiveness Missense (R273H) GTG CGT GTT Val Arg Val GTG CAT GTT Val His Val p53 is a DNA binding protein, affecting the work of many other genes GOF mutations alter how p53 binds to DNA, changing its checkpoint function
Mutation Overview Overview general mutation information Tissue distribution mutation frequency in tissue types Samples table of affected samples Pathways Affected table of affected pathways Click link to go to WikiPathways References links to relevant publications
Mutation Overview Page
Cancer Gene Census Main Links under Projects Section on COSMIC home page. List is in table format and (with license) is exportable in.csv or.tsv format. Genes marked as Hallmarks of Cancer have more functional descriptions available Click on
Cancer Gene Census Click on buttons near the top of the browser or scroll down to see the Breakdown of the types of gene abnormalities in the Census or abbreviations used in the table.
Hallmarks of Cancer
Drug Resistance Main Link under Data Curation section on COSMIC home page. Resistance Data are available for 12 genes: ABL1, ALK, BRAF, EGFR, ESR1, FLT3, KIT, MAP2K1, MAP2K2, PDGFRA, SMO, MET Table to view drug and resistance mutation frequency
Drug Resistance Manually curated list of genes that can carry resistance mutations to particular drugs. Click on gene name in table to see Gene Page. Scroll down to to the drug resistance section
Mutational Signatures Selection under Data in top banner. Describes mutational signatures in detail and specifies diseases that carry each mutational signature.
GRCh37 Cancer Archive If your lab has not converted to hg38, there is an option to view GRCh37/hg19 information on COSMIC. Under Projects in top banner, select GRCh37 archive to see legacy site.
Data Curation Other links under Expert Curation section on home page Gene Curation: list of all genes with deep curation completed. Curation details included on Curated Genes page. Click on gene to go to gene page. Gene Fusion Curation: list of all gene fusions with deep curation completed for solid tumors. Click on pair of gene to go to fusion overview page. Fusion Overview page information about in what kind of tissue fusion was observed and PMID. Genome Annotation: Information and parameters used to analyze data. Drug Resistance: See slides 19-20.
Cancer Browser Main link under Tools on home page Select tumor of interest by tissue and histology Press Go in lower right corner of window
Cancer Browser Opens to bar graph of top 20 mutated genes in your selected type of cancer. In table format you can view and filter genes with and without mutations by mutation frequency or number of samples tested. Exportable in.csv or.tsv format (with license)
Cancer Browser Can view genome browser in cancer browser window. Mutation matrix top 20 mutations by sample
Cancer Browser: Mutation Matrix Can redraw image using drop-down menu on top of window to view data based on a certain mutation type.
Cancer Browser: Distribution Distribution of mutations within your selected cancer type. Substitutions are described similarly base>base (not shown) Indels described by size in bps (Insertions displayed similar to deletions)
Cancer Browser: Variants Observed Fusions, Mutations, Methylation status, and CNV/Expression data given in table format. Exportable in.csv and.tsv format (with license) Click on mutations for mutation overview Click on fusion pairing for Fusion overview Click on gene for gene overview.
Cancer Browser: Samples Table of mutated and non-mutated samples Exportable in.tsv or.csv (with license) Click on sample ID for sample overview page
Sample Overview Page Click on menu on left side of page or scroll to desired information Primary reference(s) and/or large study name are available in the reference section
GA4GH Beacon Main link under Tools on COSMIC home page GA4GH = Global Alliance for Genomics and Health see Information section Shared URL format for query http://cancer.sanger.ac.uk/api/ga4gh/beacon/query?chrom=?;pos=?;allele=? Can query if COSMIC or Cell Lines Project has observed a specific mutation at a designated genomic locus.
COSMIC-3D
COSMIC 3D SNV frequency depicted by yellow/orange segments Predicted small molecule binding sites in blue. Can take screenshot, pop screen out, drag mouse or click button to rotate protein structure for better view. Several viewing and toggle options Protein property information UniProt sequence information and mutation information integrated. There are so many features within this application. Check them out!
COSMIC BigQuery If you have a COSMIC account, register at no charge for access to BigQuery. Research based queries of large datasets at minimal cost per query. Supports Standard SQL programming language and can be accessed by R http://isb-cancer-genomicscloud.readthedocs.io/en/latest/sections/ COSMIC.html
CONAN CONAN = COpy Number ANalysis under Tools on Home page. Search by gene (HGNC or Ensembl) or genomic region (hg38)
CONAN
Data Downloads Main Link under Data on Home Page Need license to download data Depending on the data you would like to download, you may need to download off of a SFTP site using a FTP tool like WinSCP or Filezilla
Data Submission Under Help Data Submission Using COSMIC data or describing mutation data in results Submitting Data to COSMIC
Other Links Under Data Genome Annotation Datasheets details about COSMIC versions Help links to helpful overview resources FAQ Frequently Asked Questions
Scenario #1 You are a laboratory professional trying to identify genes to put on your lab s new breast carcinoma NGS panel. Cancer Browser Search Cancer Gene Census
Cancer Browser Query
Search Cancer Gene Census
Results from Queries Cancer Genome Browser Query (Criteria 3% mutated) = 16 genes PIK3CA TP53 CDH1 GATA3 KMT2C ESR1 PTEN RB1 SPEN ARID1A NCOR1 KMT2D MAP2K4 NF1 TBX3 LRP1B Cancer Gene Census Query = 32 genes CCND1 KEAP1 NOTCH1 GATA3 SMARCD1 ESR1 MAP3K13 NCOR1 TP53 AKT1 ARID1B TBX3 ERBB2 BRCA1 BRCA2 FOXA1 CDKN1B ARID1A PBRM1 SALL4 EP300 PIK3CA ETV6 NTRK3 CTCF PPM1D CASP8 CDH1 MAP3K1 MAP3K4 RB1 BAP1 9 genes overlap between two lists = 39 total genes
Scenario #2 A whole genome sequencing (WGS) case in your lab has a mutation that you haven t observed before. Does this mutant allele exist in COSMIC? GA4GH Beacon Chr7:140753333 (BRAF) Mutated Allele = A
Yes this is in COSMIC Mutation can be found in Genome browser on BRAF gene page or by entering the AA mutation or CDS Mutation Nomenclature
Scenario #3 Your lab is building a Pan Cancer List based on multiple resources. Cancer Gene Census
Using Data from Cancer Gene Census If you have a license, download as.csv or.tsv This is useful as Excel if you d like to sort data in different ways to stratify genes based on other criteria in COSMIC s table. Cancer Gene Census is updated with most version releases of COSMIC.
Scenario #4 Mate Pair sequencing has picked up a translocation your lab has not yet reported. NUP98/KDM5A Search in Gene fusion curation under Data Curation
Curated Fusions Click on fusion gene pairing to open fusion overview. Link to PMID and/or large scale studies on Reference tab.
Scenario #5 Your lab has a significant pool of data that you would like to contribute to COSMIC to enhance the knowledgebase. COSMIC is exploring collaborative relationships with institutions aiming to release anonymized somatic mutation calls in COSMIC COSMIC will highlight these laboratories in their webpages and strongly cite them as sources. Contact COSMIC directly Simon Forbes (Head of COSMIC): saf@sanger.ac.uk COSMIC Helpdesk: cosmic@sanger.ac.uk
Future Features: Coming Soon Guide to Actionability Organoids Data release for the Human Cancer Model Initiative Better laboratory models in preparation Cancer Mutation Census identify driver mutations across all diseases
Online Tutorials COSMIC Site Overview: https://www.youtube.com/watch?v=whxil86gnka Sample Data Tutorial: https://www.youtube.com/watch?v=5dqinh8_lao Cancer Browser Tutorial: https://www.youtube.com/watch?v=k477uaikx74 Gene Pages Tutorial: https://www.youtube.com/watch?v=2fd5rabgk6o Fusions Tutorial: https://www.youtube.com/watch?v=m9ilszjsuaw COSMIC Search Tutorial: https://www.youtube.com/watch?v=svfloi4hfnm DISCLAIMER: These tutorials all provide instruction on how to use older versions of COSMIC. The latest version of COSMIC looks different and has different features.
Citation Forbes et al. COSMIC: somatic cancer genetics at high-resolution. Nucleic Acids Research (2017)45(D1):D777- D783. Available online: https://doi.org/10.1093/nar/gkw1121
Contacts Twitter Handle: @cosmic_sanger Facebook: cosmic.sanger Email: cosmic@sanger.ac.uk Simon Forbes (Head of COSMIC): saf@sanger.ac.uk