Package cancer. July 10, 2018

Size: px
Start display at page:

Download "Package cancer. July 10, 2018"

Transcription

1 Type Package Package cancer July 10, 2018 Title A Graphical User Interface for accessing and modeling the Cancer Genomics Data of MSKCC. Version Date Author Karim Mezhoud. Nuclear Safety & Security Department. Nuclear Science Center of Tunisia. Maintainer Karim Mezhoud <kmezhoud@gmail.com> The package is user friendly interface based on the cgdsr and other modeling packages to explore, compare, and analyse all available Cancer Data (Clinical data, Gene Mutation, Gene Methylation, Gene Expression, Protein Phosphorylation, Copy Number Alteration) hosted by the Computational Biology Center at Memorial-Sloan-Kettering Cancer Center (MSKCC). License GPL-2 LazyLoad yes Depends R (>= 3.3), tcltk, tcltk2, cgdsr Imports GSEABase, GSEAlm, tkrplot, genetclassifier, RUnit, Formula, rpart, survival, Biobase, phenotest, circlize, plyr, graphics, stats, utils Suggests testthat (>= ), R.rsp VignetteBuilder R.rsp biocviews GUI, GeneExpression, Software RoxygenNote git_url git_branch RELEASE_3_7 git_last_commit dc44485 git_last_commit_date Date/Publication

2 2 R topics documented: R topics documented: about cancer cancerhelp cancer_vignette cbind.na ClinicalData dialoggeneclassifier dialoggetgenelistmsigdb dialogmetoption dialogmut dialogoptioncircos dialogoptiongsealm dialogoptionphenotest dialogplotoption_skincor dialogsamplinggsea dialogselectfiles_gsea dialogspecificmut dialogsummary_gsea displayintable GeneExpMatrix getcases getcasesgenprofs getcircos getclinicaldatamatrix getclinicdata_multiplecases getcor_expcnamet geteset getgctclsexample getgct_clsfiles getgeneexpmatrix getgenelist getgenelistexample getgenelistfrommsigdb getgenesclassifier getgenestree_multiplecases getgenestree_singlecase getgenprofs getgsealm_diseases getgsealm_variables getintable getlistprofdata getmegaprofdata getmetdatamultiplegenes getmsigdb getmsigdbexample getmsigdbfile getmutdata getphenotest getprofilesdatamultiplegenes getprofilesdatasinglegene

3 about 3 getspecificmut getsummarygsea getsurvival gettextwin GSEA GSEA.Analyze.Sets GSEA.ConsPlot GSEA.EnrichmentScore GSEA.EnrichmentScore GSEA.Gct2Frame GSEA.Gct2Frame GSEA.GeneRanking GSEA.HeatMapPlot GSEA.HeatMapPlot GSEA.NormalizeCols GSEA.NormalizeRows GSEA.ReadClsFile GSEA.Res2Frame GSEA.Threshold GSEA.VarFilter GSEA.write.gct Match_GeneList_MSigDB modaldialog myglobalenv OLD.GSEA.EnrichmentScore plotmodel plot_1gene_2genprofs plot_2genes_1genprof rbind.na Run.GSEA setworkspace testcheckedcasegenprof Index 50 about about cancer about cancer about() dialig box with text

4 4 cancerhelp about() cancer main function main function cancer() open the starting windows with cancer studies myglobalenv <- new.env(parent = emptyenv()) cancer() cancerhelp cancer Help cancer Help cancerhelp() html file with tutorial cancerhelp()

5 cancer_vignette 5 cancer_vignette open pdf vignette open pdf vignette cancer_vignette() open pdf vignette cancer_vignette() cbind.na bind non equal colunm bind non equal colunm cbind.na(..., deparse.level = 1) deparse.level 1 a data frame with merged columns col1 <- c("a","b","c","d") col2 <- c("a", "B", "C") col3 <- cbind.na(col1, col2)

6 6 dialoggeneclassifier ClinicalData ClinicalData Example of Clinical Data data("clinicaldata") Format A data frame with 770 observations on the following 4 variables. DFS_MONTHS a numeric vector DFS_STATUS a factor with levels DiseaseFree Recurred/Progressed OS_MONTHS a numeric vector OS_STATUS a factor with levels DECEASED LIVING a dataframe with clinical data Source cbioportal data("clinicaldata") dialoggeneclassifier Dialogue Box for gene classifier setting: sample size and postprob threshold Dialogue Box for gene classifier setting: sample size and postprob threshold dialoggeneclassifier(lchecked_cases,entrywidth = 10,returnValOnCancel = "ID_CANCEL")

7 dialoggetgenelistmsigdb 7 Lchecked_Cases integer with a number of checked cases entrywidth integer default 10 returnvaloncancel "ID_CANCEL" a dataframe with genes classes load(paste(path.package("cancer"),"/data/gbm_tcgaplottwogenprof.rdata", sep="")) getgenesclassifier() dialoggeneclassifier(1,10,returnvaloncancel = "ID_CANCEL") dialoggetgenelistmsigdb Multi-select choice of gene sets from loaded MSigDB Multi-select choice of gene sets from loaded MSigDB dialoggetgenelistmsigdb(msigdb) MSigDB object with MSigDB. A list of genesets a dataframe with genes classes z <- 7 MSigDB <- readlines(paste(.libpaths(),"/cancer/extdata/msigdb/c5.bp.v4.0.symbols.gmt", sep="")) dialoggetgenelistmsigdb(msigdb)

8 8 dialogmut dialogmetoption Dialog Box to set methylation options Dialog Box to set methylation options dialogmetoption(profdata, k) ProfData adataframe with methylation data k threshold of silencing gene 0:1 a dialog box to set methylation option (threshold of silencing gene) getmetdatamultiplegenes() #dialogmetoption(profdata,0.7) dialogmut Dialog bos to set returned Mutation information Dialog bos to set returned Mutation information dialogmut(title, question, entryinit, entrywidth = 40,returnValOnCancel = "ID_CANCEL") title question entryinit title of the table question entryinit entrywidth 40 returnvaloncancel "ID_CANCEL"

9 dialogoptioncircos 9 a check box with mutations variables dialogmut("title", "question", "entryinit", entrywidth = 40, returnvaloncancel = "ID_CANCEL") dialogoptioncircos Checkbox to select dimensions Checkbox to select dimensions dialogoptioncircos() a checkbox with all dimensions load(paste(path.package("cancer"),"/data/circos.rdata", sep="")) dialogoptioncircos() #getcircos(dimension ="All") dialogoptiongsealm Dialogbox to select variables from Clinical data Dialogbox to select variables from Clinical data dialogoptiongsealm(k,clinicaldata) k integer 1 ClinicalData dataframe with clinical variables

10 10 dialogplotoption_skincor permutaion value, p-value, covariables data(clinicaldata) getoptiongsealm() dialogoptionphenotest Checkbox to select variables from clinical data Checkbox to select variables from clinical data dialogoptionphenotest(eset) eset Expression Set vectors: variables to test Survival status, AGE, p-value load(paste(path.package("cancer"),"/data/prad_michphenotest1021.rdata", sep="")) dialogoptionphenotest(myglobalenv$eset) dialogplotoption_skincor Checkbox to select variables for plotting Checkbox to select variables for plotting dialogplotoption_skincor(s)

11 dialogsamplinggsea 11 s integer number of Studies Dialog box with setting of correlation method load(paste(path.package("cancer"),"/data/gbm_tcgaplottwogenprof.rdata", sep="")) dialogplotoption_skincor(1) dialogsamplinggsea Dialog Box for Sampling patients from expression profile data used for GSEA-R (Broad Institute) Dialog Box for Sampling patients from expression profile data used for GSEA-R (Broad Institute) dialogsamplinggsea( Lchecked_Cases,entryWidth = 10,returnValOnCancel = "ID_CANCEL") Lchecked_Cases Number of checked Cases entrywidth 10 returnvaloncancel "ID_CANCEL" A vector with sampling size Run.GSEA() #dialogsamplinggsea(1,entrywidth=10,returnvaloncancel = "ID_CANCEL")

12 12 dialogspecificmut dialogselectfiles_gsea Dialog Box to Select GCT, CLS, GMT and output Files for GSEA-R (Broad Institute) Dialog Box to Select GCT, CLS, GMT and output Files for GSEA-R (Broad Institute) dialogselectfiles_gsea() A vector with files paths dialogselectfiles_gsea() dialogspecificmut dialog box to Specify Mutation using Regular Expression. Search specific mutation using regular expression. dialog box to Specify Mutation using Regular Expression. Search specific mutation using regular expression. getspecificmut() a a dataframe with specific mutation informations getspecificmut()

13 dialogsummary_gsea 13 dialogsummary_gsea Dialog Box to specify phenotype (variable) used in last GSEA-R to get Summary Results. This function ask the user to specify the phenotype (variable). Dialog Box to specify phenotype (variable) used in last GSEA-R to get Summary Results. This function ask the user to specify the phenotype (variable). dialogsummary_gsea(variable,returnvaloncancel ="ID_CANCEL") Variable phenotype returnvaloncancel "ID_CANCEL" variables #Run.GSEA() #getsummarygsea() displayintable Display matrix in tcltk table Display matrix in tcltk table displayintable(tclarray,title="",height=-1,width=-1,nrow=-1,ncol=-1) tclarray a dataframe title title of the table height -1 width -1 nrow -1 ncol -1

14 14 GeneExpMatrix display a Table data(clinicaldata) getintable(table= ClinicalData, title= "Clinical Data") GeneExpMatrix GeneExpMatrix Example of GeneExpMatrix data("geneexpmatrix") Format A data frame with 958 observations on the following 18 variables. BEGAIN a numeric vector CD83 a numeric vector CD93 a numeric vector CEP164 a numeric vector FOXN2 a numeric vector IGFBP2 a numeric vector IL18 a numeric vector KDELR1 a numeric vector NCSTN a numeric vector NOTCH2 a numeric vector NPY a numeric vector NT5E a numeric vector PARP4 a numeric vector SIGLEC1 a numeric vector SLC16A2 a numeric vector SLC35B1 a numeric vector SLC9A2 a numeric vector VPS16 a numeric vector

15 getcases 15 Details Source example of gene expression a dataframe of gene expression cbioportal References cbioportal data(geneexpmatrix) ## maybe str(geneexpmatrix) ; plot(geneexpmatrix)... getcases Get cases for selected Studies. The Cases are the descrption of the samples from patients. The samples can be subdivided by the type of assays as, sequencing, CNA, Mutation, Methylation. Get cases for selected Studies. The Cases are the descrption of the samples from patients. The samples can be subdivided by the type of assays as, sequencing, CNA, Mutation, Methylation. getcases() a dataframe with cases # Create CGDS object cgds<-cgds(" # Get list of cancer studies at server Studies <- getcancerstudies(cgds)[,2] # Get available case lists (collection of samples) for a given cancer study mycancerstudy <- getcancerstudies(cgds)[2,1] mycaselist <- getcaselists(cgds,mycancerstudy)[1,1] ##getcases()

16 16 getcircos getcasesgenprofs get Cases and Genetic Profiles of selected Studies. get Cases and Genetic Profiles of selected Studies. getcasesgenprofs() This function is run by the "Get Cases and Genetic Profiles for selected Studies in starting window. This function needs to select at least one study and display Cases and genetic profiles in the main window. ##Load Session load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) ## load Cases and Genetic Profiles getcasesgenprofs() getcircos get Circos Layout for selected studies and selected dimensions get Circos Layout for selected studies and selected dimensions getcircos(dimension) dimension string (All,mRNA, CNA, Met,RPPA, mirna, Mut) a plot with Circos style load(paste(path.package("cancer"),"/data/circos.rdata", sep="")) getcircos(dimension ="All")

17 getclinicaldatamatrix 17 getclinicaldatamatrix get matrix with clinical from file get matrix with clinical from file getclinicaldatamatrix() dataframe of clinicaldata load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) getclinicaldatamatrix() getclinicdata_multiplecases get Clinical Data for Multiple Cases. User needs to select at least one case to run this function. Get clinical data for more one or multiple cases. get Clinical Data for Multiple Cases. User needs to select at least one case to run this function. Get clinical data for more one or multiple cases. getclinicdata_multiplecases(getsummarygseaexists) getsummarygseaexists if equal to 0, the clinical data is displayed in table. if the argument is equal to 1, the clinical data is used to summarise GSEA analysis results. dataframe with clinical data

18 18 getcor_expcnamet ##Load Session load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) ## Select Case myglobalenv$curselectcases <- 2 ## get Clinical data getclinicdata_multiplecases(getsummarygseaexists = 0) getcor_expcnamet Get gene correlation for multiple dimensions. Get gene correlation for multiple dimensions. getcor_expcnamet(listmatrix, dimension) ListMatrix dimension is a List of numeric matrices Exp,CNA, Met, mirna, RPPA correlation matrix load(paste(path.package("cancer"),"/data/circos.rdata", sep="")) getlistprofdata() getcor_expcnamet(myglobalenv$listprofdata$expression, dimension="mrna") head(myglobalenv$cor_exp)

19 geteset 19 geteset Built Expression Set (eset) from profile data. Built Expression Set (eset) from profile data. geteset() ExpressionSet f <- 9 load(paste(.libpaths(),"cancer/data/prad_michphenotest1021", sep="")) geteset() getgctclsexample get GCT and CLS example files. get GCT and CLS example files. getgctclsexample() GCT and CLS files ## Load workspace getgctclsexample()

20 20 getgeneexpmatrix getgct_clsfiles get Profile (GCT file) and Phenotype (CLS file) Data from Disease. get Profile (GCT file) and Phenotype (CLS file) Data from Disease. getgct_clsfiles() GCT and CLS files paths getgct_clsfiles() getgeneexpmatrix get matrix with gene expression from file get matrix with gene expression from file getgeneexpmatrix() dataframe of gene expression load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) getgeneexpmatrix()

21 getgenelist 21 getgenelist User needs to specify which gene is interesting to get genomic cancer data. The gene must be with Symbol and one gene by line. User needs to specify which gene is interesting to get genomic cancer data. The gene must be with Symbol and one gene by line. getgenelist() Gene list path of file myglobalenv <- new.env(parent = emptyenv()) getgenelist() getgenelistexample get Gene List from examples. User can select one from available gene list get Gene List from examples. User can select one from available gene list getgenelistexample() Gene list path of file myglobalenv <- new.env(parent = emptyenv()) getgenelistexample()

22 22 getgenesclassifier getgenelistfrommsigdb get gene list from MSigDB get gene list from MSigDB getgenelistfrommsigdb() a vector with gene list load(paste(path.package("cancer"),"/data/brca_tcgagsealm1021.rdata", sep="")) getgenelistfrommsigdb() getgenesclassifier get Genes Classifier get Genes Classifier getgenesclassifier() a data frma with genes classes x <- 0 load(paste(.libpaths(),"/cancer/data/brca_tcga73genes.rdata", sep="")) getgenesclassifier()

23 getgenestree_multiplecases 23 getgenestree_multiplecases Get successively trees of genes list for multiple cases Get successively trees of genes list for multiple cases getgenestree_multiplecases(entrywidth = 10) entrywidth 10 plot tree q <- load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) load(paste(.libpaths(),"/cancer/data/brca_tcga73genes.rdata", sep="")) getgenestree_multiplecases(entrywidth = 10) getgenestree_singlecase classify genes in tree for two phenotypes in the same case(disease). classify genes in tree for two phenotypes in the same case(disease). getgenestree_singlecase() tree plot load(paste(path.package("cancer"),"/data/prad_michphenotest1021.rdata", sep="")) getgenestree_singlecase()

24 24 getgsealm_diseases getgenprofs Get Genetic Profile from selected Studies Get Genetic Profile from selected Studies getgenprofs() dataframe with genetic profil cgds<-cgds(" # Get list of cancer studies at server Studies <- getcancerstudies(cgds)[,2] # Get available case lists (collection of samples) for a given cancer study mycancerstudy <- getcancerstudies(cgds)[2,1] mycaselist <- getcaselists(cgds,mycancerstudy)[1,1] # Get available genetic profiles mygeneticprofile <- getgeneticprofiles(cgds,mycancerstudy)[4,1] getgenprofs() getgsealm_diseases get GSEA linear modeling by studies (diseases) get GSEA linear modeling by studies (diseases) getgsealm_diseases() a dataframe with annotation (GO, BP) load(paste(.libpaths(),"/cancer/data/ucec_tcga_pubgsea1021.rdata", sep="")) getgsealm_diseases

25 getgsealm_variables 25 getgsealm_variables get GSEA linear modeling by variables (phenotype) get GSEA linear modeling by variables (phenotype) getgsealm_variables() a dataframe with annotation (GO, BP) x <- 3 load(paste(.libpaths(),"/cancer/data/ucec_tcga_pubgsea1021.rdata", sep="")) getgsealm_variables() getintable get dataframe in TK/TCL table get dataframe in TK/TCL table getintable(table,title) table title Dataframe string a title of the table display a Table data(clinicaldata) getintable(table= ClinicalData, title= "Clinical Data")

26 26 getmegaprofdata getlistprofdata get a list of Profile Data of every available dimensions. This function load matrices of every dimension (Exp, CNA, Met, RPPA,miRNA,Mut) and save them in a list for every disease. get a list of Profile Data of every available dimensions. This function load matrices of every dimension (Exp, CNA, Met, RPPA,miRNA,Mut) and save them in a list for every disease. getlistprofdata() a list of data frame with Profiles Data load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) getlistprofdata() head(myglobalenv$profdata$expression) getmegaprofdata Get profile data for more than 500 genes list. Get profile data for more than 500 genes list. getmegaprofdata(megagenelist,k) MegaGeneList Genelist >500 k integer number of studies dataframewith profile data

27 getmetdatamultiplegenes 27 myglobalenv <- new.env(parent = emptyenv()) load(paste(path.package("cancer"),"/data/brca_tcgagsealm1021.rdata", sep="")) getmegaprofdata(myglobaenv$megagenelist,1) getmetdatamultiplegenes get Methylation data for multiple genes get Methylation data for multiple genes getmetdatamultiplegenes() a a dataframe with mean and median of methylation rate (threshold of silencing gene) getmetdatamultiplegenes() getmsigdb Reduce MSigDB size for only gene list Reduce MSigDB size for only gene list getmsigdb(eset, k) eset k Expression Set integer Number of studies

28 28 getmsigdbfile MSigDB for user gene List d <- 7 setworkspace() getmsigdb(eset = myglobalenv$esetclassifier,k = 1) getmsigdbexample get example of.gmt file from MSigDB (Broad Institute) get example of.gmt file from MSigDB (Broad Institute) getmsigdbexample() path of GMT file load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) getmsigdbexample() getmsigdbfile Dialog Box to Select MSigDB Files from drive Dialog Box to Select MSigDB Files from drive getmsigdbfile() A path of MSigDB file

29 getmutdata 29 f <- 5+2 load(paste(path.package("cancer"),"/data/prad_michphenotest1021", sep="")) geteset() getmsigdbfile() getmutdata get Mutation data for multiple genes get Mutation data for multiple genes getmutdata() a a dataframe with mutation informations getmutdata() getphenotest Associate phenotype to Studies (cancers) Associate phenotype to Studies (cancers) getphenotest() a dataframe with disease/ variables association

30 30 getprofilesdatasinglegene load(paste(path.package("cancer"),"/data/prad_michphenotest1021.rdata", sep="")) getphenotest(myglobalenv$eset) getprofilesdatamultiplegenes get Profles Data of multiple genes get Profles Data of multiple genes getprofilesdatamultiplegenes(getsummarygseaexists) getsummarygseaexists if equal to 0, the clinical data is displayed in table. if the argument is equal to 1, the clinical data is used to summarise GSEA analysis results. a file with a dataframe of profle data load(paste(path.package("cancer"),"/data/prad_michphenotest1021.rdata", sep="")) getprofilesdatamultiplegenes(getsummarygseaexists = 0) getprofilesdatasinglegene get Profiles Data for a Single Gene. get Profiles Data for a Single Gene. getprofilesdatasinglegene()

31 getspecificmut 31 dataframe with profiles data for a single gene load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) ## Select Case from Breast Cancer myglobalenv$curselectcases <- 9 ##Select Genetic Profile from Breast Cancer myglobalenv$curselectgenprofs <- 4 ## get Specific Mutation data for 73 Genes list getprofilesdatasinglegene() getspecificmut get specific Mutation data for multiple genes get specific Mutation data for multiple genes getspecificmut() a a dataframe with specific mutation informations getspecificmut() getsummarygsea get Summary results from GSEA-R (Broad Institute) get Summary results from GSEA-R (Broad Institute) getsummarygsea()

32 32 getsurvival Dataframe with summary results Run.GSEA() getsummarygsea() getsurvival Survival plot Survival plot getsurvival(coxph) Coxph if Coxph = 0 : plot Kaplan-Meier curves else Coxph= 1 : plot Cox Proportional Hazard Model Survival plot surv <- 11 load(paste(.libpaths(),"/cancer/data/gbm_tcgaplottwogenprof.rdata", sep="")) getsurvival(coxph = 1)

33 gettextwin 33 gettextwin get text in tcltk windows get text in tcltk windows gettextwin(text) text string tcltk windows with text text <- "mytext" gettextwin(text) GSEA GSEA-R (Broad Institute) See GSEA Author(s) Subramanian, Tamayo, et al. (2005, PNAS 102, ) and Mootha, Lindgren, et al. (2003, Nat Genet 34, ) library(cancer) ## Load workspace ##Run.GSEA()

34 34 GSEA.Analyze.Sets GSEA.Analyze.Sets GSEA.Analyze.Sets GSEA.Analyze.Sets(directory,topgs="",non.interactive.run= FALSE,height=12,width=17) directory topgs topgs = 20 directory= fname.output non.interactive.run non.interactive.run= FALSE height width height=16 width=16 GSEA.Analyze.Sets Author(s) Subramanian, Tamayo, et al. (2005, PNAS 102, ) and Mootha, Lindgren, et al. (2003, Nat Genet 34, ) References ## Load workspace ##Run.GSEA()

35 GSEA.ConsPlot 35 GSEA.ConsPlot GSEA.ConsPlot GSEA.ConsPlot GSEA.ConsPlot(V, col.names, main = " ", sub = " ", xlab = " ", ylab = " ") V V="Itable" col.names col.names = colnames main main= " " sub sub = " " xlab xlab= " " ylab ylab = " " GSEA.ConsPlot library(cancer) ## Load workspace ##Run.GSEA() GSEA.EnrichmentScore GSEA.EnrichmentScore GSEA.EnrichmentScore GSEA.EnrichmentScore(gene.list, gene.set, weighted.score.type = 1, correl.vector = NULL)

36 36 GSEA.EnrichmentScore2 gene.list gene.set weighted.score.type correl.vector GSEA.EnrichmentScore library(cancer) ## Load workspace ##Run.GSEA() GSEA.EnrichmentScore2 GSEA.EnrichmentScore2 GSEA.EnrichmentScore2 GSEA.EnrichmentScore2(gene.list, gene.set, weighted.score.type = 1, correl.vector = NULL) gene.list gene.set weighted.score.type correl.vector GSEA.EnrichmentScore2 library(cancer) ## Load workspace ##Run.GSEA()

37 GSEA.Gct2Frame 37 GSEA.Gct2Frame GSEA.Gct2Frame GSEA.Gct2Frame GSEA.Gct2Frame(filename = "NULL") filename GSEA.GCT2Frame library(cancer) ## Load workspace ##Run.GSEA() GSEA.Gct2Frame2 GSEA.Gct2Frame2 GSEA.Gct2Frame2 GSEA.Gct2Frame2(filename = "NULL") filename GSEA.GCT2Frame2

38 38 GSEA.GeneRanking library(cancer) ## Load workspace ##Run.GSEA() GSEA.GeneRanking GSEA.GeneRanking GSEA.GeneRanking A class.labels gene.labels nperm permutation.type sigma.correction fraction replace reverse.sign GSEA.GeneRanking library(cancer) ## Load workspace ##Run.GSEA()

39 GSEA.HeatMapPlot 39 GSEA.HeatMapPlot GSEA.HeatMapPlot GSEA.HeatMapPlot GSEA.HeatMapPlot library(cancer) ## Load workspace ##Run.GSEA() GSEA.HeatMapPlot2 GSEA.HeatMapPlot2 GSEA.HeatMapPlot2 GSEA.HeatMapPlot2 library(cancer) ## Load workspace ##Run.GSEA()

40 40 GSEA.NormalizeRows GSEA.NormalizeCols GSEA.NormalizeCols GSEA.NormalizeCols GSEA.NormalizeCols(V) V GSEA.NormalizeCols ## Load workspace ##Run.GSEA() GSEA.NormalizeRows GSEA.NormalizeRows GSEA.NormalizeRows GSEA.NormalizeRows(V) V GSEA.NormalizeRows

41 GSEA.ReadClsFile 41 library(cancer) ## Load workspace ##Run.GSEA() GSEA.ReadClsFile GSEA.ReadClsFile GSEA.ReadClsFile GSEA.ReadClsFile(file = "NULL") file GSEA.ReadClsFile ##Run.GSEA() GSEA.Res2Frame GSEA.Res2Frame GSEA.Res2Frame GSEA.Res2Frame(filename = "NULL") filename

42 42 GSEA.Threshold GSEA.NormalizeCols ##Run.GSEA() GSEA.Threshold GSEA.Threshold GSEA.Threshold GSEA.Threshold(V, thres, ceil) V thres ceil GSEA.Threshold ## Load workspace ##Run.GSEA()

43 GSEA.VarFilter 43 GSEA.VarFilter GSEA.VarFilter GSEA.VarFilter GSEA.VarFilter(V, fold, delta, gene.names = "NULL") V fold delta gene.names GSEA.VarFilter ## Load workspace ##Run.GSEA() GSEA.write.gct GSEA.write.gct GSEA.write.gct GSEA.write.gct(gct, filename) gct filename GSEA.Write.gct

44 44 modaldialog ## Load workspace ##Run.GSEA() Match_GeneList_MSigDB Search MSigDb that overlap gene list Search MSigDb that overlap gene list Match_GeneList_MSigDB GeneList load(paste(path.package("cancer"),"/data/prad_michphenotest1021.rdata", sep="")) Match_GeneList_MSigDB() modaldialog Dialog box to specify Gene Symbol. Dialog box to specify Gene Symbol. modaldialog(title, question, entryinit, entrywidth = 40,returnValOnCancel = "ID_CANCEL") title question entryinit string string entryinit entrywidth 40 returnvaloncancel "ID_CANCEL"

45 myglobalenv 45 dialog box load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) ## Select Case from Breast Cancer myglobalenv$curselectcases <- 9 ##Select Genetic Profile from Breast Cancer myglobalenv$curselectgenprofs <- 4 ## get Specific Mutation data for 73 Genes list getprofilesdatasinglegene() myglobalenv myglobalenv Global environment to store cancer variables. Format The format is: <environment: 0xb3eb240> myglobalenv <- new.env(parent = emptyenv()) OLD.GSEA.EnrichmentScore OLD.GSEA.EnrichmentScore OLD.GSEA.EnrichmentScore gene.list gene.set OLD.GSEA.EnchmentScore

46 46 plot_1gene_2genprofs ##Run.GSEA() plotmodel model plotting with tcltk model plotting with tcltk plotmodel(plotcommand, title= "TITLE",hscale=1, vscale=1 ) plotcommand title hscale vscale plotcommand title of plot horizintal scale vertical scale plot load(paste(path.package("cancer"),"/data/gbm_tcgaplottwogenprof.rdata", sep="")) plot_1gene_2genprofs() plot_1gene_2genprofs Plotting two genetic profiles for one Gene Plotting two genetic profiles for one Gene plot_1gene_2genprofs()

47 plot_2genes_1genprof 47 plot load(paste(path.package("cancer"),"/data/gbm_tcgaplottwogenprof.rdata", sep="")) plot_1gene_2genprofs() plot_2genes_1genprof plot correlation of two genes expressions. plot correlation of two genes expressions. plot_2genes_1genprof() plot plot_2genes_1genprof() rbind.na bind non equal row bind non equal row rbind.na(..., deparse.level = 1) deparse.level 1

48 48 setworkspace a data frame with merged rows row1 <- c("a","b","c","d") row2 <- c("a", "B", "C") row3 <- rbind.na(row1, row2) Run.GSEA The main function to run GSEA-R from Broad Institute The main function to run GSEA-R from Broad Institute Run.GSEA() A vector with sampling size Run.GSEA() setworkspace Setting work Directory and output folders.at starting window, user needs to set work directory for output data. The function is foud in File menu. Setting work Directory and output folders.at starting window, user needs to set work directory for output data. The function is foud in File menu. setworkspace()

49 testcheckedcasegenprof 49 paths of output files load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) setworkspace() testcheckedcasegenprof Testing checked appropriate Cases for appropriate Genetic profiles. Testing checked appropriate Cases for appropriate Genetic profiles. testcheckedcasegenprof() dialog box with warning message load(paste(path.package("cancer"),"/data/brca_tcga73genes.rdata", sep="")) testcheckedcasegenprof()

50 Index Topic datasets ClinicalData, 6 GeneExpMatrix, 14 myglobalenv, 45 about, 3 cancer, 4 cancer_vignette, 5 cancerhelp, 4 cbind.na, 5 ClinicalData, 6 dialoggeneclassifier, 6 dialoggetgenelistmsigdb, 7 dialogmetoption, 8 dialogmut, 8 dialogoptioncircos, 9 dialogoptiongsealm, 9 dialogoptionphenotest, 10 dialogplotoption_skincor, 10 dialogsamplinggsea, 11 dialogselectfiles_gsea, 12 dialogspecificmut, 12 dialogsummary_gsea, 13 displayintable, 13 GeneExpMatrix, 14 getcases, 15 getcasesgenprofs, 16 getcircos, 16 getclinicaldatamatrix, 17 getclinicdata_multiplecases, 17 getcor_expcnamet, 18 geteset, 19 getgct_clsfiles, 20 getgctclsexample, 19 getgeneexpmatrix, 20 getgenelist, 21 getgenelistexample, 21 getgenelistfrommsigdb, 22 getgenesclassifier, 22 getgenestree_multiplecases, 23 getgenestree_singlecase, 23 getgenprofs, 24 getgsealm_diseases, 24 getgsealm_variables, 25 getintable, 25 getlistprofdata, 26 getmegaprofdata, 26 getmetdatamultiplegenes, 27 getmsigdb, 27 getmsigdbexample, 28 getmsigdbfile, 28 getmutdata, 29 getphenotest, 29 getprofilesdatamultiplegenes, 30 getprofilesdatasinglegene, 30 getspecificmut, 31 getsummarygsea, 31 getsurvival, 32 gettextwin, 33 GSEA, 33 GSEA.Analyze.Sets, 34 GSEA.ConsPlot, 35 GSEA.EnrichmentScore, 35 GSEA.EnrichmentScore2, 36 GSEA.Gct2Frame, 37 GSEA.Gct2Frame2, 37 GSEA.GeneRanking, 38 GSEA.HeatMapPlot, 39 GSEA.HeatMapPlot2, 39 GSEA.NormalizeCols, 40 GSEA.NormalizeRows, 40 GSEA.ReadClsFile, 41 GSEA.Res2Frame, 41 GSEA.Threshold, 42 GSEA.VarFilter, 43 GSEA.write.gct, 43 Match_GeneList_MSigDB, 44 modaldialog, 44 myglobalenv, 45 OLD.GSEA.EnrichmentScore, 45 plot_1gene_2genprofs, 46 plot_2genes_1genprof, 47 50

51 INDEX 51 plotmodel, 46 rbind.na, 47 Run.GSEA, 48 setworkspace, 48 testcheckedcasegenprof, 49

Package biocancer. August 29, 2018

Package biocancer. August 29, 2018 Package biocancer August 29, 2018 Title Interactive Multi-Omics Cancers Data Visualization and Analysis Version 1.8.0 Date 2018-04-12 biocancer is a Shiny App to visualize and analyse interactively Multi-Assays

More information

Package AbsFilterGSEA

Package AbsFilterGSEA Type Package Package AbsFilterGSEA September 21, 2017 Title Improved False Positive Control of Gene-Permuting GSEA with Absolute Filtering Version 1.5.1 Author Sora Yoon Maintainer

More information

Package msgbsr. January 17, 2019

Package msgbsr. January 17, 2019 Type Package Package msgbsr January 17, 2019 Title msgbsr: methylation sensitive genotyping by sequencing (MS-GBS) R functions Version 1.6.1 Date 2017-04-24 Author Benjamin Mayne Maintainer Benjamin Mayne

More information

Package MSstatsTMT. February 26, Title Protein Significance Analysis in shotgun mass spectrometry-based

Package MSstatsTMT. February 26, Title Protein Significance Analysis in shotgun mass spectrometry-based Package MSstatsTMT February 26, 2019 Title Protein Significance Analysis in shotgun mass spectrometry-based proteomic experiments with tandem mass tag (TMT) labeling Version 1.1.2 Date 2019-02-25 Tools

More information

Package p3state.msm. R topics documented: February 20, 2015

Package p3state.msm. R topics documented: February 20, 2015 Package p3state.msm February 20, 2015 Type Package Depends R (>= 2.8.1),survival,base Title Analyzing survival data Version 1.3 Date 2012-06-03 Author Luis Meira-Machado and Javier Roca-Pardinas

More information

Package diggitdata. April 11, 2019

Package diggitdata. April 11, 2019 Type Package Title Example data for the diggit package Version 1.14.0 Date 2014-08-29 Author Mariano Javier Alvarez Package diggitdata April 11, 2019 Maintainer Mariano Javier Alvarez

More information

Package inote. June 8, 2017

Package inote. June 8, 2017 Type Package Package inote June 8, 2017 Title Integrative Network Omnibus Total Effect Test Version 1.0 Date 2017-06-05 Author Su H. Chu Yen-Tsung Huang

More information

Package GANPAdata. February 19, 2015

Package GANPAdata. February 19, 2015 Type Package Title The GANPA Datasets Package Version 1.0 Date 2011-05-26 Package GANPAdata February 19, 2015 Author Zhaoyuan Fang, Weidong Tian and Hongbin Ji Maintainer Zhaoyuan Fang

More information

cancer: A Graphical User Interface for accessing and modeling the Cancer Genomics Data of MSKCC

cancer: A Graphical User Interface for accessing and modeling the Cancer Genomics Data of MSKCC cancer: A Graphical User Interface for accessing and modeling the Cancer Genomics Data of MSKCC Karim Mezhoud Mars 24, 2015 Contents 1 Introduction 2 2 Installation 2 3 Starting Window 3 3.1 Setting Workspace........................................

More information

Package MethPed. September 1, 2018

Package MethPed. September 1, 2018 Type Package Version 1.8.0 Date 2016-01-01 Package MethPed September 1, 2018 Title A DNA methylation classifier tool for the identification of pediatric brain tumor subtypes Depends R (>= 3.0.0), Biobase

More information

Package AIMS. June 29, 2018

Package AIMS. June 29, 2018 Type Package Package AIMS June 29, 2018 Title AIMS : Absolute Assignment of Breast Cancer Intrinsic Molecular Subtype Version 1.12.0 Date 2014-06-25 Description This package contains the AIMS implementation.

More information

Package xseq. R topics documented: September 11, 2015

Package xseq. R topics documented: September 11, 2015 Package xseq September 11, 2015 Title Assessing Functional Impact on Gene Expression of Mutations in Cancer Version 0.2.1 Date 2015-08-25 Author Jiarui Ding, Sohrab Shah Maintainer Jiarui Ding

More information

Package IMAS. March 10, 2019

Package IMAS. March 10, 2019 Type Package Package IMAS March 10, 2019 Title Integrative analysis of Multi-omics data for Alternative Splicing Version 1.6.1 Author Seonggyun Han, Younghee Lee Maintainer Seonggyun Han

More information

Package Actigraphy. R topics documented: January 15, Type Package Title Actigraphy Data Analysis Version 1.3.

Package Actigraphy. R topics documented: January 15, Type Package Title Actigraphy Data Analysis Version 1.3. Type Package Title Actigraphy Data Analysis Version 1.3.2 Date 2016-01-14 Package Actigraphy January 15, 2016 Author William Shannon, Tao Li, Hong Xian, Jia Wang, Elena Deych, Carlos Gonzalez Maintainer

More information

Package CLL. April 19, 2018

Package CLL. April 19, 2018 Type Package Title A Package for CLL Gene Expression Data Version 1.19.0 Author Elizabeth Whalen Package CLL April 19, 2018 Maintainer Robert Gentleman The CLL package contains the

More information

Package BUScorrect. September 16, 2018

Package BUScorrect. September 16, 2018 Type Package Package BUScorrect September 16, 2018 Title Batch Effects Correction with Unknown Subtypes Version 0.99.12 Date 2018-06-07 Author , Yingying Wei Maintainer

More information

How To Use MiRSEA. Junwei Han. July 1, Overview 1. 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2

How To Use MiRSEA. Junwei Han. July 1, Overview 1. 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2 How To Use MiRSEA Junwei Han July 1, 2015 Contents 1 Overview 1 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2 3 Discovering the dysregulated pathways(or prior gene sets) based

More information

Package inctools. January 3, 2018

Package inctools. January 3, 2018 Encoding UTF-8 Type Package Title Incidence Estimation Tools Version 1.0.11 Date 2018-01-03 Package inctools January 3, 2018 Tools for estimating incidence from biomarker data in crosssectional surveys,

More information

Package ega. March 21, 2017

Package ega. March 21, 2017 Title Error Grid Analysis Version 2.0.0 Package ega March 21, 2017 Maintainer Daniel Schmolze Functions for assigning Clarke or Parkes (Consensus) error grid zones to blood glucose values,

More information

ivlewbel: Uses heteroscedasticity to estimate mismeasured and endogenous regressor models

ivlewbel: Uses heteroscedasticity to estimate mismeasured and endogenous regressor models ivlewbel: Uses heteroscedasticity to estimate mismeasured and endogenous regressor models Fernihough, A. ivlewbel: Uses heteroscedasticity to estimate mismeasured and endogenous regressor models Queen's

More information

Package flowtype. R topics documented: July 18, Type Package. Title Phenotyping Flow Cytometry Assays. Version

Package flowtype. R topics documented: July 18, Type Package. Title Phenotyping Flow Cytometry Assays. Version Package flowtype July 18, 2013 Type Package Title Phenotyping Flow Cytometry Assays Version 1.6.0 Date 2011-04-27 Author Nima Aghaeepour Maintainer Nima Aghaeepour Phenotyping Flow

More information

Backcalculating HIV incidence and predicting AIDS in Australia, Cambodia and Vietnam. Australia

Backcalculating HIV incidence and predicting AIDS in Australia, Cambodia and Vietnam. Australia Backcalculating HIV incidence and predicting AIDS in Australia, Cambodia and Vietnam The aim of today s practical is to give you some hands-on experience with a nonparametric method for backcalculating

More information

Package prognosticroc

Package prognosticroc Type Package Package prognosticroc February 20, 2015 Title Prognostic ROC curves for evaluating the predictive capacity of a binary test Version 0.7 Date 2013-11-27 Author Y. Foucher

More information

Package ORIClust. February 19, 2015

Package ORIClust. February 19, 2015 Type Package Package ORIClust February 19, 2015 Title Order-restricted Information Criterion-based Clustering Algorithm Version 1.0-1 Date 2009-09-10 Author Maintainer Tianqing Liu

More information

Package TFEA.ChIP. R topics documented: January 28, 2018

Package TFEA.ChIP. R topics documented: January 28, 2018 Type Package Title Analyze Transcription Factor Enrichment Version 0.99.8 Author Laura Puente Santamaría, Luis del Peso Package TFEA.ChIP January 28, 2018 Maintainer Laura Puente Santamaría

More information

Package propoverlap. R topics documented: February 20, Type Package

Package propoverlap. R topics documented: February 20, Type Package Type Package Package propoverlap February 20, 2015 Title Feature (gene) selection based on the Proportional Overlapping Scores Version 1.0 Date 2014-09-15 Author Osama Mahmoud, Andrew Harrison, Aris Perperoglou,

More information

Package ordibreadth. R topics documented: December 4, Type Package Title Ordinated Diet Breadth Version 1.0 Date Author James Fordyce

Package ordibreadth. R topics documented: December 4, Type Package Title Ordinated Diet Breadth Version 1.0 Date Author James Fordyce Type Package Title Ordinated Diet Breadth Version 1.0 Date 2015-11-11 Author Package ordibreadth December 4, 2015 Maintainer James A. Fordyce Calculates ordinated diet breadth with some

More information

Introduction to SPSS S0

Introduction to SPSS S0 Basic medical statistics for clinical and experimental research Introduction to SPSS S0 Katarzyna Jóźwiak k.jozwiak@nki.nl November 10, 2017 1/55 Introduction SPSS = Statistical Package for the Social

More information

Package cssam. February 19, 2015

Package cssam. February 19, 2015 Type Package Package cssam February 19, 2015 Title cssam - cell-specific Significance Analysis of Microarrays Version 1.2.4 Date 2011-10-08 Author Shai Shen-Orr, Rob Tibshirani, Narasimhan Balasubramanian,

More information

Package seqcombo. R topics documented: March 10, 2019

Package seqcombo. R topics documented: March 10, 2019 Package seqcombo March 10, 2019 Title Visualization Tool for Sequence Recombination and Reassortment Version 1.5.1 Provides useful functions for visualizing sequence recombination and virus reassortment

More information

Package nopaco. R topics documented: April 13, Type Package Title Non-Parametric Concordance Coefficient Version 1.0.

Package nopaco. R topics documented: April 13, Type Package Title Non-Parametric Concordance Coefficient Version 1.0. Type Package Title Non-Parametric Concordance Coefficient Version 1.0.3 Date 2017-04-10 Package nopaco April 13, 2017 A non-parametric test for multi-observer concordance and differences between concordances

More information

IMPaLA tutorial.

IMPaLA tutorial. IMPaLA tutorial http://impala.molgen.mpg.de/ 1. Introduction IMPaLA is a web tool, developed for integrated pathway analysis of metabolomics data alongside gene expression or protein abundance data. It

More information

Package citccmst. February 19, 2015

Package citccmst. February 19, 2015 Version 1.0.2 Date 2014-01-07 Package citccmst February 19, 2015 Title CIT Colon Cancer Molecular SubTypes Prediction Description This package implements the approach to assign tumor gene expression dataset

More information

Package wally. May 25, Type Package

Package wally. May 25, Type Package Type Package Package wally May 25, 2017 Title The Wally Calibration Plot for Risk Prediction Models Version 1.0.9 Date 2017-04-28 Author Paul F Blanche , Thomas A. Gerds

More information

R documentation. of GSCA/man/GSCA-package.Rd etc. June 8, GSCA-package. LungCancer metadi... 3 plotmnw... 5 plotnw... 6 singledc...

R documentation. of GSCA/man/GSCA-package.Rd etc. June 8, GSCA-package. LungCancer metadi... 3 plotmnw... 5 plotnw... 6 singledc... R topics documented: R documentation of GSCA/man/GSCA-package.Rd etc. June 8, 2009 GSCA-package....................................... 1 LungCancer3........................................ 2 metadi...........................................

More information

Package leukemiaseset

Package leukemiaseset Package leukemiaseset August 14, 2018 Type Package Title Leukemia's microarray gene expression data (expressionset). Version 1.16.0 Date 2013-03-20 Author Sara Aibar, Celia Fontanillo and Javier De Las

More information

CSDplotter user guide Klas H. Pettersen

CSDplotter user guide Klas H. Pettersen CSDplotter user guide Klas H. Pettersen [CSDplotter user guide] [0.1.1] [version: 23/05-2006] 1 Table of Contents Copyright...3 Feedback... 3 Overview... 3 Downloading and installation...3 Pre-processing

More information

Package frailtyhl. February 19, 2015

Package frailtyhl. February 19, 2015 Type Package Title Frailty Models via H-likelihood Version 1.1 Date 2012-05-04 Author Il Do HA, Maengseok Noh, Youngjo Lee Package frailtyhl February 19, 2015 Maintainer Maengseok Noh

More information

Package pathifier. August 17, 2018

Package pathifier. August 17, 2018 Type Package Title Quantify deregulation of pathways in cancer Version 1.19.0 Date 2013-06-27 Author Yotam Drier Package pathifier August 17, 2018 Maintainer Assif Yitzhaky

More information

Package HAP.ROR. R topics documented: February 19, 2015

Package HAP.ROR. R topics documented: February 19, 2015 Type Package Title Recursive Organizer (ROR) Version 1.0 Date 2013-03-23 Author Lue Ping Zhao and Xin Huang Package HAP.ROR February 19, 2015 Maintainer Xin Huang Depends R (>=

More information

Gene Expression Analysis Web Forum. Jonathan Gerstenhaber Field Application Specialist

Gene Expression Analysis Web Forum. Jonathan Gerstenhaber Field Application Specialist Gene Expression Analysis Web Forum Jonathan Gerstenhaber Field Application Specialist Our plan today: Import Preliminary Analysis Statistical Analysis Additional Analysis Downstream Analysis 2 Copyright

More information

Package sbpiper. April 20, 2018

Package sbpiper. April 20, 2018 Version 1.7.0 Date 2018-04-20 Package sbpiper April 20, 2018 Title Data Analysis Functions for 'SBpipe' Package Depends R (>= 3.2.0) Imports colorramps, data.table, factoextra, FactoMineR, ggplot2 (>=

More information

Package DeconRNASeq. November 19, 2017

Package DeconRNASeq. November 19, 2017 Type Package Package DeconRNASeq November 19, 2017 Title Deconvolution of Heterogeneous Tissue Samples for mrna-seq data Version 1.20.0 Date 2013-01-22 Author Ting Gong Joseph D. Szustakowski

More information

Package PELVIS. R topics documented: December 18, Type Package

Package PELVIS. R topics documented: December 18, Type Package Type Package Package PELVIS December 18, 2018 Title Probabilistic Sex Estimate using Logistic Regression, Based on VISual Traits of the Human Os Coxae Version 1.0.5 Date 2018-12-18 Depends R (>= 3.4.0),

More information

Package fabiadata. July 19, 2018

Package fabiadata. July 19, 2018 Package fabiadata July 19, 2018 Title Data sets for FABIA (Factor Analysis for Bicluster Acquisition) Version 1.18.0 Date 2012-08-15 Author Sepp Hochreiter Maintainer Sepp Hochreiter

More information

Package metaseq. R topics documented: August 17, Type Package

Package metaseq. R topics documented: August 17, Type Package Type Package Package metaseq August 17, 2018 Title Meta-analysis of RNA-Seq count data in multiple studies Version 1.20.0 Date 2013-12-2 Author Maintainer Koki Tsuyuzaki

More information

MethylMix An R package for identifying DNA methylation driven genes

MethylMix An R package for identifying DNA methylation driven genes MethylMix An R package for identifying DNA methylation driven genes Olivier Gevaert May 3, 2016 Stanford Center for Biomedical Informatics Department of Medicine 1265 Welch Road Stanford CA, 94305-5479

More information

Package CorMut. January 21, 2019

Package CorMut. January 21, 2019 Type Package Package CorMut January 21, 2019 Title Detect the correlated mutations based on selection pressure Version 1.24.0 Date 2018-02-21 Author Maintainer Depends methods,seqinr,igraph

More information

Using SPSS for Correlation

Using SPSS for Correlation Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded

More information

1. To review research methods and the principles of experimental design that are typically used in an experiment.

1. To review research methods and the principles of experimental design that are typically used in an experiment. Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise Lab #7 (there was no Lab #6) Treatment for Depression: A Randomized Controlled Clinical Trial Objectives: 1. To review

More information

Package seqdesign. April 27, Version 1.1 Date

Package seqdesign. April 27, Version 1.1 Date Version 1.1 Date 2015-04-23 Package seqdesign April 27, 2015 Title Simulation and Group Sequential Monitoring of Randomized Two-Stage Treatment Efficacy Trials with Time-to-Event Endpoints Author Michal

More information

dataset1 <- read.delim("c:\dca_example_dataset1.txt", header = TRUE, sep = "\t") attach(dataset1)

dataset1 <- read.delim(c:\dca_example_dataset1.txt, header = TRUE, sep = \t) attach(dataset1) Worked examples of decision curve analysis using R A note about R versions The R script files to implement decision curve analysis were developed using R version 2.3.1, and were tested last using R version

More information

Assignment 5: Integrative epigenomics analysis

Assignment 5: Integrative epigenomics analysis Assignment 5: Integrative epigenomics analysis Due date: Friday, 2/24 10am. Note: no late assignments will be accepted. Introduction CpG islands (CGIs) are important regulatory regions in the genome. What

More information

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1 Agenda for Week 3, Hr 1 (Tuesday, Jan 19) Hour 1: - Installing R and inputting data. - Different tools for R: Notepad++ and RStudio. - Basic commands:?,??, mean(), sd(), t.test(), lm(), plot() - t.test()

More information

Package PROcess. R topics documented: June 15, Title Ciphergen SELDI-TOF Processing Version Date Author Xiaochun Li

Package PROcess. R topics documented: June 15, Title Ciphergen SELDI-TOF Processing Version Date Author Xiaochun Li Title Ciphergen SELDI-TOF Processing Version 1.56.0 Date 2005-11-06 Author Package PROcess June 15, 2018 A package for processing protein mass spectrometry data. Depends Icens Imports graphics, grdevices,

More information

Package metaseq. R topics documented: November 1, Type Package. Title Meta-analysis of RNA-Seq count data in multiple studies. Version 1.1.

Package metaseq. R topics documented: November 1, Type Package. Title Meta-analysis of RNA-Seq count data in multiple studies. Version 1.1. Package metaseq November 1, 2013 Type Package Title Meta-analysis of RNA-Seq count data in multiple studies Version 1.1.1 Date 2013-09-25 Author Koki Tsuyuzaki, Itoshi Nikaido Maintainer Koki Tsuyuzaki

More information

Benchmark Dose Modeling Cancer Models. Allen Davis, MSPH Jeff Gift, Ph.D. Jay Zhao, Ph.D. National Center for Environmental Assessment, U.S.

Benchmark Dose Modeling Cancer Models. Allen Davis, MSPH Jeff Gift, Ph.D. Jay Zhao, Ph.D. National Center for Environmental Assessment, U.S. Benchmark Dose Modeling Cancer Models Allen Davis, MSPH Jeff Gift, Ph.D. Jay Zhao, Ph.D. National Center for Environmental Assessment, U.S. EPA Disclaimer The views expressed in this presentation are those

More information

CNV PCA Search Tutorial

CNV PCA Search Tutorial CNV PCA Search Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Data Preparation 2 A. Join Log Ratio Data with Phenotype Information.............................. 2 B. Activate only

More information

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc. Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets

More information

Quality assessment of TCGA Agilent gene expression data for ovary cancer

Quality assessment of TCGA Agilent gene expression data for ovary cancer Quality assessment of TCGA Agilent gene expression data for ovary cancer Nianxiang Zhang & Keith A. Baggerly Dept of Bioinformatics and Computational Biology MD Anderson Cancer Center Oct 1, 2010 Contents

More information

User Guide. Association analysis. Input

User Guide. Association analysis. Input User Guide TFEA.ChIP is a tool to estimate transcription factor enrichment in a set of differentially expressed genes using data from ChIP-Seq experiments performed in different tissues and conditions.

More information

Package speff2trial. February 20, 2015

Package speff2trial. February 20, 2015 Version 1.0.4 Date 2012-10-30 Package speff2trial February 20, 2015 Title Semiparametric efficient estimation for a two-sample treatment effect Author Michal Juraska , with contributions

More information

Module 3: Pathway and Drug Development

Module 3: Pathway and Drug Development Module 3: Pathway and Drug Development Table of Contents 1.1 Getting Started... 6 1.2 Identifying a Dasatinib sensitive cancer signature... 7 1.2.1 Identifying and validating a Dasatinib Signature... 7

More information

Vega: Variational Segmentation for Copy Number Detection

Vega: Variational Segmentation for Copy Number Detection Vega: Variational Segmentation for Copy Number Detection Sandro Morganella Luigi Cerulo Giuseppe Viglietto Michele Ceccarelli Contents 1 Overview 1 2 Installation 1 3 Vega.RData Description 2 4 Run Vega

More information

Multilevel data analysis

Multilevel data analysis Multilevel data analysis TUTORIAL 1 1 1 Multilevel PLS-DA 1 PLS-DA 1 1 Frequency Frequency 1 1 3 5 number of misclassifications (NMC) 1 3 5 number of misclassifications (NMC) Copyright Biosystems Data

More information

Data mining with Ensembl Biomart. Stéphanie Le Gras

Data mining with Ensembl Biomart. Stéphanie Le Gras Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:

More information

ECDC HIV Modelling Tool User Manual

ECDC HIV Modelling Tool User Manual ECDC HIV Modelling Tool User Manual Version 1.3.0 European Centre for Disease Prevention and Control 20 December 2017 1 Table of Contents 2 Introduction... 3 2.1 Incidence Method... 3 2.2 London Method...

More information

Meta-analysis using RevMan. Yemisi Takwoingi October 2015

Meta-analysis using RevMan. Yemisi Takwoingi October 2015 Yemisi Takwoingi October 2015 Contents 1 Introduction... 1 2 Dataset 1 PART I..2 3 Starting RevMan... 2 4 Data and analyses in RevMan... 2 5 RevMan calculator tool... 2 Table 1. Data for derivation of

More information

Package CompareTests

Package CompareTests Type Package Package CompareTests February 6, 2017 Title Correct for Verification Bias in Diagnostic Accuracy & Agreement Version 1.2 Date 2016-2-6 Author Hormuzd A. Katki and David W. Edelstein Maintainer

More information

TCGA. The Cancer Genome Atlas

TCGA. The Cancer Genome Atlas TCGA The Cancer Genome Atlas TCGA: History and Goal History: Started in 2005 by the National Cancer Institute (NCI) and the National Human Genome Research Institute (NHGRI) with $110 Million to catalogue

More information

A Quick-Start Guide for rseqdiff

A Quick-Start Guide for rseqdiff A Quick-Start Guide for rseqdiff Yang Shi (email: shyboy@umich.edu) and Hui Jiang (email: jianghui@umich.edu) 09/05/2013 Introduction rseqdiff is an R package that can detect differential gene and isoform

More information

Exercise Pro Getting Started Guide

Exercise Pro Getting Started Guide Exercise Pro Getting Started Guide Table Of Contents Installation... 1 Overview... 1 Tutorial... 1 The Exercise Pro 6 Interface... 1 Searching and Selecting Exercises... 2 Printing the Exercise Program...

More information

Charts Worksheet using Excel Obesity Can a New Drug Help?

Charts Worksheet using Excel Obesity Can a New Drug Help? Worksheet using Excel 2000 Obesity Can a New Drug Help? Introduction Obesity is known to be a major health risk. The data here arise from a study which aimed to investigate whether or not a new drug, used

More information

Package pollen. October 7, 2018

Package pollen. October 7, 2018 Type Package Title Analysis of Aerobiological Data Version 0.71.0 Package pollen October 7, 2018 Supports analysis of aerobiological data. Available features include determination of pollen season limits,

More information

Package QualInt. February 19, 2015

Package QualInt. February 19, 2015 Title Test for Qualitative Interactions Version 1.0.0 Package QualInt February 19, 2015 Author Lixi Yu, Eun-Young Suh, Guohua (James) Pan Maintainer Lixi Yu Description Used for testing

More information

Managing and Taking Notes

Managing and Taking Notes When scheduling a meeting, the host can specify the default note-taking options that take effect once the meeting starts. During a meeting, the presenter can change the default note-taking options at any

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg.

Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg. Photometry Frequently Asked Questions Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg. Protein standard curves are traditionally presented as

More information

Two-Way Independent ANOVA

Two-Way Independent ANOVA Two-Way Independent ANOVA Analysis of Variance (ANOVA) a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment. There

More information

AudioConsole. User Guide. Doc. No EN/01 Part No EN

AudioConsole. User Guide. Doc. No EN/01 Part No EN AudioConsole Doc. No. 7-50-2180-EN/01 Part No. 7-50-21800-EN Copyright notice [2003], 2018 Inmedico A/S. All rights reserved. Oscilla is aregistered trademark of Inmedico A/S in the U.S.A. and/or other

More information

Table of Contents Foreword 9 Stay Informed 9 Introduction to Visual Steps 10 What You Will Need 10 Basic Knowledge 11 How to Use This Book

Table of Contents Foreword 9 Stay Informed 9 Introduction to Visual Steps 10 What You Will Need 10 Basic Knowledge 11 How to Use This Book Table of Contents Foreword... 9 Stay Informed... 9 Introduction to Visual Steps... 10 What You Will Need... 10 Basic Knowledge... 11 How to Use This Book... 11 The Screenshots... 12 The Website and Supplementary

More information

Package splicer. R topics documented: August 19, Version Date 2014/04/29

Package splicer. R topics documented: August 19, Version Date 2014/04/29 Package splicer August 19, 2014 Version 1.6.0 Date 2014/04/29 Title Classification of alternative splicing and prediction of coding potential from RNA-seq data. Author Johannes Waage ,

More information

Managing and Taking Notes

Managing and Taking Notes When scheduling a meeting, the host can specify the default note-taking options that take effect once the meeting starts. During a meeting, the presenter can change the default note-taking options at any

More information

Enter a file or directory path and press the Enter key

Enter a file or directory path and press the Enter key Untitled Flow? Assistance Routine Description " importfiles Import file(s) into H O # getframes Get a list of frames in H O $ splitframe Split a frame into two or more frames % mergeframes Merge two frames

More information

Package NarrowPeaks. August 3, Version Date Type Package

Package NarrowPeaks. August 3, Version Date Type Package Package NarrowPeaks August 3, 2013 Version 1.5.0 Date 2013-02-13 Type Package Title Analysis of Variation in ChIP-seq using Functional PCA Statistics Author Pedro Madrigal , with contributions

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature10866 a b 1 2 3 4 5 6 7 Match No Match 1 2 3 4 5 6 7 Turcan et al. Supplementary Fig.1 Concepts mapping H3K27 targets in EF CBX8 targets in EF H3K27 targets in ES SUZ12 targets in ES

More information

PROcess. June 1, Align peaks for a specified precision. A user specified precision of peak position.

PROcess. June 1, Align peaks for a specified precision. A user specified precision of peak position. PROcess June 1, 2010 align Align peaks for a specified precision. A vector of peaks are expanded to a collection of intervals, eps * m/z of the peak to the left and right of the peak position. They are

More information

Voxar 3D CardiaMetrix. Reference Guide

Voxar 3D CardiaMetrix. Reference Guide Voxar 3D CardiaMetrix Reference Guide The software described in this document is furnished under a license, and may be used or copied only according to the terms of such license. Toshiba means, Toshiba

More information

SubLasso:a feature selection and classification R package with a. fixed feature subset

SubLasso:a feature selection and classification R package with a. fixed feature subset SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

Package medicalrisk. August 29, 2016

Package medicalrisk. August 29, 2016 Type Package Package medicalrisk August 29, 2016 Title Medical Risk and Comorbidity Tools for ICD-9-CM Data Version 1.2 Date 2016-01-23 Generates risk estimates and comorbidity flags from ICD-9-CM codes

More information

Package mdsdt. March 12, 2016

Package mdsdt. March 12, 2016 Version 1.2 Date 2016-03-11 Package mdsdt March 12, 2016 Title Functions for Analysis of Data with General Recognition Theory Author Robert X.D. Hawkins , Joe Houpt ,

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Package bionetdata. R topics documented: February 19, Type Package Title Biological and chemical data networks Version 1.0.

Package bionetdata. R topics documented: February 19, Type Package Title Biological and chemical data networks Version 1.0. Type Package Title Biological and chemical data networks Version 1.0.1 Date 2014-09-07 Package bionetdata February 19, 2015 Author Matteo Re, Giorgio Valentini -- Universita' degli Studi di Milano Maintainer

More information

EXPression ANalyzer and DisplayER

EXPression ANalyzer and DisplayER EXPression ANalyzer and DisplayER Tom Hait Aviv Steiner Igor Ulitsky Chaim Linhart Amos Tanay Seagull Shavit Rani Elkon Adi Maron-Katz Dorit Sagir Eyal David Roded Sharan Israel Steinfeld Yossi Shiloh

More information

OECD QSAR Toolbox v.4.2. An example illustrating RAAF scenario 6 and related assessment elements

OECD QSAR Toolbox v.4.2. An example illustrating RAAF scenario 6 and related assessment elements OECD QSAR Toolbox v.4.2 An example illustrating RAAF scenario 6 and related assessment elements Outlook Background Objectives Specific Aims Read Across Assessment Framework (RAAF) The exercise Workflow

More information

The FunCluster Package

The FunCluster Package The FunCluster Package October 23, 2007 Version 1.07 Date 2007-10-23 Title Functional Profiling of Microarray Expression Data Author Corneliu Henegar Maintainer Corneliu Henegar

More information

# Assessment of gene expression levels between several cell group types is a common application of the unsupervised technique.

# Assessment of gene expression levels between several cell group types is a common application of the unsupervised technique. # Aleksey Morozov # Microarray Data Analysis Using Hierarchical Clustering. # The "unsupervised learning" approach deals with data that has the features X1,X2...Xp, but does not have an associated response

More information