Visualization of Human Disorders and Genes Network

Size: px
Start display at page:

Download "Visualization of Human Disorders and Genes Network"

Transcription

1 Visualization of Human Disorders and Genes Network Bryan (Dung) Ta Dept. of Computer Science University of Maryland, College Park I. INTRODUCTION Goh et al. [4] has provided an useful analysis of the human disease network. The work used Gephi [1] to visualize the network and discussed insight information about the relations between human disease and human genes. The main purpose of this assigment is to use another visualization tool called NodeXL to reveal the insights obtained directly from the data. Since the topic is very field-specific and requires a good domain knowledge about genes and diseases, in order to explain reasons behind the insights, this work cited some expert explainations from [4]. II. DATA DESCRIPTION The raw data for this work is collected from [2] [4]. It contains three major data sets described as follows: - The Disorder dataset with 867 nodes and 1527 edges. Nodes represent disorders (i.e. diseases). Two disorders are connected to each other if they share at least one gene in which mutations are associated with both disorders. - The Gene dataset with 1378 nodes and 7491 edges. Nodes represent disease genes. Two genes are connected if they are associated with the same disorder. - The Disorder-Gene bipartite dataset with 3290 nodes and 2673 edges. A disorder and a gene are then connected by a link if mutations in that gene are implicated in that disorder. Contribution of this work includes visualizing the data to highlight interesting information about the connections between human disorders and genes some of which have been discovered by [4], and converting the raw data into the excel format that works with NodeXL (e.g. manually creating 22 groups, removing/modifying 2000 same IDs in the bipartite dataset etc) which took a considerable amount of time. I also made the processed data available [3] to NodeXL community. III. INSIGHT 1: LIFE WOULD BE EASY IF EACH HUMAN DISORDER DID NOT RELATE TO EACH OTHER. Sad as it might be, in the reality, many disorders closely relate to each other, which makes it hard to explain the cause of diseases and complicates treatment processes. From the figure 1, we can see that each disorder has at least one link to the other disorder, and most of them (516 out of 867) altogether form a giant component. This suggests that most diseases share the same genetic origins [4]. Not only individual disorders but classes of disorders also exhibit strong interconnections both among classes and within classes themselves. In figure 5, the concentration of grey links demonstrates significant connections between 22 disorder classes. 1

2 Fig. 1. Human disorder (disease) network. Each node corresponds to a distinct disorder, and is shaped and colored based on the disorder class to which it belongs (the names of 22 disorder classes shown on Figure 5). There is a link between 2 node if they share at least one gene. The size of each node is proportional to its number of links. Only the names of a small number of disorders are shown as examples in order to reduce the graph density. Figure 2 and Figure 3 present interesting observations on connections between disorders. In figure 2, the degree (i.e. the number of distinct disorders that a particular disorder connects to) distribution of the network indicates that the majority of the disorders are linked to only a few of the other disorders, whereas some disorders are linked to many other disorders. For example, colon cancer are linked to 50 other disorders and breast cancer are linked to 30 disorders. Figure 3, which diagrams disorders based on their degree and number of related genes, highlights a fascinating fact that colon cancer is the disorder that connects to a significant number of disorders (50 other disorders) whereas deafness connects to the highest number of genes yet to a very small number of disorders. This suggests that the degree of disorders connection with genes does not determine the degree of their connections to other diseases. This insight information has also been discovered by [4] which used gephi. In this assigment, I would like to use a different tool (NodeXL) to see if we can find out the same insight. As the result shows, we can get the same useful information using NodeXL. Fig. 2. Histogram of degree of the network. 2

3 Fig. 3. Disorder ranking based on their degree and number of related genes. IV. INSIGHT 2: WATCH OUT FOR CANCER - THE MOST CONNECTED DISORDER CLASS In addition to interconnections among different disorders and among different disorder classes in general, there are different trends of connections within a certain disorder class. In Figure 4, whereas the large cancer cluster (blue solid diamonds) is tightly interconnected, metabolic disorders (red solid triangles) only form scattered small connected components. Note that all the other clusters, which are not analyzed, are colored in grey to make cancer and metabolic clusters more visible. This interesting insight is also highlighted in cancer cluster and metabolic cluster shown Figure 5 when I clustered disorders based on their disorder classes. The reason for cancer s prominence among the most connected disorder classes comes from the fact that many cancer subtypes share common tumor genes such as TP53 and PTEN which make the connection tighter [4]. Fig. 4. Cancer cluster (blue solid diamonds) is better connected than metabolic cluster (red solid triangles). Other clusters are colored in grey to facilitate better contrast. 3

4 Fig. 5. Twenty-two disorder classes. To quantify the difference in degree of cluster interconnection, for each disorder class, based on the graph metrics for the groups, I measured the following metrics: - The ratio of single vertices to vertices (the lower the ratio is, the better the cluster connects) - The ratio of maximum vertices in a connected component to the total vertices (the higher the ratio is, the better the cluster connects) - The ratio of maximum edges in a connected component to total Edges (the higher the ratio is, the better the cluster connects) - Graph density (The higher the density value is, the better the cluster connects) I noticed that cancer and neurological disorders show better values for those metrics and also represent the most connected disease classes, in contrast with metabolic, and multiple disorders which have worse values and consequently are the least connected. V. INSIGHT 3: TOP 3 MOST ACTIVE GENES. The visualization in Figure 6 is a bipartite graph between disorders and genes. From this graph, I use NodeXL s filter function to get rid of the genes which have low degree. The Figure 7 presents the result of the above filtering process and shows the top 3 most active genes in terms of the number of related disorders including PAX6 which relates to 11 disorders, PTEN which relates to 10 disorders, FGFR2 and TP53, each of which relates to 9 disorders. Interestingly, most of these genes relate to most of the diseases in the cancer class. Note that in order to show the most active genes in terms of the number of related disorders, we can employ a similar visualization as done in Figure 3. In order to test various features of the tool, I employed filter feature of NodeXL. 4

5 Fig. 6. Bipartite graph between genes (green solid rectangle) and diseases (red solid disk). A link is placed between a disorder and a disease gene if mutations in that gene lead to the specific disorder.the size of each node is proportional to its number of links. Fig. 7. The top 3 active genes (blue solid squares). VI. TOOL CRITIQUE Overall, NodeXL is a very useful tool for network visualization. It provides users with many well-developed features to explore data sets. I am very impressed with NodeXL s ability of displaying insights almost instantly or with several quick processing steps after the data is loaded. However, there is still room for improvements which are suggested as folllows. - When visualizing data with X, Y axes (i.e. Graph Elements/Axes), the axes disappear when the visualization is saved to an image file. This is a significant weakness as these axes contain important information such as axes labels, axes unit and values which are key to interpret diagrams. The image below is the corresponding output to the visualization in Figure 3 yet without axes labels. Undeniably, users have much more difficulty in extracting information from this image due to the lack of axes labels. 5

6 Fig. 8. Diagram without axes lables. - Legends are not easy to be edited/created by users. In some contexts, the tool does not seem to have a supporting feature to create legends for group attributes. Particularly in this assignment, although I want to create legends to explain the 22 groups that are symbolized in different colors and shapes, I found no feature supporting this task. - There is no undo feature which makes it impossible to reverse any changes to the visualization. REFERENCES [1] [2] Publications/ PNAS-HumanDisease/Suppl/index.htm. [3] f12/images/f/fd/bryanta NodeXL Data.zip. [4] K.I. Goh, M.E. Cusick, D. Valle, B. Childs, M. Vidal, and A.L. Barabási. The human disease network. Proceedings of the National Academy of Sciences, 104(21): ,

Application Project for CMSC734: Information Visualization

Application Project for CMSC734: Information Visualization Application Project for CMSC734: Information Visualization Tibco s Spotfire software was used to analyze Prof. Vaughn-Cooke s survey data on diabetes treatment [1]. The dataset consist of diabetes blood-glucose

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1

Nature Neuroscience: doi: /nn Supplementary Figure 1 Supplementary Figure 1 Illustration of the working of network-based SVM to confidently predict a new (and now confirmed) ASD gene. Gene CTNND2 s brain network neighborhood that enabled its prediction by

More information

The North Carolina Health Data Explorer

The North Carolina Health Data Explorer The North Carolina Health Data Explorer The Health Data Explorer provides access to health data for North Carolina counties in an interactive, user-friendly atlas of maps, tables, and charts. It allows

More information

User Manual. RaySafe i2 dose viewer

User Manual. RaySafe i2 dose viewer User Manual RaySafe i2 dose viewer 2012.03 Unfors RaySafe 5001048-A All rights are reserved. Reproduction or transmission in whole or in part, in any form or by any means, electronic, mechanical or otherwise,

More information

User s Manual Version 1.0

User s Manual Version 1.0 User s Manual Version 1.0 #639 Longmian Avenue, Jiangning District, Nanjing,211198,P.R.China. http://tcoa.cpu.edu.cn/ Contact us at xiaosheng.wang@cpu.edu.cn for technical issue and questions Catalogue

More information

LESSON: Working with line graphs

LESSON: Working with line graphs LESSON: Working with line graphs FOCUS QUESTION: How do I display trends in data? This lesson shows you different ways to plot the rows and columns of a table or array using line graphs. It introduces

More information

Unit 7 Comparisons and Relationships

Unit 7 Comparisons and Relationships Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons

More information

Main Study: Summer Methods. Design

Main Study: Summer Methods. Design Main Study: Summer 2000 Methods Design The experimental design is within-subject each participant experiences five different trials for each of the ten levels of Display Condition and for each of the three

More information

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of numbers. Also, students will understand why some measures

More information

Visualizing Temporal Patterns by Clustering Patients

Visualizing Temporal Patterns by Clustering Patients Visualizing Temporal Patterns by Clustering Patients Grace Shin, MS 1 ; Samuel McLean, MD 2 ; June Hu, MS 2 ; David Gotz, PhD 1 1 School of Information and Library Science; 2 Department of Anesthesiology

More information

v Feature Stamping SMS 12.0 Tutorial Prerequisites Requirements TABS model Map Module Mesh Module Scatter Module Time minutes

v Feature Stamping SMS 12.0 Tutorial Prerequisites Requirements TABS model Map Module Mesh Module Scatter Module Time minutes v. 12.0 SMS 12.0 Tutorial Objectives In this lesson will teach how to use conceptual modeling techniques to create numerical models that incorporate flow control structures into existing bathymetry. The

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

GLOOKO REPORT REFERENCE GUIDE

GLOOKO REPORT REFERENCE GUIDE GLOOKO REPORT REFERENCE GUIDE November 2018 Version IFU-0010 02 Contents Intended Use... 2 Warnings... 2 Introduction... 3 Reports... 4 Report Criteria...4 Date Range... 4 Glucose Data Source... 4 Exercise

More information

smk72+ Handbook Prof. Dr. Andreas Frey Dr. Lars Balzer Stephan Spuhler smk72+ Handbook Page 1

smk72+ Handbook Prof. Dr. Andreas Frey Dr. Lars Balzer Stephan Spuhler smk72+ Handbook Page 1 smk72+ Handbook Prof. Dr. Andreas Frey Dr. Lars Balzer Stephan Spuhler Email: support@kompetenzscreening.de Page 1 Table of Contents HOME... 3 BASIC INFORMATION ON LOGGING IN... 4 LOGIN PROCESS FOR PARTICIPANTS...

More information

How to interpret scientific & statistical graphs

How to interpret scientific & statistical graphs How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott 1 A brief introduction Graphics:

More information

Nature Methods: doi: /nmeth.3115

Nature Methods: doi: /nmeth.3115 Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by

More information

Brain Tumor Detection using Watershed Algorithm

Brain Tumor Detection using Watershed Algorithm Brain Tumor Detection using Watershed Algorithm Dawood Dilber 1, Jasleen 2 P.G. Student, Department of Electronics and Communication Engineering, Amity University, Noida, U.P, India 1 P.G. Student, Department

More information

Feature Stamping SURFACE WATER MODELING SYSTEM. 1 Introduction. 2 Opening a Background Image

Feature Stamping SURFACE WATER MODELING SYSTEM. 1 Introduction. 2 Opening a Background Image SURFACE WATER MODELING SYSTEM Feature Stamping 1 Introduction In this lesson you will learn how to use conceptual modeling techniques to create numerical models that incorporate flow control structures

More information

chapter 1 - fig. 2 Mechanism of transcriptional control by ppar agonists.

chapter 1 - fig. 2 Mechanism of transcriptional control by ppar agonists. chapter 1 - fig. 1 The -omics subdisciplines. chapter 1 - fig. 2 Mechanism of transcriptional control by ppar agonists. 201 figures chapter 1 chapter 2 - fig. 1 Schematic overview of the different steps

More information

EXTRACT THE BREAST CANCER IN MAMMOGRAM IMAGES

EXTRACT THE BREAST CANCER IN MAMMOGRAM IMAGES International Journal of Civil Engineering and Technology (IJCIET) Volume 10, Issue 02, February 2019, pp. 96-105, Article ID: IJCIET_10_02_012 Available online at http://www.iaeme.com/ijciet/issues.asp?jtype=ijciet&vtype=10&itype=02

More information

Journal: Nature Methods

Journal: Nature Methods Journal: Nature Methods Article Title: Network-based stratification of tumor mutations Corresponding Author: Trey Ideker Supplementary Item Supplementary Figure 1 Supplementary Figure 2 Supplementary Figure

More information

Identification of Tissue Independent Cancer Driver Genes

Identification of Tissue Independent Cancer Driver Genes Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important

More information

Warfarin Help Documentation

Warfarin Help Documentation Warfarin Help Documentation Table Of Contents Warfarin Management... 1 iii Warfarin Management Warfarin Management The Warfarin Management module is a powerful tool for monitoring INR results and advising

More information

ECG-PS ECG SIGNAL MEASUREMENT MODULE. Operation Manual. March 2005

ECG-PS ECG SIGNAL MEASUREMENT MODULE. Operation Manual. March 2005 ECG-PS ECG SIGNAL MEASUREMENT MODULE Operation Manual March 2005 2555 Collins Avenue, Suite C-5 - Miami Beach FL - 33140 - U.S.A. Phone #: (305) 534-5905 Fax: (305) 534-8222 e-mail: info@galix-gbi.com

More information

Six Sigma Glossary Lean 6 Society

Six Sigma Glossary Lean 6 Society Six Sigma Glossary Lean 6 Society ABSCISSA ACCEPTANCE REGION ALPHA RISK ALTERNATIVE HYPOTHESIS ASSIGNABLE CAUSE ASSIGNABLE VARIATIONS The horizontal axis of a graph The region of values for which the null

More information

Non Muscle Invasive Bladder Cancer (NMIBC) Experts Discuss Treatment Options. Part II: The Future Treatment of NMIBC

Non Muscle Invasive Bladder Cancer (NMIBC) Experts Discuss Treatment Options. Part II: The Future Treatment of NMIBC Non Muscle Invasive Bladder Cancer (NMIBC) Experts Discuss Treatment Options Wednesday, June 14, 2017 Part II: The Future Treatment of NMIBC Presented by Dr. Arlene Siefer-Radtke is an Associate Professor

More information

v Feature Stamping SMS 13.0 Tutorial Prerequisites Requirements Map Module Mesh Module Scatter Module Time minutes

v Feature Stamping SMS 13.0 Tutorial Prerequisites Requirements Map Module Mesh Module Scatter Module Time minutes v. 13.0 SMS 13.0 Tutorial Objectives Learn how to use conceptual modeling techniques to create numerical models which incorporate flow control structures into existing bathymetry. The flow control structures

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

Making charts in Excel

Making charts in Excel Making charts in Excel Use Excel file MakingChartsInExcel_data We ll start with the worksheet called treatment This shows the number of admissions (not necessarily people) to Twin Cities treatment programs

More information

Mobile Phone Enabled Social Community Extraction for Controlling of Disease Propagation in Healthcare

Mobile Phone Enabled Social Community Extraction for Controlling of Disease Propagation in Healthcare 2 Eighth IEEE International Conference on Mobile Ad-Hoc and Sensor Systems Mobile Phone Enabled Social Community Extraction for Controlling of Disease Propagation in Healthcare Yanzhi Ren, Jie Yang, Mooi

More information

Pathway Exercises Metabolism and Pathways

Pathway Exercises Metabolism and Pathways 1. Find the metabolic pathway for glycolysis. For this exercise use PlasmoDB.org Pathway Exercises Metabolism and Pathways a. Navigate to the search page for Identify Metabolic Pathways based on Pathway

More information

Introduction to VideometerLab 2 for Malting

Introduction to VideometerLab 2 for Malting New vision sensors for lab and in-line quality assessment Introduction to VideometerLab 2 for Malting For more information contact: analytikltd (UK and Ireland Distributor) Barn B, 2 Cygnus Business Park,

More information

Supplementary Material. Functional connectivity in multiple cortical networks is associated with performance. across cognitive domains in older adults

Supplementary Material. Functional connectivity in multiple cortical networks is associated with performance. across cognitive domains in older adults Supplementary Material Functional connectivity in multiple cortical networks is associated with performance across cognitive domains in older adults Emily E. Shaw 1,2, Aaron P. Schultz 1,2,3, Reisa A.

More information

Your Guide to the Breast Cancer Pathology. Report. Key Questions. Here are important questions to be sure you understand, with your doctor s help:

Your Guide to the Breast Cancer Pathology. Report. Key Questions. Here are important questions to be sure you understand, with your doctor s help: Your Guide to the Breast Cancer Pathology Report Key Questions Here are important questions to be sure you understand, with your doctor s help: Your Guide to the Breast Cancer Pathology Report 1. Is this

More information

EMOTIONAL INTELLIGENCE skills assessment: technical report

EMOTIONAL INTELLIGENCE skills assessment: technical report OnlineAssessments EISA EMOTIONAL INTELLIGENCE skills assessment: technical report [ Abridged Derek Mann ] To accompany the Emotional Intelligence Skills Assessment (EISA) by Steven J. Stein, Derek Mann,

More information

Extraction of Blood Vessels and Recognition of Bifurcation Points in Retinal Fundus Image

Extraction of Blood Vessels and Recognition of Bifurcation Points in Retinal Fundus Image International Journal of Research Studies in Science, Engineering and Technology Volume 1, Issue 5, August 2014, PP 1-7 ISSN 2349-4751 (Print) & ISSN 2349-476X (Online) Extraction of Blood Vessels and

More information

Assignment 2 - Static Visualization Critique

Assignment 2 - Static Visualization Critique Kalen Wong HCDE 511 - Fall 2014 Oct 7, 2014 Assignment 2 - Static Visualization Critique 1. List the source of the visualization (a bibliographic citation is preferred, including the date if you can determine

More information

Agilent GeneSpring/MPP Metadata Analysis Framework

Agilent GeneSpring/MPP Metadata Analysis Framework Agilent GeneSpring/MPP Metadata Analysis Framework Technical Overview Authors Srikanthi R., Pritha Aggarwal, Durairaj R., Maria Kammerer, and Pramila Tata Strand Life Sciences Bangalore, India Michael

More information

Module 3: Pathway and Drug Development

Module 3: Pathway and Drug Development Module 3: Pathway and Drug Development Table of Contents 1.1 Getting Started... 6 1.2 Identifying a Dasatinib sensitive cancer signature... 7 1.2.1 Identifying and validating a Dasatinib Signature... 7

More information

diasend mobile app for patients

diasend mobile app for patients diasend mobile app for patients Contents About the diasend mobile app... 3 Download the app... 4 Log in or create new account... 4 After logging in... 5 Scorecard... 6 About your Scorecard... 7 Reports...

More information

Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They?

Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They? Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They? Dubovenko Alexey Discovery Product Manager Sonia Novikova Solution Scientist September 2018 2 Non-Small Cell Lung Cancer

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Defect Removal Metrics. SE 350 Software Process & Product Quality 1

Defect Removal Metrics. SE 350 Software Process & Product Quality 1 Defect Removal Metrics 1 Objectives Understand some basic defect metrics and the concepts behind them Defect density metrics Defect detection and removal effectiveness etc. Look at the uses and limitations

More information

Analyzing Spammers Social Networks for Fun and Profit

Analyzing Spammers Social Networks for Fun and Profit Chao Yang Robert Harkreader Jialong Zhang Seungwon Shin Guofei Gu Texas A&M University Analyzing Spammers Social Networks for Fun and Profit A Case Study of Cyber Criminal Ecosystem on Twitter Presentation:

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Experimental design and workflow utilized to generate the WMG Protein Atlas.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Experimental design and workflow utilized to generate the WMG Protein Atlas. Supplementary Figure 1 Experimental design and workflow utilized to generate the WMG Protein Atlas. (a) Illustration of the plant organs and nodule infection time points analyzed. (b) Proteomic workflow

More information

MYGLOOKO USER GUIDE. June 2017 IM GL+ A0003 REV J

MYGLOOKO USER GUIDE. June 2017 IM GL+ A0003 REV J MYGLOOKO USER GUIDE June 2017 IM GL+ A0003 REV J TABLE OF CONTENTS TABLE OF CONTENTS GENERAL INFORMATION...1 Product Description...1 Intended Use...1 Supported Software...1 Warnings...2 Contraindications...2

More information

Network Analysis of Toxic Chemicals and Symptoms: Implications for Designing First-Responder Systems

Network Analysis of Toxic Chemicals and Symptoms: Implications for Designing First-Responder Systems Network Analysis of Toxic Chemicals and Symptoms: Implications for Designing First-Responder Systems Suresh K. Bhavnani 1 PhD, Annie Abraham 1, Christopher Demeniuk 1, Messeret Gebrekristos 1 Abe Gong

More information

BRCA1 & BRCA2: CANCER RISK & GENETIC TESTING IAP ID 2013 NAIR HOSPITAL, MUMBAI

BRCA1 & BRCA2: CANCER RISK & GENETIC TESTING IAP ID 2013 NAIR HOSPITAL, MUMBAI BRCA1 & BRCA2: CANCER RISK & GENETIC TESTING IAP ID 2013 NAIR HOSPITAL, MUMBAI DR KIRTI CHADHA MD (Path), PDCC (Oncopath & Oncohemat) CONSULTANT SURGICAL PATHOLOGIST NATIONAL COORDINATOR,SURGICAL PATHOLOGY

More information

Coat Color in Labrador Retrievers

Coat Color in Labrador Retrievers Coat Color in Labrador Retrievers Sunny Cocoa Midnight Metabolic Networks Introduction On your note sheet, write a definition for what you think metabolism means. HINT: my mom used to tell me when I was

More information

SubLasso:a feature selection and classification R package with a. fixed feature subset

SubLasso:a feature selection and classification R package with a. fixed feature subset SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced

More information

Modeling Sentiment with Ridge Regression

Modeling Sentiment with Ridge Regression Modeling Sentiment with Ridge Regression Luke Segars 2/20/2012 The goal of this project was to generate a linear sentiment model for classifying Amazon book reviews according to their star rank. More generally,

More information

Segmentation of Tumor Region from Brain Mri Images Using Fuzzy C-Means Clustering And Seeded Region Growing

Segmentation of Tumor Region from Brain Mri Images Using Fuzzy C-Means Clustering And Seeded Region Growing IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 5, Ver. I (Sept - Oct. 2016), PP 20-24 www.iosrjournals.org Segmentation of Tumor Region from Brain

More information

Eating and Sleeping Habits of Different Countries

Eating and Sleeping Habits of Different Countries 9.2 Analyzing Scatter Plots Now that we know how to draw scatter plots, we need to know how to interpret them. A scatter plot graph can give us lots of important information about how data sets are related

More information

Conserved properties of dentate gyrus neurogenesis across postnatal development revealed by single-cell RNA sequencing

Conserved properties of dentate gyrus neurogenesis across postnatal development revealed by single-cell RNA sequencing SUPPLEMENTARY INFORMATION Resource https://doi.org/10.1038/s41593-017-0056-2 In the format provided by the authors and unedited. Conserved properties of dentate gyrus neurogenesis across postnatal development

More information

Appendix B Statistical Methods

Appendix B Statistical Methods Appendix B Statistical Methods Figure B. Graphing data. (a) The raw data are tallied into a frequency distribution. (b) The same data are portrayed in a bar graph called a histogram. (c) A frequency polygon

More information

VISION. Software for the colour blind. INFORMATION PACK Version 3 08 January Product of Vision Technology Ltd. All rights reserved.

VISION. Software for the colour blind. INFORMATION PACK Version 3 08 January Product of Vision Technology Ltd. All rights reserved. VISION Software for the colour blind INFORMATION PACK Version 3 08 January 2018 INFORMATION PACK Version 3, 08 January 2018 CONTENTS 1. What does Vision do?...1 2. What is colour blindness and which are

More information

IAT 355 Visual Analytics. Encoding Information: Design. Lyn Bartram

IAT 355 Visual Analytics. Encoding Information: Design. Lyn Bartram IAT 355 Visual Analytics Encoding Information: Design Lyn Bartram 4 stages of visualization design 2 Recall: Data Abstraction Tables Data item (row) with attributes (columns) : row=key, cells = values

More information

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING 134 TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING H.F.S.M.Fonseka 1, J.T.Jonathan 2, P.Sabeshan 3 and M.B.Dissanayaka 4 1 Department of Electrical And Electronic Engineering, Faculty

More information

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months? Medical Statistics 1 Basic Concepts Farhad Pishgar Defining the data Population and samples Except when a full census is taken, we collect data on a sample from a much larger group called the population.

More information

Quick-Start Guide TeamUnify, LLC

Quick-Start Guide TeamUnify, LLC Quick-Start Guide Setup Basics System Settings 1 When you initially sign up for MainSet, you need to set up Roster Group colors and designate coaches. first, Click settings 1. Navigate to http://mainset.com

More information

Chapter 7: Non-parametric models

Chapter 7: Non-parametric models Chapter 7: Non-parametric models Labcoat Leni s Real Research Having a Quail of a Time? Problem Matthews, R. C. et al. (2007). Psychological Science, 18(9), 758 762. We encountered some research in Chapter

More information

ANXIETY. A brief guide to the PROMIS Anxiety instruments:

ANXIETY. A brief guide to the PROMIS Anxiety instruments: ANXIETY A brief guide to the PROMIS Anxiety instruments: ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS Bank v1.0 Anxiety PROMIS Short Form v1.0 Anxiety 4a PROMIS Short Form v1.0 Anxiety 6a PROMIS Short

More information

Lab 5: Testing Hypotheses about Patterns of Inheritance

Lab 5: Testing Hypotheses about Patterns of Inheritance Lab 5: Testing Hypotheses about Patterns of Inheritance How do we talk about genetic information? Each cell in living organisms contains DNA. DNA is made of nucleotide subunits arranged in very long strands.

More information

PROTOCOL GUIDE FOR NEUROFEEDBACK CLINICIANS BY SUSAN F. OTHMER DOWNLOAD EBOOK : PROTOCOL GUIDE FOR NEUROFEEDBACK CLINICIANS BY SUSAN F.

PROTOCOL GUIDE FOR NEUROFEEDBACK CLINICIANS BY SUSAN F. OTHMER DOWNLOAD EBOOK : PROTOCOL GUIDE FOR NEUROFEEDBACK CLINICIANS BY SUSAN F. Read Online and Download Ebook PROTOCOL GUIDE FOR NEUROFEEDBACK CLINICIANS BY SUSAN F. OTHMER DOWNLOAD EBOOK : PROTOCOL GUIDE FOR NEUROFEEDBACK CLINICIANS BY Click link bellow and free register to download

More information

Using Perceptual Grouping for Object Group Selection

Using Perceptual Grouping for Object Group Selection Using Perceptual Grouping for Object Group Selection Hoda Dehmeshki Department of Computer Science and Engineering, York University, 4700 Keele Street Toronto, Ontario, M3J 1P3 Canada hoda@cs.yorku.ca

More information

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper? Outline A Critique of Software Defect Prediction Models Norman E. Fenton Dongfeng Zhu What s inside this paper? What kind of new technique was developed in this paper? Research area of this technique?

More information

The Effect of Code Coverage on Fault Detection under Different Testing Profiles

The Effect of Code Coverage on Fault Detection under Different Testing Profiles The Effect of Code Coverage on Fault Detection under Different Testing Profiles ABSTRACT Xia Cai Dept. of Computer Science and Engineering The Chinese University of Hong Kong xcai@cse.cuhk.edu.hk Software

More information

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival* LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,

More information

Characterizing health informatics journals by subject-level dependencies: a citation network analysis

Characterizing health informatics journals by subject-level dependencies: a citation network analysis Characterizing health informatics journals by subject-level dependencies: a citation network analysis Arezo Bodaghi School of Industrial and Systems Engineering, Tarbiat Modares University, Iran arezo.bodaghi@modares.ac.ir

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information

A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range

A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range Lae-Jeong Park and Jung-Ho Moon Department of Electrical Engineering, Kangnung National University Kangnung, Gangwon-Do,

More information

COMPUTER AIDED DIAGNOSTIC SYSTEM FOR BRAIN TUMOR DETECTION USING K-MEANS CLUSTERING

COMPUTER AIDED DIAGNOSTIC SYSTEM FOR BRAIN TUMOR DETECTION USING K-MEANS CLUSTERING COMPUTER AIDED DIAGNOSTIC SYSTEM FOR BRAIN TUMOR DETECTION USING K-MEANS CLUSTERING Urmila Ravindra Patil Tatyasaheb Kore Institute of Engineering and Technology, Warananagar Prof. R. T. Patil Tatyasaheb

More information

Framework for Comparative Research on Relational Information Displays

Framework for Comparative Research on Relational Information Displays Framework for Comparative Research on Relational Information Displays Sung Park and Richard Catrambone 2 School of Psychology & Graphics, Visualization, and Usability Center (GVU) Georgia Institute of

More information

Tutorial: RNA-Seq Analysis Part II: Non-Specific Matches and Expression Measures

Tutorial: RNA-Seq Analysis Part II: Non-Specific Matches and Expression Measures : RNA-Seq Analysis Part II: Non-Specific Matches and Expression Measures March 15, 2013 CLC bio Finlandsgade 10-12 8200 Aarhus N Denmark Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19 www.clcbio.com support@clcbio.com

More information

Lecture #4: Overabundance Analysis and Class Discovery

Lecture #4: Overabundance Analysis and Class Discovery 236632 Topics in Microarray Data nalysis Winter 2004-5 November 15, 2004 Lecture #4: Overabundance nalysis and Class Discovery Lecturer: Doron Lipson Scribes: Itai Sharon & Tomer Shiran 1 Differentially

More information

LifelogExplorer: A Tool for Visual Exploration of Ambulatory Skin Conductance Measurements in Context

LifelogExplorer: A Tool for Visual Exploration of Ambulatory Skin Conductance Measurements in Context LifelogExplorer: A Tool for Visual Exploration of Ambulatory Skin Conductance Measurements in Context R.D. Kocielnik 1 1 Department of Mathematics & Computer Science, Eindhoven University of Technology,

More information

Impact of Group Rejections on Self Esteem

Impact of Group Rejections on Self Esteem Impact of Group Rejections on Self Esteem In 2005, researchers from the School of Physical Education at the University of Otago carried out a study to assess the impact of rejection on physical self esteem.

More information

Display of Glucose Distributions by Date, Time of Day, and Day of Week: New and Improved Methods

Display of Glucose Distributions by Date, Time of Day, and Day of Week: New and Improved Methods Journal of Diabetes Science and Technology Volume 3, Issue 6, November 2009 Diabetes Technology Society ORIGINAL ARTICLES Display of Glucose Distributions by Date, Time of Day, and Day of Week: New and

More information

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al.

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Holger Höfling Gad Getz Robert Tibshirani June 26, 2007 1 Introduction Identifying genes that are involved

More information

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022 96 APPENDIX B. Supporting Information for chapter 4 "changes in genome content generated via segregation of non-allelic homologs" Figure S1. Potential de novo CNV probes and sizes of apparently de novo

More information

Visualizing Sports Injuries in Athletes

Visualizing Sports Injuries in Athletes Visualizing Sports Injuries in Athletes Arlene Swan College of Information Science and Technology, Drexel University The ischool, Distance Learning Program INFO633: Information Visualization Professor

More information

Predicting the Effect of Diabetes on Kidney using Classification in Tanagra

Predicting the Effect of Diabetes on Kidney using Classification in Tanagra Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

INTRODUCTION TO MACHINE LEARNING. Decision tree learning INTRODUCTION TO MACHINE LEARNING Decision tree learning Task of classification Automatically assign class to observations with features Observation: vector of features, with a class Automatically assign

More information

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful. icausalbayes USER MANUAL INTRODUCTION You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful. We expect most of our users

More information

Cancer Informatics Lecture

Cancer Informatics Lecture Cancer Informatics Lecture Mayo-UIUC Computational Genomics Course June 22, 2018 Krishna Rani Kalari Ph.D. Associate Professor 2017 MFMER 3702274-1 Outline The Cancer Genome Atlas (TCGA) Genomic Data Commons

More information

How Lertap and Iteman Flag Items

How Lertap and Iteman Flag Items How Lertap and Iteman Flag Items Last updated on 24 June 2012 Larry Nelson, Curtin University This little paper has to do with two widely-used programs for classical item analysis, Iteman and Lertap. I

More information

Data processing software for TGI/TGE series

Data processing software for TGI/TGE series 1/19 1. Overview Used with TGI or TGE series tensile and compression testing machines, the software enables efficient static strength testing in single tests, cyclical tests, or controlled (customized)

More information

OUTLIER SUBJECTS PROTOCOL (art_groupoutlier)

OUTLIER SUBJECTS PROTOCOL (art_groupoutlier) OUTLIER SUBJECTS PROTOCOL (art_groupoutlier) Paul K. Mazaika 2/23/2009 Outlier subjects are a problem in fmri data sets for clinical populations. This protocol and program are a method to identify outlier

More information

A walk in the park with Probabilites and Stats. it::unimi::sps::webcomm

A walk in the park with Probabilites and Stats. it::unimi::sps::webcomm A walk in the park with Probabilites and Stats it::unimi::sps::webcomm S Data presentation: Spreadsheet S A spreadsheet is a collection of data orginized as row of cells: The cell "A1" S Each cell can

More information

CoINcIDE: A framework for discovery of patient subtypes across multiple datasets

CoINcIDE: A framework for discovery of patient subtypes across multiple datasets Planey and Gevaert Genome Medicine (2016) 8:27 DOI 10.1186/s13073-016-0281-4 METHOD CoINcIDE: A framework for discovery of patient subtypes across multiple datasets Catherine R. Planey and Olivier Gevaert

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

Myers Psychology for AP* David G. Myers PowerPoint Presentation Slides by Kent Korek Germantown High School Worth Publishers, 2010

Myers Psychology for AP* David G. Myers PowerPoint Presentation Slides by Kent Korek Germantown High School Worth Publishers, 2010 Myers Psychology for AP* David G. Myers PowerPoint Presentation Slides by Kent Korek Germantown High School Worth Publishers, 2010 *AP is a trademark registered and/or owned by the College Board, which

More information

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples. Supplementary files Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples. Table S3. Specificity of AGO1- and AGO4-preferred 24-nt

More information

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

A Comparison of Collaborative Filtering Methods for Medication Reconciliation A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,

More information

Expanded View Figures

Expanded View Figures Molecular Systems iology Tumor CNs reflect metabolic selection Nicholas Graham et al Expanded View Figures Human primary tumors CN CN characterization by unsupervised PC Human Signature Human Signature

More information

Discovering Meaningful Cut-points to Predict High HbA1c Variation

Discovering Meaningful Cut-points to Predict High HbA1c Variation Proceedings of the 7th INFORMS Workshop on Data Mining and Health Informatics (DM-HI 202) H. Yang, D. Zeng, O. E. Kundakcioglu, eds. Discovering Meaningful Cut-points to Predict High HbAc Variation Si-Chi

More information

AUTOMATIC MEASUREMENT ON CT IMAGES FOR PATELLA DISLOCATION DIAGNOSIS

AUTOMATIC MEASUREMENT ON CT IMAGES FOR PATELLA DISLOCATION DIAGNOSIS AUTOMATIC MEASUREMENT ON CT IMAGES FOR PATELLA DISLOCATION DIAGNOSIS Qi Kong 1, Shaoshan Wang 2, Jiushan Yang 2,Ruiqi Zou 3, Yan Huang 1, Yilong Yin 1, Jingliang Peng 1 1 School of Computer Science and

More information