IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE

Size: px
Start display at page:

Download "IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE"

Transcription

1 IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE JOHN H. PHAN The Wallace H. Coulter Department of Bomedcal Engneerng, Georga Insttute of Technology, 313 Ferst Drve Atlanta, GA 30332, USA QIQIN YIN-GOEN ANDREW N. YOUNG Department of Pathology and Laboratory Medcne, Emory Unversty Atlanta, GA 30322, USA MAY D. WANG The Wallace H. Coulter Department of Bomedcal Engneerng, Georga Insttute of Technology, 313 Ferst Drve Atlanta, GA 30332, USA Identfyng and valdatng bomarkers from hgh-throughput gene expresson data s mportant for understandng and treatng cancer. Typcally, we dentfy canddate bomarkers as features that are dfferentally expressed between two or more classes of samples. Many feature selecton metrcs rely on rankng by some measure of dfferental expresson. However, nterpretng these results s dffcult due to the large varety of exstng algorthms and metrcs, each of whch may produce dfferent results. Consequently, a feature rankng metrc may work well on some datasets but perform consderably worse on others. We propose a method to choose an optmal feature rankng metrc on an ndvdual dataset bass. A metrc s optmal f, for a partcular dataset, t favorably ranks features that are known to be relevant bomarkers. Extensve knowledge of bomarker canddates s avalable n publc databases and lterature. Usng ths knowledge, we can choose a rankng metrc that produces the most bologcally meanngful results. In ths paper, we frst descrbe a framework for assessng the ablty of a rankng metrc to detect known relevant bomarkers. We then apply ths method to clncal renal cancer mcroarray data to choose an optmal metrc and dentfy several canddate bomarkers. 1. Introducton The subjectve nature of tradtonal medcal technques lmts the accuracy of cancer subtype classfcaton and, subsequently, the effectveness of therapy. Clncans vsually examne cancer specmens to determne ther subtypes before proposng treatment regmens. However, cancers wth smlar characterstcs may behave very dfferently despte smlar treatment condtons [1]. Because cancer s the result of genetc anomales, emergng dagnostc research has

2 prmarly focused on genetc and proteomc expresson. Ths research generally nvolves the use of hgh throughput technology (e.g. mcroarrays and mass spectrometry) to generate large amounts of genetc and proteomc expresson data. We typcally reduce ths data usng one of many analyss algorthms wth the goal of dentfyng a subset of features (correspondng to genes or protens) wth hgh predctve accuracy [2-4]. We hope that these feature subsets wll both enhance our understandng of the bologcal mechansms as well as provde us wth an accurate dagnostc system. When valdated, we call these dfferentally expressed features bomarkers. Unfortunately, even the selecton of a rankng metrc s subjectve, as dfferent metrcs may dentfy dfferent subsets of features [5]. Feature rankng affects both the effcency of dentfyng relevant genes and the accuracy of subsequent predctve models. We address ths ssue by presentng a method that uses exstng bologcal knowledge to dentfy the best feature rankng metrc for a partcular gene expresson dataset. The optmal metrc maxmzes the probablty of correctly rankng dfferentally expressed and prevously valdated genes. Despte numerous feature selecton studes, there s stll a lack of clncally valdated and proven bomarkers for most cancers. Thus, the use of correct genes as knowledge for algorthm selecton s subjectve and we should choose these genes carefully. Sources of bologcal knowledge are abundant, but vary n terms of relablty. We consder a knowledge source to be relable f genes (or the correspondng expressed protens) from that source have been clncally valdated as dfferentally expressed. The majorty of knowledge s contaned n the lterature and roughly falls nto four levels of relablty, adapted from a revew of post-analyss valdaton methods by Chuaqu et al. [6]: 1. No bologcal valdaton. As the lowest level of relablty, ths ncludes studes that develop feature selecton algorthms and present the selected lst of genes wthout a strngent nterpretaton of the bologcal results. 2. In slco valdaton. Also known as computatonal valdaton, these studes compare ther feature selecton results to the results of other studes. They may also dentfy Gene Ontology (GO) categores that are statstcally overrepresented as a result of feature selecton. 3. Same-sample valdaton. These studes valdate ther mcroarray experments by performng addtonal assays on the same samples from whch ther mcroarrays were derved. These assays typcally nclude quanttatve real-tme PCR (qrt-pcr) or northern analyss and serve to valdate the techncal relablty of the mcroarrays. 4. Independent or clncal valdaton. As the hghest level of relablty, these studes valdate the results of ther mcroarray experments usng ndependent bologcal samples, usually from a clncal source. Independent

3 valdaton ensures that the selected features are not a result of over-fttng. These valdatons often take the form of qrt-pcr and n stu hybrdzaton (ISH) for RNA products, or mmunohstochemstry (IHC) and western analyss for proten products. Despte frequent dsagreement between qrt-pcr and mcroarray results, qrt- PCR s the most common method for valdaton of dfferentally expressed genes. Genes wth large fold-change n mcroarray data are consstently correlated wth qrt-pcr whle those wth smaller fold change are more susceptble to techncal varablty [7]. The detecton of dfferentally expressed genes s generally reproducble across several mcroarray platforms [8]. However, n lght of a recent study llustratng the pervasveness of techncal artfacts n mcroarray data [9], we only consder a knowledge source relable f t falls nto category three or four. Investgators have attempted to mprove feature selecton by usng bologcal knowledge. Ther knowledge sources often fall nto category two of relablty, n slco valdaton, and nclude Gene Ontology and pathway databases, publshed lterature, mcroarray repostores, and sequence nformaton. Generally, these studes dentfy genes that cluster or correlate wth genes from the knowledge sources [10-12]. Another study developed a theoretcal framework to compare feature rankng metrcs n the presence of control features [13]. However, ths study also neglected to focus on the relablty of the control features. Indeed, the wealth of avalable nformaton n the form of gene and proten nteractons, functonal annotaton, and genetc and pathways can mprove the results of data analyss [14]. Furthermore, mcroarray data analyss has shfted from purely data drven methods to methods that use addtonal knowledge, even n the feature selecton process [14]. We develop a method to quantfy the effcency of detectng bomarkers by feature rankng. Ths method maxmzes the bologcal relevance of feature rankng by choosng the best metrc from a populaton of metrcs. The chosen rankng metrc s optmal wth respect to knowledge obtaned from relable sources. We test the effectveness of our method usng clncal gene expresson data. Results ndcate that the choce of rankng metrc sgnfcantly affects feature rankng, whch, n turn, affects the effcency of dscoverng and valdatng novel bomarkers.

4 2. Methods 2.1. Modelng Knowledge n Feature Selecton Throughout ths paper, the term feature set denotes a group of one or more features or genes that act n concert. A sample refers to measurements of a feature set from a sngle mcroarray or molecular profle. The entre mcroarray sample contans l features whle a feature set may contan p features (where p << l ). We r represent samples for feature set as jontly dstrbuted p random vectors, X R, and labels, Y {0,1 }. The class label, Y, ndcates the clncal source of the mcroarray sample. In most cancer problems, Y = 1 ndcates, for example, samples measured from patents wth cancer and Y = 0 ndcates samples from patents wth no cancer. For a mcroarray dataset wth N samples, feature set for a partcular dataset s the vector d r r r r = (( y1, x1 ),( y2, x2), K,( yn, xn )) r from the random varable D, whch represents all feature sets n a dataset. Each feature set s assocated wth a relevance varable, r, from the random varable R {0,1 }. r represents the bologcal relevance of the feature set and the relablty of the knowledge source. D r and R are jontly dstrbuted. For each feature set, we assgn a score that represents the predctve ablty of that feature set: r A = h( D, θ ) (1) where A R s a random varable and θ s a meta-parameter that characterzes the scorng functon, or rankng metrc. Although θ may represent the space of all rankng methods, we use a reduced set of wrapperbased methods n our smulatons. Specfcally, we use a support vector machne (SVM) classfer wth the lnear and radal bass kernels and estmate the classfcaton accuracy of bomarkers usng the bootstrap [5, 15]. The SVM classfer depends on a cost parameter, C, whch determnes the penalty of msclassfcaton. The radal bass kernel depends on γ, whch s proportonal to the complexty of the classfer. For the radal bass kernel, the par of parameters, ( C, γ ), represents θ. We dscretely vary C and γ over the log scale range of 0.1 to 10 3 and 0.01 to 10 5, respectvely. For the lnear kernel, only the sngle parameter,c, represents θ. We vary ths parameter over the log scale range of 0.01 to 10 2.

5 In practce, a gene expresson dataset wll have N samples, each wth l features. We separately examne m (m can be dfferent from l and nclude, for example, all pars, trplets, or a subset of feature combnatons) feature sets, r r r correspondng to { d1, d2, K, dm} and { r1, r2, K, rm }. From the mappng defned n eq. 1, we compute the set of values { α1, α2, K, αm} where each α s an observaton from A. Usng a smple selecton method, we can then conclude that the best feature sets and potental bomarkers are n the set G = { : α < τ} (2) where τ s a threshold. We want to choose a θ that produces the most bologcally relevant r r r rankng of the m feature sets, { d1, d2, K, dm}, wth respect to a gven set of knowledge. Assumng that lower scores are better, the best θ assgns scores such that α < α j for r = 1 and r j = 0,.e., feature set s known to be more relevant than feature set j for ths partcular dataset. Although we may never know the relevance of all features n a dataset, we may nfer from lterature that the k feature sets, Gk = { g1, g2, K, gk}, are relevant, where k << m. Ths mples that the elements of the set { α : Gk} should generally be smaller than those of { α j : j Gk}. If the knowledge s relable, we want to choose a θ that maxmzes the probablty that the score of a feature set from G k s less than that of a feature set that s not fromg k. Explctly, ths probablty s P ( α < α j θ ) (3) for Gk and j Gk. The estmated optmal rankng method s ˆ = arg max P ( α < α θ ), (4) θ θ j keepng n mnd that θˆ s only optmal, or maxmzes the probablty, wth respect to the gven knowledge set. For m feature sets, k of whch are n our knowledge set, G k, we can emprcally approxmate the probablty of eq. 3 wth P 1 ( < α j θ ) = I( α < α j k( m k) ) α (5) G k j G

6 where I (x) evaluates to one when x s true and zero when x s false. Eq. 5 s equvalent to computng the area under an ROC curve (AUC) for classfyng feature sets as ether relevant or rrelevant [13] Iteratvely Updatng Knowledge It may be dffcult to comple a comprehensve lst of knowledge from lterature and ndependent valdaton. Consequently, we can expect that some feature sets that are not n our knowledge set, j Gk, are, n fact, relevant bomarkers. If V s the set of all relevant bomarkers, regardless of whether ther relevance s known, we defne the knowledge update functon, S, as θˆ Gk + 1 = S ˆ ( Gk ) = {{ Gk,arg mn α }: V, Gk }. (6) θ Ths functon adds to G k a relevant bomarker wth the best rank accordng to the estmated optmal metrc,θˆ. Of course, a feature set s known to be n the set V only after performng a valdaton procedure such as qrt-pcr. If we know all feature sets n V, we can quantfy any mprovement n effcency due to optmzaton of the rankng metrc. Usng bootstrap resamplng, we randomly and repeatedly partton the feature sets n V nto a group of known relevant feature sets (tranng) and a group of unknown relevant feature sets (testng). If there are K elements n V, we randomly select * * K elements wth replacement, resultng n K ( K < K) unque elements for * the testng set. We use the group of K K known relevant feature sets to optmze the rankng metrc, then teratvely detect feature sets from the * unknown set of K features and update our knowledge usng eq. 6. Every valdaton test requres a fnte amount of tme and resources. Plottng the fracton of correctly valdated bomarkers (y-axs) vs. total valdaton tme (xaxs), reveals that hgher detecton effcency corresponds to a larger area under ths curve. Ths curve s smlar to a ROC curve, so we also call the area under ths curve the AUC. We repeat ths bootstrap samplng of feature sets 100 tmes n order to compute the sgnfcance of the dfferences among three condtons: optmal metrc selecton, sub-optmal metrc selecton, and sub-optmal ntal knowledge. For the sub-optmal metrc selecton condton, we use correct ntal knowledge selected from V va bootstrap, but use a modfed equaton to choose θˆ wth medan AUC: ˆ = arg medan P( α < α θ ). (7) θ θ j

7 Selecton of a rankng metrc wth medan AUC represents the common practce of arbtrarly selectng a metrc wth no regard for bologcal relevance and effcency. Ths medan AUC algorthm also serves as a reference pont for assessng the potental mprovement of effcency when usng the optmal algorthm. For the sub-optmal ntal knowledge condton, we begn the smulaton wth ncorrect knowledge selected va bootstrap and use eq. 4 to optmze the rankng algorthm before updatng the current knowledge set. We expect the average AUC of the optmal selecton condton to be hgher than that of both of the sub-optmal condtons. Fgure 1 llustrates ths process. To determne whether the optmzaton procedure s over-fttng to the knowledge set, we conduct addtonal tests usng randomly selected knowledge sets. If over-fttng s occurrng, results of the optmal, suboptmal, and suboptmal knowledge tests for randomly selected knowledge should be smlar to those of the true knowledge set. Fgure 1. Quantfyng the effcency of detectng relevant feature sets. For clncal data, we defne V as the set of K known dfferentally expressed feature sets. Usng bootstrap cross valdaton, we partton V nto K * and K-K * samples. K * s the number of unque samples after samplng from V K tmes wth replacement. We optmze the rankng algorthm usng K-K * feature sets and assess the algorthm s effcency n detectng the remanng K * feature sets. For each of the three condtons optmal metrc selecton, sub-optmal metrc selecton, and sub-optmal ntal knowledge we perform ths bootstrap samplng 100 tmes n order to compute the sgnfcance of any dfferences between mean AUC values.

8 2.3. Mcroarray Data Analyss and qrt-pcr Valdaton We examne two clncal case studes usng renal tumor mcroarray datasets. The frst dataset, from a study by Schuetz et al., uses Affymetrx mcroarrays (HG-Focus, 8793 probesets) to profle samples from three subtypes of renal tumors: 13 clear cell (CC) renal cell carcnoma (RCC), 4 chromophobe (CHR) RCC, and 3 oncocytoma (ONC, bengn) [2]. The second dataset, from a study by Jones et al., uses a dfferent model of Affymetrx mcroarrays (HG-U133A, probesets reduced to 8793 that are common to HG-Focus) to examne smlar renal tumor subtypes wth 32 CC, 6 CHR, and 12 ONC samples [16]. We are nterested n bomarkers that dfferentate the CC class from the combned group of ONC and CHR. Usng lterature, we dentfy genes that have been valdated (va qrt-pcr or IHC) as dfferentally expressed between the CC and ONC/CHR subtypes. We then valdate an addtonal 94 genes usng qrt-pcr (usng RNA from 34 CC and 18 CHR tssue samples). These 94 genes were selected by a renal cancer pathologst based on hs knowledge and prevous research. Only some of the 94 genes assayed wth qrt-pcr are dfferentally expressed as assessed by a lnear SVM wth classfcaton error estmated usng bootstrap. Genes measured wth qrt-pcr are categorzed as dfferentally expressed f the estmated classfcaton error s less than 10%. Usng the set of knowledge from both lterature and qrt-pcr valdaton, we examne the effcency of detectng these bomarkers by optmzng the rankng metrc under varous condtons, as llustrated n fgure Results and Dscusson As descrbed n the methods, we dentfy fve genes from lterature that are dfferentally expressed between the CC and ONC/CHR renal tumor subtypes (table 1). Each of these genes had been valdated usng ether qrt-pcr or IHC. Addtonally, we valdate several other potental bomarkers usng qrt-pcr and select genes wth estmated classfcaton errors of less than 10% (table 2). Combnng all knowledge from both lterature and qrt-pcr valdaton, we examne the effect of optmzng the feature rankng metrc usng the method llustrated n fgure 1. Box plots of the 100 teratons for each of the three tests ndcate that optmal selecton outperforms sub-optmal selecton (fgure 2, left column). The comparson of optmal to suboptmal metrcs may seem to always favor the optmal metrc. However, the optmal metrc s not always a smple lnear classfer. In fact, durng the teratve gene detecton process, θ changes frequently as V s updated. Moreover, suboptmal selecton may represent the common practce of arbtrarly selectng rankng metrcs wth no regard to ther

9 potental dsadvantages for partcular datasets. The box plots represent the medan and quartles of the AUC values for each of the 100 teratons. Correspondngly, the ROC curves also ndcate that the optmal selecton method mproves the effcency of bomarker detecton (fgure 2, rght column). For the Schuetz data (fgure 2, top row), the performance dfference between the optmal and suboptmal rankng metrcs seems small accordng to the box plots. However, the ROC curve of the optmal metrc ntally rses much more quckly compared to that of the suboptmal. The regon of low specfcty boosts the performance of the suboptmal metrc. However, ths regon should be neglected when assessng performance snce the number of false postves at ths pont s very hgh. Valdaton procedures would lkely consder only the bomarkers detected n the hgh specfcty regon. Results are smlar for the Jones data (fgure 2, bottom row). The hgh varance of the suboptmal ntal knowledge condton ndcates that optmzaton of the rankng metrc s senstve to the ntal condtons. Some of the randomly selected ntal knowledge may, n fact, be dfferentally expressed, resultng n good performance. However, these random ntal knowledge sets are more lkely to be rrelevant. Thus, box plots for ths condton llustrate ths mxture of knowledge qualty. These results stress the mportance of the qualty of bomarker knowledge. The control tests usng random knowledge sets for V show that our method does not over-ft to the knowledge (fgure 2, box plots CO, CSO, and CSK). None of the algorthms consdered n our space of θ are able to favorably rank these randomly selected genes. AUCs of these control tests are close to 0.5 as expected for random classfcaton. Usng all knowledge from lterature and the frst round of qrt-pcr, we optmze the rankng metrc and select the top genes that have not been prevously valdated and that have estmated classfcaton errors of less than 5% (table 3). We can lnk a few of these genes drectly to prevous lterature pertanng to renal cancer. For example, CXCR4 has been lnked to kdney cancer. Usng qrt-pcr, Schrader et al. shows that ths gene s over-expressed n kdney cancer tssue compared to normal kdney tssue [17]. IGFBP3 and KLF10 has also been lnked to renal cell carcnoma [18, 19]. Valdaton of these genes usng qrt-pcr may yeld addtonal knowledge to teratvely refne the bomarker selecton process. However, snce we want to prmarly focus on the methodology here, we reserve the actual valdaton of these results for a future study.

10 Table 1. Genes valdated as dfferentally expressed between CC and ONC/CHR renal tumor subtypes from varous knowledge sources. Gene Symbol Knowledge Source Valdaton Method CA9 Chen et al., Cln Cancer Res, 2005 qrt-pcr CLCNKB Chen et al., Cln Cancer Res, 2005 qrt-pcr DEFB1 Schuetz et al., J Mol Dagn, 2004 qrt-pcr, IHC LRP2 Schuetz et al., J Mol Dagn, 2004 qrt-pcr, IHC PVALB Chen et al., Cln Cancer Res, 2005 qrt-pcr Table 2. Genes that we valdated wth qrt-pcr. These genes have estmated classfcaton errors of less than 10% as assessed by a lnear SVM classfer usng bootstrap estmaton. Gene Symbol Error Gene Symbol Error STC1 2.43E-05 COX5A SLC25A BAG CFTR LY6E PDHA CD PFKM AKAP NNMT ACAT CP SPTBN CFB GOT Fgure 2. Box plots of AUC areas over 100 teratons for each test (left). AUCs for the optmal test (O) are hgher than both the sub-optmal (SO) and sub-optmal knowledge (SK) tests (dfferences are statstcally sgnfcant wth p-values very close to 0). The control tests, usng randomly selected knowledge ndcate that optmzng the rankng metrc does not over-ft (CO=control optmal, CSO=control suboptmal, CSK=control suboptmal knowledge). Average ROC curves for each test, llustrate the dfferences n bomarker detecton effcency (rght). The ROC for the optmal metrc test (sold lne) ndcates more accurate bomarker detecton for both the Schuetz (top row) and Jones (bottom row) renal cancer datasets.

11 4. Concluson Table 3. Proposed lst of genes for further qrt-pcr valdaton. Gene Symbol Error Gene Symbol Error ACLY 0 PCCB CXCR TMSB C4A /// C4B HCLS FLNA ACTA PMP IGFBP PFKFB NFKBIA KLF CD PRG IER LGALS We have shown that bomarker dentfcaton by feature rankng benefts from knowledge ntegraton at key ponts. Usng ths knowledge whether from clncal observatons, laboratory experments, or exstng lterature we can ntellgently choose an optmal rankng metrc for a specfc gene expresson dataset. The use of an optmal metrc for rankng and dentfyng novel bomarkers reduces the number of false dscoveres, ncreases the number of true dscoveres, reduces the requred tme for valdaton, and ncreases the overall effcency of the process. The results of our smulatons ndcate that knowledge ntegraton mproves bomarker selecton for clncal mcroarray data. Although ths study assumes ndependent gene expresson, the method s general and we can use t to rank combnatoral gene expresson data as well. Furthermore, we test ths method usng only a lmted set of wrapper-based feature rankng metrcs. However, t s easly expandable to encompass a varety of metrcs, ncludng the commonly used flter methods such as t-tests and fold change. We hope that the proposed method wll mpact bomarker dentfcaton practces and mprove the effectveness of resultng clncal applcatons. Acknowledgments Ths research has been supported by grants from Natonal Insttutes of Health (R01CA108468, P20GM072069, U54CA119338), Mcrosoft Research Fundng, and Georga Cancer Coalton (Dstngushed Cancer Scholar Award to MDW). References 1. Golub, T., et al., Molecular Classfcaton of Cancer: Class Dscovery and Class Predcton by Gene Expresson Montorng. Scence, : p Schuetz, A., et al., Molecular classfcaton of renal tumors by gene expresson proflng. J Mol Dagn, 2004.

12 3. Sngh, D., et al., Gene expresson correlates of clncal prostate cancer behavor. Cancer Cell, : p van't Veer, L., et al., Gene expresson proflng predcts clncal outcome of breast cancer. Nature, : p Braga-Neto, U. and E. Dougherty, Is cross-valdaton vald for smallsample mcroarray classfcaton? Bonformatcs, : p Chuaqu, R., et al., Post-analyss follow-up and valdaton of mcroarray experments. Nature Genetcs, : p Morey, J., J. Ryna, and F. Van Dolah, Mcroarray valdaton: factors nfluencng correlaton between olgonucleotde mcroarrays and real-tme PCR. Bol. Proced. Onlne, (1): p Sh, L., et al., The McroArray Qualty Control (MAQC) project shows nter- and ntraplatform reproducblty of gene expresson measurements. Nat Botechnol, (9): p Stokes, T., et al., chp artfact CORRECTon (cacorrect): A Bonformatcs System for Qualty Assurance of Genomcs and Proteomcs Array Data. Annals of Bomedcal Engneerng, : p Aerts, S., et al., Gene prortzaton through genomc data fuson. Nature Botechnology, (5): p Kuffner, R., K. Fundel, and R. Zmmer, Expert knowledge wthout the expert: ntegrated analyss of gene expresson and lterature to derve actve functonal contexts. Bonformatcs, : p Kong, S., W. Pu, and P. Park, A multvarate approach for ntegratng genome-wde expresson data and bologcal knowledge. Bonformatcs, (19): p Mukherjee, S. and S. Roberts, A theoretcal analyss of the selecton of dfferentally expressed genes. J Bonformatcs Comput Bol, : p Bellazz, R. and B. Zupan, Towards knowledge-based gene expresson data mnng. Journal of Bomedcal Informatcs, : p Efron, B. and R. Tbshran, Improvements on Cross-Valdaton: The.632+ Bootstrap Method. Journal of the Amercan Statstcal Assocaton, (438): p Jones, J., et al., Gene sgnatures of progresson and metastass n renal cell cancer. Cln Cancer Res, (16): p Schrader, A., et al., CXCR4/CXCL12 expresson and sgnallng n kdney cancer. Brtsh Journal of Cancer, : p Rosendahl, A. and G. Forseberg, IGF-I and IGFBP-3 augment transformng growth factor-beta actons n human renal carcnoma cells. Kdney Internatonal, : p Ivanov, S., et al., Two novel VHL targets, TGFBI (BIGH3) and ts transactvator KLF10, are up-regulated n renal clear cell carcnoma and other tumors. Bochem Bophys Res Commun, 2008.

Reconstruction of gene regulatory network of colon cancer using information theoretic approach

Reconstruction of gene regulatory network of colon cancer using information theoretic approach Reconstructon of gene regulatory network of colon cancer usng nformaton theoretc approach Khald Raza #1, Rafat Parveen * # Department of Computer Scence Jama Mlla Islama (Central Unverst, New Delh-11005,

More information

Study and Comparison of Various Techniques of Image Edge Detection

Study and Comparison of Various Techniques of Image Edge Detection Gureet Sngh et al Int. Journal of Engneerng Research Applcatons RESEARCH ARTICLE OPEN ACCESS Study Comparson of Varous Technques of Image Edge Detecton Gureet Sngh*, Er. Harnder sngh** *(Department of

More information

INTEGRATIVE NETWORK ANALYSIS TO IDENTIFY ABERRANT PATHWAY NETWORKS IN OVARIAN CANCER

INTEGRATIVE NETWORK ANALYSIS TO IDENTIFY ABERRANT PATHWAY NETWORKS IN OVARIAN CANCER INTEGRATIVE NETWORK ANALYSIS TO IDENTIFY ABERRANT PATHWAY NETWORKS IN OVARIAN CANCER LI CHEN 1,2, JIANHUA XUAN 1,*, JINGHUA GU 1, YUE WANG 1, ZHEN ZHANG 2, TIAN LI WANG 2, IE MING SHIH 2 1The Bradley Department

More information

Statistically Weighted Voting Analysis of Microarrays for Molecular Pattern Selection and Discovery Cancer Genotypes

Statistically Weighted Voting Analysis of Microarrays for Molecular Pattern Selection and Discovery Cancer Genotypes IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.6 No.2, December 26 73 Statstcally Weghted Votng Analyss of Mcroarrays for Molecular Pattern Selecton and Dscovery Cancer Genotypes

More information

Introduction ORIGINAL RESEARCH

Introduction ORIGINAL RESEARCH ORIGINAL RESEARCH Assessng the Statstcal Sgnfcance of the Acheved Classfcaton Error of Classfers Constructed usng Serum Peptde Profles, and a Prescrpton for Random Samplng Repeated Studes for Massve Hgh-Throughput

More information

AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION

AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION www.arpapress.com/volumes/vol8issue2/ijrras_8_2_02.pdf AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION I. Jule 1 & E. Krubakaran 2 1 Department

More information

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) Internatonal Assocaton of Scentfc Innovaton and Research (IASIR (An Assocaton Unfyng the Scences, Engneerng, and Appled Research Internatonal Journal of Emergng Technologes n Computatonal and Appled Scences

More information

Gene Selection Based on Mutual Information for the Classification of Multi-class Cancer

Gene Selection Based on Mutual Information for the Classification of Multi-class Cancer Gene Selecton Based on Mutual Informaton for the Classfcaton of Mult-class Cancer Sheng-Bo Guo,, Mchael R. Lyu 3, and Tat-Mng Lok 4 Department of Automaton, Unversty of Scence and Technology of Chna, Hefe,

More information

Joint Modelling Approaches in diabetes research. Francisco Gude Clinical Epidemiology Unit, Hospital Clínico Universitario de Santiago

Joint Modelling Approaches in diabetes research. Francisco Gude Clinical Epidemiology Unit, Hospital Clínico Universitario de Santiago Jont Modellng Approaches n dabetes research Clncal Epdemology Unt, Hosptal Clínco Unverstaro de Santago Outlne 1 Dabetes 2 Our research 3 Some applcatons Dabetes melltus Is a serous lfe-long health condton

More information

Copy Number Variation Methods and Data

Copy Number Variation Methods and Data Copy Number Varaton Methods and Data Copy number varaton (CNV) Reference Sequence ACCTGCAATGAT TAAGCCCGGG TTGCAACGTTAGGCA Populaton ACCTGCAATGAT TAAGCCCGGG TTGCAACGTTAGGCA ACCTGCAATGAT TTGCAACGTTAGGCA

More information

Optimal Planning of Charging Station for Phased Electric Vehicle *

Optimal Planning of Charging Station for Phased Electric Vehicle * Energy and Power Engneerng, 2013, 5, 1393-1397 do:10.4236/epe.2013.54b264 Publshed Onlne July 2013 (http://www.scrp.org/ournal/epe) Optmal Plannng of Chargng Staton for Phased Electrc Vehcle * Yang Gao,

More information

Feature Selection for Predicting Tumor Metastases in Microarray Experiments using Paired Design

Feature Selection for Predicting Tumor Metastases in Microarray Experiments using Paired Design Feature Selecton for Predctng Tumor Metastases n Mcroarray Experments usng Pared Desgn Qhua Tan 1,2, Mads Thomassen 1 and Torben A. Kruse 1 ORIGINAL RESEARCH 1 Department of Bochemstry, Pharmacology and

More information

Using Past Queries for Resource Selection in Distributed Information Retrieval

Using Past Queries for Resource Selection in Distributed Information Retrieval Purdue Unversty Purdue e-pubs Department of Computer Scence Techncal Reports Department of Computer Scence 2011 Usng Past Queres for Resource Selecton n Dstrbuted Informaton Retreval Sulleyman Cetntas

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and Ths artcle appeared n a journal publshed by Elsever. The attached copy s furnshed to the author for nternal non-commercal research and educaton use, ncludng for nstructon at the authors nsttuton and sharng

More information

AUTOMATED DETECTION OF HARD EXUDATES IN FUNDUS IMAGES USING IMPROVED OTSU THRESHOLDING AND SVM

AUTOMATED DETECTION OF HARD EXUDATES IN FUNDUS IMAGES USING IMPROVED OTSU THRESHOLDING AND SVM AUTOMATED DETECTION OF HARD EXUDATES IN FUNDUS IMAGES USING IMPROVED OTSU THRESHOLDING AND SVM Wewe Gao 1 and Jng Zuo 2 1 College of Mechancal Engneerng, Shangha Unversty of Engneerng Scence, Shangha,

More information

Prediction of Total Pressure Drop in Stenotic Coronary Arteries with Their Geometric Parameters

Prediction of Total Pressure Drop in Stenotic Coronary Arteries with Their Geometric Parameters Tenth Internatonal Conference on Computatonal Flud Dynamcs (ICCFD10), Barcelona, Span, July 9-13, 2018 ICCFD10-227 Predcton of Total Pressure Drop n Stenotc Coronary Arteres wth Ther Geometrc Parameters

More information

Parameter Estimates of a Random Regression Test Day Model for First Three Lactation Somatic Cell Scores

Parameter Estimates of a Random Regression Test Day Model for First Three Lactation Somatic Cell Scores Parameter Estmates of a Random Regresson Test Day Model for Frst Three actaton Somatc Cell Scores Z. u, F. Renhardt and R. Reents Unted Datasystems for Anmal Producton (VIT), Hedeweg 1, D-27280 Verden,

More information

310 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'16

310 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'16 310 Int'l Conf. Par. and Dst. Proc. Tech. and Appl. PDPTA'16 Akra Sasatan and Hrosh Ish Graduate School of Informaton and Telecommuncaton Engneerng, Toka Unversty, Mnato, Tokyo, Japan Abstract The end-to-end

More information

Lymphoma Cancer Classification Using Genetic Programming with SNR Features

Lymphoma Cancer Classification Using Genetic Programming with SNR Features Lymphoma Cancer Classfcaton Usng Genetc Programmng wth SNR Features Jn-Hyuk Hong and Sung-Bae Cho Dept. of Computer Scence, Yonse Unversty, 134 Shnchon-dong, Sudaemoon-ku, Seoul 120-749, Korea hjnh@candy.yonse.ac.kr,

More information

Insights in Genetics and Genomics

Insights in Genetics and Genomics Insghts n Genetcs and Genomcs Research Artcle Open Access New Score Tests for Equalty of Varances n the Applcaton of DNA Methylaton Data Analyss [Verson ] Welang Qu Xuan L Jarrett Morrow Dawn L DeMeo Scott

More information

The Limits of Individual Identification from Sample Allele Frequencies: Theory and Statistical Analysis

The Limits of Individual Identification from Sample Allele Frequencies: Theory and Statistical Analysis The Lmts of Indvdual Identfcaton from Sample Allele Frequences: Theory and Statstcal Analyss Peter M. Vsscher 1 *, Wllam G. Hll 2 1 Queensland Insttute of Medcal Research, Brsbane, Australa, 2 Insttute

More information

The Influence of the Isomerization Reactions on the Soybean Oil Hydrogenation Process

The Influence of the Isomerization Reactions on the Soybean Oil Hydrogenation Process Unversty of Belgrade From the SelectedWorks of Zeljko D Cupc 2000 The Influence of the Isomerzaton Reactons on the Soybean Ol Hydrogenaton Process Zeljko D Cupc, Insttute of Chemstry, Technology and Metallurgy

More information

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article Jestr Journal of Engneerng Scence and Technology Revew () (08) 5 - Research Artcle Prognoss Evaluaton of Ovaran Granulosa Cell Tumor Based on Co-forest ntellgence Model Xn Lao Xn Zheng Juan Zou Mn Feng

More information

Using the Perpendicular Distance to the Nearest Fracture as a Proxy for Conventional Fracture Spacing Measures

Using the Perpendicular Distance to the Nearest Fracture as a Proxy for Conventional Fracture Spacing Measures Usng the Perpendcular Dstance to the Nearest Fracture as a Proxy for Conventonal Fracture Spacng Measures Erc B. Nven and Clayton V. Deutsch Dscrete fracture network smulaton ams to reproduce dstrbutons

More information

Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO

Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO Zuo et al. BMC Bonformatcs (2017) 18:99 DOI 10.1186/s12859-017-1515-1 METHODOLOGY ARTICLE Open Access Incorporatng pror bologcal knowledge for network-based dfferental gene expresson analyss usng dfferentally

More information

Modeling Multi Layer Feed-forward Neural. Network Model on the Influence of Hypertension. and Diabetes Mellitus on Family History of

Modeling Multi Layer Feed-forward Neural. Network Model on the Influence of Hypertension. and Diabetes Mellitus on Family History of Appled Mathematcal Scences, Vol. 7, 2013, no. 41, 2047-2053 HIKARI Ltd, www.m-hkar.com Modelng Mult Layer Feed-forward Neural Network Model on the Influence of Hypertenson and Dabetes Melltus on Famly

More information

A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA

A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA Journal of Theoretcal and Appled Informaton Technology 2005 ongong JATIT & LLS ISSN: 1992-8645 www.jatt.org E-ISSN: 1817-3195 A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA 1 SUNGMIN

More information

Project title: Mathematical Models of Fish Populations in Marine Reserves

Project title: Mathematical Models of Fish Populations in Marine Reserves Applcaton for Fundng (Malaspna Research Fund) Date: November 0, 2005 Project ttle: Mathematcal Models of Fsh Populatons n Marne Reserves Dr. Lev V. Idels Unversty College Professor Mathematcs Department

More information

THE NATURAL HISTORY AND THE EFFECT OF PIVMECILLINAM IN LOWER URINARY TRACT INFECTION.

THE NATURAL HISTORY AND THE EFFECT OF PIVMECILLINAM IN LOWER URINARY TRACT INFECTION. MET9401 SE 10May 2000 Page 13 of 154 2 SYNOPSS MET9401 SE THE NATURAL HSTORY AND THE EFFECT OF PVMECLLNAM N LOWER URNARY TRACT NFECTON. L A study of the natural hstory and the treatment effect wth pvmecllnam

More information

Modeling the Survival of Retrospective Clinical Data from Prostate Cancer Patients in Komfo Anokye Teaching Hospital, Ghana

Modeling the Survival of Retrospective Clinical Data from Prostate Cancer Patients in Komfo Anokye Teaching Hospital, Ghana Internatonal Journal of Appled Scence and Technology Vol. 5, No. 6; December 2015 Modelng the Survval of Retrospectve Clncal Data from Prostate Cancer Patents n Komfo Anokye Teachng Hosptal, Ghana Asedu-Addo,

More information

Physical Model for the Evolution of the Genetic Code

Physical Model for the Evolution of the Genetic Code Physcal Model for the Evoluton of the Genetc Code Tatsuro Yamashta Osamu Narkyo Department of Physcs, Kyushu Unversty, Fukuoka 8-856, Japan Abstract We propose a physcal model to descrbe the mechansms

More information

Biomarker Selection from Gene Expression Data for Tumour Categorization Using Bat Algorithm

Biomarker Selection from Gene Expression Data for Tumour Categorization Using Bat Algorithm Receved: March 20, 2017 401 Bomarker Selecton from Gene Expresson Data for Tumour Categorzaton Usng Bat Algorthm Gunavath Chellamuthu 1 *, Premalatha Kandasamy 2, Svasubramanan Kanagaraj 3 1 School of

More information

Survival Rate of Patients of Ovarian Cancer: Rough Set Approach

Survival Rate of Patients of Ovarian Cancer: Rough Set Approach Internatonal OEN ACCESS Journal Of Modern Engneerng esearch (IJME) Survval ate of atents of Ovaran Cancer: ough Set Approach Kamn Agrawal 1, ragat Jan 1 Department of Appled Mathematcs, IET, Indore, Inda

More information

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract.

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract. A Support Vector Machne Classfer based on Recursve Feature Elmnaton for Mcroarray Data n Breast Cancer Characterzaton. R.Campann, D. Dongovann, N. Lanconell, G. Palermo, A. Rccard, M. Roffll Dpartmento

More information

THIS IS AN OFFICIAL NH DHHS HEALTH ALERT

THIS IS AN OFFICIAL NH DHHS HEALTH ALERT THIS IS AN OFFICIAL NH DHHS HEALTH ALERT Dstrbuted by the NH Health Alert Network Health.Alert@dhhs.nh.gov August 26, 2016 1430 EDT (2:30 PM EDT) NH-HAN 20160826 Recommendatons for Accurate Dagnoss of

More information

An Introduction to Modern Measurement Theory

An Introduction to Modern Measurement Theory An Introducton to Modern Measurement Theory Ths tutoral was wrtten as an ntroducton to the bascs of tem response theory (IRT) modelng and ts applcatons to health outcomes measurement for the Natonal Cancer

More information

A Computer-aided System for Discriminating Normal from Cancerous Regions in IHC Liver Cancer Tissue Images Using K-means Clustering*

A Computer-aided System for Discriminating Normal from Cancerous Regions in IHC Liver Cancer Tissue Images Using K-means Clustering* A Computer-aded System for Dscrmnatng Normal from Cancerous Regons n IHC Lver Cancer Tssue Images Usng K-means Clusterng* R. M. CHEN 1, Y. J. WU, S. R. JHUANG, M. H. HSIEH, C. L. KUO, Y. L. MA Department

More information

ARTICLE IN PRESS Neuropsychologia xxx (2010) xxx xxx

ARTICLE IN PRESS Neuropsychologia xxx (2010) xxx xxx Neuropsychologa xxx (200) xxx xxx Contents lsts avalable at ScenceDrect Neuropsychologa journal homepage: www.elsever.com/locate/neuropsychologa Storage and bndng of object features n vsual workng memory

More information

Price linkages in value chains: methodology

Price linkages in value chains: methodology Prce lnkages n value chans: methodology Prof. Trond Bjorndal, CEMARE. Unversty of Portsmouth, UK. and Prof. José Fernández-Polanco Unversty of Cantabra, Span. FAO INFOSAMAK Tangers, Morocco 14 March 2012

More information

N-back Training Task Performance: Analysis and Model

N-back Training Task Performance: Analysis and Model N-back Tranng Task Performance: Analyss and Model J. Isaah Harbson (jharb@umd.edu) Center for Advanced Study of Language and Department of Psychology, Unversty of Maryland 7005 52 nd Avenue, College Park,

More information

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract.

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract. A Support Vector Machne Classfer based on Recursve Feature Elmnaton for Mcroarray Data n Breast Cancer Characterzaton. R.Campann, D. Dongovann, E. Iamper, N. Lanconell, G. Palermo, M. Roffll, A. Rccard

More information

BINNING SOMATIC MUTATIONS BASED ON BIOLOGICAL KNOWLEDGE FOR PREDICTING SURVIVAL: AN APPLICATION IN RENAL CELL CARCINOMA

BINNING SOMATIC MUTATIONS BASED ON BIOLOGICAL KNOWLEDGE FOR PREDICTING SURVIVAL: AN APPLICATION IN RENAL CELL CARCINOMA BINNING SOMATIC MUTATIONS BASED ON BIOLOGICAL KNOWLEDGE FOR PREDICTING SURVIVAL: AN APPLICATION IN RENAL CELL CARCINOMA DOKYOON KIM, RUOWANG LI, SCOTT M. DUDEK, JOHN R. WALLACE, MARYLYN D. RITCHIE Center

More information

Cancer Classification Based on Support Vector Machine Optimized by Particle Swarm Optimization and Artificial Bee Colony

Cancer Classification Based on Support Vector Machine Optimized by Particle Swarm Optimization and Artificial Bee Colony molecules Artcle Cancer Classfcaton Based on Support Vector Machne Optmzed by Partcle Swarm Optmzaton and Artfcal Bee Colony Lngyun Gao 1 ID, Mngquan Ye 1, * and Changrong Wu 2 1 School of Medcal Informaton,

More information

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article Jestr Journal of Engneerng Scence and Technology Revew 11 (2) (2018) 8-12 Research Artcle Detecton Lung Cancer Usng Gray Level Co-Occurrence Matrx (GLCM) and Back Propagaton Neural Network Classfcaton

More information

NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 TIANHONG ZHOU

NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 TIANHONG ZHOU NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 by TIANHONG ZHOU B.S., Chna Agrcultural Unversty, 2003 M.S., Chna Agrcultural Unversty, 2006 A THESIS submtted n partal fulfllment of the requrements

More information

Economic crisis and follow-up of the conditions that define metabolic syndrome in a cohort of Catalonia,

Economic crisis and follow-up of the conditions that define metabolic syndrome in a cohort of Catalonia, Economc crss and follow-up of the condtons that defne metabolc syndrome n a cohort of Catalona, 2005-2012 Laa Maynou 1,2,3, Joan Gl 4, Gabrel Coll-de-Tuero 5,2, Ton Mora 6, Carme Saurna 1,2, Anton Scras

More information

Evaluation of Literature-based Discovery Systems

Evaluation of Literature-based Discovery Systems Evaluaton of Lterature-based Dscovery Systems Melha Yetsgen-Yldz 1 and Wanda Pratt 1,2 1 The Informaton School, Unversty of Washngton, Seattle, USA. 2 Bomedcal and Health Informatcs, School of Medcne,

More information

Drug Prescription Behavior and Decision Support Systems

Drug Prescription Behavior and Decision Support Systems Drug Prescrpton Behavor and Decson Support Systems ABSTRACT Adverse drug events plague the outcomes of health care servces. In ths research, we propose a clncal learnng model that ncorporates the use of

More information

Boosting for tumor classification with gene expression data. Seminar für Statistik, ETH Zürich, CH-8092, Switzerland

Boosting for tumor classification with gene expression data. Seminar für Statistik, ETH Zürich, CH-8092, Switzerland BIOINFORMATICS Vol. 19 no. 9 2003, pages 1061 1069 DOI: 10.1093/bonformatcs/btf867 Boostng for tumor classfcaton wth gene expresson data Marcel Dettlng and Peter Bühlmann Semnar für Statstk, ETH Zürch,

More information

Estimation for Pavement Performance Curve based on Kyoto Model : A Case Study for Highway in the State of Sao Paulo

Estimation for Pavement Performance Curve based on Kyoto Model : A Case Study for Highway in the State of Sao Paulo Estmaton for Pavement Performance Curve based on Kyoto Model : A Case Study for Kazuya AOKI, PASCO CORPORATION, Yokohama, JAPAN, Emal : kakzo603@pasco.co.jp Octávo de Souza Campos, Publc Servces Regulatory

More information

(From the Gastroenterology Division, Cornell University Medical College, New York 10021)

(From the Gastroenterology Division, Cornell University Medical College, New York 10021) ROLE OF HEPATIC ANION-BINDING PROTEIN IN BROMSULPHTHALEIN CONJUGATION* BY N. KAPLOWITZ, I. W. PERC -ROBB,~ ANn N. B. JAVITT (From the Gastroenterology Dvson, Cornell Unversty Medcal College, New York 10021)

More information

A comparison of statistical methods in interrupted time series analysis to estimate an intervention effect

A comparison of statistical methods in interrupted time series analysis to estimate an intervention effect Peer revew stream A comparson of statstcal methods n nterrupted tme seres analyss to estmate an nterventon effect a,b, J.J.J., Walter c, S., Grzebeta a, R. & Olver b, J. a Transport and Road Safety, Unversty

More information

RENAL FUNCTION AND ACE INHIBITORS IN RENAL ARTERY STENOSISA/adbon et al. 651

RENAL FUNCTION AND ACE INHIBITORS IN RENAL ARTERY STENOSISA/adbon et al. 651 Downloaded from http://ahajournals.org by on January, 209 RENAL FUNCTION AND INHIBITORS IN RENAL ARTERY STENOSISA/adbon et al. 65 Downloaded from http://ahajournals.org by on January, 209 Patents and Methods

More information

A Linear Regression Model to Detect User Emotion for Touch Input Interactive Systems

A Linear Regression Model to Detect User Emotion for Touch Input Interactive Systems 2015 Internatonal Conference on Affectve Computng and Intellgent Interacton (ACII) A Lnear Regresson Model to Detect User Emoton for Touch Input Interactve Systems Samt Bhattacharya Dept of Computer Scence

More information

Sparse Representation of HCP Grayordinate Data Reveals. Novel Functional Architecture of Cerebral Cortex

Sparse Representation of HCP Grayordinate Data Reveals. Novel Functional Architecture of Cerebral Cortex 1 Sparse Representaton of HCP Grayordnate Data Reveals Novel Functonal Archtecture of Cerebral Cortex X Jang 1, Xang L 1, Jngle Lv 2,1, Tuo Zhang 2,1, Shu Zhang 1, Le Guo 2, Tanmng Lu 1* 1 Cortcal Archtecture

More information

Subject-Adaptive Real-Time Sleep Stage Classification Based on Conditional Random Field

Subject-Adaptive Real-Time Sleep Stage Classification Based on Conditional Random Field Subject-Adaptve Real-Tme Sleep Stage Classfcaton Based on Condtonal Random Feld Gang Luo, PhD, Wanl Mn, PhD IBM TJ Watson Research Center, Hawthorne, NY {luog, wanlmn}@usbmcom Abstract Sleep stagng s the

More information

Saeed Ghanbari, Seyyed Mohammad Taghi Ayatollahi*, Najaf Zare

Saeed Ghanbari, Seyyed Mohammad Taghi Ayatollahi*, Najaf Zare DOI:http://dx.do.org/10.7314/APJCP.2015.16.14.5655 and Anthracyclne- Breast Cancer Treatment and Survval n the Eastern Medterranean and Asa: a Meta-analyss RESEARCH ARTICLE Comparng Role of Two Chemotherapy

More information

ALMALAUREA WORKING PAPERS no. 9

ALMALAUREA WORKING PAPERS no. 9 Snce 1994 Inter-Unversty Consortum Connectng Unverstes, the Labour Market and Professonals AlmaLaurea Workng Papers ISSN 2239-9453 ALMALAUREA WORKING PAPERS no. 9 September 211 Propensty Score Methods

More information

Integration of sensory information within touch and across modalities

Integration of sensory information within touch and across modalities Integraton of sensory nformaton wthn touch and across modaltes Marc O. Ernst, Jean-Perre Brescan, Knut Drewng & Henrch H. Bülthoff Max Planck Insttute for Bologcal Cybernetcs 72076 Tübngen, Germany marc.ernst@tuebngen.mpg.de

More information

Monte Carlo Analysis of a Subcutaneous Absorption Insulin Glargine Model: Variability in Plasma Insulin Concentrations

Monte Carlo Analysis of a Subcutaneous Absorption Insulin Glargine Model: Variability in Plasma Insulin Concentrations 2012 2nd Internatonal Conference on Bomedcal Engneerng and Technology IPCBEE vol. 34 (2012) (2012) IACSIT Press, Sngapore Monte Carlo Analyss of a Subcutaneous Absorpton Insuln Glargne Model: Varablty

More information

Evaluation of the generalized gamma as a tool for treatment planning optimization

Evaluation of the generalized gamma as a tool for treatment planning optimization Internatonal Journal of Cancer Therapy and Oncology www.jcto.org Evaluaton of the generalzed gamma as a tool for treatment plannng optmzaton Emmanoul I Petrou 1,, Ganesh Narayanasamy 3, Eleftheros Lavdas

More information

Prediction of Human Disease-Related Gene Clusters by Clustering Analysis

Prediction of Human Disease-Related Gene Clusters by Clustering Analysis Int. J. Bol. Sc. 2011, 7 61 Research Paper Internatonal Journal of Bologcal Scences 2011; 7(1):61-73 Ivysprng Internatonal Publsher. All rghts reserved Predcton of Human Dsease-Related Gene Clusters by

More information

Towards Prediction of Radiation Pneumonitis Arising from Lung Cancer Patients Using Machine Learning Approaches

Towards Prediction of Radiation Pneumonitis Arising from Lung Cancer Patients Using Machine Learning Approaches Towards Predcton of Radaton Pneumonts Arsng from Lung Cancer Patents Usng Machne Learnng Approaches Jung Hun Oh, Adtya Apte, Rawan Al-Loz, Jeffrey Bradley, Issam El Naqa * Dvson of Bonformatcs and Outcomes

More information

Fast Algorithm for Vectorcardiogram and Interbeat Intervals Analysis: Application for Premature Ventricular Contractions Classification

Fast Algorithm for Vectorcardiogram and Interbeat Intervals Analysis: Application for Premature Ventricular Contractions Classification Fast Algorthm for Vectorcardogram and Interbeat Intervals Analyss: Applcaton for Premature Ventrcular Contractons Classfcaton Irena Jekova, Vessela Krasteva Centre of Bomedcal Engneerng Prof. Ivan Daskalov

More information

CONSTRUCTION OF STOCHASTIC MODEL FOR TIME TO DENGUE VIRUS TRANSMISSION WITH EXPONENTIAL DISTRIBUTION

CONSTRUCTION OF STOCHASTIC MODEL FOR TIME TO DENGUE VIRUS TRANSMISSION WITH EXPONENTIAL DISTRIBUTION Internatonal Journal of Pure and Appled Mathematcal Scences. ISSN 97-988 Volume, Number (7), pp. 3- Research Inda Publcatons http://www.rpublcaton.com ONSTRUTION OF STOHASTI MODEL FOR TIME TO DENGUE VIRUS

More information

What Determines Attitude Improvements? Does Religiosity Help?

What Determines Attitude Improvements? Does Religiosity Help? Internatonal Journal of Busness and Socal Scence Vol. 4 No. 9; August 2013 What Determnes Atttude Improvements? Does Relgosty Help? Madhu S. Mohanty Calforna State Unversty-Los Angeles Los Angeles, 5151

More information

FAST DETECTION OF MASSES IN MAMMOGRAMS WITH DIFFICULT CASE EXCLUSION

FAST DETECTION OF MASSES IN MAMMOGRAMS WITH DIFFICULT CASE EXCLUSION computng@tanet.edu.te.ua www.tanet.edu.te.ua/computng ISSN 727-6209 Internatonal Scentfc Journal of Computng FAST DETECTION OF MASSES IN MAMMOGRAMS WITH DIFFICULT CASE EXCLUSION Gábor Takács ), Béla Patak

More information

The Effect of Fish Farmers Association on Technical Efficiency: An Application of Propensity Score Matching Analysis

The Effect of Fish Farmers Association on Technical Efficiency: An Application of Propensity Score Matching Analysis The Effect of Fsh Farmers Assocaton on Techncal Effcency: An Applcaton of Propensty Score Matchng Analyss Onumah E. E, Esslfe F. L, and Asumng-Brempong, S 15 th July, 2016 Background and Motvaton Outlne

More information

Appendix F: The Grant Impact for SBIR Mills

Appendix F: The Grant Impact for SBIR Mills Appendx F: The Grant Impact for SBIR Mlls Asmallsubsetofthefrmsnmydataapplymorethanonce.Ofthe7,436applcant frms, 71% appled only once, and a further 14% appled twce. Wthn my data, seven companes each submtted

More information

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data Unobserved Heterogenety and the Statstcal Analyss of Hghway Accdent Data Fred L. Mannerng Professor of Cvl and Envronmental Engneerng Courtesy Department of Economcs Unversty of South Florda 4202 E. Fowler

More information

Active Affective State Detection and User Assistance with Dynamic Bayesian Networks. Xiangyang Li, Qiang Ji

Active Affective State Detection and User Assistance with Dynamic Bayesian Networks. Xiangyang Li, Qiang Ji Actve Affectve State Detecton and User Assstance wth Dynamc Bayesan Networks Xangyang L, Qang J Electrcal, Computer, and Systems Engneerng Department Rensselaer Polytechnc Insttute, 110 8th Street, Troy,

More information

Incorrect Beliefs. Overconfidence. Types of Overconfidence. Outline. Overprecision 4/22/2015. Econ 1820: Behavioral Economics Mark Dean Spring 2015

Incorrect Beliefs. Overconfidence. Types of Overconfidence. Outline. Overprecision 4/22/2015. Econ 1820: Behavioral Economics Mark Dean Spring 2015 Incorrect Belefs Overconfdence Econ 1820: Behavoral Economcs Mark Dean Sprng 2015 In objectve EU we assumed that everyone agreed on what the probabltes of dfferent events were In subjectve expected utlty

More information

A New Machine Learning Algorithm for Breast and Pectoral Muscle Segmentation

A New Machine Learning Algorithm for Breast and Pectoral Muscle Segmentation Avalable onlne www.ejaet.com European Journal of Advances n Engneerng and Technology, 2015, 2(1): 21-29 Research Artcle ISSN: 2394-658X A New Machne Learnng Algorthm for Breast and Pectoral Muscle Segmentaton

More information

Estimating the distribution of the window period for recent HIV infections: A comparison of statistical methods

Estimating the distribution of the window period for recent HIV infections: A comparison of statistical methods Research Artcle Receved 30 September 2009, Accepted 15 March 2010 Publshed onlne n Wley Onlne Lbrary (wleyonlnelbrary.com) DOI: 10.1002/sm.3941 Estmatng the dstrbuton of the wndow perod for recent HIV

More information

Resampling Methods for the Area Under the ROC Curve

Resampling Methods for the Area Under the ROC Curve Resamplng ethods for the Area Under the ROC Curve Andry I. Bandos AB6@PITT.EDU Howard E. Rockette HERBST@PITT.EDU Department of Bostatstcs, Graduate School of Publc Health, Unversty of Pttsburgh, Pttsburgh,

More information

WHO S ASSESSMENT OF HEALTH CARE INDUSTRY PERFORMANCE: RATING THE RANKINGS

WHO S ASSESSMENT OF HEALTH CARE INDUSTRY PERFORMANCE: RATING THE RANKINGS WHO S ASSESSMENT OF HEALTH CARE INDUSTRY PERFORMANCE: RATING THE RANKINGS ELLIOTT PARKER and JEANNE WENDEL * Department of Economcs, Unversty of Nevada, Reno, NV, USA SUMMARY Ths paper examnes the econometrc

More information

An Approach to Discover Dependencies between Service Operations*

An Approach to Discover Dependencies between Service Operations* 36 JOURNAL OF SOFTWARE VOL. 3 NO. 9 DECEMBER 2008 An Approach to Dscover Dependences between Servce Operatons* Shuyng Yan Research Center for Grd and Servce Computng Insttute of Computng Technology Chnese

More information

Statistical Analysis on Infectious Diseases in Dubai, UAE

Statistical Analysis on Infectious Diseases in Dubai, UAE Internatonal Journal of Preventve Medcne Research Vol. 1, No. 4, 015, pp. 60-66 http://www.ascence.org/journal/jpmr Statstcal Analyss on Infectous Dseases 1995-013 n Duba, UAE Khams F. G. 1, Hussan H.

More information

Computing and Using Reputations for Internet Ratings

Computing and Using Reputations for Internet Ratings Computng and Usng Reputatons for Internet Ratngs Mao Chen Department of Computer Scence Prnceton Unversty Prnceton, J 8 (69)-8-797 maoch@cs.prnceton.edu Jaswnder Pal Sngh Department of Computer Scence

More information

National Polyp Study data: evidence for regression of adenomas

National Polyp Study data: evidence for regression of adenomas 5 Natonal Polyp Study data: evdence for regresson of adenomas 78 Chapter 5 Abstract Objectves The data of the Natonal Polyp Study, a large longtudnal study on survellance of adenoma patents, s used for

More information

We analyze the effect of tumor repopulation on optimal dose delivery in radiation therapy. We are primarily

We analyze the effect of tumor repopulation on optimal dose delivery in radiation therapy. We are primarily INFORMS Journal on Computng Vol. 27, No. 4, Fall 215, pp. 788 83 ISSN 191-9856 (prnt) ó ISSN 1526-5528 (onlne) http://dx.do.org/1.1287/joc.215.659 215 INFORMS Optmzaton of Radaton Therapy Fractonaton Schedules

More information

Statistical models for predicting number of involved nodes in breast cancer patients

Statistical models for predicting number of involved nodes in breast cancer patients Vol.2, No.7, 641-651 (2010) do:10.4236/health.2010.27098 Health Statstcal models for predctng number of nvolved nodes n breast cancer patents Alok Kumar Dwved 1 *, Sada Nand Dwved 2, Suryanarayana Deo

More information

Encoding processes, in memory scanning tasks

Encoding processes, in memory scanning tasks vlemory & Cognton 1976,4 (5), 501 506 Encodng processes, n memory scannng tasks JEFFREY O. MILLER and ROBERT G. PACHELLA Unversty of Mchgan, Ann Arbor, Mchgan 48101, Three experments are presented that

More information

*VALLIAPPAN Raman 1, PUTRA Sumari 2 and MANDAVA Rajeswari 3. George town, Penang 11800, Malaysia. George town, Penang 11800, Malaysia

*VALLIAPPAN Raman 1, PUTRA Sumari 2 and MANDAVA Rajeswari 3. George town, Penang 11800, Malaysia. George town, Penang 11800, Malaysia 38 A Theoretcal Methodology and Prototype Implementaton for Detecton Segmentaton Classfcaton of Dgtal Mammogram Tumor by Machne Learnng and Problem Solvng *VALLIAPPA Raman, PUTRA Sumar 2 and MADAVA Rajeswar

More information

Effects of Estrogen Contamination on Human Cells: Modeling and Prediction Based on Michaelis-Menten Kinetics 1

Effects of Estrogen Contamination on Human Cells: Modeling and Prediction Based on Michaelis-Menten Kinetics 1 J. Water Resource and Protecton, 009,, 6- do:0.6/warp.009.500 Publshed Onlne ovember 009 (http://www.scrp.org/ournal/warp) Effects of Estrogen Contamnaton on Human Cells: Modelng and Predcton Based on

More information

Submitted for Presentation 94th Annual Meeting of the Transportation Research Board January 11-15, 2015, Washington, D.C.

Submitted for Presentation 94th Annual Meeting of the Transportation Research Board January 11-15, 2015, Washington, D.C. Wegh-In-Moton Staton Montorng and Calbraton usng Inductve Loop Sgnature Technology Shn-Tng (Cndy) Jeng (Correspondng Author) CLR Analytcs Inc 8885 Research Drve Sute 15 Irvne, CA 92618 Tel: 949-864-6696,

More information

Balanced Query Methods for Improving OCR-Based Retrieval

Balanced Query Methods for Improving OCR-Based Retrieval Balanced Query Methods for Improvng OCR-Based Retreval Kareem Darwsh Electrcal and Computer Engneerng Dept. Unversty of Maryland, College Park College Park, MD 20742 kareem@glue.umd.edu Douglas W. Oard

More information

A Geometric Approach To Fully Automatic Chromosome Segmentation

A Geometric Approach To Fully Automatic Chromosome Segmentation A Geometrc Approach To Fully Automatc Chromosome Segmentaton Shervn Mnaee ECE Department New York Unversty Brooklyn, New York, USA shervn.mnaee@nyu.edu Mehran Fotouh Computer Engneerng Department Sharf

More information

Strategies for the Early Diagnosis of Acute Myocardial Infarction Using Biochemical Markers

Strategies for the Early Diagnosis of Acute Myocardial Infarction Using Biochemical Markers Clncal Chemstry / EARLY DIAGNOSIS OF ACUTE MYOCARDIAL INFARCTION USING IOCHEMICAL MARKERS Strateges for the Early Dagnoss of Acute Myocardal Infarcton Usng ochemcal Markers Martna Zannotto, Leopoldo Celegon,

More information

HIV/AIDS-related Expectations and Risky Sexual Behavior in Malawi

HIV/AIDS-related Expectations and Risky Sexual Behavior in Malawi HIV/AIDS-related Expectatons and Rsky Sexual Behavor n Malaw Adelne Delavande Unversty of Essex and RAND Corporaton Hans-Peter Kohler Unversty of Pennsylvanna January 202 Abstract We use probablstc expectatons

More information

NeuroImage. Multimodal classification of Alzheimer's disease and mild cognitive impairment

NeuroImage. Multimodal classification of Alzheimer's disease and mild cognitive impairment NeuroImage 55 (2011) 856 867 Contents lsts avalable at ScenceDrect NeuroImage journal homepage: www.elsever.com/locate/ynmg Multmodal classfcaton of Alzhemer's dsease and mld cogntve mparment Daoqang Zhang

More information

Nonstandard Machine Learning Algorithms for Microarray Data Mining. Byoung-Tak Zhang

Nonstandard Machine Learning Algorithms for Microarray Data Mining. Byoung-Tak Zhang Nonstandard Machne Learnng Algorthms for Mcroarray Data Mnng Byoung-Tak Zhang Center for Bonformaton Technology (CBIT) & Bontellgence Laboratory School of Computer Scence and Engneerng Seoul Natonal Unversty

More information

Using a Wavelet Representation for Classification of Movement in Bed

Using a Wavelet Representation for Classification of Movement in Bed Usng a Wavelet Representaton for Classfcaton of Movement n Bed Adrana Morell Adam Depto. de Matemátca e Estatístca Unversdade de Caxas do Sul Caxas do Sul RS E-mal: amorell@ucs.br André Gustavo Adam Depto.

More information

Research Article Computational Analysis of Specific MicroRNA Biomarkers for Noninvasive Early Cancer Detection

Research Article Computational Analysis of Specific MicroRNA Biomarkers for Noninvasive Early Cancer Detection Hndaw BoMed Research Internatonal Volume 0, Artcle ID 00, pages https://do.org/0./0/00 Research Artcle Computatonal Analyss of Specfc McroRNA Bomarkers for Nonnvasve Early Detecton Tanc Song, Yanchun Lang,,

More information

Investigation of zinc oxide thin film by spectroscopic ellipsometry

Investigation of zinc oxide thin film by spectroscopic ellipsometry VNU Journal of Scence, Mathematcs - Physcs 24 (2008) 16-23 Investgaton of znc oxde thn flm by spectroscopc ellpsometry Nguyen Nang Dnh 1, Tran Quang Trung 2, Le Khac Bnh 2, Nguyen Dang Khoa 2, Vo Th Ma

More information

A New Diagnosis Loseless Compression Method for Digital Mammography Based on Multiple Arbitrary Shape ROIs Coding Framework

A New Diagnosis Loseless Compression Method for Digital Mammography Based on Multiple Arbitrary Shape ROIs Coding Framework I.J.Modern Educaton and Computer Scence, 2011, 5, 33-39 Publshed Onlne August 2011 n MECS (http://www.mecs-press.org/) A New Dagnoss Loseless Compresson Method for Dgtal Mammography Based on Multple Arbtrary

More information

TOPICS IN HEALTH ECONOMETRICS

TOPICS IN HEALTH ECONOMETRICS TOPICS IN HEALTH ECONOMETRICS By VIDHURA SENANI BANDARA WIJAYAWARDHANA TENNEKOON A dssertaton submtted n partal fulfllment of the requrements for the degree of DOCTOR OF PHILOSOPHY WASHINGTON STATE UNIVERSITY

More information

Lateral Transfer Data Report. Principal Investigator: Andrea Baptiste, MA, OT, CIE Co-Investigator: Kay Steadman, MA, OTR, CHSP. Executive Summary:

Lateral Transfer Data Report. Principal Investigator: Andrea Baptiste, MA, OT, CIE Co-Investigator: Kay Steadman, MA, OTR, CHSP. Executive Summary: Samar tmed c ali ndus t r esi nc 55Fl em ngdr ve, Un t#9 Cambr dge, ON. N1T2A9 T el. 18886582206 Ema l. nf o@s amar t r ol l boar d. c om www. s amar t r ol l boar d. c om Lateral Transfer Data Report

More information

Importance of Atrial Compliance in Cardiac Performance

Importance of Atrial Compliance in Cardiac Performance Importance of Atral Complance n Cardac Performance By Hroyuk Suga ABSTRACT Effects of changes n atral complance on cardac performance were analyzed usng a crculatory analog model. The atrum was assumed

More information

The effect of salvage therapy on survival in a longitudinal study with treatment by indication

The effect of salvage therapy on survival in a longitudinal study with treatment by indication Research Artcle Receved 28 October 2009, Accepted 8 June 2010 Publshed onlne 30 August 2010 n Wley Onlne Lbrary (wleyonlnelbrary.com) DOI: 10.1002/sm.4017 The effect of salvage therapy on survval n a longtudnal

More information