Hierarchical kernel mixture models for the prediction of AIDS disease progression using HIV structural gp120 profiles

Size: px
Start display at page:

Download "Hierarchical kernel mixture models for the prediction of AIDS disease progression using HIV structural gp120 profiles"

Transcription

1 PROCEEDINGS Open Access Herarchcal kernel mxture models for the predcton of AIDS dsease progresson usng HIV structural gp20 profles Paul D Yoo,2*, Yung Shwen Ho 3, Jason Ng 4, Mchael Charleston 5, Ntn K Saksena 3, Pengy Yang 6, Albert Y Zomaya,5 From Asa Pacfc Bonformatcs Network (APBoNet) Nnth Internatonal Conference on Bonformatcs (InCoB200) Tokyo, Japan September 200 Abstract Changes to the glycosylaton profle on HIV gp20 can nfluence vral pathogeness and alter AIDS dsease progresson. The characterzaton of glycosylaton dfferences at the sequence level s nadequate as the placement of carbohydrates s structurally complex. However, no structural framework s avalable to date for the study of HIV dsease progresson. In ths study, we propose a novel machne-learnng based framework for the predcton of AIDS dsease progresson n three stages (RP, SP, and LTNP) usng the HIV structural gp20 profle. Ths new ntellgent framework proves to be accurate and provdes an mportant benchmark for predctng AIDS dsease progresson computatonally. The model s traned usng a novel HIV gp20 glycosylaton structural profle to detect possble stages of AIDS dsease progresson for the target sequences of HIV + ndvduals. The performance of the proposed model was compared to seven exstng dfferent machne-learnng models on newly proposed gp20- Benchmark_ dataset n terms of error-rate (MSE), accuracy (CCI), stablty (STD), and complexty (TBM). The novel framework showed better predctve performance wth 67.82% CCI, 30.2 MSE, 0.8 STD, and 2.62 TBM on the three stages of AIDS dsease progresson of 50 HIV+ ndvduals. Ths framework s an nvaluable bonformatcs tool that wll be useful to the clncal assessment of vral pathogeness. Background The human mmunodefcency vrus (HIV) s responsble for the acqured mmunodefcency syndrome (AIDS) dsease and 33 mllon people are nfected globally. Infected ndvduals can lve a normal lfe wth drug treatment, but most wll eventually progress to AIDS. The duraton of dsease vares between ndvduals. Some HIV + patents can progress towards AIDS wthn two years of prmary nfecton (rapd progressors RP). RP show rapd rse n plasma vrus and rapd declne n CD + T cell counts. On the other hand, another group of HIV + patents show steady but gradual ncrease n vrema and decrease n T cell counts over 0-5 years (slow progressors SP). Only about % of HIV + therapy naïve ndvduals can mantan vrus level below detecton level, robust T cell counts and experence sustaned mmune response for more than 20 years (long term non-progressors LTNP). Wth such a great dfference n AIDS dsease progresson among HIV + patents, much can be learned at the level of dfferences n vral archtecture that exsts n HIV varants evolvng at dfferent stages of HIV dsease and under dfferent mmunologc constrants n a gven host. Glycans on the HIV glycoproten 20 (gp20) surface mask mportant vral eptopes that host antbodes recognze [,2], preventng the eradcaton of the vrus. The rapd mutaton n gp20 durng vral evoluton further creates an ever changng landscape of glycosylaton patterns of HIV surface glycoproten gp20 (also known as the carbohydrate landscape ) thatfavours host mmune evason. Ths observaton has been termed the glycan sheld of HIV [3] and s drectly responsble for the persstence of vral nfecton even after therapy. Thus, any modfcaton to the glycosylaton profle of 200 Yoo et al; lcensee BoMed Central Ltd. Ths s an open access artcle dstrbuted under the terms of the Creatve Commons Attrbuton Lcense ( whch permts unrestrcted use, dstrbuton, and reproducton n any medum, provded the orgnal work s properly cted.

2 Page 2 of 0 gp20 s lkely to affect vral susceptblty to host mmune response [4], transmsson effcency [5], nfectvty [6] and AIDS dsease progresson [7]. Whle the glycosylaton of HIV s the man barrer to vral control and eradcaton, t s possble to harness the protectve glycosylaton profles on gp20 aganst the vrus [8] and develop a glycan based approach to vaccne desgn. We have prevously reported on our fndngs on glycosylaton ste nteracton wthn the envelope gp20 [9], whch are consstent wth the fndngs by Poon et al [0]. The assocaton of multple glycans wthn env gp20 could be due to the structural placement of the glycosylaton stes after proten foldng. Glycosylaton stes that are far away at the sequence level mght actually be close together n three-dmensonal (3D) structure of a proten. Thus, the understandng of gp20 glycosylaton structural (3D) profle modfcaton can explan the determnants of HIV dsease progresson. Studes to date have manly focused on the changes to sngle glycosylaton stes at the sequence level, whle the analyss of complete gp20 structural glycan modfcaton s new. Ths could be due to the lack of an analyss framework for multple glycan comparson across the entre gp20 sequence. In ths paper, we ntroduce a novel statstcal kernel model, whch s desgned to learn the complex glycan nteractons and predct the dfferences n AIDS dsease progresson usng the structural 3D glycan profle. It nvolves the desgn of sem-parameterzed, and supportvector asssted herarchcal mxture model, whch s able to effectvely capture the nformaton of non-local nteractons wth strong resstance to vanshng gradent and hgh-dmensonalty problems. The proposed framework successfully classfed the changes to glycosylaton profles and segregated HIV dsease groups. These results show the utlty of new bonformatcs and machnelearnng tools n provdng useful bologcal understandng of glycosylaton patterns durng AIDS dsease progresson. Methods Our approach to the predcton of AIDS dsease progresson conssts of three consecutve steps: () comprehensve HIV dataset constructon for the purpose of benchmarkng 3D structure-based HIV progress classfcaton methods. (2) novel gp20 structural profle desgn and (3) the constructon of sem-parameterzed, and support vector asssted herarchcal mxture model for the explotaton of non-local nteracton nformaton from the profle. gp20 benchmark dataset gp20-benchmark_ s a newly developed comprehensve dataset for benchmarkng 3D structure-based AIDS dsease progress classfcaton methods. Based on AIDS progresson rates, HIV + ndvduals have been dvded nto three categores such as rapd progressors (RP), slow progressors (SP) and long-term non-progressors (LTNP) []. RP patents progress towards AIDS wthn two years of prmary nfecton. RP show rapd rse n plasma vrus and rapd declne n CD + Tcellcounts.SP group of HIV + patents show steady but gradual ncrease n vrema and decrease n T cell counts over 0-5 years. Only about % of HIV + therapy naïve ndvduals (LTNP) can mantan vrus level below detecton level, robust T cell counts and experence sustaned mmune response for more than 20 years []. A total of 50 env gp20 samples were manually extracted from 50 HIV + ndvduals, of whch 0 samples were LTNPs, were RPs and 29 were SPs. The lst of samples wth the Gen- Bank accesson number, year of sample collecton and dsease type are shown n Table. gp20-benchmark_ s avalable at edu.au/~yangpy/software/aids-progresson.html Structural gp20 proflng The glycosylaton of HIV occurs on the asparagne resdue of a NX[ST] motf where X can be any amno acd resdue except prolne [35,36]. To create a whole envelope glycan profle, the 3D coordnates of the asparagne resdue from every glycosylaton ste were extracted from the gp20 structure models. The extracted glycans from the query model were matched to an approprate glycosylaton ste from the template model (Fgure, left-hand). Mutatons, nsertons and deletons (ndels) are common wthn the gp20 gene and can thus complcate the matchng of target and template glycosylaton stes. Further, t s also dffcult to dfferentate whether a glycosylaton ste has moved f the neghbourng glycans are clustered close together. To overcome these problems, we developed a protocol to loosely encapsulate the movement, nserton and deleton of glycosylaton stes n the HIV envelope gp20 models, to facltate the glycan matchng process. Frst, the glycosylaton stes were separated nto ther respectve regons (V,V2,C2,V3,C3,V4,C4,V5,C5).Theglycanfrom the query model can only be matched wth the template glycan of the same regon. Second, wthn each regon, a dstance matrx was created by calculatng the 3D dstance between every glycan of the query and template models. Fnally, every glycan n the query model was matched to the closest glycan from the target model and ther dstance noted. The addton of glycosylaton stes as compared to the template model s expressed as an addtonal dstance value n the glycosylaton profle. The movement of glycosylaton stes s reflected as a sgnfcant change n dstance value n the profle, whle retanng ts assocaton

3 Page 3 of 0 Table Patent dataset used for structural glycan proflng Cohort/ Country Sample GenBank Accesson No Year of sample collecton Dsease type USA A AY RP USA A2 AY RP USA A3 AY RP USA B4 AY RP USA B6 AY RP USA C7 AY RP USA C8 AY RP USA D9 AY RP USA D0 AY RP USA D AY RP USA E2 AY RP USA F AY SP USA F2 AY SP USA F3 AY SP USA G4 AY SP USA G5 AY SP USA G6 AY SP USA H7 AY SP USA H8 AY SP USA J AY SP USA K4 AY SP USA K5 AY SP USA K6 AY SP USA L7 AY SP USA M AY LTNP USA M2 AY LTNP USA M3 AY LTNP USA M4 AY LTNP USA M5 AY LTNP USA N8 AY LTNP Canada CAN_A_ AY LTNP Canada CAN_A_3 AY LTNP Canada CAN_A_5 AY LTNP Canada CAN_A_6 AY LTNP Canada CAN_B_3 AY SP Canada CAN_B_4 AY SP Canada CAN_B_5 AY SP Canada CAN_B_6 AY SP Canada CAN_C_2 AY SP Canada CAN_C_3 AY SP Canada CAN_C_5 AY SP Canada CAN_C_6 AY SP Canada CAN_C_8 AY SP Canada CAN_C_0 AY SP Australa 8 GQ SP Australa 82 GQ SP Australa BB_76 GQ SP Australa BB_24 GQ SP Australa BB_42 GQ SP Australa BB_92 GQ SP wth the same template glycan. Fnally, the deleton of glycans was coded as an absent dstance value n the profle as compared to other models. The glycan profle reflects the varaton n the global glycosylaton pattern n relaton to the gven template model. The glycan profles for all the sequences n our dataset are compled n an N? M matrx, where N s the maxmum number of glycans n our dataset and M s the number of sequences n the dataset. Sequences wth fewer glycosylaton stes were padded wth a dstance value of zero on ether sde of the matrx. Nonlocal nteractons n sequence and vanshng gradent problem A machne learnng framework wth kernel method [2] was used to learn and characterze the changes to glycosylaton profles for the predcton of HIV dsease progresson. Exstng large machne-learnng algorthms lke neural networks have performed well n the analyss of sequence-based problems, where the nformaton s processed n the order that they are gven. However, these were not desgned to analyse the long-range dependences between glycosylaton stes at the sequence level. Ths s due to the lack of an effcent algorthm for numercal optmzaton, commonly known as the vanshng-gradent problem [3]. For example, when the glycosylaton stes are analysed n the order they appear n the sequence, the dependency between a glycan X and a prevously processed glycan Y mght not be learnt. If glycan X and glycan Y are processed mmedately after one another (short range), ther dependency can be learnt. However, f there are multple glycans between X and Y (long range), the neural network can only learn the dependences between glycan X and the sum of all glycan nformaton before X (ncludng glycan Y). Whle the dependences between X and Y are somewhat establshed because of the partal ncluson of glycan Y n the combned nformaton, the long range dependency learned between glycans X and Y are not suffcent to classfy HIV dsease progresson. Furthermore, error mnmzaton s known to fal n the presence of dependences that are far apart n the sequence space [4,5]. Support vector asssted herarchcal mxture model (SV-HMM) One possble soluton to ths problem s to amelorate the vanshng gradent problem by provdng the kernel functon wth nformaton of dstantly located glycan. Ths gves the machne-learnng framework a far chance of learnng any long-range nteracton between glycosylaton stes. We developed a herarchcal kernel mxture model that combnes a modular approach wth local lnear support vector classfers, whch can effectvely analyse dstantly related nformaton.

4 Page 4 of 0 Fgure Annotaton of the glycans on the template env gp20 crystal structure PB4C (left-hand), and varatons n structural locatons of glycans (rght-hand). At the top level of our framework, we have adopted a well-known Herarchcal Mxture of Experts (HME) model for regresson and classfcaton [6]. The HME model s a tree-lke structure (Fgure 2) that uses the dvde and conquer prncple to learn the nteractons between HIV glycosylaton stes. At the bottom of the tree (leaf nodes) are multple localty effectve support vector, that wll analyse the smlartes between the gven glycosylaton stes. HME descrbes a condtonal probablty dstrbuton over a vector t of target varables, condtoned on a vector x of nputs. Consder the case of functonal mappng learnng of the type y = f()based x on tranng data set T=(x (t),y (t) ),t=0,, nwth x = { x, x2,, x n } and a correspondng desred response y = { y, y2,, y n }.All of the networks, both experts and gatng, receve the same nput vector at the t th tme nstant, x (t). However, whle the gatng networks use ths nput to compute confdence level values for the outputs of the connected expert networks, the expert networks themselves use the nput to generate an estmate of the desred output value. The outputs of the gatng networks are scalar values and are a partton of unty at each pont n the nput space,.e. a probablty set. Thus, consder a twolayered bnary branchng HME: Each of the expert local models (, j) produces outputs y j from the nput vector x (t) t accordng to the relatonshp: yj = f x ( ) (, wj),, where f s a neural-network mappng usng nput x (t) and ts correspondng weght matrx w j. The outputs of the gatng network g at the top level are computed accordng to: g e = wth = v x k e k T () t where V s the weght vector assocated wth gatng network g. Due to the specal form of the softmax beng non-lnear, the g s are postve and sum up to one for each nput vector x (t). The lower level gatng networks compute ther output actvatons smlar to the top level gatng network accordng to the followng expresson: g j j e = wth = v x k e k j T () t The output actvatons of the expert networks are weghted by the gatng networks output actvatons as they proceed up the tree to form the overall output vector. Specfcally, the output of the th nternal node n the second layer of the tree s: y = g y j j j whle the output at the top level node s: y ( t ) = gy Snce both the g s and the y s depend on the nput x (t), the overall output of the archtecture s a non-lnear functon of the nput.

5 Page 5 of 0 Fgure 2 Modular herarchcal kernel experts as a global model. HME has a tree-lke structure that uses the dvde and conquer prncple to learn the nteractons between HIV glycosylaton stes. At the bottom of the tree (leaf nodes) are multple localty effectve support vector, that wll analyse the smlartes between the gven glycosylaton stes. Local kernel machne as a collaboratng flter to SV-HMM As a key local collaborator, weuseasupport-vector classfer [7] to assst the global HME model. SVMs are known as maxmum margn classfers snce they classfy ther objects by mnmzng the emprcal generalzaton error and maxmzng the geometrc margn smultaneously. Where the two classes are not separable, they map the nput space nto a hgh-dmensonal feature space (where the classes are lnearly separable) by usng a non-lnear kernel functon. The kernel functon calculates the scalar product of the mages of two examples n the feature space. Gven a n-dmensonal nput vector, x =(x,x 2,,x n ) wth class labels, y Î {+, }, (=,2,...,N), the hyperplane decson functon of bnary SVM wth kernel method s wrtten as: f( x) = sgn ya ( x), ( x) + b sgn yak( x, x) b Φ Φ = + = = and the followng quadratc program: maxmze Wa () = a aayykx j j (, xj) 2 = j, = subject to a 0, =,...,, and ay = 0. = where s the number of tranng patters; a are the parameters of the SVM; k (, ) s a sutable kernel functon, and b s the bas term. Sem-parametrc modellng to the local kernels The use of x examples, especally n hgh-dmensonal space causes several key problems. Frst, the good data fttng capacty of the flexble model-free approach often tends to ft the tranng data very well and thus, have a low bas. However, the potental rsk s overfttng that causes hgh varance n generalsaton. In general, the varance s shown to be a more mportant factor than the learnng bas n poor predcton performance [8]. Second, wth the hgh-dmensonal data such as protens, as the number of hdden nodes of the network s severely ncreased, t eventually leads to an exponental rse n computatonal complexty. A hgh complexty model generally shows a low bas but a hgh varance [9]. On the other hand, a model wth low complexty shows a hgh bas but a low varance. Hence, a good model should balance well between model bas and model varance. Ths problem s generally regarded as the term bas-varance tradeoff. One of the solutons to the above problem s the socalled sem-parametrc modelng. Sem-parametrc models take assumptons that are stronger than those of nonparametrc models, but are less restrctve than those of parametrc model. In partcular, they avod many serous practcal dsadvantages of non-parametrc methods at the prce of an ncreased rsk of specfcaton errors. To sem-parameterze SV-HMM, we substtute the centrod vectors from vorono regon [20] for each

6 Page 6 of 0 tranng sample x used n the SVM decson functon (Fgure 3). Consder a tranng sequence consstng of M source vectors, T={x,x 2,,x m }. M s assumed to be suffcently large and so that all the statstcal propertes of the source are captured by the tranng sequence. We assume that the source vectors are k-dmensonal, X m = (x m,,x m,2,, x m,k ), m=,2,,m. These vectors are compressed by choosng the nearest matchng vectors and form a codebook consstng the set of all the code-vectors. N s the number of code-vectors, C={c,c 2,,c n } and each code-vector s k-dmensonal, c n =(c n,,c n,2,,c n, k),n=,2,,n. The representatve codevector s determned to be the closest n Eucldean dstance from the source vector. The Eucldean dstance s defned by: k 2 j j j= dxc (, ) = ( x c ) where x j s the j th component of the source vector, and c j s the j th s components of the code-vector c. S n s the nearest-neghbor regon assocated wth code-vector c n, and the parttons of the whole regon are denoted by P= {S,S 2,,S N }. If the source vector X m s n the regon S n, ts approxmaton can be denoted by Q(X m )=c n, f X m Î S n. The Vorono regon s defned by: V = { x R : x c x c, for all j } k To fnd the most optmal C and P, vector quantzaton uses a square-error dstorton measure specfyng exactly how close the approxmaton s. The dstorton measure can be gven as: M 2 Dave Xm Q Xm Mk m= If C and P are soluton to the above mnmzaton problem, then t must satsfy two condtons namely nearest neghbor and centrod condtons. The nearest neghbor condton ndcates the sub-regon S n should consst of all vectors that are closer to c n than any of the other code-vectors. It s wrtten as: = ( ) { } 2 2 n n n S = x : x c x c, n = 2,,..., N The centrod condton requres the code-vector c n should be average of all those tranng vectors that are n ts Vorono Regon S n. c n X m Xm Sn =, n = 2,,..., N Xm Sn j The centrod vector of each vorono regon whch can be expressed as: xm Xm S Q( Xm) = c =, = 2,,..., N N The new local SVM s approxmaton can be wrtten as: f() x = sgn ( w () x + b) = sgn yak(, x c) + b = and the followng quadratc program: maxmze Wa () = a aayykq j j ( (), x Qj()) x 2 = j, = subject to a 0, =,...,, and ay = 0. = SVM s consdered as a purely non-parametrc model, whereassv-hmmsconsderedassem-parametrc model as t adopts the method of groupng the assocated nput vectors n each class. Hence, the performance of the proposed model has some advantages n comparson to both pure parametrc models and pure non-parametrc models n terms of learnng bas and generalzaton varance especally on hgh-dmensonal proten datasets. Predcton of AIDS dsease progresson Our herarchcal mxture modelng contans a number of consecutve steps. Frst, structural glycosylaton profles from our HIV + RP, SP and LTNP sequences were used as the nput data n our computatonal kernel analyss. By learnng the dfferences n glycosylaton changes between patent samples, the SV-HMM was used to classfy a prevously unseen glycosylaton profle nto RPs, SPs or LTNPs. Second, the predcted classfcatons were compared wth the true dsease group for the glycosylaton profle. These experments were performed n a seres of steps. The SV-HMM and seven other wellregarded machne learnng models, transductve support vector machne (SVM LIB ) [2], SVM based decorate model (Decorate SVM ) [22], mult-layered perceptron (MLP) [23], radal bass functon network (RBFN) [24], logstcs [25], and decson trees (J48) [26] were used to analyse the glycosylaton profles. For the analyss of each kernel model, we performed a tenfold cross-valdaton evaluaton on the dataset most wdely adopted method for far model evaluaton of general computatonal sequence-based classfcaton [37-42]. Durng the cross-valdaton phase, one fold data were randomly chosen and excluded from the tranng

7 Page 7 of 0 Fgure 3 The basc archtecture of sem-parameterzed SV-based local models. The centrod vectors from vorono regon for each tranng sample x used n the SVM decson functon. SVM s consdered as a purely non-parametrc model, whereas SV-HMM s consdered as semparametrc model as t adopts the method of groupng the assocated nput vectors n each class. RBF kernel has been used for the SV-based local models. set before model learnng began, and were later used to test the performance of the learnt model. Ths process was repeated ten tmes. The cross valdaton protocol was chosen to solve the potental problems caused by resdual evaluatons. Ths s because, f the entre dataset were used for tranng, the model could not provde adequate ndcaton of ts effectveness n predctng any unseen data. Thus, t does not matter how the data s dvded as every data pont gets to be n the test set exactly once and, n the tranng sets, nne out of ten tmes. Snce every data pont s tested exactly once, t has been a wdely used evaluaton method for real expermental sequence data, (just as the case n gp20 benchmark experment) [37-42]. Fnally, each model has been measured by the predctve accuracy (CCI: correctly classfed nstances) of the kernel model. The fnal CCI value was calculated based on the average of all the predcton accuraces observed durng the tenfold valdaton process. The mean squared error (MSE: a quantty used to measure how close forecasts or predctons are to the eventual outcomes), tme (TBM: tme to buld model), and model stablty (STD: ST Devaton obtaned from ten sub-samples) were also measured n the experments. The stepwse procedure has been provded n Fgure 4. Results In our comparson between the eght machne-learnng models, SVM-asssted herarchcal mxture models (SV- HMM: sem-parametrc, HME SVM : non-parametrc) were shown to be most sutable (CCI: 67.82%, 69.38%) for the classfcaton of HIV dsease types usng the glycosylaton profles (Table 2). In general, SVM-based models such as SV-HMM, HME SVM,SVM LIB and Decorate SVM performed better than the other models n terms of CCI. Ths suggests that the lnear support vector approaches wth RBF kernel used n the standard SVMmodels accurately learns the characterstcs from structural gp20 profles for the predcton of AIDS dsease progresson. Mean testng data ±STD obtaned by ANOVA test usng optmal settngs for each model. The ANOVA CCI and STD of each model are nsgnfcant, whch suggest that all the models are performng consstently n the experment. The performance of our sem-parameterzed SV-MHH model was the most stable (STD: 0.8) n the classfcaton of HIV dsease stages than the other machne-learnng models, wth the excepton of the Logstc model, whch also gave good predcton stablty due to ts statstcal nature. These results showed that the herarchcal mxture model approach used n the proposed SV-HMM model can consstently capture crtcal changes to HIV glycosylaton profles and can be used to predct AIDS dsease progresson. Assocatve dfferences n the glycosylaton profle that determne dsease progresson mght be located far apart n the sequence, but close together n the tertary structure. The results also suggest that the modular approaches used n SV-HMM and HME SVM s able to capture these non-local nteractons. Thus, our proposed models are resstant to model over-fttng and weak sgnal to nose rato durng the learnng phase and can avod the vanshng gradent problem n ths scenaro. Our structural glycosylaton profles of our gp20 models were created usng the template crystal structure. The dstance between the matchng glycosylaton stes from our model to the template structure (Fgure, left-hand) was calculated to create the glycan profle. Most glycosylaton stes had an equal contrbuton of structural

8 Page 8 of 0 Fgure 4 The flowchart of SV-HMM showng the stepwse procedure. The above fgure shows the stepwse procedure we have performed. () data collecton, buldng gp20 benchmark dataset and pre-processng datasets; (2) structural gp20 profle constructon ncludng matchng up and calculatng the 3D dstance between every glycan of the query and template models. (3) the nformaton obtaned n (2) and (3) were combned and normalsed to fall n the nterval [, ] to be fed nto networks; (4) target levels were assgned to each profle (postve, +, for RPs, 0 for SP. for LTNP); (5) a hold-out method, to dvde the combned dataset nto ten subsets (tranng and testng sets); (6) model tranng on each set, to create a model; (7) smulaton of each model on the test set, to obtan predcted outputs; and (8) post-processng to fnd predcted HIV progressor groups. The procedure from (6) to (8) was performed teratvely untl we obtaned the most sutable kernel and the optmal hyperparameters for SV-HMM for gp20 benchmark dataset. varatons by the three dsease groups, wth the excepton of glycan N23 n the C3 regon, where there were more structural glycan dfferences n the LTNP models (Fgure, rght-hand). When we compared the average structural varaton observed, glycan stes 5, 29, 243, 254 were more structurally varable (>50%). The gp20 proten exsts as a trmer and the nner doman of gp20 (coloured grey) faces the trmer axs (left-hand). The outer doman (coloured blue) faces away from the trmer axs and s exposed on the envelope surface. Coloured red spheres represent the glycosylaton stes on our template structure. The V3, V4 and V5 loop regons are annotated. The CD4 molecule (yellow) that bnds to gp20 s ncluded to gve a better perspectve of the proten. Each pe chart (rght-hand) represents the percentage of glycan structural varatons by the LTNP, RP or SP samples at each glycosylaton ste. The area of the pe charts s proportonal to average structural varaton observed. Connectng lnes ndcate Table 2 Model comparson on gp20_benchmark_ dataset Models MSE RP MSE SP MSE LTNP MSE Overall CCI STD TBM SV-HMM HME SVM SVM LIB Decorate SVM MLP RBFN Logstc J The fnal CCI value was calculated based on the average of all the predcton accuraces ob-served durng the tenfold valdaton process. MSE, TBM and STD (ST Devaton obtaned from ten sub-samples) ndcate the accuracy, complexty and stablty of the model respectvely. the prmary (sequence) orderng of the glycosylaton stes. Dscusson A predcton accuracy of 67 to 69% was acheved n the classfcaton of HIV envelope sequences from RP, SP and LTNP patents. The structural glycan proflng technque was desgned to encapsulate the frequent and vared structural changes to glycosylaton stes observed n HIV envelope gp20. Structural glycan modfcatons from the entre envelope gp20 were characterzed at the same tme to provde a macroscopc vew of the carbohydrate landscape. Interactons between glycosylaton stes were analysed by the dvson of the glycan nput space nto smaller and more manageable problems, usng the modular kernel desgn adopted n herarchcal kernel mxture archtecture. These partal solutons were then ntegrated to yeld an overall soluton to the whole problem. The modular approach enforces constant error flow through the nternal states of neural-network unts. By dong so, we were able to study nteractons between glycosylaton stes that are located far apart n the sequence, whch were not possble usng tradtonal machne-learnng methods. Whle the framework was successful n dentfyng crtcal changes to glycan profles for dfferent dsease progresson rates, we were lmted to the number of avalable patent samples that were well characterzed clncally. Havng more SP sequences n our dataset also meant that we had to address the potental bas n our learnng model. Another reason for the reducton of predctve power for HIV dsease progresson could be due to the absence of structural nformaton from the V-V2 loops of gp20. Durng the crystallzaton process, the V-V2 loops were normally deleted n order to promote better

9 Page 9 of 0 crystallzaton. Thus, no V-V2 structural nformaton s avalable to date. Unfortunately, the V and V2 glycans are mportant to dsease progresson. Structurally, the V and V2 glycans protect the brdgng sheet between the nner and outer doman of HIV envelope gp20 [27], and nfluence AIDS progresson through hostreceptor modulaton durng an nfecton. Any changes to the V2 glycan can potentally restrct the capacty of HIV- to replcate [28]. Wthout the correct glycosylaton profle from the V V2 loops, we can only, at best, develop a partal mappng of the envelope carbohydrate landscape to determne AIDS progresson. Durng the glycan proflng, the V3 regon of our sequence was not ncluded n the analyses, as the template crystal structure (PDB: 2B4C) was not glycosylated n the V3 regon. Ths mght also have partally dsadvantaged the model s predcton power, as the V3 regon s known to ndrectly nfluence dsease progresson rates. When HIV nfects a host cell, the gp20 proten frst attaches to a CD4 molecule, followed by the bndng to ether the CCR5 or CXCR4 co receptor [29,30]. Vral strans that preferentally bnd to CCR5 are less pathogenc, whle vruses that prefer CXCR4 are more vrulent and are assocated wth faster dsease progresson [3]. The ablty to use the CXCR4 co-receptor s nfluenced by the absence of the glycan n the V3 regon only [32], whereas the use of CCR5 co-receptor s collectvely nfluenced by the amno acd makeup and glycan profles n the V, V2 and V3 regons all together [32,33]. Thus, the occluson of V3 glycans from our study mght have partally weakened our predcton results. On the contrary, the stable predcton rate wthout the nfluence of V, V2 and V3 glycans showed that the other glycans outsde these regons are mportant to dsease progresson. Ths observaton s consstent wth several prevously reported fndngs. For example, glycan modfcaton wthn the C3 regon can affect vral fusogencty and entry knetcs [6], whle both V4 [34] and C4 [7] glycans can affect the CD4 bndng. Concluson Ths paper addressed two mportant ssues n the predcton of AIDS dsease progresson. Frst, t provdes a formal framework for researchers to understand the effect of whole envelope structural glycan modfcaton aganst phenotypc changes to vral pathogeness, lke receptor bndng ablty, replcaton effcency and nfectvty predctons. It s the frst study n the lterature that uses the structural glycan nformaton n the analyss of HIV whch was made possble by the avalablty of the HIV envelope crystal structures and sophstcated homology modellng protocols. Second, our novel framework uses sem-parameterzed, support-vector asssted herarchcal mxture model, whch s able to effectvely explot the nformaton of non-local nteractons wth strong resstance to vanshng-gradent and hgh-dmensonalty problems. Ths also provded a way of fne-tunng the model by the adjustment of hyper-parameters as well as provdng effcent sem-parametrc approxmaton. Wth the newly bult gp20-benchmark_ dataset, our novel framework whch uses SV-HMM and HIV structural gp20 profle has set an mportant benchmark for the computatonal predcton of AIDS dsease progresson. Lst of abbrevaton used AIDS: Acqured Immune Defcency Syndrome; ANOVA: The Analyss Of Varance; CCI: Correctly Classfed Instances); HIV: Human Immunodefcency Vrus; HKMM: Herarchcal Kernel Mxture Model; HME: Herarchcal Mxture of Experts; J48: Decson Trees; MLP: Mult-Layered Perceptron; PDB: Proten Data Bank; RBFN: Radal Bass Functon Network; RP: Rapd Progressors; SP: Slow Progressors; STD: ST Devaton; STNP: Long Term Non-Progressors; SVM: Support Vector Machne; TBM: Tme to Buld Model. Competng nterests The authors declare that there s no competng nterests. Authors contrbutons PDY developed and mplemented the new kernel model (HKMM). YSH and PDY developed glycol-profle and gp20 dataset, and drafted the manuscrpt. YSH, JN, PY nterpreted the results wth PDY. NKS, MC and AYZ edted the manuscrpt and offered much advces and nsght durng the project. Acknowledgement Ths work s supported by KUSTAR, and Centres for Dstrbuted and Hgh Performance Computng, Mathematcal Bology & Sydney Bonformatcs, Unversty of Sydney, Australa. Ths artcle has been publshed as part of BMC Genomcs Volume Supplement 4, 200: Nnth Internatonal Conference on Bonformatcs (InCoB200): Computatonal Bology. The full contents of the supplement are avalable onlne at Author detals Centre for Dstrbuted and Hgh Performance Computng, Unversty of Sydney, NSW 2006, Australa. 2 Department of Computer Engneerng, Khalfa Unversty of Scence, Technology and Research (KUSTAR), Abu Dhab, U.A.E. 3 Retrovral Genetc Laboratory, Centre for Vrus Research, Westmead Mllennum Insttute, Westmead, NSW 245, Australa. 4 Etsalat Brtsh Telecom Innovaton Centre (EBTIC), P.O. Box 27788, Abu Dhab, U.A.E. 5 Centre for Mathematcal Bology & Sydney Bonformatcs, Unversty of Sydney, NSW 2006, Australa. 6 Natonal ICT Australa, Australan Technology Park, Evelegh, NSW 205, Australa. Publshed: 2 December 200 References. Wyatt R, Kwong PD, Desjardns E, Sweet RW, Robnson J, Hendrckson WA, Sodrosk JG: The antgenc structure of the HIV gp20 envelope glycoproten. Nature 998, 393(6686): Kwong PD, Wyatt R, Sattentau QJ, Sodrosk J, Hendrckson WA: Olgomerc modelng and electrostatc analyss of the gp20 envelope glycoproten of human mmunodefcency vrus. J Vrol 2000, 74(4): We X, Decker JM, Wang S, Hu H, Kappes JC, Wu X, Salazar-Gonzalez JF, Salazar MG, Klby JM, Saag MS, Komarova NL, Nowak MA, Hahn BH, Kwong PD, Shaw GM: Antbody neutralzaton and escape by HIV-. Nature 2003, 422(6929): Sagar M, Wu X, Lee S, Overbaugh J: Human mmunodefcency vrus type V-V2 envelope loop sequences expand and add glycosylaton stes over the course of nfecton and these modfcatons affect antbody neutralzaton senstvty. J Vrol 2006, 80(9):

10 Page 0 of 0 5. Chohan B, Lang D, Sagar M, Korber B, Lavreys L, Rchardson B, Overbaugh J: Selecton for human mmunodefcency vrus type envelope glycosylaton varants wth shorter V-V2 loop sequences occurs durng transmsson of certan genetc subtypes and may mpact vral RNA levels. J Vrol 2005, 79(0): Sterjovsk J, Churchll MJ, Ellett A, Gray LR, Roche MJ, Dunfee RL, Purcell DF, Saksena NK, Wang B, Sonza S, Wesselngh SL, Karlsson I, Fenyo EM, Gabuzda D, Cunnngham AL, Gorry PR: Asn 362 n gp20 contrbutes to enhanced fusogencty by CCR5-restrcted HIV- envelope glycoproten varants from patents wth AIDS. Retrovrology 2007, 4: L H, Xu CF, Blas S, Wan Q, Zhang HT, Landry SJ, Hoe CE: Proxmal glycans outsde of the eptopes regulate the presentaton of HIV- envelope gp20 helper eptopes. J Immunol 2009, 82(0): Scanlan CN, Offer J, Ztzmann N, Dwek RA: Explotng the defensve sugars of HIV- for drug and vaccne desgn. Nature 2007, 446(739): Ho YS, Abecass AB, Theys K, Deforche K, Dwyer DE, Charleston M, Vandamme AM, Saksena NK: HIV- gp20 N-lnked glycosylaton dffers between plasma and leukocyte compartments. Vrol J 2008, 5:4. 0. Poon AF, Lews FI, Pond SL, Frost SD: Evolutonary nteractons between N-lnked glycosylaton stes n the HIV- envelope. PLoS Comput Bo 2007, 3():e.. Saksena NK, Rodes B, Wang B, Sorano V: Elte HIV controllers: myth or realty? AIDS Rev 2007, 9(4): Ben-Hur A, Ong CS, Sonnenburg S, Scholkopf B, Ratsch G: Support vector machnes and kernels for computatonal bology. PLoS Comput Bol 2008, 4(0):e Hochreter S: The vanshng gradent problem durng learnng recurrent neural nets and problem solutons. Internatonal Journal of Uncertanty Fuzzness and Knowledge-Based Systems 998, 6(2): Hochreter S, Informatk F, Bengo Y, Frascon P, Schmdhuber J, Kolen J, Kremer S: Gradent Flow n Recurrent Nets: the Dffculty of Learnng Long-Term Dependences. In Feld Gude to Dynamcal Recurrent Networks IEEE Bengo Y, Smard P, Frascon P: Learnng long-term dependences wth gradent descent s dffcult. IEEE Trans Neural Netw 994, 5(2): Jordan MI, Jacobs RA: Herarchcal mxtures of experts and the EM algorthm. Neural Comp 994, 6: Vapnk V: The Nature of Statstcal Learnng Theory. Sprnger-Verlag; Detterch TG, Bakr G: Machne Learnng bas statstcal bas and statstcal varance of decson tree algorthms. Dept. Comput. Sc., Oregon State Unv., Corvalles, Tech. Rep Larose DT: Dscoverng Knowledge n Data. Wley Okabe A, Boots B, Sughara K, Chu SN: Spatal Tessellatons - Concepts and Applcatons of Vorono Dagrams. John Wley, , Joachms T: Transductve Inference for Text Classfcaton usng Support Vector Machnes. In Internatonal Conference on Machne Learnng (ICML), Bled, Slovena Melvlle P, Mooney RJ: In Constructng Dverse Classfer Ensembles usng Artfcal Tranng Examples. IJACI, Acapulco, Mexco 2003, Haykn S: Neural Networks: A Comprehensve Foundaton. Prentce Hall, Yee P, Haykn S: Regularzed Radal Bass Functon Networks: Theory and Applcatons. John Wley & Sons: Canada; Bshop CM: Pattern Recognton and Machne Learnng (Informaton Scence and Statstcs). Sprnger: Cambrrdge UK; Qunlan JR: C4.5: Programs for Machne Learnng. Morgan Kaufmann Kwong PD, Wyatt R, Robnson J, Sweet RW, Sodrosk J, Hendrckson WA: Structure of an HIV gp20 envelope glycoproten n complex wth the CD4 receptor and a neutralzng human antbody. Nature 998, 393(6686): Shoda T, Oka S, Xn X, Lu H, Harukun R, Kurotan A, Fukushma M, Hasan MK, Shno T, Takebe Y, Iwamoto A, Naga Y: In vvo sequence varablty of human mmunodefcency vrus type envelope gp20: assocaton of V2 extenson wth slow dsease progresson. J Vrol 997, 7(7): Berger EA: HIV entry and tropsm: the chemokne receptor connecton. AIDS 997, (SupplA):S Zhang YJ, Moore JP: Wll multple coreceptors need to be targeted by nhbtors of human mmunodefcency vrus type entry? J Vrol 999, 73(4): Mascotra S, Owen SM, Rudolph D, Yang C, Wang B, Saksena N, Spra T, Dhawan S, Lal RB: Temporal relatonshp between VV2 varaton macrophage replcaton and coreceptor adaptaton durng HIV- dsease progresson. AIDS 2002, 6(4): Nabatov AA, Pollaks G, Lnnemann T, Klphus A, Chalaby MI, Paxton WA: Intrapatent alteratons n the human mmunodefcency vrus type gp20 VV2 and V3 regons dfferentally modulate coreceptor usage vrus nhbton by CC/CXC chemoknes soluble CD4 and the b2 and 2G2 monoclonal antbodes. J Vrol 2004, 78(): Ly A, Stamatatos L: V2 loop glycosylaton of the human mmunodefcency vrus type SF62 envelope facltates nteracton of ths proten wth CD4 and CCR5 receptors and protects the vrus from neutralzaton by ant-v3 loop and ant-cd4 bndng ste antbodes. J Vrol 2000, 74(5): Pantophlet R, Ollmann Saphre E, Pognard P, Parren PW, Wlson IA, Burton DR: Fne mappng of the nteracton of neutralzng and nonneutralzng monoclonal antbodes wth the CD4 bndng ste of human mmunodefcency vrus type gp20. J Vrol 2003, 77(): Ben-dor S, Esterman N, et al: Bases and complex patterns n the resdues flankng proten N-glycosylaton stes. Glycobology 2004, 4: Shakn-Eshleman SH: Sptalnk SL: et al: The amno acd at the X poston of an Asn-X-Ser sequon s an mportant determnant of N-lnked coreglycosylaton effcency. J Bol Chem 996, 27: Yoo PD, Ho S, Zhou BB, Zomaya AY: SteSeek: Proten Post-Translatonal Modfcaton Analyss Usng Adaptve Localty-Effectve Kernel Methods and New Profles. BMC Bonformatcs 2008, 9: Bald P, Brunak S: Bonformatcs: The machne learnng approach 2nd ed MIT Press 200 Cambrdge Mass., Jones DT: Proten secondary structure predcton based on postonspecfc scorng matrces. J Molecular Bology 999, 292(2): Gnalsk K, Elofsson A, Fscher D, Rychlewsk L: 3D-Jury: a smple approach to mprove proten structure predctons. Bonformatcs 2003, 9(8): Sm J, Km SY, Lee J: PRODO: Predcton of Proten Doman Boundares usng Neural Networks. Protens 2005, 59: Skder AR, Zomaya AY: Improvng the performance of DomanDscovery of proten doman boundary assgnment usng nter-doman lnker ndex. BMC Bonformatcs 2006, 7(5):S6. do:0.86/ s4-s22 Cte ths artcle as: Yoo et al.: Herarchcal kernel mxture models for the predcton of AIDS dsease progresson usng HIV structural gp20 profles. BMC Genomcs 200 (Suppl 4):S22. Submt your next manuscrpt to BoMed Central and take full advantage of: Convenent onlne submsson Thorough peer revew No space constrants or color fgure charges Immedate publcaton on acceptance Incluson n PubMed, CAS, Scopus and Google Scholar Research whch s freely avalable for redstrbuton Submt your manuscrpt at

Copy Number Variation Methods and Data

Copy Number Variation Methods and Data Copy Number Varaton Methods and Data Copy number varaton (CNV) Reference Sequence ACCTGCAATGAT TAAGCCCGGG TTGCAACGTTAGGCA Populaton ACCTGCAATGAT TAAGCCCGGG TTGCAACGTTAGGCA ACCTGCAATGAT TTGCAACGTTAGGCA

More information

Modeling Multi Layer Feed-forward Neural. Network Model on the Influence of Hypertension. and Diabetes Mellitus on Family History of

Modeling Multi Layer Feed-forward Neural. Network Model on the Influence of Hypertension. and Diabetes Mellitus on Family History of Appled Mathematcal Scences, Vol. 7, 2013, no. 41, 2047-2053 HIKARI Ltd, www.m-hkar.com Modelng Mult Layer Feed-forward Neural Network Model on the Influence of Hypertenson and Dabetes Melltus on Famly

More information

Study and Comparison of Various Techniques of Image Edge Detection

Study and Comparison of Various Techniques of Image Edge Detection Gureet Sngh et al Int. Journal of Engneerng Research Applcatons RESEARCH ARTICLE OPEN ACCESS Study Comparson of Varous Technques of Image Edge Detecton Gureet Sngh*, Er. Harnder sngh** *(Department of

More information

Joint Modelling Approaches in diabetes research. Francisco Gude Clinical Epidemiology Unit, Hospital Clínico Universitario de Santiago

Joint Modelling Approaches in diabetes research. Francisco Gude Clinical Epidemiology Unit, Hospital Clínico Universitario de Santiago Jont Modellng Approaches n dabetes research Clncal Epdemology Unt, Hosptal Clínco Unverstaro de Santago Outlne 1 Dabetes 2 Our research 3 Some applcatons Dabetes melltus Is a serous lfe-long health condton

More information

Physical Model for the Evolution of the Genetic Code

Physical Model for the Evolution of the Genetic Code Physcal Model for the Evoluton of the Genetc Code Tatsuro Yamashta Osamu Narkyo Department of Physcs, Kyushu Unversty, Fukuoka 8-856, Japan Abstract We propose a physcal model to descrbe the mechansms

More information

Optimal Planning of Charging Station for Phased Electric Vehicle *

Optimal Planning of Charging Station for Phased Electric Vehicle * Energy and Power Engneerng, 2013, 5, 1393-1397 do:10.4236/epe.2013.54b264 Publshed Onlne July 2013 (http://www.scrp.org/ournal/epe) Optmal Plannng of Chargng Staton for Phased Electrc Vehcle * Yang Gao,

More information

AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION

AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION www.arpapress.com/volumes/vol8issue2/ijrras_8_2_02.pdf AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION I. Jule 1 & E. Krubakaran 2 1 Department

More information

CONSTRUCTION OF STOCHASTIC MODEL FOR TIME TO DENGUE VIRUS TRANSMISSION WITH EXPONENTIAL DISTRIBUTION

CONSTRUCTION OF STOCHASTIC MODEL FOR TIME TO DENGUE VIRUS TRANSMISSION WITH EXPONENTIAL DISTRIBUTION Internatonal Journal of Pure and Appled Mathematcal Scences. ISSN 97-988 Volume, Number (7), pp. 3- Research Inda Publcatons http://www.rpublcaton.com ONSTRUTION OF STOHASTI MODEL FOR TIME TO DENGUE VIRUS

More information

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) Internatonal Assocaton of Scentfc Innovaton and Research (IASIR (An Assocaton Unfyng the Scences, Engneerng, and Appled Research Internatonal Journal of Emergng Technologes n Computatonal and Appled Scences

More information

Gene Selection Based on Mutual Information for the Classification of Multi-class Cancer

Gene Selection Based on Mutual Information for the Classification of Multi-class Cancer Gene Selecton Based on Mutual Informaton for the Classfcaton of Mult-class Cancer Sheng-Bo Guo,, Mchael R. Lyu 3, and Tat-Mng Lok 4 Department of Automaton, Unversty of Scence and Technology of Chna, Hefe,

More information

Parameter Estimates of a Random Regression Test Day Model for First Three Lactation Somatic Cell Scores

Parameter Estimates of a Random Regression Test Day Model for First Three Lactation Somatic Cell Scores Parameter Estmates of a Random Regresson Test Day Model for Frst Three actaton Somatc Cell Scores Z. u, F. Renhardt and R. Reents Unted Datasystems for Anmal Producton (VIT), Hedeweg 1, D-27280 Verden,

More information

A New Machine Learning Algorithm for Breast and Pectoral Muscle Segmentation

A New Machine Learning Algorithm for Breast and Pectoral Muscle Segmentation Avalable onlne www.ejaet.com European Journal of Advances n Engneerng and Technology, 2015, 2(1): 21-29 Research Artcle ISSN: 2394-658X A New Machne Learnng Algorthm for Breast and Pectoral Muscle Segmentaton

More information

Project title: Mathematical Models of Fish Populations in Marine Reserves

Project title: Mathematical Models of Fish Populations in Marine Reserves Applcaton for Fundng (Malaspna Research Fund) Date: November 0, 2005 Project ttle: Mathematcal Models of Fsh Populatons n Marne Reserves Dr. Lev V. Idels Unversty College Professor Mathematcs Department

More information

Using the Perpendicular Distance to the Nearest Fracture as a Proxy for Conventional Fracture Spacing Measures

Using the Perpendicular Distance to the Nearest Fracture as a Proxy for Conventional Fracture Spacing Measures Usng the Perpendcular Dstance to the Nearest Fracture as a Proxy for Conventonal Fracture Spacng Measures Erc B. Nven and Clayton V. Deutsch Dscrete fracture network smulaton ams to reproduce dstrbutons

More information

Comparison among Feature Encoding Techniques for HIV-1 Protease Cleavage Specificity

Comparison among Feature Encoding Techniques for HIV-1 Protease Cleavage Specificity Internatonal Journal of Intellgent Systems and Applcatons n Engneerng Advanced Technology and Scence ISSN:2147-67992147-6799 http://jsae.atscence.org/ Orgnal Research Paper Comparson among Feature Encodng

More information

Lymphoma Cancer Classification Using Genetic Programming with SNR Features

Lymphoma Cancer Classification Using Genetic Programming with SNR Features Lymphoma Cancer Classfcaton Usng Genetc Programmng wth SNR Features Jn-Hyuk Hong and Sung-Bae Cho Dept. of Computer Scence, Yonse Unversty, 134 Shnchon-dong, Sudaemoon-ku, Seoul 120-749, Korea hjnh@candy.yonse.ac.kr,

More information

AUTOMATED DETECTION OF HARD EXUDATES IN FUNDUS IMAGES USING IMPROVED OTSU THRESHOLDING AND SVM

AUTOMATED DETECTION OF HARD EXUDATES IN FUNDUS IMAGES USING IMPROVED OTSU THRESHOLDING AND SVM AUTOMATED DETECTION OF HARD EXUDATES IN FUNDUS IMAGES USING IMPROVED OTSU THRESHOLDING AND SVM Wewe Gao 1 and Jng Zuo 2 1 College of Mechancal Engneerng, Shangha Unversty of Engneerng Scence, Shangha,

More information

Prediction of Total Pressure Drop in Stenotic Coronary Arteries with Their Geometric Parameters

Prediction of Total Pressure Drop in Stenotic Coronary Arteries with Their Geometric Parameters Tenth Internatonal Conference on Computatonal Flud Dynamcs (ICCFD10), Barcelona, Span, July 9-13, 2018 ICCFD10-227 Predcton of Total Pressure Drop n Stenotc Coronary Arteres wth Ther Geometrc Parameters

More information

Biomarker Selection from Gene Expression Data for Tumour Categorization Using Bat Algorithm

Biomarker Selection from Gene Expression Data for Tumour Categorization Using Bat Algorithm Receved: March 20, 2017 401 Bomarker Selecton from Gene Expresson Data for Tumour Categorzaton Usng Bat Algorthm Gunavath Chellamuthu 1 *, Premalatha Kandasamy 2, Svasubramanan Kanagaraj 3 1 School of

More information

Modeling the Survival of Retrospective Clinical Data from Prostate Cancer Patients in Komfo Anokye Teaching Hospital, Ghana

Modeling the Survival of Retrospective Clinical Data from Prostate Cancer Patients in Komfo Anokye Teaching Hospital, Ghana Internatonal Journal of Appled Scence and Technology Vol. 5, No. 6; December 2015 Modelng the Survval of Retrospectve Clncal Data from Prostate Cancer Patents n Komfo Anokye Teachng Hosptal, Ghana Asedu-Addo,

More information

310 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'16

310 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'16 310 Int'l Conf. Par. and Dst. Proc. Tech. and Appl. PDPTA'16 Akra Sasatan and Hrosh Ish Graduate School of Informaton and Telecommuncaton Engneerng, Toka Unversty, Mnato, Tokyo, Japan Abstract The end-to-end

More information

Introduction ORIGINAL RESEARCH

Introduction ORIGINAL RESEARCH ORIGINAL RESEARCH Assessng the Statstcal Sgnfcance of the Acheved Classfcaton Error of Classfers Constructed usng Serum Peptde Profles, and a Prescrpton for Random Samplng Repeated Studes for Massve Hgh-Throughput

More information

A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA

A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA Journal of Theoretcal and Appled Informaton Technology 2005 ongong JATIT & LLS ISSN: 1992-8645 www.jatt.org E-ISSN: 1817-3195 A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA 1 SUNGMIN

More information

Using Past Queries for Resource Selection in Distributed Information Retrieval

Using Past Queries for Resource Selection in Distributed Information Retrieval Purdue Unversty Purdue e-pubs Department of Computer Scence Techncal Reports Department of Computer Scence 2011 Usng Past Queres for Resource Selecton n Dstrbuted Informaton Retreval Sulleyman Cetntas

More information

IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE

IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE JOHN H. PHAN The Wallace H. Coulter Department of Bomedcal Engneerng, Georga Insttute of Technology, 313 Ferst Drve Atlanta,

More information

The Influence of the Isomerization Reactions on the Soybean Oil Hydrogenation Process

The Influence of the Isomerization Reactions on the Soybean Oil Hydrogenation Process Unversty of Belgrade From the SelectedWorks of Zeljko D Cupc 2000 The Influence of the Isomerzaton Reactons on the Soybean Ol Hydrogenaton Process Zeljko D Cupc, Insttute of Chemstry, Technology and Metallurgy

More information

Investigation of zinc oxide thin film by spectroscopic ellipsometry

Investigation of zinc oxide thin film by spectroscopic ellipsometry VNU Journal of Scence, Mathematcs - Physcs 24 (2008) 16-23 Investgaton of znc oxde thn flm by spectroscopc ellpsometry Nguyen Nang Dnh 1, Tran Quang Trung 2, Le Khac Bnh 2, Nguyen Dang Khoa 2, Vo Th Ma

More information

A Linear Regression Model to Detect User Emotion for Touch Input Interactive Systems

A Linear Regression Model to Detect User Emotion for Touch Input Interactive Systems 2015 Internatonal Conference on Affectve Computng and Intellgent Interacton (ACII) A Lnear Regresson Model to Detect User Emoton for Touch Input Interactve Systems Samt Bhattacharya Dept of Computer Scence

More information

CLUSTERING is always popular in modern technology

CLUSTERING is always popular in modern technology Max-Entropy Feed-Forward Clusterng Neural Network Han Xao, Xaoyan Zhu arxv:1506.03623v1 [cs.lg] 11 Jun 2015 Abstract The outputs of non-lnear feed-forward neural network are postve, whch could be treated

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and Ths artcle appeared n a journal publshed by Elsever. The attached copy s furnshed to the author for nternal non-commercal research and educaton use, ncludng for nstructon at the authors nsttuton and sharng

More information

Price linkages in value chains: methodology

Price linkages in value chains: methodology Prce lnkages n value chans: methodology Prof. Trond Bjorndal, CEMARE. Unversty of Portsmouth, UK. and Prof. José Fernández-Polanco Unversty of Cantabra, Span. FAO INFOSAMAK Tangers, Morocco 14 March 2012

More information

The Effect of Fish Farmers Association on Technical Efficiency: An Application of Propensity Score Matching Analysis

The Effect of Fish Farmers Association on Technical Efficiency: An Application of Propensity Score Matching Analysis The Effect of Fsh Farmers Assocaton on Techncal Effcency: An Applcaton of Propensty Score Matchng Analyss Onumah E. E, Esslfe F. L, and Asumng-Brempong, S 15 th July, 2016 Background and Motvaton Outlne

More information

THIS IS AN OFFICIAL NH DHHS HEALTH ALERT

THIS IS AN OFFICIAL NH DHHS HEALTH ALERT THIS IS AN OFFICIAL NH DHHS HEALTH ALERT Dstrbuted by the NH Health Alert Network Health.Alert@dhhs.nh.gov August 26, 2016 1430 EDT (2:30 PM EDT) NH-HAN 20160826 Recommendatons for Accurate Dagnoss of

More information

Survival Rate of Patients of Ovarian Cancer: Rough Set Approach

Survival Rate of Patients of Ovarian Cancer: Rough Set Approach Internatonal OEN ACCESS Journal Of Modern Engneerng esearch (IJME) Survval ate of atents of Ovaran Cancer: ough Set Approach Kamn Agrawal 1, ragat Jan 1 Department of Appled Mathematcs, IET, Indore, Inda

More information

LETTERS. Effects of thymic selection of the T-cell repertoire on HLA class I-associated control of HIV infection

LETTERS. Effects of thymic selection of the T-cell repertoire on HLA class I-associated control of HIV infection do:1.138/nature8997 LETTERS Effects of thymc selecton of the T-cell repertore on HLA class I-assocated control of HIV nfecton Andrej Košmrlj 1,2 *, Elzabeth L. Read 1,3,4 *, Yng Q 5, Todd M. Allen 1, Marcus

More information

Balanced Query Methods for Improving OCR-Based Retrieval

Balanced Query Methods for Improving OCR-Based Retrieval Balanced Query Methods for Improvng OCR-Based Retreval Kareem Darwsh Electrcal and Computer Engneerng Dept. Unversty of Maryland, College Park College Park, MD 20742 kareem@glue.umd.edu Douglas W. Oard

More information

Statistically Weighted Voting Analysis of Microarrays for Molecular Pattern Selection and Discovery Cancer Genotypes

Statistically Weighted Voting Analysis of Microarrays for Molecular Pattern Selection and Discovery Cancer Genotypes IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.6 No.2, December 26 73 Statstcally Weghted Votng Analyss of Mcroarrays for Molecular Pattern Selecton and Dscovery Cancer Genotypes

More information

BINNING SOMATIC MUTATIONS BASED ON BIOLOGICAL KNOWLEDGE FOR PREDICTING SURVIVAL: AN APPLICATION IN RENAL CELL CARCINOMA

BINNING SOMATIC MUTATIONS BASED ON BIOLOGICAL KNOWLEDGE FOR PREDICTING SURVIVAL: AN APPLICATION IN RENAL CELL CARCINOMA BINNING SOMATIC MUTATIONS BASED ON BIOLOGICAL KNOWLEDGE FOR PREDICTING SURVIVAL: AN APPLICATION IN RENAL CELL CARCINOMA DOKYOON KIM, RUOWANG LI, SCOTT M. DUDEK, JOHN R. WALLACE, MARYLYN D. RITCHIE Center

More information

Incorrect Beliefs. Overconfidence. Types of Overconfidence. Outline. Overprecision 4/22/2015. Econ 1820: Behavioral Economics Mark Dean Spring 2015

Incorrect Beliefs. Overconfidence. Types of Overconfidence. Outline. Overprecision 4/22/2015. Econ 1820: Behavioral Economics Mark Dean Spring 2015 Incorrect Belefs Overconfdence Econ 1820: Behavoral Economcs Mark Dean Sprng 2015 In objectve EU we assumed that everyone agreed on what the probabltes of dfferent events were In subjectve expected utlty

More information

NHS Outcomes Framework

NHS Outcomes Framework NHS Outcomes Framework Doman 1 Preventng people from dyng prematurely Indcator Specfcatons Verson: 1.21 Date: May 2018 Author: Clncal Indcators Team NHS Outcomes Framework: Doman 1 Preventng people from

More information

econstor Make Your Publications Visible.

econstor Make Your Publications Visible. econstor Make Your Publcatons Vsble. A Servce of Wrtschaft Centre zbwlebnz-informatonszentrum Economcs Chang, Huan-Cheng; Chang, Pn-Hsang; Tseng, Sung-Chn; Chang, Ch- Chang; Lu, Yen-Chao Artcle A comparatve

More information

Sparse Representation of HCP Grayordinate Data Reveals. Novel Functional Architecture of Cerebral Cortex

Sparse Representation of HCP Grayordinate Data Reveals. Novel Functional Architecture of Cerebral Cortex 1 Sparse Representaton of HCP Grayordnate Data Reveals Novel Functonal Archtecture of Cerebral Cortex X Jang 1, Xang L 1, Jngle Lv 2,1, Tuo Zhang 2,1, Shu Zhang 1, Le Guo 2, Tanmng Lu 1* 1 Cortcal Archtecture

More information

Subject-Adaptive Real-Time Sleep Stage Classification Based on Conditional Random Field

Subject-Adaptive Real-Time Sleep Stage Classification Based on Conditional Random Field Subject-Adaptve Real-Tme Sleep Stage Classfcaton Based on Condtonal Random Feld Gang Luo, PhD, Wanl Mn, PhD IBM TJ Watson Research Center, Hawthorne, NY {luog, wanlmn}@usbmcom Abstract Sleep stagng s the

More information

Richard Williams Notre Dame Sociology Meetings of the European Survey Research Association Ljubljana,

Richard Williams Notre Dame Sociology   Meetings of the European Survey Research Association Ljubljana, Rchard Wllams Notre Dame Socology rwllam@nd.edu http://www.nd.edu/~rwllam Meetngs of the European Survey Research Assocaton Ljubljana, Slovena July 19, 2013 Comparng Logt and Probt Coeffcents across groups

More information

Estimation for Pavement Performance Curve based on Kyoto Model : A Case Study for Highway in the State of Sao Paulo

Estimation for Pavement Performance Curve based on Kyoto Model : A Case Study for Highway in the State of Sao Paulo Estmaton for Pavement Performance Curve based on Kyoto Model : A Case Study for Kazuya AOKI, PASCO CORPORATION, Yokohama, JAPAN, Emal : kakzo603@pasco.co.jp Octávo de Souza Campos, Publc Servces Regulatory

More information

INTEGRATIVE NETWORK ANALYSIS TO IDENTIFY ABERRANT PATHWAY NETWORKS IN OVARIAN CANCER

INTEGRATIVE NETWORK ANALYSIS TO IDENTIFY ABERRANT PATHWAY NETWORKS IN OVARIAN CANCER INTEGRATIVE NETWORK ANALYSIS TO IDENTIFY ABERRANT PATHWAY NETWORKS IN OVARIAN CANCER LI CHEN 1,2, JIANHUA XUAN 1,*, JINGHUA GU 1, YUE WANG 1, ZHEN ZHANG 2, TIAN LI WANG 2, IE MING SHIH 2 1The Bradley Department

More information

INTRAUTERINE GROWTH RESTRICTION (IUGR) RISK DECISION BASED ON SUPPORT VECTOR MACHINES

INTRAUTERINE GROWTH RESTRICTION (IUGR) RISK DECISION BASED ON SUPPORT VECTOR MACHINES Mathematcal and Computatonal Applcatons, Vol. 15, No. 3, pp. 472-480, 2010. Assocaton for Scentfc Research INTRAUTERINE GROWTH RESTRICTION (IUGR) RISK DECISION BASED ON SUPPORT VECTOR MACHINES Zeynep Zengn

More information

Reconstruction of gene regulatory network of colon cancer using information theoretic approach

Reconstruction of gene regulatory network of colon cancer using information theoretic approach Reconstructon of gene regulatory network of colon cancer usng nformaton theoretc approach Khald Raza #1, Rafat Parveen * # Department of Computer Scence Jama Mlla Islama (Central Unverst, New Delh-11005,

More information

A comparison of statistical methods in interrupted time series analysis to estimate an intervention effect

A comparison of statistical methods in interrupted time series analysis to estimate an intervention effect Peer revew stream A comparson of statstcal methods n nterrupted tme seres analyss to estmate an nterventon effect a,b, J.J.J., Walter c, S., Grzebeta a, R. & Olver b, J. a Transport and Road Safety, Unversty

More information

Nonlinear Modeling Method Based on RBF Neural Network Trained by AFSA with Adaptive Adjustment

Nonlinear Modeling Method Based on RBF Neural Network Trained by AFSA with Adaptive Adjustment Advances n Engneerng Research (AER), volue 48 3rd Workshop on Advanced Research and Technology n Industry Applcatons (WARTIA 27) Nonlnear Modelng Method Based on RBF Neural Network Traned by AFSA wth Adaptve

More information

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article Jestr Journal of Engneerng Scence and Technology Revew () (08) 5 - Research Artcle Prognoss Evaluaton of Ovaran Granulosa Cell Tumor Based on Co-forest ntellgence Model Xn Lao Xn Zheng Juan Zou Mn Feng

More information

Fast Algorithm for Vectorcardiogram and Interbeat Intervals Analysis: Application for Premature Ventricular Contractions Classification

Fast Algorithm for Vectorcardiogram and Interbeat Intervals Analysis: Application for Premature Ventricular Contractions Classification Fast Algorthm for Vectorcardogram and Interbeat Intervals Analyss: Applcaton for Premature Ventrcular Contractons Classfcaton Irena Jekova, Vessela Krasteva Centre of Bomedcal Engneerng Prof. Ivan Daskalov

More information

NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 TIANHONG ZHOU

NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 TIANHONG ZHOU NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 by TIANHONG ZHOU B.S., Chna Agrcultural Unversty, 2003 M.S., Chna Agrcultural Unversty, 2006 A THESIS submtted n partal fulfllment of the requrements

More information

ARTICLE IN PRESS Neuropsychologia xxx (2010) xxx xxx

ARTICLE IN PRESS Neuropsychologia xxx (2010) xxx xxx Neuropsychologa xxx (200) xxx xxx Contents lsts avalable at ScenceDrect Neuropsychologa journal homepage: www.elsever.com/locate/neuropsychologa Storage and bndng of object features n vsual workng memory

More information

EVALUATION OF BULK MODULUS AND RING DIAMETER OF SOME TELLURITE GLASS SYSTEMS

EVALUATION OF BULK MODULUS AND RING DIAMETER OF SOME TELLURITE GLASS SYSTEMS Chalcogende Letters Vol. 12, No. 2, February 2015, p. 67-74 EVALUATION OF BULK MODULUS AND RING DIAMETER OF SOME TELLURITE GLASS SYSTEMS R. EL-MALLAWANY a*, M.S. GAAFAR b, N. VEERAIAH c a Physcs Dept.,

More information

Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO

Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO Zuo et al. BMC Bonformatcs (2017) 18:99 DOI 10.1186/s12859-017-1515-1 METHODOLOGY ARTICLE Open Access Incorporatng pror bologcal knowledge for network-based dfferental gene expresson analyss usng dfferentally

More information

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article

Journal of Engineering Science and Technology Review 11 (2) (2018) Research Article Jestr Journal of Engneerng Scence and Technology Revew 11 (2) (2018) 8-12 Research Artcle Detecton Lung Cancer Usng Gray Level Co-Occurrence Matrx (GLCM) and Back Propagaton Neural Network Classfcaton

More information

Classification of Breast Tumor in Mammogram Images Using Unsupervised Feature Learning

Classification of Breast Tumor in Mammogram Images Using Unsupervised Feature Learning Amercan Journal of Appled Scences Orgnal Research Paper Classfcaton of Breast Tumor n Mammogram Images Usng Unsupervsed Feature Learnng 1 Adarus M. Ibrahm, 1 Baharum Baharudn, 1 Abas Md Sad and 2 P.N.

More information

Evaluation of the generalized gamma as a tool for treatment planning optimization

Evaluation of the generalized gamma as a tool for treatment planning optimization Internatonal Journal of Cancer Therapy and Oncology www.jcto.org Evaluaton of the generalzed gamma as a tool for treatment plannng optmzaton Emmanoul I Petrou 1,, Ganesh Narayanasamy 3, Eleftheros Lavdas

More information

A Geometric Approach To Fully Automatic Chromosome Segmentation

A Geometric Approach To Fully Automatic Chromosome Segmentation A Geometrc Approach To Fully Automatc Chromosome Segmentaton Shervn Mnaee ECE Department New York Unversty Brooklyn, New York, USA shervn.mnaee@nyu.edu Mehran Fotouh Computer Engneerng Department Sharf

More information

Arithmetic Average: Sum of all precipitation values divided by the number of stations 1 n

Arithmetic Average: Sum of all precipitation values divided by the number of stations 1 n Char of ssgnment Suggested soluton PRCIPITTION Task (Charactersaton of the study area) rea: 43.3 km 2 Rver length: 0.296 km Hghest pont: 346 m a.s.l. Lowest pont (staton elevaton): 668 m a.s.l. Domnant

More information

Feature Selection for Predicting Tumor Metastases in Microarray Experiments using Paired Design

Feature Selection for Predicting Tumor Metastases in Microarray Experiments using Paired Design Feature Selecton for Predctng Tumor Metastases n Mcroarray Experments usng Pared Desgn Qhua Tan 1,2, Mads Thomassen 1 and Torben A. Kruse 1 ORIGINAL RESEARCH 1 Department of Bochemstry, Pharmacology and

More information

*VALLIAPPAN Raman 1, PUTRA Sumari 2 and MANDAVA Rajeswari 3. George town, Penang 11800, Malaysia. George town, Penang 11800, Malaysia

*VALLIAPPAN Raman 1, PUTRA Sumari 2 and MANDAVA Rajeswari 3. George town, Penang 11800, Malaysia. George town, Penang 11800, Malaysia 38 A Theoretcal Methodology and Prototype Implementaton for Detecton Segmentaton Classfcaton of Dgtal Mammogram Tumor by Machne Learnng and Problem Solvng *VALLIAPPA Raman, PUTRA Sumar 2 and MADAVA Rajeswar

More information

Machine Understanding - a new area of research aimed at building thinking/understanding machines

Machine Understanding - a new area of research aimed at building thinking/understanding machines achne Understandng - a new area of research amed at buldng thnkng/understandng machnes Zbgnew Les and agdalena Les St. Queen Jadwga Research Insttute of Understandng, elbourne, Australa sqru@outlook.com

More information

Comparison of support vector machine based on genetic algorithm with logistic regression to diagnose obstructive sleep apnea

Comparison of support vector machine based on genetic algorithm with logistic regression to diagnose obstructive sleep apnea Orgnal Artcle Comparson of support vector machne based on genetc algorthm wth logstc regresson to dagnose obstructve sleep apnea Zohreh Manoochehr, Nader Salar 1, Mansour Rezae 1, Habbolah Khazae 2, Sara

More information

Estimation of Relative Survival Based on Cancer Registry Data

Estimation of Relative Survival Based on Cancer Registry Data Revew of Bonformatcs and Bometrcs (RBB) Volume 2 Issue 4, December 203 www.sepub.org/rbb Estmaton of Relatve Based on Cancer Regstry Data Olaf Schoffer *, Ante Nedostate 2, Stefane J. Klug,2 Cancer Epdemology,

More information

Insights in Genetics and Genomics

Insights in Genetics and Genomics Insghts n Genetcs and Genomcs Research Artcle Open Access New Score Tests for Equalty of Varances n the Applcaton of DNA Methylaton Data Analyss [Verson ] Welang Qu Xuan L Jarrett Morrow Dawn L DeMeo Scott

More information

4.2 Scheduling to Minimize Maximum Lateness

4.2 Scheduling to Minimize Maximum Lateness 4. Schedulng to Mnmze Maxmum Lateness Schedulng to Mnmzng Maxmum Lateness Mnmzng lateness problem. Sngle resource processes one ob at a tme. Job requres t unts of processng tme and s due at tme d. If starts

More information

ARTICLE IN PRESS. computer methods and programs in biomedicine xxx (2007) xxx xxx. journal homepage:

ARTICLE IN PRESS. computer methods and programs in biomedicine xxx (2007) xxx xxx. journal homepage: computer methods and programs n bomedcne xxx (2007) xxx xxx journal homepage: www.ntl.elseverhealth.com/journals/cmpb Improvng bran tumor characterzaton on MRI by probablstc neural networks and non-lnear

More information

Resampling Methods for the Area Under the ROC Curve

Resampling Methods for the Area Under the ROC Curve Resamplng ethods for the Area Under the ROC Curve Andry I. Bandos AB6@PITT.EDU Howard E. Rockette HERBST@PITT.EDU Department of Bostatstcs, Graduate School of Publc Health, Unversty of Pttsburgh, Pttsburgh,

More information

The Limits of Individual Identification from Sample Allele Frequencies: Theory and Statistical Analysis

The Limits of Individual Identification from Sample Allele Frequencies: Theory and Statistical Analysis The Lmts of Indvdual Identfcaton from Sample Allele Frequences: Theory and Statstcal Analyss Peter M. Vsscher 1 *, Wllam G. Hll 2 1 Queensland Insttute of Medcal Research, Brsbane, Australa, 2 Insttute

More information

Boosting for tumor classification with gene expression data. Seminar für Statistik, ETH Zürich, CH-8092, Switzerland

Boosting for tumor classification with gene expression data. Seminar für Statistik, ETH Zürich, CH-8092, Switzerland BIOINFORMATICS Vol. 19 no. 9 2003, pages 1061 1069 DOI: 10.1093/bonformatcs/btf867 Boostng for tumor classfcaton wth gene expresson data Marcel Dettlng and Peter Bühlmann Semnar für Statstk, ETH Zürch,

More information

Optimal probability weights for estimating causal effects of time-varying treatments with marginal structural Cox models

Optimal probability weights for estimating causal effects of time-varying treatments with marginal structural Cox models Optmal probablty weghts for estmatng causal effects of tme-varyng treatments wth margnal structural Cox models Mchele Santacatterna, Cela García-Pareja Rno Bellocco, Anders Sönnerborg, Anna Ma Ekström

More information

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract.

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract. A Support Vector Machne Classfer based on Recursve Feature Elmnaton for Mcroarray Data n Breast Cancer Characterzaton. R.Campann, D. Dongovann, E. Iamper, N. Lanconell, G. Palermo, M. Roffll, A. Rccard

More information

A-UNIFAC Modeling of Binary and Multicomponent Phase Equilibria of Fatty Esters+Water+Methanol+Glycerol

A-UNIFAC Modeling of Binary and Multicomponent Phase Equilibria of Fatty Esters+Water+Methanol+Glycerol -UNIFC Modelng of Bnary and Multcomponent Phase Equlbra of Fatty Esters+Water+Methanol+Glycerol N. Garrdo a, O. Ferrera b, R. Lugo c, J.-C. de Hemptnne c, M. E. Macedo a, S.B. Bottn d,* a Department of

More information

Efficiency Considerations for the Purely Tapered Interference Fit (TIF) Abutments Used in Dental Implants

Efficiency Considerations for the Purely Tapered Interference Fit (TIF) Abutments Used in Dental Implants Dnçer Bozkaya Graduate Student Snan Müftü* Ph.D., Assocate Professor Northeastern Unversty, Department of Mechancal Engneerng, Boston, MA 0115 Effcency Consderatons for the Purely Tapered Interference

More information

N-back Training Task Performance: Analysis and Model

N-back Training Task Performance: Analysis and Model N-back Tranng Task Performance: Analyss and Model J. Isaah Harbson (jharb@umd.edu) Center for Advanced Study of Language and Department of Psychology, Unversty of Maryland 7005 52 nd Avenue, College Park,

More information

(From the Gastroenterology Division, Cornell University Medical College, New York 10021)

(From the Gastroenterology Division, Cornell University Medical College, New York 10021) ROLE OF HEPATIC ANION-BINDING PROTEIN IN BROMSULPHTHALEIN CONJUGATION* BY N. KAPLOWITZ, I. W. PERC -ROBB,~ ANn N. B. JAVITT (From the Gastroenterology Dvson, Cornell Unversty Medcal College, New York 10021)

More information

EXAMINATION OF THE DENSITY OF SEMEN AND ANALYSIS OF SPERM CELL MOVEMENT. 1. INTRODUCTION

EXAMINATION OF THE DENSITY OF SEMEN AND ANALYSIS OF SPERM CELL MOVEMENT. 1. INTRODUCTION JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol.3/00, ISSN 64-6037 Łukasz WITKOWSKI * mage enhancement, mage analyss, semen, sperm cell, cell moblty EXAMINATION OF THE DENSITY OF SEMEN AND ANALYSIS OF

More information

An Approach to Discover Dependencies between Service Operations*

An Approach to Discover Dependencies between Service Operations* 36 JOURNAL OF SOFTWARE VOL. 3 NO. 9 DECEMBER 2008 An Approach to Dscover Dependences between Servce Operatons* Shuyng Yan Research Center for Grd and Servce Computng Insttute of Computng Technology Chnese

More information

Computing and Using Reputations for Internet Ratings

Computing and Using Reputations for Internet Ratings Computng and Usng Reputatons for Internet Ratngs Mao Chen Department of Computer Scence Prnceton Unversty Prnceton, J 8 (69)-8-797 maoch@cs.prnceton.edu Jaswnder Pal Sngh Department of Computer Scence

More information

Prototypes in the Mist: The Early Epochs of Category Learning

Prototypes in the Mist: The Early Epochs of Category Learning Journal of Expermental Psychology: Learnng, Memory, and Cognton 1998, Vol. 24, No. 6, 1411-1436 Copyrght 1998 by the Amercan Psychologcal Assocaton, Inc. 0278-7393/98/S3.00 Prototypes n the Mst: The Early

More information

THE NORMAL DISTRIBUTION AND Z-SCORES COMMON CORE ALGEBRA II

THE NORMAL DISTRIBUTION AND Z-SCORES COMMON CORE ALGEBRA II Name: Date: THE NORMAL DISTRIBUTION AND Z-SCORES COMMON CORE ALGEBRA II The normal dstrbuton can be used n ncrements other than half-standard devatons. In fact, we can use ether our calculators or tables

More information

Linking Dynamical and Population Genetic Models of Persistent Viral Infection

Linking Dynamical and Population Genetic Models of Persistent Viral Infection vol. 162, no. 1 the amercan naturalst july 2003 Lnkng Dynamcal and Populaton Genetc Models of Persstent Vral Infecton John K. Kelly, 1,* Scott Wllamson, 1 Mara E. Orve, 1 Marlyn S. Smth, 2 and Robert D.

More information

Clinging to Beliefs: A Constraint-satisfaction Model

Clinging to Beliefs: A Constraint-satisfaction Model Clngng to Belefs: A Constrant-satsfacton Model Thomas R. Shultz (shultz@psych.mcgll.ca) Department of Psychology; McGll Unversty Montreal, QC H3C 1B1 Canada Jacques A. Katz (jakatz@cnbc.cmu.edu) Department

More information

Validation of the Gravity Model in Predicting the Global Spread of Influenza

Validation of the Gravity Model in Predicting the Global Spread of Influenza Int. J. Envron. Res. Publc Health 2011, 8, 3134-3143; do:10.3390/jerph8083134 OPEN ACCESS Internatonal Journal of Envronmental Research and Publc Health ISSN 1660-4601 www.mdp.com/journal/jerph Artcle

More information

Statistical Analysis on Infectious Diseases in Dubai, UAE

Statistical Analysis on Infectious Diseases in Dubai, UAE Internatonal Journal of Preventve Medcne Research Vol. 1, No. 4, 015, pp. 60-66 http://www.ascence.org/journal/jpmr Statstcal Analyss on Infectous Dseases 1995-013 n Duba, UAE Khams F. G. 1, Hussan H.

More information

Nonstandard Machine Learning Algorithms for Microarray Data Mining. Byoung-Tak Zhang

Nonstandard Machine Learning Algorithms for Microarray Data Mining. Byoung-Tak Zhang Nonstandard Machne Learnng Algorthms for Mcroarray Data Mnng Byoung-Tak Zhang Center for Bonformaton Technology (CBIT) & Bontellgence Laboratory School of Computer Scence and Engneerng Seoul Natonal Unversty

More information

Appendix for. Institutions and Behavior: Experimental Evidence on the Effects of Democracy

Appendix for. Institutions and Behavior: Experimental Evidence on the Effects of Democracy Appendx for Insttutons and Behavor: Expermental Evdence on the Effects of Democrac 1. Instructons 1.1 Orgnal sessons Welcome You are about to partcpate n a stud on decson-makng, and ou wll be pad for our

More information

INITIAL ANALYSIS OF AWS-OBSERVED TEMPERATURE

INITIAL ANALYSIS OF AWS-OBSERVED TEMPERATURE INITIAL ANALYSIS OF AWS-OBSERVED TEMPERATURE Wang Yng, Lu Xaonng, Ren Zhhua, Natonal Meteorologcal Informaton Center, Bejng, Chna Tel.:+86 684755, E-mal:cdcsjk@cma.gov.cn Abstract From, n Chna meteorologcal

More information

An Introduction to Modern Measurement Theory

An Introduction to Modern Measurement Theory An Introducton to Modern Measurement Theory Ths tutoral was wrtten as an ntroducton to the bascs of tem response theory (IRT) modelng and ts applcatons to health outcomes measurement for the Natonal Cancer

More information

Dr.S.Sumathi 1, Mrs.V.Agalya 2 Mahendra Engineering College, Mahendhirapuri, Mallasamudram

Dr.S.Sumathi 1, Mrs.V.Agalya 2 Mahendra Engineering College, Mahendhirapuri, Mallasamudram Detecton Of Myocardal Ischema In ECG Sgnals Usng Support Vector Machne Dr.S.Sumath 1, Mrs.V.Agalya Mahendra Engneerng College, Mahendhrapur, Mallasamudram Abstract--Ths paper presents an ntellectual dagnoss

More information

Cancer Classification Based on Support Vector Machine Optimized by Particle Swarm Optimization and Artificial Bee Colony

Cancer Classification Based on Support Vector Machine Optimized by Particle Swarm Optimization and Artificial Bee Colony molecules Artcle Cancer Classfcaton Based on Support Vector Machne Optmzed by Partcle Swarm Optmzaton and Artfcal Bee Colony Lngyun Gao 1 ID, Mngquan Ye 1, * and Changrong Wu 2 1 School of Medcal Informaton,

More information

DS May 31,2012 Commissioner, Development. Services Department SPA June 7,2012

DS May 31,2012 Commissioner, Development. Services Department SPA June 7,2012 . h,oshawa o Report To: From: Subject: Development Servces Commttee Item: Date of Report: DS-12-189 May 31,2012 Commssoner, Development Fle: Date of Meetng: Servces Department SPA-2010-09 June 7,2012 Applcaton

More information

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract.

A Support Vector Machine Classifier based on Recursive Feature Elimination for Microarray Data in Breast Cancer Characterization. Abstract. A Support Vector Machne Classfer based on Recursve Feature Elmnaton for Mcroarray Data n Breast Cancer Characterzaton. R.Campann, D. Dongovann, N. Lanconell, G. Palermo, A. Rccard, M. Roffll Dpartmento

More information

ALMALAUREA WORKING PAPERS no. 9

ALMALAUREA WORKING PAPERS no. 9 Snce 1994 Inter-Unversty Consortum Connectng Unverstes, the Labour Market and Professonals AlmaLaurea Workng Papers ISSN 2239-9453 ALMALAUREA WORKING PAPERS no. 9 September 211 Propensty Score Methods

More information

ENRICHING PROCESS OF ICE-CREAM RECOMMENDATION USING COMBINATORIAL RANKING OF AHP AND MONTE CARLO AHP

ENRICHING PROCESS OF ICE-CREAM RECOMMENDATION USING COMBINATORIAL RANKING OF AHP AND MONTE CARLO AHP ENRICHING PROCESS OF ICE-CREAM RECOMMENDATION USING COMBINATORIAL RANKING OF AHP AND MONTE CARLO AHP 1 AKASH RAMESHWAR LADDHA, 2 RAHUL RAGHVENDRA JOSHI, 3 Dr.PEETI MULAY 1 M.Tech, Department of Computer

More information

A Classification Model for Imbalanced Medical Data based on PCA and Farther Distance based Synthetic Minority Oversampling Technique

A Classification Model for Imbalanced Medical Data based on PCA and Farther Distance based Synthetic Minority Oversampling Technique A Classfcaton Model for Imbalanced Medcal Data based on PCA and Farther Dstance based Synthetc Mnorty Oversamplng Technque NADIR MUSTAFA School of Computer Scence and Engneerng Unversty of Electronc Scence

More information

A GEOGRAPHICAL AND STATISTICAL ANALYSIS OF LEUKEMIA DEATHS RELATING TO NUCLEAR POWER PLANTS. Whitney Thompson, Sarah McGinnis, Darius McDaniel,

A GEOGRAPHICAL AND STATISTICAL ANALYSIS OF LEUKEMIA DEATHS RELATING TO NUCLEAR POWER PLANTS. Whitney Thompson, Sarah McGinnis, Darius McDaniel, A GEOGRAPHICAL AD STATISTICAL AALYSIS OF LEUKEMIA DEATHS RELATIG TO UCLEAR POWER PLATS Whtney Thompson, Sarah McGnns, Darus McDanel, Jean Sexton, Rebecca Pettt, Sarah Anderson, Monca Jackson ABSTRACT:

More information

THE NATURAL HISTORY AND THE EFFECT OF PIVMECILLINAM IN LOWER URINARY TRACT INFECTION.

THE NATURAL HISTORY AND THE EFFECT OF PIVMECILLINAM IN LOWER URINARY TRACT INFECTION. MET9401 SE 10May 2000 Page 13 of 154 2 SYNOPSS MET9401 SE THE NATURAL HSTORY AND THE EFFECT OF PVMECLLNAM N LOWER URNARY TRACT NFECTON. L A study of the natural hstory and the treatment effect wth pvmecllnam

More information