The Limits of Individual Identification from Sample Allele Frequencies: Theory and Statistical Analysis

Size: px
Start display at page:

Download "The Limits of Individual Identification from Sample Allele Frequencies: Theory and Statistical Analysis"

Transcription

1 The Lmts of Indvdual Identfcaton from Sample Allele Frequences: Theory and Statstcal Analyss Peter M. Vsscher 1 *, Wllam G. Hll 2 1 Queensland Insttute of Medcal Research, Brsbane, Australa, 2 Insttute of Evolutonary Bology, School of Bologcal Scences, Unversty of Ednburgh, Unted Kngdom Abstract It was shown recently usng expermental data that t s possble under certan condtons to determne whether a person wth known genotypes at a number of markers was part of a sample from whch only allele frequences are known. Usng populaton genetc and statstcal theory, we show that the power of such dentfcaton s, approxmately, proportonal to the number of ndependent SNPs dvded by the sze of the sample from whch the allele frequences are avalable. We quantfy the lmts of dentfcaton and propose lkelhood and regresson analyss methods for the analyss of data. We show that these methods have smlar statstcal propertes and have more desrable propertes, n terms of type-i error rate and statstcal power, than test statstcs suggested n the lterature. Ctaton: Vsscher PM, Hll WG (2009) The Lmts of Indvdual Identfcaton from Sample Allele Frequences: Theory and Statstcal Analyss. PLoS Genet 5(10): e do: /journal.pgen Edtor: Greg Gbson, The Unversty of Queensland, Australa Receved May 29, 2009; Accepted August 3, 2009; Publshed October 2, 2009 Copyrght: ß 2009 Vsscher, Hll. Ths s an open-access artcle dstrbuted under the terms of the Creatve Commons Attrbuton Lcense, whch permts unrestrcted use, dstrbuton, and reproducton n any medum, provded the orgnal author and source are credted. Fundng: Ths study was partally supported by the Australan Natonal Health and Medcal Research Councl (grants and ) and the Australan Research Councl (grant DP ). None of the funders had any role n the analyses and nterpretaton of the data or n the preparaton, revew or approval of the manuscrpt. Competng Interests: The authors have declared that no competng nterests exst. * E-mal: peter.vsscher@qmr.edu.au Introducton Homer et al. [1] showed that t was possble n some crcumstances to dentfy whether a person wth observed genotypes at multple loc was part of a sample from whch only estmated allele frequences were known. Such dentfcaton would be partcularly useful n forensc scence f the presence or absence of a person s DNA n a mxture of DNA could be establshed. The authors also dscussed the relevance of ther fndngs when summary statstcs such as allele frequences were avalable n the publc doman as part of genotype-phenotype studes, because t possbly could be establshed that ndvduals, or ther close relatves, were part of a partcular study. As a result of the publcaton of Homer et al., NIH and the Wellcome Trust added more restrctons to the access of such data to avod potental dentfablty ( data_sharng_polcy_modfcatons_ pdf). The approach taken by Homer et al. was to have two samples wth estmated allele frequences, here called the test and reference sample, and to ask whether an ndvdual was close to ether of these samples, usng a statstc that measured a dstance to the sample. The propertes of the test statstc were not nvestgated theoretcally (although smulaton studes were performed), and the dfference between sample and populaton was not always clear. In ths note we take a best-case dealsed settng n whch there s a sngle populaton from whch there s a test sample wth allele frequences at a number of loc and from whch there s a sngle ndvdual, called the proband, wth full genotypes. The queston s whether the person was part of ths test sample from whch allele frequences are avalable. We use both lkelhood and lnear regresson theory, whch llustrate dfferent approaches to the problem, to draw nference about the hypothess that a proband was part of the test sample. We show that the power of dentfcaton of a proband as part of a test sample s, approxmately, proportonal to the number of ndependent SNPs dvded by the sze of the sample from whch the allele frequences are avalable. The power s reduced by a predctable magntude f the frequences n the populaton are themselves estmated mprecsely. Propertes of lkelhood-ratos and regresson test statstcs and a comparson wth the statstc used by Homer et al. were verfed by smulaton. Methods Notaton and assumptons There are m ndependent SNP markers wth a populaton frequency of p for allele B at the th SNP. We assume Hardy- Wenberg equlbrum n the populaton, so that the genotype proportons for the th SNP are (12p ) 2, 2p (12p ) and p 2 for genotypes AA, AB and BB, respectvely. We have estmated allele frequences ^p based upon a test sample of N unrelated ndvduals. In the test sample of 2N alleles, n s the number of B alleles at locus. In ths study we assume that N s known and ndvduals are equally represented n computng ^p. Note that these condtons are unlkely to be fully met n forensc applcatons when the test sample may be a DNA pool and we consder the mplcatons later. The genotype for proband X at the th SNP s g, whch can take values of 0, 1 and 2 for genotypes AA, AB and BB, and the expectaton of y = Kg s the populaton frequency p,.e. E[Kg ]=p. To smplfy dervatons, we shall frst assume the populaton frequences p, are known. More generally, we assume we have pror unbased estmates of the allele frequences from the same PLoS Genetcs 1 October 2009 Volume 5 Issue 10 e

2 Author Summary It was shown recently by Homer and colleagues that t may be possble to determne whether a person wth known genotypes at a number of markers was part of a pool of DNA from whch only frequences of alleles at the markers are known. In ths study, we quantfy how well such dentfcaton can work n practce. The larger the sze of the sample from whch the allele frequences are avalable, the more ndependent genetc markers are requred to allow ndvdual dentfcaton. populaton from a dfferent fnte sample (the reference sample ) of sze N*, n whch there are n* B alleles at locus. As both the test and reference samples are drawn ndependently from the populaton, the best estmate of the frequency n the populaton s gven by the pooled value, ^p ~ n zn = 2Nz2N ð Þ It s explaned subsequently why ths estmate, rather than say n* /2N*, the estmate of the allele frequency from the reference sample, s used n the statstcal analyss. Lkelhood Populaton frequences known. If, under the assumptons descrbed above, the numbers of ndvduals n the test sample and populaton frequences are known, then we can compute the relatve lkelhood of samplng the observed genotypes under the two alternatve hypotheses: the proband X s or s not n the test sample. If X s not a member of ths sample, then n, Bnomal(2N, p ) and g s ndependently dstrbuted Bnomal(2, p ). Hence the jont probablty of sample and proband s P(out)~ 2N n n p 2 (1{p) 2N{n g p g (1{p ) 2{g If X s a member of the sample, n has the same dstrbuton, but g s sampled from the 2N wthout replacement and has the hypergeometrc dstrbuton: P(n)~ 2N n p n (1{p) 2N{n 2N{n n g 2{g = 2N 2 Alternatvely P(n) can be vewed as n 2g, Bnomal(2N22, p ) and g, Bnomal(2, p ) ndependently, gvng the same formula. Hence the lkelhood rato for X n vs not n (out) the test sample reduces to a smple equaton, but n vew of the varyng length of the factoral expressons, t s clearer to wrte three separate ones: LR(n=out,AA)~(2N{n )(2N{n {1)=½2N(2N{1)(1{p ) 2 Š LR(n=out,AB)~n (2N{n )=½2N(2N{1)p (1{p )Š LR(n=out,BB)~n (n {1)=½2N(2N{1)p 2 Š For example, f allele B s at low frequency n the populaton (p small) and the proband s BB, then f the number n the sample, n,2, LR(BB) = 0, as t should; but as n ncreases LR(BB) becomes hgh. If the test sample s qute large, the correcton for nonreplacement samplng becomes less mportant, and the formulae smplfy to, for example, LR(n/out, BB) = (n /2N) 2 /p 2,.e. a smple comparson of whether the genotype frequences correspond more closely to those n the sample than n the populaton. For m ndependent loc, the log lkelhood rato (loglr) s log LR(n=out)~{m½log (2N)z log (2N{1)Š z X 0 z X 1 z X 2 { X 0 { X 2 ½log (2N{n )z log (2N{n {1)Š ½log (n )z log (2N{n )Š ½log (n )z log (n {1)Š ½2 log (1{p )Š{ X 1 ½2 log (p )Š ½log (p )z log (1{p )Š where 0, 1, 2 represent AA, AB and BB ndvduals at the respectve loc. If the non-replacement samplng s gnored, ths smplfes to a lkelhood comparson of allele frequences n an ndvdual to one of two dfferent populatons loglr(n=out)~ X (2g 0 zg 1 )½log(1{n =2N){log(1{p )Š z X (g 1 z2g 2 )½log(n =2N){log(p )Š where g 0 etc. refer to counts over the correspondng genotypes. Populaton frequences estmated. If the marker frequences are estmated from a reference sample of the populaton of sze N*, then the allele frequences p n the above equatons have to be replaced n the analyss by an estmate of populaton frequency. Although t would be possble just to use the frequences n* /2N* n the reference sample, ths should not be done as t leads to ncreased expectatons of loglr and, f unadjusted, to bas n assgnment of the proband to the test sample. More approprately, provdng the reference and test samples are ndependent, the pooled estmate of the populaton frequency ^p ~ n zn = 2Nz2N ð Þ should be used nstead of p n the above formulae. Propertes. The lkelhood rato (or ts logarthm) contans all of the nformaton and reflects the relatve probabltes of the two hypotheses (n/out) gven the data. We consder expectatons of loglr under the dfferent hypotheses. Standard statstcal dfferentaton was employed, takng a Taylor seres expanson of terms such as log(n ) about log(2np ), gnorng hgher order terms, and takng expectatons over the samplng dstrbutons of the observed frequences under each hypothess (see Text S1 for more detals). The followng formulae have also been verfed by smulaton. 1. If the populaton frequences are known, then for a proband n the test sample, E(logLR n)<km/n, and for a proband not n the test sample E(logLR out)<2km/n. Therefore the ablty to fnd whether the proband s n or not n the sample s proportonal to the number of ndependent markers and nversely proportonal to the sze of the test sample. 2. The varance of loglr s approxmately the same whether the proband s or s not present, and s close to m/n = 2E(logLR n). One measure of dscrmnatng power s the dfference n expected log-lkelhoods for the two hypotheses, scaled by the varance of that dfference, analogous to the non-centralty parameter of a test statstc: [E(logLR n)2e(loglr out)] 2 / [var(loglr n)+var(loglr out)]<km/n. Hypothess tests are PLoS Genetcs 2 October 2009 Volume 5 Issue 10 e

3 dscussed further n the subsequent secton on the regresson analyss, but note that the two hypothess (n/out) are not nested. The varance under the n hypothess s twce ts expectaton as for a ch-square wth 1 degree of freedom so the proporton of LR exceedng some threshold can be predcted. 3. The allele frequences have lttle nfluence on the dstrbuton of the lkelhoods. Unless the frequences are very extreme, or the test sample very small, the expected lkelhood ratos are lttle affected by whether the non-replacement samplng s accounted for, provdng they are computable. Wth very small numbers of a homozygous class expected under the out hypothess, then exclusons can occur wth some probablty. In such a case, f genotype results are correct, then presence of the proband n the test sample has to be excluded. Ths can occur even wth relatvely large test sample szes. The jont probablty of the proband havng genotype AB and the test sample beng homozygous AA and thereby excluded s 2p(12p) 2N+1 <2pe 22Np for small p, and for example s for p = 0.01 and N = If the populaton frequences are estmated as ^p, the expectatons of the lkelhoods and ther varances and hence dscrmnatng ablty are all reduced by a proporton of approxmately N*/ (N+N*), e.g. E(logLR n)=[n*/(n+n*)](km/n). For example, the reducton s by one-half f the frequency s estmated usng a reference sample of the same sze as the test sample, and essentally to zero f there are no such other data. 5. If there s lnkage dsequlbrum amongst the loc, but the data are analysed as f they are ndependent, the expectaton of loglr s the same as f all were unlnked. The samplng varances are, however, ncreased. If the populaton frequences are known wthout error, t can be shown that for any par of loc, regardless of ther frequency, var(loglr n)<var(logl- R out)<2(1+r 2 )/N, approxmately, where r 2 s the squared correlaton of gene frequences between these loc [2]. Hence, for mnloc, h the dscrmnatng o ablty s approxmately 1 =2 m= N 1z(m{1)r 2 asymptotes to 1 h = 2 Nr 2 of loc. If ths quantty can not be calculated drectly t can be predcted from populaton parameters. and, as the number of loc ncreases,, where r 2 s the mean of r 2 over all pars Lnear regresson We show that the man results for the regresson approach are based upon the expectaton that the regresson of the proband frequency, y = Kg, on ^p, each expressed as devatons from populaton frequences, s dstrbuted about unty for all loc f the proband was part of the test sample, and about zero otherwse. Populaton frequences known. Consderng ths case frst for smplcty, the regresson coeffcent s estmated as b~ P (y {p,^p {p )= P (^p {p ) 2. If the proband s n the test sample, y and ^p are correlated, so cov(y {p,^p {p )jn~ 1 = 2 p (1{p )Š=N, and f t s not n the test sample, cov(y {p,^p {p )jout~0. In both cases, var(^p {p )jn~var(^p {p )jout~ 1 = 2 p (1{p )=N: Hence, assumng many loc such that the rato of expectatons approxmates the expectaton of the ratos, h E(bjn)~E X h fcov(y {p,^p {p )jng = X fvar(^p {p )joutg~1 and E(bjout)~0 Therefore the regresson of the proband s allele frequency on the estmated allele frequency n the test sample, both expressed as a devaton from the populaton frequency, s expected to be zero f the proband was not n the test sample and one f the probands was n the test sample. The correspondng samplng varances are, respectvely, assumng large m, var(bjn)~(n{1)=m and var(b=jout)~n=m;.e., the varance s slghtly smaller f the proband s n the sample. These results correspond closely to the expectatons of the condtonal log-lkelhood analyss, and show how they are related. Populaton frequences estmated. There are two approaches to estmatng the populaton frequency and testng: comparson of the proband wth ether the reference sample of N* alone, or comparson of the proband wth the estmate ^p from the combned sample of sze N+N*. Whlst t mght seem counterntutve to use the latter whch ncludes the test data n the estmate, t provdes smpler results, notably expected regresson coeffcents of 0 (out) and 1 (n); hence we use t here. The estmate of the regresson coeffcent s b~ P (y { ^p,^p { ^p ) = P (^p { ^p )2. Now var (^p { p ) ~1 = 2 p (1 { p )1=N ½ {1=(N zn ) Š. Ths s also cov(y {^p,^p {^p jn), whereas cov(y {^p,^p {^p jout)~0. Hence, f the proband s n the test sample, E(bjn)~1, and var(bjn)~ ½(N{1)=mŠ ½(NzN )=N Š: If the proband s not n the test sample, E(bjout)~0, and var(bjout)~ ½N=mŠ ½(NzN )=N Š; where terms of 1 relatve to N+N* are gnored. Hence the test statstcs are smply N*/(N+N*) of those where the populaton frequences are known (.e., N*R ). Hypothess testng. The null hypothess s out, E(b) = 0: the proband was not part of the test sample. The alternatve hypothess (n, E(b).0) s that the proband (or a close relatve) was part of the test sample. If hypothess out s true, a test statstc for the null hypothess that the proband s part of the sample s t =[b21] 2 /var (b out). Agan, t,x 2 (1) f ths hypothess s true. If t s false,.e. the proband s not part of the sample, then t has a non-central ch-square dstrbuton t,x 29 (1),l wth non-centralty l<(m/n)[n*/(n+n*)]. For large N, nferences from testng whether the proband s n or whether the proband s out of the test sample are dentcal, as n the lkelhood approach: the probablty of rejectng the null hypothess that the proband s not part of the sample when that s false s the same as the probablty of rejectng the null hypothess that the proband s n the sample when that s false. For a type-i error rate of a and power of 12b, wth correspondng normal devates of z a and z 12b, the requred rato of m/n = l =(z a +z 12b ) 2, assumng a very large reference sample (N*&N). For a type-i error rate of 0.05 and a power of 80%, the requred m/n rato s therefore approxmately 6, and for a =10 26 and 12b = 99%, the rato s approxmately 50. If, for example, the reference sample were the same sze as the test sample, the number of loc would have to be doubled to gve the same power. Results Smulatons Populaton allele frequences on m markers were drawn from a unform dstrbuton wth lower bound 0.05 and upper bound 0.95 PLoS Genetcs 3 October 2009 Volume 5 Issue 10 e

4 (.e., mnor allele frequency (MAF).0.05). For the th SNP, a genotype score (y ) of a proband was smulated from a bnomal dstrbuton wth probablty p and sample sze 2. Allele frequences n the reference and test samples were smulated from a bnomal dstrbuton wth probablty p and sample sze 2N * and 2N, respectvely. If the proband was part of the test sample then the test sample was smulated on N21 ndvduals and the allele count from the proband was added to that from ths sample to create a sample from N ndvduals. Lnear regresson was performed as descrbed prevously, for a type-i error rate of 0.05, and the Homer et al. [1] test statstc (see Text S2) was also mplemented smulatons were performed for combnatons of N = 100, 1000, 10000, N* = 100, 1000, and and m = 50,000, when the proband was ether part or not part of the test sample. The results are shown n Table 1. The regresson type-i error rates are well controlled when the hypotheses tested are true. As predcted (Text S2), the type-i error rates for the Homer et al. test statstc are not well controlled. In many cases the probablty of rejectng the null hypothess when t s true s close to zero. Power to determne whether the proband s part of the test sample s good for test samples of 1000 f the reference sample sze s large. Inference from the regresson and lkelhood-rato approach s smlar, as expected (Table S1). Dscusson Smple methods were proposed to test the hypothess of whether a proband was part of a test sample. The expected lkelhood rato or the power to reject the null hypothess when t s false were derved and shown to be a smple functon of m/n, the rato of the number of markers and test sample sze. If allele frequences n the populaton are well-estmated then there s good power to determne f a proband s part of a sample of,1000 ndvduals when usng a whole genome scan of,50,000 ndependent markers. There s a strong relatonshp between the loglr statstc and regresson test statstcs. The dfference n the two regresson test statstcs, n or out of the test sample, s approxmately equal to twce the loglr statstc. Hence, twce the loglr statstc s very smlar to a test statstc from regresson that also tests for the n vs out hypothess (Table S1). Could any nference be drawn n the case where there are no pror estmates of allele frequences? The analyses ndcate that, Table 1. Smulaton results (m = 50,000 SNPs; type-i error rate = 0.05; 1000 smulatons). Lnear regresson Homer et al. Proband n test? N* N b P(b.0) P(b,1)} P(D.0) P(D,0) Type-I error Power Type-I error Power NO NO NO NO NO NO NO NO NO NO NO NO Power Type-I error Power Type-I error YES YES YES YES YES YES YES YES YES YES YES YES D refers to the Homer et al. test statstc. do: /journal.pgen t001 PLoS Genetcs 4 October 2009 Volume 5 Issue 10 e

5 even wth many marker loc, there s lttle power as N* approaches 0 unless the sample sze N s also very small, and no larger than N*. The parameter m was defned as the number of ndependent SNPs. When many SNPs are used, e.g. all common SNPs on a chp, then there s correlaton (lnkage dsequlbrum) among the SNPs. Consequently, the y varables (allele numbers n the proband) are correlated and not takng ths nto account wll nflate the test statstc because the true varance of the estmated regresson coeffcent s larger than appears from the total number of SNPs. Smlarly, the varance of the lkelhood statstc s ncreased f allele frequences across SNPs are correlated. There are a number of ways to deal wth ths correlaton structure. () Restrct the analyses to SNPs that are n lnkage equlbrum. Ths seems wasteful because nformaton s dscarded. () Take the correlated nature of y nto account by fttng the covarance structure of y nto the regresson or lkelhood analyss. The effect of LD on the varance of the log lkelhoods s shown earler, and approprate correctons usng the mean r 2 gven. In vew of the correspondence of the lkelhood and regresson approaches, the same correcton can be appled to the latter. The relevant quantty may be obtaned from a separate data set (e.g. HapMap). () Perform a theoretcal adjustment on the test statstc, by calbratng the varance of the test statstc on the equvalent number of ndependent markers. Accordng to populaton genetcs theory, the number of ndependent loc ( segments ) n a random populaton wth effectve sze N e and genome length L (Morgan) s approxmately 2N e L/log(4N e L) [3]. For human populatons, wth N e = 10,000 and L = 35, ths mples a total of,50,000 SNPs. Ths number can also be estmated usng a smulaton approach, condtonng on the observed LD structure n a sample where ndvdual-level genotype data are avalable. Such an applcaton resulted n,55,000 ndependent SNPs for one genome-wde assocaton study [4]. Populaton dfferences In our dervatons we have assumed that all samples (proband, reference and test) are from the same populaton and that wthn the populaton there s random matng. What f these assumptons are volated? If all samples are from the same populaton but there s devaton from HWE then the tests are somewhat based because HWE s assumed n computng the lkelhood and the varance of sample allele frequences. Populaton dfferences are more serous and can lead to the wrong nference. There are a large number of possbltes because, n prncple, the proband, reference and test samples can all come from dfferent populatons. However, populaton dfferences between the reference and test sample can be tested explctly usng standard tests for dfferences n gene frequency. There seems lttle pont n testng whether a proband was part of a specfc test sample when there s no reference sample from the same populaton. Nevertheless, what can we predct f the reference populaton s not actually from the same populaton, but s used as f t s? Then both the lkelhood statstcs for the hypothess n and out are nflated, by essentally the same amount, so the problem s not the dvergence between the two populatons, but bas n the test statstc. If populaton frequences are napproprately or approxmately estmated, the sample s more lkely to be assgned as n when t should not be. The reference sample s of lttle value f the dvergence between the populatons, expressed as Wrght s F ST, approaches 1/(2N). Can we quantfy the lmts of dentfcaton n practcal stuatons? Ths s hard, because there are (at least) three dffcultes n addton to the theoretcal sample m/n crteron: 1) The sze of reference sample used to estmate the populaton frequency - n effect a sort of outgroup as N gets very large. So f the test sample s much larger than the reference sample (N&N*) the latter provdes the lmt. 2) The degree to whch the test N and the reference N* ndvduals are samples from the same populaton. 3) Lnkage dsequlbrum, whch generates a lmt regardless of numbers of loc. For these reasons we cannot set a smple lmt to dentfcaton wthout reference to other parameters (or speculaton). Relatves In the analyss we have not consdered the possblty that the proband s not n the test sample, but s related to one or more persons who s. For example f a relatve wth relatonshp R (e.g. R = K for full sbs) s n the test sample, then the expectaton of the regresson coeffcent s E(b)=R rather than 0 or 1. Smlar calculatons can be done f, for example, there are several relatves n the test or reference samples. If many markers are used, a value of b of approxmately one-half would rase suspcons that n fact a full sb, parent or chld s n the test sample. Lower, but non-zero values could be consequences of samplng or relatonshp. The smulaton results n Table 1 llustrate how senstve the methods can be, and hence there seems a real possblty of dentfyng not just the proband but also hs/her relatves. Forensc applcatons A problem frequently met n forensc applcatons s whether a partcular ndvdual s DNA appears n a mxture obtaned at a crme scene, for example. In ths case, t s usually unknown how many ndvduals DNA s present n the sample (.e., N s unknown), equal representaton cannot be assumed, and there may be allelc drop out n the sample, although Homer et al. [1] showed emprcally that probands could be detected even f ther contrbuton to the DNA pool was small. We do not therefore consder the present results to be relevant for probablstc nference n a forensc settng. However, excluson of a proband from a pooled DNA sample s possble f many markers are used, the actual N s small and frequences of alleles from the pool are estmated accurately. The lkelhood framework s senstve to genotypng errors n that false exclusons could occur, but the analyss could be adapted to model genotype counts wth specfed probablty of errors or by assumng replacement samplng n computng P(n). The lnear regresson approach s lkely to be robust to genotypng error. Genome-wde assocaton studes In contrast to forensc applcatons, n the stuaton consdered by Homer et al. n whch the test sample s a database constructed usng a specfed number of ndvduals each wth ndvdual genotypes, and wth the gene frequences estmated as ther average, our results support ther conclusons. Probands that were part of a test sample could be dentfed even for samples szes of If, for example, there are both dseased case and healthy control samples n the assocaton test, each assumed to be sampled from the same populaton, then t s possble to test whether an ndvdual s present n ether the case or control group usng the analyss we have descrbed, but usng each sample n turn as the test sample. Current genome-wde assocaton studes (and meta-analyses based upon multple studes) are conducted on large samples, often of the order of 10,000 or so, and n ths case our results show that PLoS Genetcs 5 October 2009 Volume 5 Issue 10 e

6 the power to dentfy a proband who was part of such a large sample when the reference sample s of smlar sze s only about one-half (Table 1) assumng 50,000 ndependent loc, even under the deal crcumstances consdered n ths study. Supportng Informaton Table S1 Smulaton results comparng the LR and Regresson statstcs. Found at: do: /journal.pgen s001 (0.06 MB DOC) Text S1 Computaton of expected lkelhoods. Found at: do: /journal.pgen s002 (0.03 MB DOC) References 1. Homer N, Szelnger S, Redman M, Duggan D, Tembe W, et al. (2008) Resolvng ndvduals contrbutng trace amounts of DNA to hghly complex mxtures usng hgh-densty SNP genotypng mcroarrays. PLoS Genet 4: e do: /journal.pgen Hll WG, Robertson A (1968) Lnkage dsequlbrum n fnte populatons. Theor Appl Genet 38: Text S2 Homer et al. test statstc. Found at: do: /journal.pgen s003 (0.03 MB DOC) Acknowledgments We thank Naom Wray and two referees for helpful comments on the manuscrpt. Author Contrbutons Conceved and desgned the experments: PMV. Performed the experments: PMV WGH. Analyzed the data: PMV WGH. Contrbuted reagents/materals/analyss tools: PMV WGH. Wrote the paper: PMV WGH. 3. Hayes BJ, Vsscher PM, Goddard ME (2009) Increased accuracy of artfcal selecton by usng the realzed relatonshp matrx. Genetcs Research 91: Internatonal Schzophrena Consortum (2009) Common polygenc varaton contrbutes to rsk of schzophrena and bpolar dsorder. Nature (Epub July 1 st 2009). PLoS Genetcs 6 October 2009 Volume 5 Issue 10 e

Copy Number Variation Methods and Data

Copy Number Variation Methods and Data Copy Number Varaton Methods and Data Copy number varaton (CNV) Reference Sequence ACCTGCAATGAT TAAGCCCGGG TTGCAACGTTAGGCA Populaton ACCTGCAATGAT TAAGCCCGGG TTGCAACGTTAGGCA ACCTGCAATGAT TTGCAACGTTAGGCA

More information

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) Internatonal Assocaton of Scentfc Innovaton and Research (IASIR (An Assocaton Unfyng the Scences, Engneerng, and Appled Research Internatonal Journal of Emergng Technologes n Computatonal and Appled Scences

More information

Using the Perpendicular Distance to the Nearest Fracture as a Proxy for Conventional Fracture Spacing Measures

Using the Perpendicular Distance to the Nearest Fracture as a Proxy for Conventional Fracture Spacing Measures Usng the Perpendcular Dstance to the Nearest Fracture as a Proxy for Conventonal Fracture Spacng Measures Erc B. Nven and Clayton V. Deutsch Dscrete fracture network smulaton ams to reproduce dstrbutons

More information

Price linkages in value chains: methodology

Price linkages in value chains: methodology Prce lnkages n value chans: methodology Prof. Trond Bjorndal, CEMARE. Unversty of Portsmouth, UK. and Prof. José Fernández-Polanco Unversty of Cantabra, Span. FAO INFOSAMAK Tangers, Morocco 14 March 2012

More information

Parameter Estimates of a Random Regression Test Day Model for First Three Lactation Somatic Cell Scores

Parameter Estimates of a Random Regression Test Day Model for First Three Lactation Somatic Cell Scores Parameter Estmates of a Random Regresson Test Day Model for Frst Three actaton Somatc Cell Scores Z. u, F. Renhardt and R. Reents Unted Datasystems for Anmal Producton (VIT), Hedeweg 1, D-27280 Verden,

More information

Modeling Multi Layer Feed-forward Neural. Network Model on the Influence of Hypertension. and Diabetes Mellitus on Family History of

Modeling Multi Layer Feed-forward Neural. Network Model on the Influence of Hypertension. and Diabetes Mellitus on Family History of Appled Mathematcal Scences, Vol. 7, 2013, no. 41, 2047-2053 HIKARI Ltd, www.m-hkar.com Modelng Mult Layer Feed-forward Neural Network Model on the Influence of Hypertenson and Dabetes Melltus on Famly

More information

Using Past Queries for Resource Selection in Distributed Information Retrieval

Using Past Queries for Resource Selection in Distributed Information Retrieval Purdue Unversty Purdue e-pubs Department of Computer Scence Techncal Reports Department of Computer Scence 2011 Usng Past Queres for Resource Selecton n Dstrbuted Informaton Retreval Sulleyman Cetntas

More information

Optimal Planning of Charging Station for Phased Electric Vehicle *

Optimal Planning of Charging Station for Phased Electric Vehicle * Energy and Power Engneerng, 2013, 5, 1393-1397 do:10.4236/epe.2013.54b264 Publshed Onlne July 2013 (http://www.scrp.org/ournal/epe) Optmal Plannng of Chargng Staton for Phased Electrc Vehcle * Yang Gao,

More information

ALMALAUREA WORKING PAPERS no. 9

ALMALAUREA WORKING PAPERS no. 9 Snce 1994 Inter-Unversty Consortum Connectng Unverstes, the Labour Market and Professonals AlmaLaurea Workng Papers ISSN 2239-9453 ALMALAUREA WORKING PAPERS no. 9 September 211 Propensty Score Methods

More information

Project title: Mathematical Models of Fish Populations in Marine Reserves

Project title: Mathematical Models of Fish Populations in Marine Reserves Applcaton for Fundng (Malaspna Research Fund) Date: November 0, 2005 Project ttle: Mathematcal Models of Fsh Populatons n Marne Reserves Dr. Lev V. Idels Unversty College Professor Mathematcs Department

More information

Study and Comparison of Various Techniques of Image Edge Detection

Study and Comparison of Various Techniques of Image Edge Detection Gureet Sngh et al Int. Journal of Engneerng Research Applcatons RESEARCH ARTICLE OPEN ACCESS Study Comparson of Varous Technques of Image Edge Detecton Gureet Sngh*, Er. Harnder sngh** *(Department of

More information

Modeling the Survival of Retrospective Clinical Data from Prostate Cancer Patients in Komfo Anokye Teaching Hospital, Ghana

Modeling the Survival of Retrospective Clinical Data from Prostate Cancer Patients in Komfo Anokye Teaching Hospital, Ghana Internatonal Journal of Appled Scence and Technology Vol. 5, No. 6; December 2015 Modelng the Survval of Retrospectve Clncal Data from Prostate Cancer Patents n Komfo Anokye Teachng Hosptal, Ghana Asedu-Addo,

More information

Physical Model for the Evolution of the Genetic Code

Physical Model for the Evolution of the Genetic Code Physcal Model for the Evoluton of the Genetc Code Tatsuro Yamashta Osamu Narkyo Department of Physcs, Kyushu Unversty, Fukuoka 8-856, Japan Abstract We propose a physcal model to descrbe the mechansms

More information

Joint Modelling Approaches in diabetes research. Francisco Gude Clinical Epidemiology Unit, Hospital Clínico Universitario de Santiago

Joint Modelling Approaches in diabetes research. Francisco Gude Clinical Epidemiology Unit, Hospital Clínico Universitario de Santiago Jont Modellng Approaches n dabetes research Clncal Epdemology Unt, Hosptal Clínco Unverstaro de Santago Outlne 1 Dabetes 2 Our research 3 Some applcatons Dabetes melltus Is a serous lfe-long health condton

More information

310 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'16

310 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'16 310 Int'l Conf. Par. and Dst. Proc. Tech. and Appl. PDPTA'16 Akra Sasatan and Hrosh Ish Graduate School of Informaton and Telecommuncaton Engneerng, Toka Unversty, Mnato, Tokyo, Japan Abstract The end-to-end

More information

(From the Gastroenterology Division, Cornell University Medical College, New York 10021)

(From the Gastroenterology Division, Cornell University Medical College, New York 10021) ROLE OF HEPATIC ANION-BINDING PROTEIN IN BROMSULPHTHALEIN CONJUGATION* BY N. KAPLOWITZ, I. W. PERC -ROBB,~ ANn N. B. JAVITT (From the Gastroenterology Dvson, Cornell Unversty Medcal College, New York 10021)

More information

ARTICLE IN PRESS Neuropsychologia xxx (2010) xxx xxx

ARTICLE IN PRESS Neuropsychologia xxx (2010) xxx xxx Neuropsychologa xxx (200) xxx xxx Contents lsts avalable at ScenceDrect Neuropsychologa journal homepage: www.elsever.com/locate/neuropsychologa Storage and bndng of object features n vsual workng memory

More information

Richard Williams Notre Dame Sociology Meetings of the European Survey Research Association Ljubljana,

Richard Williams Notre Dame Sociology   Meetings of the European Survey Research Association Ljubljana, Rchard Wllams Notre Dame Socology rwllam@nd.edu http://www.nd.edu/~rwllam Meetngs of the European Survey Research Assocaton Ljubljana, Slovena July 19, 2013 Comparng Logt and Probt Coeffcents across groups

More information

An Introduction to Modern Measurement Theory

An Introduction to Modern Measurement Theory An Introducton to Modern Measurement Theory Ths tutoral was wrtten as an ntroducton to the bascs of tem response theory (IRT) modelng and ts applcatons to health outcomes measurement for the Natonal Cancer

More information

Appendix F: The Grant Impact for SBIR Mills

Appendix F: The Grant Impact for SBIR Mills Appendx F: The Grant Impact for SBIR Mlls Asmallsubsetofthefrmsnmydataapplymorethanonce.Ofthe7,436applcant frms, 71% appled only once, and a further 14% appled twce. Wthn my data, seven companes each submtted

More information

Economic crisis and follow-up of the conditions that define metabolic syndrome in a cohort of Catalonia,

Economic crisis and follow-up of the conditions that define metabolic syndrome in a cohort of Catalonia, Economc crss and follow-up of the condtons that defne metabolc syndrome n a cohort of Catalona, 2005-2012 Laa Maynou 1,2,3, Joan Gl 4, Gabrel Coll-de-Tuero 5,2, Ton Mora 6, Carme Saurna 1,2, Anton Scras

More information

A comparison of statistical methods in interrupted time series analysis to estimate an intervention effect

A comparison of statistical methods in interrupted time series analysis to estimate an intervention effect Peer revew stream A comparson of statstcal methods n nterrupted tme seres analyss to estmate an nterventon effect a,b, J.J.J., Walter c, S., Grzebeta a, R. & Olver b, J. a Transport and Road Safety, Unversty

More information

Reconstruction of gene regulatory network of colon cancer using information theoretic approach

Reconstruction of gene regulatory network of colon cancer using information theoretic approach Reconstructon of gene regulatory network of colon cancer usng nformaton theoretc approach Khald Raza #1, Rafat Parveen * # Department of Computer Scence Jama Mlla Islama (Central Unverst, New Delh-11005,

More information

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data Unobserved Heterogenety and the Statstcal Analyss of Hghway Accdent Data Fred L. Mannerng Professor of Cvl and Envronmental Engneerng Courtesy Department of Economcs Unversty of South Florda 4202 E. Fowler

More information

THE NATURAL HISTORY AND THE EFFECT OF PIVMECILLINAM IN LOWER URINARY TRACT INFECTION.

THE NATURAL HISTORY AND THE EFFECT OF PIVMECILLINAM IN LOWER URINARY TRACT INFECTION. MET9401 SE 10May 2000 Page 13 of 154 2 SYNOPSS MET9401 SE THE NATURAL HSTORY AND THE EFFECT OF PVMECLLNAM N LOWER URNARY TRACT NFECTON. L A study of the natural hstory and the treatment effect wth pvmecllnam

More information

Appendix for. Institutions and Behavior: Experimental Evidence on the Effects of Democracy

Appendix for. Institutions and Behavior: Experimental Evidence on the Effects of Democracy Appendx for Insttutons and Behavor: Expermental Evdence on the Effects of Democrac 1. Instructons 1.1 Orgnal sessons Welcome You are about to partcpate n a stud on decson-makng, and ou wll be pad for our

More information

NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 TIANHONG ZHOU

NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 TIANHONG ZHOU NUMERICAL COMPARISONS OF BIOASSAY METHODS IN ESTIMATING LC50 by TIANHONG ZHOU B.S., Chna Agrcultural Unversty, 2003 M.S., Chna Agrcultural Unversty, 2006 A THESIS submtted n partal fulfllment of the requrements

More information

NHS Outcomes Framework

NHS Outcomes Framework NHS Outcomes Framework Doman 1 Preventng people from dyng prematurely Indcator Specfcatons Verson: 1.21 Date: May 2018 Author: Clncal Indcators Team NHS Outcomes Framework: Doman 1 Preventng people from

More information

Prediction of Total Pressure Drop in Stenotic Coronary Arteries with Their Geometric Parameters

Prediction of Total Pressure Drop in Stenotic Coronary Arteries with Their Geometric Parameters Tenth Internatonal Conference on Computatonal Flud Dynamcs (ICCFD10), Barcelona, Span, July 9-13, 2018 ICCFD10-227 Predcton of Total Pressure Drop n Stenotc Coronary Arteres wth Ther Geometrc Parameters

More information

Resampling Methods for the Area Under the ROC Curve

Resampling Methods for the Area Under the ROC Curve Resamplng ethods for the Area Under the ROC Curve Andry I. Bandos AB6@PITT.EDU Howard E. Rockette HERBST@PITT.EDU Department of Bostatstcs, Graduate School of Publc Health, Unversty of Pttsburgh, Pttsburgh,

More information

HERMAN AGUINIS University of Colorado at Denver. SCOTT A. PETERSEN U.S. Military Academy at West Point. CHARLES A. PIERCE Montana State University

HERMAN AGUINIS University of Colorado at Denver. SCOTT A. PETERSEN U.S. Military Academy at West Point. CHARLES A. PIERCE Montana State University ORGANIZATIONAL Aguns et al. / MODERATING RESEARCH EFFECTS METHODS Apprasal of the Homogenety of Error Varance Assumpton and Alternatves to Multple Regresson for Estmatng Moderatng Effects of Categorcal

More information

INITIAL ANALYSIS OF AWS-OBSERVED TEMPERATURE

INITIAL ANALYSIS OF AWS-OBSERVED TEMPERATURE INITIAL ANALYSIS OF AWS-OBSERVED TEMPERATURE Wang Yng, Lu Xaonng, Ren Zhhua, Natonal Meteorologcal Informaton Center, Bejng, Chna Tel.:+86 684755, E-mal:cdcsjk@cma.gov.cn Abstract From, n Chna meteorologcal

More information

Investigation of zinc oxide thin film by spectroscopic ellipsometry

Investigation of zinc oxide thin film by spectroscopic ellipsometry VNU Journal of Scence, Mathematcs - Physcs 24 (2008) 16-23 Investgaton of znc oxde thn flm by spectroscopc ellpsometry Nguyen Nang Dnh 1, Tran Quang Trung 2, Le Khac Bnh 2, Nguyen Dang Khoa 2, Vo Th Ma

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and Ths artcle appeared n a journal publshed by Elsever. The attached copy s furnshed to the author for nternal non-commercal research and educaton use, ncludng for nstructon at the authors nsttuton and sharng

More information

A GEOGRAPHICAL AND STATISTICAL ANALYSIS OF LEUKEMIA DEATHS RELATING TO NUCLEAR POWER PLANTS. Whitney Thompson, Sarah McGinnis, Darius McDaniel,

A GEOGRAPHICAL AND STATISTICAL ANALYSIS OF LEUKEMIA DEATHS RELATING TO NUCLEAR POWER PLANTS. Whitney Thompson, Sarah McGinnis, Darius McDaniel, A GEOGRAPHICAL AD STATISTICAL AALYSIS OF LEUKEMIA DEATHS RELATIG TO UCLEAR POWER PLATS Whtney Thompson, Sarah McGnns, Darus McDanel, Jean Sexton, Rebecca Pettt, Sarah Anderson, Monca Jackson ABSTRACT:

More information

Incorrect Beliefs. Overconfidence. Types of Overconfidence. Outline. Overprecision 4/22/2015. Econ 1820: Behavioral Economics Mark Dean Spring 2015

Incorrect Beliefs. Overconfidence. Types of Overconfidence. Outline. Overprecision 4/22/2015. Econ 1820: Behavioral Economics Mark Dean Spring 2015 Incorrect Belefs Overconfdence Econ 1820: Behavoral Economcs Mark Dean Sprng 2015 In objectve EU we assumed that everyone agreed on what the probabltes of dfferent events were In subjectve expected utlty

More information

TOPICS IN HEALTH ECONOMETRICS

TOPICS IN HEALTH ECONOMETRICS TOPICS IN HEALTH ECONOMETRICS By VIDHURA SENANI BANDARA WIJAYAWARDHANA TENNEKOON A dssertaton submtted n partal fulfllment of the requrements for the degree of DOCTOR OF PHILOSOPHY WASHINGTON STATE UNIVERSITY

More information

Alma Mater Studiorum Università di Bologna DOTTORATO DI RICERCA IN METODOLOGIA STATISTICA PER LA RICERCA SCIENTIFICA

Alma Mater Studiorum Università di Bologna DOTTORATO DI RICERCA IN METODOLOGIA STATISTICA PER LA RICERCA SCIENTIFICA Alma Mater Studorum Unverstà d Bologna DOTTORATO DI RICERCA IN METODOLOGIA STATISTICA PER LA RICERCA SCIENTIFICA Cclo XXVII Settore Concorsuale d afferenza: 13/D1 Settore Scentfco dscplnare: SECS-S/02

More information

Evaluation of two release operations at Bonneville Dam on the smolt-to-adult survival of Spring Creek National Fish Hatchery fall Chinook salmon

Evaluation of two release operations at Bonneville Dam on the smolt-to-adult survival of Spring Creek National Fish Hatchery fall Chinook salmon Evaluaton of two release operatons at Bonnevlle Dam on the smolt-to-adult survval of Sprng Creek Natonal Fsh Hatchery fall Chnook salmon By Steven L. Haeseker and Davd Wlls Columba Rver Fshery Program

More information

What Determines Attitude Improvements? Does Religiosity Help?

What Determines Attitude Improvements? Does Religiosity Help? Internatonal Journal of Busness and Socal Scence Vol. 4 No. 9; August 2013 What Determnes Atttude Improvements? Does Relgosty Help? Madhu S. Mohanty Calforna State Unversty-Los Angeles Los Angeles, 5151

More information

A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA

A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA Journal of Theoretcal and Appled Informaton Technology 2005 ongong JATIT & LLS ISSN: 1992-8645 www.jatt.org E-ISSN: 1817-3195 A MIXTURE OF EXPERTS FOR CATARACT DIAGNOSIS IN HOSPITAL SCREENING DATA 1 SUNGMIN

More information

EVALUATION OF BULK MODULUS AND RING DIAMETER OF SOME TELLURITE GLASS SYSTEMS

EVALUATION OF BULK MODULUS AND RING DIAMETER OF SOME TELLURITE GLASS SYSTEMS Chalcogende Letters Vol. 12, No. 2, February 2015, p. 67-74 EVALUATION OF BULK MODULUS AND RING DIAMETER OF SOME TELLURITE GLASS SYSTEMS R. EL-MALLAWANY a*, M.S. GAAFAR b, N. VEERAIAH c a Physcs Dept.,

More information

Insights in Genetics and Genomics

Insights in Genetics and Genomics Insghts n Genetcs and Genomcs Research Artcle Open Access New Score Tests for Equalty of Varances n the Applcaton of DNA Methylaton Data Analyss [Verson ] Welang Qu Xuan L Jarrett Morrow Dawn L DeMeo Scott

More information

Does reporting heterogeneity bias the measurement of health disparities?

Does reporting heterogeneity bias the measurement of health disparities? HEDG Workng Paper 06/03 Does reportng heterogenety bas the measurement of health dspartes? Teresa Bago d Uva Eddy Van Doorslaer Maarten Lndeboom Owen O Donnell Somnath Chatterj March 2006 ISSN 1751-1976

More information

CONSTRUCTION OF STOCHASTIC MODEL FOR TIME TO DENGUE VIRUS TRANSMISSION WITH EXPONENTIAL DISTRIBUTION

CONSTRUCTION OF STOCHASTIC MODEL FOR TIME TO DENGUE VIRUS TRANSMISSION WITH EXPONENTIAL DISTRIBUTION Internatonal Journal of Pure and Appled Mathematcal Scences. ISSN 97-988 Volume, Number (7), pp. 3- Research Inda Publcatons http://www.rpublcaton.com ONSTRUTION OF STOHASTI MODEL FOR TIME TO DENGUE VIRUS

More information

A-UNIFAC Modeling of Binary and Multicomponent Phase Equilibria of Fatty Esters+Water+Methanol+Glycerol

A-UNIFAC Modeling of Binary and Multicomponent Phase Equilibria of Fatty Esters+Water+Methanol+Glycerol -UNIFC Modelng of Bnary and Multcomponent Phase Equlbra of Fatty Esters+Water+Methanol+Glycerol N. Garrdo a, O. Ferrera b, R. Lugo c, J.-C. de Hemptnne c, M. E. Macedo a, S.B. Bottn d,* a Department of

More information

Evaluation of the generalized gamma as a tool for treatment planning optimization

Evaluation of the generalized gamma as a tool for treatment planning optimization Internatonal Journal of Cancer Therapy and Oncology www.jcto.org Evaluaton of the generalzed gamma as a tool for treatment plannng optmzaton Emmanoul I Petrou 1,, Ganesh Narayanasamy 3, Eleftheros Lavdas

More information

Statistical Analysis on Infectious Diseases in Dubai, UAE

Statistical Analysis on Infectious Diseases in Dubai, UAE Internatonal Journal of Preventve Medcne Research Vol. 1, No. 4, 015, pp. 60-66 http://www.ascence.org/journal/jpmr Statstcal Analyss on Infectous Dseases 1995-013 n Duba, UAE Khams F. G. 1, Hussan H.

More information

The effect of salvage therapy on survival in a longitudinal study with treatment by indication

The effect of salvage therapy on survival in a longitudinal study with treatment by indication Research Artcle Receved 28 October 2009, Accepted 8 June 2010 Publshed onlne 30 August 2010 n Wley Onlne Lbrary (wleyonlnelbrary.com) DOI: 10.1002/sm.4017 The effect of salvage therapy on survval n a longtudnal

More information

Journal of Economic Behavior & Organization

Journal of Economic Behavior & Organization Journal of Economc Behavor & Organzaton 133 (2017) 52 73 Contents lsts avalable at ScenceDrect Journal of Economc Behavor & Organzaton j ourna l ho me pa g e: www.elsever.com/locate/jebo Perceptons, ntentons,

More information

HIV/AIDS-related Expectations and Risky Sexual Behavior in Malawi

HIV/AIDS-related Expectations and Risky Sexual Behavior in Malawi HIV/AIDS-related Expectatons and Rsky Sexual Behavor n Malaw Adelne Delavande Unversty of Essex and RAND Corporaton Hans-Peter Kohler Unversty of Pennsylvanna January 202 Abstract We use probablstc expectatons

More information

Estimating the distribution of the window period for recent HIV infections: A comparison of statistical methods

Estimating the distribution of the window period for recent HIV infections: A comparison of statistical methods Research Artcle Receved 30 September 2009, Accepted 15 March 2010 Publshed onlne n Wley Onlne Lbrary (wleyonlnelbrary.com) DOI: 10.1002/sm.3941 Estmatng the dstrbuton of the wndow perod for recent HIV

More information

Integration of sensory information within touch and across modalities

Integration of sensory information within touch and across modalities Integraton of sensory nformaton wthn touch and across modaltes Marc O. Ernst, Jean-Perre Brescan, Knut Drewng & Henrch H. Bülthoff Max Planck Insttute for Bologcal Cybernetcs 72076 Tübngen, Germany marc.ernst@tuebngen.mpg.de

More information

Introduction ORIGINAL RESEARCH

Introduction ORIGINAL RESEARCH ORIGINAL RESEARCH Assessng the Statstcal Sgnfcance of the Acheved Classfcaton Error of Classfers Constructed usng Serum Peptde Profles, and a Prescrpton for Random Samplng Repeated Studes for Massve Hgh-Throughput

More information

HIV/AIDS-related Expectations and Risky Sexual Behavior in Malawi

HIV/AIDS-related Expectations and Risky Sexual Behavior in Malawi Unversty of Pennsylvana ScholarlyCommons PSC Workng Paper Seres 7-29-20 HIV/AIDS-related Expectatons and Rsky Sexual Behavor n Malaw Adelne Delavande RAND Corporaton, Nova School of Busness and Economcs

More information

Desperation or Desire? The Role of Risk Aversion in Marriage. Christy Spivey, Ph.D. * forthcoming, Economic Inquiry. Abstract

Desperation or Desire? The Role of Risk Aversion in Marriage. Christy Spivey, Ph.D. * forthcoming, Economic Inquiry. Abstract Desperaton or Desre? The Role of Rsk Averson n Marrage Chrsty Spvey, Ph.D. * forthcomng, Economc Inury Abstract Because of the uncertanty nherent n searchng for a spouse and the uncertanty of the future

More information

The Case for Selection at CCR5-D32

The Case for Selection at CCR5-D32 Open access, freely avalable onlne The Case for Selecton at CCR5-D32 PLoS BIOLOGY Pards C. Sabet 1,2*, Emly Walsh 1, Steve F. Schaffner 1, Patrck Varlly 1, Ben Fry 1, Holl B. Hutcheson 3, Mke Cullen 3,

More information

Normal variation in the length of the luteal phase of the menstrual cycle: identification of the short luteal phase

Normal variation in the length of the luteal phase of the menstrual cycle: identification of the short luteal phase Brtsh Journal of Obstetrcs and Gvnaecologjl July 1984, Vol. 9 1, pp. 685-689 Normal varaton n the length of the luteal phase of the menstrual cycle: dentfcaton of the short luteal phase ELIZABETH A. LENTON,

More information

WHO S ASSESSMENT OF HEALTH CARE INDUSTRY PERFORMANCE: RATING THE RANKINGS

WHO S ASSESSMENT OF HEALTH CARE INDUSTRY PERFORMANCE: RATING THE RANKINGS WHO S ASSESSMENT OF HEALTH CARE INDUSTRY PERFORMANCE: RATING THE RANKINGS ELLIOTT PARKER and JEANNE WENDEL * Department of Economcs, Unversty of Nevada, Reno, NV, USA SUMMARY Ths paper examnes the econometrc

More information

Statistical models for predicting number of involved nodes in breast cancer patients

Statistical models for predicting number of involved nodes in breast cancer patients Vol.2, No.7, 641-651 (2010) do:10.4236/health.2010.27098 Health Statstcal models for predctng number of nvolved nodes n breast cancer patents Alok Kumar Dwved 1 *, Sada Nand Dwved 2, Suryanarayana Deo

More information

Biased Perceptions of Income Distribution and Preferences for Redistribution: Evidence from a Survey Experiment

Biased Perceptions of Income Distribution and Preferences for Redistribution: Evidence from a Survey Experiment DISCUSSION PAPER SERIES IZA DP No. 5699 Based Perceptons of Income Dstrbuton and Preferences for Redstrbuton: Evdence from a Survey Experment Gullermo Cruces Rcardo Pérez Trugla Martn Tetaz May 2011 Forschungsnsttut

More information

Are National School Lunch Program Participants More Likely to be Obese? Dealing with Identification

Are National School Lunch Program Participants More Likely to be Obese? Dealing with Identification Are Natonal School Lunch Program Partcpants More Lkely to be Obese? Dealng wth Identfcaton Janet G. Peckham Graduate Student, Clemson Unversty (jgemml@clemson.edu) Jaclyn D. Kropp Assstant Professor, Clemson

More information

Estimation of Relative Survival Based on Cancer Registry Data

Estimation of Relative Survival Based on Cancer Registry Data Revew of Bonformatcs and Bometrcs (RBB) Volume 2 Issue 4, December 203 www.sepub.org/rbb Estmaton of Relatve Based on Cancer Regstry Data Olaf Schoffer *, Ante Nedostate 2, Stefane J. Klug,2 Cancer Epdemology,

More information

Gene Selection Based on Mutual Information for the Classification of Multi-class Cancer

Gene Selection Based on Mutual Information for the Classification of Multi-class Cancer Gene Selecton Based on Mutual Informaton for the Classfcaton of Mult-class Cancer Sheng-Bo Guo,, Mchael R. Lyu 3, and Tat-Mng Lok 4 Department of Automaton, Unversty of Scence and Technology of Chna, Hefe,

More information

Balanced Query Methods for Improving OCR-Based Retrieval

Balanced Query Methods for Improving OCR-Based Retrieval Balanced Query Methods for Improvng OCR-Based Retreval Kareem Darwsh Electrcal and Computer Engneerng Dept. Unversty of Maryland, College Park College Park, MD 20742 kareem@glue.umd.edu Douglas W. Oard

More information

IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE

IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE IMPROVING THE EFFICIENCY OF BIOMARKER IDENTIFICATION USING BIOLOGICAL KNOWLEDGE JOHN H. PHAN The Wallace H. Coulter Department of Bomedcal Engneerng, Georga Insttute of Technology, 313 Ferst Drve Atlanta,

More information

UNIVERISTY OF KWAZULU-NATAL, PIETERMARITZBURG SCHOOL OF MATHEMATICS, STATISTICS AND COMPUTER SCIENCE

UNIVERISTY OF KWAZULU-NATAL, PIETERMARITZBURG SCHOOL OF MATHEMATICS, STATISTICS AND COMPUTER SCIENCE UNIVERISTY OF KWAZULU-NATAL, PIETERMARITZBURG SCHOOL OF MATHEMATICS, STATISTICS AND COMPUTER SCIENCE A COMPLEX SURVEY DATA ANALYSIS OF TB AND HIV MORTALITY IN SOUTH AFRICA By JOIE LEA MURORUNKWERE STUDENT

More information

Computing and Using Reputations for Internet Ratings

Computing and Using Reputations for Internet Ratings Computng and Usng Reputatons for Internet Ratngs Mao Chen Department of Computer Scence Prnceton Unversty Prnceton, J 8 (69)-8-797 maoch@cs.prnceton.edu Jaswnder Pal Sngh Department of Computer Scence

More information

Impact of Imputation of Missing Data on Estimation of Survival Rates: An Example in Breast Cancer

Impact of Imputation of Missing Data on Estimation of Survival Rates: An Example in Breast Cancer Orgnal Artcle Impact of Imputaton of Mssng Data on Estmaton of Survval Rates: An Example n Breast Cancer Banesh MR 1, Tale AR 2 Abstract Background: Multfactoral regresson models are frequently used n

More information

The Influence of the Isomerization Reactions on the Soybean Oil Hydrogenation Process

The Influence of the Isomerization Reactions on the Soybean Oil Hydrogenation Process Unversty of Belgrade From the SelectedWorks of Zeljko D Cupc 2000 The Influence of the Isomerzaton Reactons on the Soybean Ol Hydrogenaton Process Zeljko D Cupc, Insttute of Chemstry, Technology and Metallurgy

More information

Non-linear Multiple-Cue Judgment Tasks

Non-linear Multiple-Cue Judgment Tasks Non-lnear Multple-Cue Tasks Anna-Carn Olsson (anna-carn.olsson@psy.umu.se) Department of Psychology, Umeå Unversty SE-09 87, Umeå, Sweden Tommy Enqvst (tommy.enqvst@psyk.uu.se) Department of Psychology,

More information

Were the babies switched? The Genetics of Blood Types i

Were the babies switched? The Genetics of Blood Types i Were the babes swtched? The Genetcs of Blood Types Two couples had babes on the same day n the same hosptal. Dense and Earnest had a grl, Tonja. Danelle and Mchael had twns, a boy, Mchael, Jr., and a grl,

More information

Evaluation of Literature-based Discovery Systems

Evaluation of Literature-based Discovery Systems Evaluaton of Lterature-based Dscovery Systems Melha Yetsgen-Yldz 1 and Wanda Pratt 1,2 1 The Informaton School, Unversty of Washngton, Seattle, USA. 2 Bomedcal and Health Informatcs, School of Medcne,

More information

Sparse Representation of HCP Grayordinate Data Reveals. Novel Functional Architecture of Cerebral Cortex

Sparse Representation of HCP Grayordinate Data Reveals. Novel Functional Architecture of Cerebral Cortex 1 Sparse Representaton of HCP Grayordnate Data Reveals Novel Functonal Archtecture of Cerebral Cortex X Jang 1, Xang L 1, Jngle Lv 2,1, Tuo Zhang 2,1, Shu Zhang 1, Le Guo 2, Tanmng Lu 1* 1 Cortcal Archtecture

More information

Association between cholesterol and cardiac parameters.

Association between cholesterol and cardiac parameters. Short communcaton http://www.alledacademes.org/cholesterol-and-heart-dsease/ Assocaton between cholesterol and cardac parameters. Rabndra Nath Das* Department of Statstcs, The Unversty of Burdwan, Burdwan,

More information

THE NORMAL DISTRIBUTION AND Z-SCORES COMMON CORE ALGEBRA II

THE NORMAL DISTRIBUTION AND Z-SCORES COMMON CORE ALGEBRA II Name: Date: THE NORMAL DISTRIBUTION AND Z-SCORES COMMON CORE ALGEBRA II The normal dstrbuton can be used n ncrements other than half-standard devatons. In fact, we can use ether our calculators or tables

More information

An Approach to Discover Dependencies between Service Operations*

An Approach to Discover Dependencies between Service Operations* 36 JOURNAL OF SOFTWARE VOL. 3 NO. 9 DECEMBER 2008 An Approach to Dscover Dependences between Servce Operatons* Shuyng Yan Research Center for Grd and Servce Computng Insttute of Computng Technology Chnese

More information

Encoding processes, in memory scanning tasks

Encoding processes, in memory scanning tasks vlemory & Cognton 1976,4 (5), 501 506 Encodng processes, n memory scannng tasks JEFFREY O. MILLER and ROBERT G. PACHELLA Unversty of Mchgan, Ann Arbor, Mchgan 48101, Three experments are presented that

More information

Rainbow trout survival and capture probabilities in the upper Rangitikei River, New Zealand

Rainbow trout survival and capture probabilities in the upper Rangitikei River, New Zealand Ranbow trout survval and capture probabltes n the upper Rangtke Rver, New Zealand Rchard J Barker Department of Mathematcs and Statstcs Unversty of Otago P.O. Box 56 Dunedn, New Zealand Peter H Taylor

More information

Maize Varieties Combination Model of Multi-factor. and Implement

Maize Varieties Combination Model of Multi-factor. and Implement Maze Varetes Combnaton Model of Mult-factor and Implement LIN YANG,XIAODONG ZHANG,SHAOMING LI Department of Geographc Informaton Scence Chna Agrcultural Unversty No. 17 Tsnghua East Road, Bejng 100083

More information

Saeed Ghanbari, Seyyed Mohammad Taghi Ayatollahi*, Najaf Zare

Saeed Ghanbari, Seyyed Mohammad Taghi Ayatollahi*, Najaf Zare DOI:http://dx.do.org/10.7314/APJCP.2015.16.14.5655 and Anthracyclne- Breast Cancer Treatment and Survval n the Eastern Medterranean and Asa: a Meta-analyss RESEARCH ARTICLE Comparng Role of Two Chemotherapy

More information

Can Subjective Questions on Economic Welfare Be Trusted?

Can Subjective Questions on Economic Welfare Be Trusted? Publc Dsclosure Authorzed Polcy Research Workng Paper 6726 WPS6726 Publc Dsclosure Authorzed Publc Dsclosure Authorzed Can Subjectve Questons on Economc Welfare Be Trusted? Evdence for Three Developng

More information

Are Drinkers Prone to Engage in Risky Sexual Behaviors?

Are Drinkers Prone to Engage in Risky Sexual Behaviors? Amercan Internatonal Journal of Socal Scence Vol. 2 No. 5; July 2013 Are Drnkers Prone to Engage n Rsky Sexual Behavors? Ana Isabel Gl Lacruz Zaragoza Unversty Department of Busness Organzaton and Management

More information

Addressing empirical challenges related to the incentive compatibility of stated preference methods

Addressing empirical challenges related to the incentive compatibility of stated preference methods Addressng emprcal challenges related to the ncentve compatblty of stated preference methods Mkołaj Czajkowsk 1, Chrstan A. Vossler 2,, Wktor Budzńsk 1, Aleksandra Wśnewska 1 and Ewa Zawojska 1 The fnal

More information

Disease Mapping for Stomach Cancer in Libya Based on Besag York Mollié (BYM) Model

Disease Mapping for Stomach Cancer in Libya Based on Besag York Mollié (BYM) Model DI:0.034/APJCP.07.8.6.479 Dsease Mappng for Stomach Cancer n Lbya: Bayesan Study RESEARC ARTICLE Dsease Mappng for Stomach Cancer n Lbya Based on Besag York Mollé (BYM) Model Maryam Ahmed Salem Alhdr *,

More information

Causal inference in nonexperimental studies typically

Causal inference in nonexperimental studies typically Orgnal Artcle Regresson Dscontnuty Desgns n Epdemology Causal Inference Wthout Randomzed Trals Jacob Bor, a,b,c Ellen Moscoe, c Porta Mutevedz, b Mare-Louse Newell, b,d and Tll Bärnghausen b,c Abstract:

More information

A Meta-Analysis of the Effect of Education on Social Capital

A Meta-Analysis of the Effect of Education on Social Capital A Meta-Analyss of the Effect of Educaton on Socal Captal Huang Jan ** "Scholar" Research Center for Educaton and Labor Market Department of Economcs, Unversty of Amsterdam and Tnbergen Insttute by Henrëtte

More information

Efficiency Considerations for the Purely Tapered Interference Fit (TIF) Abutments Used in Dental Implants

Efficiency Considerations for the Purely Tapered Interference Fit (TIF) Abutments Used in Dental Implants Dnçer Bozkaya Graduate Student Snan Müftü* Ph.D., Assocate Professor Northeastern Unversty, Department of Mechancal Engneerng, Boston, MA 0115 Effcency Consderatons for the Purely Tapered Interference

More information

HIV/AIDS AND POVERTY IN SOUTH AFRICA: A BAYESIAN ESTIMATION OF SELECTION MODELS WITH CORRELATED FIXED-EFFECTS

HIV/AIDS AND POVERTY IN SOUTH AFRICA: A BAYESIAN ESTIMATION OF SELECTION MODELS WITH CORRELATED FIXED-EFFECTS HIV/AIDS AND POVERTY IN SOUTH AFRICA: A BAYESIAN ESTIMATION OF SELECTION MODELS WITH CORRELATED FIXED-EFFECTS FABRICE MURTIN* AND FEDERICA MARZO Abstract In ths paper, we estmate the causal mpact of human

More information

Modeling seasonal variation in indoor radon concentrations

Modeling seasonal variation in indoor radon concentrations Journal of Exposure Analyss and Envronmental Epdemology (2005) 15, 234 243 r 2005 Nature Publshng Group All rghts reserved 1053-4245/05/$30.00 www.nature.com/ea Modelng seasonal varaton n ndoor radon concentratons

More information

J. H. Rohrer, S. H. Baron, E. L. Hoffman, D. V. Swander

J. H. Rohrer, S. H. Baron, E. L. Hoffman, D. V. Swander 2?Hr a! A Report of Research on o ^^ -^~" r" THE STABILITY OF AUTOKINETIC JUDGMENTS J. H. Rohrer, S. H. Baron, E. L. Hoffman, D. V. Swander A techncal report made under ONR Contract Nonr-475(01) between

More information

Stephanie von Hinke Kessler Scholder, George Davey Smith, Debbie A. Lawlor, Carol Propper, Frank Windmeijer

Stephanie von Hinke Kessler Scholder, George Davey Smith, Debbie A. Lawlor, Carol Propper, Frank Windmeijer Stephane von Hnke Kessler Scholder, George Davey Smth, Debbe A. Lawlor, Carol Propper, Frank Wndmejer Chld heght, health and human captal: evdence usng genetc markers Dscusson paper 2010/13 September 2010

More information

Comparison of methods for modelling a count outcome with excess zeros: an application to Activities of Daily Living (ADL-s)

Comparison of methods for modelling a count outcome with excess zeros: an application to Activities of Daily Living (ADL-s) Comparson of methods for modellng a count outcome wth excess zeros: an applcaton to Actvtes of Daly Lvng (ADL-s) Paola Zannotto, Emanuela Falaschett To cte ths verson: Paola Zannotto, Emanuela Falaschett.

More information

A Mathematical Model of the Cerebellar-Olivary System II: Motor Adaptation Through Systematic Disruption of Climbing Fiber Equilibrium

A Mathematical Model of the Cerebellar-Olivary System II: Motor Adaptation Through Systematic Disruption of Climbing Fiber Equilibrium Journal of Computatonal Neuroscence 5, 71 90 (1998) c 1998 Kluwer Academc Publshers. Manufactured n The Netherlands. A Mathematcal Model of the Cerebellar-Olvary System II: Motor Adaptaton Through Systematc

More information

AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION

AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION www.arpapress.com/volumes/vol8issue2/ijrras_8_2_02.pdf AN ENHANCED GAGS BASED MTSVSL LEARNING TECHNIQUE FOR CANCER MOLECULAR PATTERN PREDICTION OF CANCER CLASSIFICATION I. Jule 1 & E. Krubakaran 2 1 Department

More information

Survival Rate of Patients of Ovarian Cancer: Rough Set Approach

Survival Rate of Patients of Ovarian Cancer: Rough Set Approach Internatonal OEN ACCESS Journal Of Modern Engneerng esearch (IJME) Survval ate of atents of Ovaran Cancer: ough Set Approach Kamn Agrawal 1, ragat Jan 1 Department of Appled Mathematcs, IET, Indore, Inda

More information

Rich and Powerful? Subjective Power and Welfare in Russia

Rich and Powerful? Subjective Power and Welfare in Russia Ths paper was presented at the Workshop on Measurng Empowerment: Cross-Dscplnary Perspectves held at the World Bank n Washngton, DC on February 4 and 5, 23. Rch and Powerful? Subjectve Power and Welfare

More information

Mathematical model of fish schooling behaviour in a set-net

Mathematical model of fish schooling behaviour in a set-net ICES Journal of Marne Scence, 61: 114e13 (004) do:10.1016/j.cesjms.004.07.009 Mathematcal model of fsh schoolng behavour n a set-net Tsutomu Takag, Yutaka Mortom, Jyun Iwata, Hrosh Nakamne, and Nobuo Sannomya

More information

[ ] + [3] i 1 1. is the density of the vegetable oil, R is the universal gas constant, T r. is the reduced temperature, and F c

[ ] + [3] i 1 1. is the density of the vegetable oil, R is the universal gas constant, T r. is the reduced temperature, and F c Densty and Vscosty of Vegetable Ols C.M. Rodenbush a, F.H. Hseh b, and D.S. Vswanath a, * Departments of a Chemcal Engneerng and b Bologcal and Agrcultural Engneerng, Unversty of Mssour-Columba, Columba,

More information

Subject-Adaptive Real-Time Sleep Stage Classification Based on Conditional Random Field

Subject-Adaptive Real-Time Sleep Stage Classification Based on Conditional Random Field Subject-Adaptve Real-Tme Sleep Stage Classfcaton Based on Condtonal Random Feld Gang Luo, PhD, Wanl Mn, PhD IBM TJ Watson Research Center, Hawthorne, NY {luog, wanlmn}@usbmcom Abstract Sleep stagng s the

More information