Challenges and Implications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling Them

Size: px
Start display at page:

Download "Challenges and Implications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling Them"

Transcription

1 International Journal of Statistical Distributions and Applications 2017; 3(4): doi: /j.ijsd ISSN: (Print); ISSN: (Online) Challenges and Iplications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling The Nicholas Pindar Dibal 1, *, Ray Okafor 2, Haadu Dallah 3 1 Departent of Matheatical Sciences, University of Maiduguri, Maiduguri, Nigeria 2 Departent of Matheatics, University of Lagos, Lagos, Nigeria 3 Departent of Actuarial Science, University of Lagos, Lagos, Nigeria Eail address: pndibal@gail.co (N. P. Dibal), okaforray@yahoo.co (R. Okafor), dallara2014@gail.co (H. Dallah) To cite this article: Nicholas Pindar Dibal, Ray Okafor, Haadu Dallah. Challenges and Iplications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling The. International Journal of Statistical Distributions and Applications. Vol. 3, No. 4, 2017, pp doi: /j.ijsd Received: Septeber 19, 2017; Accepted: October 17, 2017; Published: Noveber 20, 2017 Abstract: Missing data in surveys and experiental research is a coon occurrence which has serious iplications on the validity of inferences. Advances in statistical procedures provides better and efficient ethods of handling issing data yet any researches still handle incoplete data in ways that affects the results negatively. We review in detail the echaniss that generates issingness, and the appropriate ethods to account for the issing values to enable the researcher have adequate knowledge to ake infored decision on the choice of ethod to account for issingness. Keywords: Missing Data, Inference, Missingness Mechaniss, Ignorable, Non-Ignorable Missingness, Multiple Iputation 1. Introduction Missing values creates serious probles for researchers on the field and other data users during statistical analyses and data interpretation, especially when the assuption of ignorability of the issing data does not hold because ost of statistical ethods are not designed to handle incoplete data ([33]; [28]; [18]). Iproper ethods used to account for issing data usually results in biased estiate ([29]; [36]; [42]). Missing data are unobserved values, they occur when the actual data we intend to easure could not be easured for soe reasons leaving us with incoplete data. Missing data is a coon occurrence in surveys and experients when soething goes wrong for unanticipated reasons. Data ay be issing for a wide range of reasons soe of which can be partially controlled by the researcher while others are not. The ethical iperative that participation in any study is voluntary eans that participants are free to skip any question on issues sensitive to the or even withdraw fro the study whenever they wish thereby increasing the possibilities of issing values [1]. Missing values are unavoidable in any instances, hence the occurrence of issing values distorts the representativeness of the saple thereby affecting inferences about the population of study. Incoplete data signifies that soe inforation about the population paraeter are issing which ay likely influence the validity of results ([44]; [38]; [39]). Deciding on how to account for issing values is a challenging task that requires a good understanding of the reasons why the data got issing and the pattern of issingness. Unethical and unprofessional practices by any has led to the filling in of the issing values with zero assuing that unobserved values are equivalent to zero, or replacing the with any arbitrary values. Others siply resort to the default option in soe statistical software which deletes units with issing values, however, this assuption is not universally applicable to all types of issing data. Incoplete data poses great challenges in analysis resulting in invalid inferences, hence issing values should be handled with great care. In any surveys and experients, researchers rarely have detail inforation about the reasons for which parts of the data are issing, hence, any researchers rely on the default position in soe statistical software to account for issing

2 88 Nicholas Pindar Dibal et al.: Challenges and Iplications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling The data without giving regard to what cause the issigness ([32]). Incoplete data yields results that do not adequately describe the population of interest ([4]; [1]; [11]; [15]; [25]; [22]). Accounting for issing values soeties takes longer than necessary due to lack of proper understanding of the issingness echaniss ([32]). In the works of [31]), several journals were reviewed to assess how issing values were handled in researches. Of the 1087 studies in 918 articles with quantitative coponents, attention was given to; saple size, the df of the test statistics reported, and how issing data were treated. The survey showed that out of 1087 studies, 305 (28%) did not ake any report on issing data, 587 (54%) showed evidence of issing data, and the reaining 195 (18%) did not provide sufficient inforation to deterine whether or not the data used were incoplete. Out of the cases with issing values, 569 (97%) reported dealing with such probles where 509 (89.5%) used the listwise deletion (LD) ethod and 43 (7.6%) the pairwise deletion (PD) ethod, only 2.9% used other ethods to account for the issing data. The over usage of listwise deletion ethod in handling issing data as indicated in the survey, could possibly be because of ease of application. Being the default option in ost statistical software, it coes handy for researchers who don t know which ethod to use. Listwise Deletion reduces statistical power due to reduction in the saple size ([5]). Even when the statistical power of a test statistic is not of interest, the accuracy of predicted values ay still be biased. For instance, with 10% of observations eliinated randoly fro each of 5 variables in a dataset, 59% of the total observations will be lost thereby seriously affecting the statistical power of the test ([26]; [14]). Other new ethods used to handle issing data include the works of [12] which uses quantile regression to replace the issing values, their ethod cobine both quantile regression iputation and general estiating equation ethods, which have copetitive advantages over soe of the ost widely used paraetric and non-paraetric iputation estiators. [41], also reviewed ethods of handling issing data giving ephasis on the application of iputation ethod. 2. Methodology The necessary steps and procedures to account for issing values are outlined in [22] and suarized as follows; i) understand the analytic objective and, identify the data structure and study design, ii) ake appropriate assuptions for issing data echanis, iii) identification of variables and the construction of the iputation odel. In developing the odel to handle issing data, it is iportant to balance sophistication, feasibility of odels and achievable results. To adequately account for issing data, descriptive analysis should be perfored on each variable to distinguish the issing data pattern in the data atrix ([17]; [37]). The issing data patterns are useful in deterining whether the survey was adinistered and entered correctly, it also helps in variable choice for inclusion in the iputation odel and analyses. [37] developed SAS 9.2 acro which identifies issing data pattern faster in four ways by; evaluating the proportion of subjects with each pattern of issing data, the nuber and percentage of issing data for each individual variable, the concordance of issingness in any pair of variables, and possible unit nonresponse. [32], suggested that at the preliinary and diagnostic stages of data preprocessing, stateent giving the range of issing data, such as; issing data ranged fro a low of 2% for Cancer to a high of 12% for HIV/AIDS and any other relevant inforation should be included and presented in a tabular forat as presented in Table 1. Table 1. Suary of Missing Units on each Variable across Seven (7) Counities. Variable Reported Std. Missing % of Missing Mean Cases Error Cases Cases Cancer Malaria Tuberculosis HIV/AIDS Pneuonia The perforance of the different ethods of handling issing data should be evaluated so that infored decision about which ethod to use could be ade; in doing so, the variables and scale of easureent should be considered, however, it should be noted that there is no single ethod that works best in all situations. Graphical plot where applicable could also be used to help identify issing data patterns and to assess how uch values are issing (there is no strict guideline to how uch is too uch). If no pattern can be found by ere visualisation, randoness test is suggested Assuptions About Missing Data Data could be issing for any different reasons such as; accidentally skipping an ite, wrong data entry, lacking knowledge on the issue or becoing frustrated and losing interest in the whole exercise, refusing to respond to particular question for soe reason, relocating to another city thereby aking it difficult to continue with the on-going research or data could as well be issing by design ([2]). Whichever ethod is chosen to account for issing data, it is iportant to understand why and how the values got issing in the first place ([44]; [40]; [43]). Missing data can be thought of as being caused in one or soe cobinations of ways which [20] outlined as; rando processes, processes that are easured, and process that are not easurable. [22], pointed out that identifying the exact echanis that generates the issing values is helpful in choosing the appropriate ethod to use in handling issing data Setup and Notations The notations and setup for the issingness echanis is described as follows; the coplete dataset is denoted by =, where represent the observed part of the data and the issing part of the data. Let the data

3 International Journal of Statistical Distributions and Applications 2017; 3(4): atrix with cases for variables be denoted by =, it is also assued that R is atrix of duy variables that irrors the data atrix, then for the response atrix, the issingness indicator is given as; = 1, 0, "# Hence, a issing value is defined as; Let $% = &' be that the () observation is not available (NA). Then the probability of not observing the variable X lying between two specified nubers * and for the () observation, that is +$% $*,% - would be given by the following conditional probability: +$% $*,% - = +$%= &' $% $*,%. (2) In view of this probability, [28], and [7] described the echaniss which cause the data to be issing ore forally as falling into one of the following three categories according to their dependency structure: issing copletely at rando (MCAR), issing at rando (MAR), and issing not at rando (MNAR) Missing Copletely at Rando (MCAR) Data are said to be issing copletely at rando (MCAR) when the probability that issing value on one variable is unrelated to the unit s score on any other variable, whether the other variables are observed or not; that is +"#,0 = +"#. The MCAR assuption is stringent and unreasonable, and rarely holds in real life situations because issingness is usually triggered by other variables in the dataset ([36]; [1]). The assuption of issing copletely at rando ay be reasonable when values are issing by design ([20]; [19]), issingness is copletely rando only when the probability +$% $*,% - is unrelated to the value of the variable X (or to the value of any other variable in the dataset) in the () observation, that is; 2: +$% $*,% - = +$%= &' (3) where 2 =41,2,,7 and +$% = &' is the probability of the observation $% being issing regardless of its value. The idea of values issing copletely at rando virtually appears in every technical paper on issing values, this issingness echanis can be confired using Little's MCAR test ([27]) Missing at Rando (MAR) Data are said to be issing at rando (MAR) when the probability of the issing data does not depend on the unobserved data ( ) but depends on the available inforation ( ), or equally the issing values are not randoly distributed across all observations but are randoly distributed within one or ore subsaples. MAR assuption is less stringent copared to issing copletely at rando assuption ([34]), it depends on observed values and can be justified by including auxiliary variables that either explains why values are issing or predicts the score (1) for the issing values. In reality, there are few auxiliary variables that can do both the functions of predicting issing values and explaining why the values got issing. The +"#,0 = +"# is to say that, the probability that is not observed in the interval $*,%, that is + $*,% - does not depend on the value of the variable X after controlling for another variable Y in the dataset, therefore; 4 2: 8$%=97: +$% $*,% - = +$% = &' (4) where 4 2:8$%=97 is the subset of those observations 8$% on the variable 8, in which 8 is equal to soe constant value 9. The MAR condition is soeties referred to as ignorable issingness because unbiased paraeter estiates can be obtained using direct axiu likelihood (DML) or ultiple iputation (MI) without the need to incorporate an explicit odel that explains why the data are issing ([15]) Missing Not at Rando (MNAR) Missing not at rando (MNAR) assuption is the ost probleatic aong the issingness echaniss as the issing values are not randoly distributed across all the observations, and neither this distribution is rando within any subset(s) that can be drawn fro the given dataset ([7]). The probability of issingness cannot be easily predicted fro the variables in the odel, that is +"#,0 cannot be quantified or siplified since the issingness depends on the issing value itself. The probability +$% $*,% - depends on the unobserved value of the variable X, i.e, 2: +$% $*,% - = :;<$%= >? <$% $A,% :;<$% $A,% (5) where +$% $*,% is the probability of the variable being in the interval $*,% in the () observation, regardless of whether this observation will be issing or not and the joint probability Pr$%= &' $% $*,% is the probability of the observation $% being issing while in the interval $*,%. With the issing observation being dependent on events or ites which the researcher has not easured, it is difficult or ipossible to evaluate the probability of the issing values. The issing not at rando echanis is referred to as non-ignorable issingness Methods of Handling Missing Data Most of the standard statistical ethods of data analysis are usually not applicable with incoplete data, therefore to get valid inferences about the population of study, the researcher needs to understand the iplications of issing data and decide on the best approach to use in accounting for the ([29]; [34]; [28]; [36]; [10]; [40]; [16]; [42]). In choosing the ost appropriate ethod to use in handling the challenges posed by issing data, it is iportant to understand how and why observations are issing and how uch influence they have on the results of the study as

4 90 Nicholas Pindar Dibal et al.: Challenges and Iplications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling The pointed out by [40] and [7]. The best way to handle issing values and ensure valid inference is to coe up with a good design to prevent or reduce their occurrence in the first place ([24]), or repeat the experient to generate the coplete data set again which is not feasible especially where readings are taken at set ties or the cost of retesting is prohibitive. It should be noted that however hard we ay try; values ay still be issing for unanticipated reasons in soe surveys. The procedures and ethods used to account for issing values are not eant to recreate the issing values exactly but to ake valid and efficient inferences about the population of interest with or without issing data ([36]). The choice of the ethod of handling issing data is often related to particular data characteristics and to the goal of iputation ([23]). [36]) pointed out that the perforance of the different ethods of analysing incoplete data depends upon the ultiate goals of the analysis. In deciding which iputation ethod to use, [22] suggests that diagnostic checks be carried out on the iputation odel to help identify odel defects and facilitate odel iproveent. In the following section, several ad hoc and statistically principled ethods of handling issing data together with their benefits and drawbacks are discussed in detail to enable researchers and data analyst ake good decision on the best ethod to use in handling issing values ([34]) Listwise Deletion The ost coonly used ethod of handling issing values is listwise deletion (LD), this ethod siply discards cases with issing values thereby yielding coplete data set with reduced saple size. It is easy to ipleent, very fast to conduct and does not require recreation of the issing data. Most researchers especially those with little or no understanding of the iplications of issing data on the validity of results usually adapt this technique. Being the default in ost statistical software, it allows for the application of any statistical ethod of analysis and it has seen great increase in application without verification of the issingness echanis ([3]; [32]; [1]; [31]). The loss of observations reduces the saple size thereby inflating the standard errors and eventually resulting in invalid inferences. Unbiased paraeter estiates are possible only when data are MCAR, a condition which rarely holds in practice Pairwise Deletion (PD) Pairwise deletion is another ethod of handling issing values where all observed values on a subject are retained and issing values are only deleted in pairs as the analysis is carried out. This technique eliinates pair of cases if one or both values are unavailable and only cases with nonissing values are used to copute eans and variances, under this ethod, different calculations will utilise different cases with different saple sizes thereby producing undesirable effect ([3]; [9]). Pair-wise deletion is applicable and useful when data are MCAR, the issing cases involved are sall on each variable relative to the total saple size and large nubers of variables are involved ([8]; [3]). Due to the varying saple sizes for each analysis, this ethod is not recoended for easures such as correlation and covariance which are highly influenced by saple size Mean Substitution (MS) The ean substitution (MS) ethod replaces issing values with the ean of the observed cases once. Analysis is based on coplete data as the saple size is not altered for any variable, no collected inforation is discarded ([35]). It is fast and easy to ipleent, however it has several drawbacks. The ajor shortcoing of the ean substitution is that it does not take into account the uncertainties associated with the issing data thereby overestiating the saple size and artificially decreasing the variability between individuals' responses ([28]; [1]). This usually results in narrower confidence intervals and produces correlations which are negatively biased between pairs of variables. A different and better approach is to replace the issing values with sub-group ean within the data set, for exaple when handling longitudinal data, issing score can be replaced with the ean of individual s responses on other waves Regression-Based Iputation (RI) The regression-based iputation is another ethod of handling issing data where the issing cases are replaced with predicted values derived fro regression equation based on observed values of the variables in the data set that are coplete. Also referred to as conditional ean iputation, it is probably one of the best aong the siple ad-hoc ethods because cases with issing values are preserved and saple size is aintained. It is ore inforative since all existing inforation are utilized. The shortcoings of this ethod are; values beyond the logical range of the data ay be iputed thereby distorting inferences, choosing the right regression odel to fit the given data is challenging, it is not suitable for application on ultivariate data having ore than one variable with issing values, and large saple size is required to produce valid estiates Hot-Deck and Cold-Deck Iputation Hot-deck iputation replaces issing values with nonissing values taken fro a randoly selected, closely atched observation in the sae data set as the observation containing the issing value. Cold-deck iputation replaces issing values fro observations atched in a different data set. The hot-deck ethod is slightly ore coplex than the other ad-hoc procedures discussed, it involves several steps; first, copletely observed variables are separated fro incoplete ones. Next, both the incoplete variables and copletely observed variables with siilar attributes are grouped together on the basis of soe characteristics (e.g., household size, job category, incoe, educational level, etc.). The issing values are replaced with values randoly drawn fro fully observed individuals having siilar factors that predict issingness. In case of ultiple individuals atching the ite with issing values, the ean score of the atching individuals or rando draw fro the distribution of the

5 International Journal of Statistical Distributions and Applications 2017; 3(4): donor cases can be used. The proble with the hot deck ethod is that it grossly underestiates the variability in the saple as copared to the coplete data, however it is better than the ean iputation Multiple Iputation (MI) Multiple iputation is a flexible predictive approach that replaces issing data by creating " D1 plausible replaceents developed by [34] to address the shortcoings associated with the single iputation ethods. Figure 1. Multiple Iputation Steps. The procedure takes into account the uncertainty associated with the issing values, this ethod is applicable when data are issing at rando. Unlike the ad hoc ethods, ultiple iputation cobines both classical and Bayesian statistical techniques using suitable odels to create " iputed datasets which takes into account the uncertainties associated with the issing values ([34]). The usefulness of ultiple iputation in accounting for issing values has been docuented in any research works ([35]; [3]; [28]; [1]; [7]). The iputation odel should be specified accurately so that the degree of uncertainty about the issing data will be adequately reflected and, data relationships and associations are preserved ([3]; [30]; [22]). When using ultiple iputation, we assue that observations are issing at rando (MAR) to ake it possible to ignore the process that causes the issing data ([30]). Under MAR, the ultiple iputation approach retains the advantages of axiu likelihood (ML) ethod and allows the uncertainty caused by the iputation to be incorporated into the coplete-data analysis ([22]). Multiple iputation consists of three basic steps outlined as; i) iputation step to create coplete datasets using the chosen iputation odel, ii) analysing each of the -iputed dataset separately, and iii) cobining the results into a single estiate using the Rubin s rule. These steps are presented in Figure 1. Multiple iputation (MI) allows for the use of coplete-data analysis ethods and incorporates rando errors to account for the uncertainties associated with the issing values ([34]). Multiple iputation perfors better by iniizing the standard errors and increasing the efficiency of estiates as copared to the single iputation ethods and, can be ipleented using any odel on any data without requiring specialized software. Rando seed should be set to ensure reproducibility of results. There are several nuber of ultiple iputation (MI)) approaches which have been proposed for dealing with issing data probles. [13], cobine the advantages of kernel estiators where kernelbased sapling weights were developed to create iputations, and the popular doubly robust ethods developed to handle the isspecification of the outcoe odel. They used these two strategies to develop a kernelbased doubly robust MI ethod which is ore robust than paraetric alternatives against the isspecification of the outcoe odel Pooling the Paraeter Estiates for Inference Each of the iputed dataset is analysed using coplete-data analysis ethod specified for the research under identical conditions. To draw inference about the population, all the " estiates of the paraeters being considered are cobined into single values ([34]; [38]) thereby yielding; ˆ Qi (6) i= 1 Q = where EF is the pooled paraeter estiate and EG 1 " is the paraeter estiate for the () iputed coplete dataset. The cobined standard error using Rubin s rules ([34]) is slightly ore coplex as there are two coponents that ake up the total error. Let EG be the estiate of a scalar quantity of interest obtained fro the iputed dataset 1 " and H is the standard error associated with EG.

6 92 Nicholas Pindar Dibal et al.: Challenges and Iplications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling The The overall standard error for the iputation process is the su of within and between variances, where; the withiniputation variance is represented by 1 Ui i = 1 and the between-iputation variance by U = (7) K1+ M NOP (11) where is the nuber of iputed datasets and Q the proportion of issing values in the data set. Inference based on ultiple iputation are ore efficient than those based on the ad hoc ethods which either discard cases with issing values or ipute once without taking into account the uncertainties associated with the issing values ([28]). i= 1 ( Q ) 2 i Q 1 B = ˆ (8) 1 and the total variance is then calculated by adding the within and between variances ([21]) as follows: 1 T = U + 1+ B The overall standard error and confidence intervals associated with the paraeters of interest can be estiated in the noral way while the degrees of freedo is given as; df = ( 1) 1 + U ( + 1) B 2 (9) (10) The rate of issing data is used to deterine the nuber of data sets to ipute, [34] and [3] suggest that for practical purposes, sall nuber " 5 of repeated iputations is adequate to produce estiates which give valid inferences. Soe scholars however recoend that ranges fro 20 to 100 or ore as using large nuber of iputed datasets iproves the stability of results ore especially when estiating sall size effect ([21]; [2]). The relative efficiency of the estiates could however be evaluated using 3. Siulation and Results Table 2. Paraeter Estiates under Different Iputation Methods. Four siulated data on disk polishing was used to illustrate soe ethods for handling issing data. The perforance of the different ethods were copared using the following datasets; a coplete data set (to serve as control). b part of the coplete data were set issing at 5% and 35% and analyzed using listwise deletion. c use ean substitution to analyze the two datasets in (ii). d use ultiple iputation to analyze the two datasets in (ii). The usefulness of soe of the ethods of handling issing data is deonstrated using the siulated data on the tie it takes to polish a disk based on the thickness of the disk (), diaeter of the disk (c) and aount of hardener (gras) added to the cast. Fifty nine (59) sets of easureents with 5% and 35% of values set issing randoly were used. Regression analysis was used on the coplete data to odel the tie it takes to polish a disk to serve as a control. Three ethods of handling issing data; Listwise Deletion (LD), Mean Substitution (MS) and Multiple Iputation (MI) were studied and copared with the control. The paraeter estiates together with their standard errors are presented in Table 2. Missing Rate Intercept Diaeter Thickness Hardener Iputation Methods Control LD MS MI 0% 5% 35% 5% 35% 5% 35% (6.78) (7.48) (15.31) (7.74) (9.36) (7.2) (8.53) (0.51) (0.53) (0.99) (0.55) (0.64) (0.53) (0.53) (1.59) (1.59) (4.07) (1.67) (3.14) (1.64) (2.45) (3.05) (3.59) (4.42) (3.42) (3.59) (3.27) (3.14) 4. Discussion With LD, any case with at least one issing value is oitted, hence the residual standard error was calculated on 44 degrees of freedo as 11 observations were deleted with 5% of the observations issing giving estiates coparable with the coplete case. With 35% issingness rate, 49 of the observations were deleted thereby giving biased estiates. With ean substitution (MS), no observations were lost but substituting with the ean of the variables reduces the variation which eventually bias the estiate at both 5% and 35%. Estiates obtained using MI were coparable to the coplete case under both sall and large rates of issing values as all issing values were replaced predictively 15 ties and no case was deleted. Handling issing values therefore requires better understanding of the pattern of issingness and the reason why the observations got issing

7 International Journal of Statistical Distributions and Applications 2017; 3(4): to enable the choice of the best ethod to use. 5. Conclusion The ethods discussed are all useful in handling issing data, however their application depends on how uch of the data are issing and what causes the data to be issing. This paper provide researchers, ore especially those with little or no understanding of statistics with guidance on how to handle issing data efficiently. The detail description of the issing data echaniss, pattern of issing values and why the data got issing will adequately equip the researcher to ake infored decision in the design of the research and the choice of the ethod to use. In all, the choice of ethod for handling issing data should be guided by the need to preserve the essential characteristics of the data, aintain the representativeness of the analyzed data, provide valid statistical inference (control Type I error), axiize the statistical power of the test (iniize Type II error), and avoid bias. However it is recoended that researchers should put in place easures that iniise the occurrence of issing values in the first instance to enhance the quality and validity of inferences obtained fro incoplete data. Consultation with professionals and experts in survey design and experientation could be the first step in overcoing the challenges of issing data. References [1] Acock, A. C. (2005). Working with issing values. Journal of Marriage and Faily. 67; [2] Ader, H. J. {2008). Missing data. In Ader, H. J. & Mellenbergh, G. J. (Eds). Advising on research Methods: A consultant s copanion. (pp ). Huizen, The Netherlands: Johannes van Kessel Publishing. [3] Allison, P. D. (2000). Multiple iputation for issing data: A cautionary tale. Sociological Methods & Research, 28 (3); [4] Allison, P. D. (2003). Missing data techniques for structural equation odeling. Journal of Abnoral Psychology. 112 (4); [5] Beasley, T. M. (1988). Coents on the analysis of data with issing values. Multiple Linear Regression Viewpoints. 25, [6] Blackwell, M., Honaker, J. and King, G (2011). Multiple Over iputation: A Unified Approach to Measureent Error and Missing Data. [7] Bolotin, A. (2010). Anew ethod of ultiple iputation for copletely (or alost copletely) issing data. Proceeding MACMESE'10 Proceedings of the 2 th WSEASinternational conference on Matheatical and coputational ethods in science and engineering. [8] Burke, S. (1998). Missing values, outliers, robust statistics & non-paraetric ethods. LC GC Europe Online Suppleent. Scientific Data Manageent. 2 (2), [9] Carpenter, J. R. (2010). Statistical odeling with issing data using ultiple Iputation. www. issingdata. org.uk. [10] Carpenter, J. R. and Kenward, M. G. (2008). Missing data in randoized controlled trials-a practical guide. Biringha: National Health Service Coordinating Centre for Research Methodology, [11] Carter, R. L. (2006). Solutions for issing data in structural equation odeling. Research & Practice in Assessent. 1 (1); [12] Chen, S. (2014) "Iputation of issing values using quantile regression". Unpublished Graduate Theses and Dissertations. Iowa State University. [13] Chiu-Hsieh, H., He, Y., Li, Y., Long, Q. and Friese, R. (2016). Doubly robust ultiple iputation using kernel-based techniques. Bio J. 58(3): [14] Davey, A. and Savla, J. (2010). Statistical power analysis with issing data: A structural equation odeling approach. NY: Routledge; [15] Enders, C. K. (2006). A prier on the use of odern issingdata ethods in psychosoatic edicine research. Psychosoatic Medicine, 68; [16] Fisher, A. and Waclawski, A. (2009). A survey of techniques for identifying and handling outliers and issing values in tie series data. 29 th International Syposiu on Forecasting. Hong Kong. ww.forecasters.org/isf. [17] Foster, P. J., Mai, M. A. and Bala, A. M. (2009). On treatent of the ultivariate issing data. Research Report No. 13, Probability and Statistics Group, School of Matheatics. The University of Manchester. [18] Glas, C. A. W. and Pientel, J. L. (2008). Modeling Nonignorable Missing Data in Speeded Tests. Educational and Psychological Measureent 68 (6), [19] Graha, J. W. (2009). Missing data analysis: Making it work in the real world. Annual Review of Psychology, 60; [20] Graha, J. W., Cusille, P. E. and Elek-Fisk, E. (2003). Methods for handling issing data. In J. A. Schinka & W. F. Velicer (Eds.), Research Methods in Psychology (pp ). Handbook of Psychology, New York: John Wiley & Sons. [21] Graha, J. W., Olchowski, A. E., and Gilreath, T. D. (2007). How any iputations are really needed: Soe practical clarifications of ultiple iputation theory. Prevention Science, 8, [22] He, Y. (2010). Missing data analysis: Getting to the heart of the atter. Journal of the Aerican Heart Association. 3; [23] Horton, N. J. and Kleinan, K. P. (2007). Much ado about nothing: A coparison of issing data ethods and software to fit incoplete data regression odels. The Aerican Statistician, 61(1), [24] Kang, H. (2013). The prevention and handling of the issing data. Korean J Anesthesiol.; 64(5): [25] Kenward, M. (2007). Missing Data with MLwiN: An overview. A Paper Presented at Researcher Developent Initiative Workshop. London School of Hygiene and Tropical Medicine.

8 94 Nicholas Pindar Dibal et al.: Challenges and Iplications of Missing Data on the Validity of Inferences and Options for Choosing the Right Strategy in Handling The [26] Ki, J. O. and Curry, J. (1977). The treatent of issing data in ultivariate analysis. Sociological Methods & Research, 6 (2); [27] Litttle, R. J. A. (1988). A test of issing copletely at rando data with issing values. Journal of the Aerica Statistical Association. 83 (404); [28] Little, R. J. A. and Rubin, D. B. (2002). Statistical analysis with issing data, (2 nd ed.). New York: John Wiley & Sons. [29] Okafor, R. (1982). Bias due to logistic non-response in saple survey. (Unpublished Ph. D. Thesis Subitted to the Departent of Statistics, Harvard University. Cabridge, Massachusetts). [30] Patrcian, A. P. (2002). Focus on research ethods: Multiple iputation for issing data. Research in Nursing and Health, 25, [31] Peng, C. Y., Harwell, M. R., Liou, S. M., & Ehan, L. H. (2006). Advances in issing data ethods and iplications for educational research. In S. S. Sawilowsky (Ed.), Real Data Analysis. (pp ). New York. [32] Pigott, T. D. (2001). A review of ethods for issing data. Educational Research and Evaluation, 7 (4); [33] Rubin, D. B. (1976). Inference and issing data. Bioetrika, 63; [34] Rubin, D. B. (1987). Multiple iputation for non-response in Surveys. New York: Wiley. [35] Schafer, J. L. (1997). Analysis of incoplete ultivariate data, New York: Chapan and Hall. [36] Schafer, J. L. and Graha, J. W. (2002). Missing data: Our view of the state of the art. Psychological Methods, 7(2); [37] Schwartz, T., Chen, NY Q. and Duan, NY N. (2011). Studying issing data patterns using a SAS Macro. Statistics and Data Analysis, SAS Global Foru. Paper 339. [38] Song, Q., Shepperd, M., Cartwright M. and Twala, B. (2005), A New iputation ethod for sall software project data sets. [39] Stratton, I. M., and Aldington, S. J. (2007). Missing data eans lost opportunities. Journal of Clinical Research Best Practices. 3, (5). [40] Stuart, E. A., Azur, M., Frangakis, C. and Leaf, P. (2009). Multiple iputation with large data set: A case study of the children s ental health initiative. Aerican Journal of Epideiology, 169 (9); [41] Swetha, S. (2016). An Integral Study on Missing Value Data Iputation. International Journal of Engineering Sciences & Research Technology. 5(2) [42] Todorov, V., Tepl, M. and Filzoser, P. (2011). Detection of ultivariate outliers in business survey data with incoplete inforation. Advance Data Analyses and Classification. 5; [43] vonhippel, P. T. (2013). Should a noral iputation odel be odified to ipute skewed variables? Sociological Methods and Research, 42 (1); [44] Wayan, J. C. (2003). Multiple iputation for issing data: What is it and how can I use it? Paper presented at the Annual Meeting of the Aerican Educational Research Association, Chicago, IL.

Tucker, L. R, & Lewis, C. (1973). A reliability coefficient for maximum likelihood factor

Tucker, L. R, & Lewis, C. (1973). A reliability coefficient for maximum likelihood factor T&L article, version of 6/7/016, p. 1 Tucker, L. R, & Lewis, C. (1973). A reliability coefficient for axiu likelihood factor analysis. Psychoetrika, 38, 1-10 (4094 citations according to Google Scholar

More information

Results Univariable analyses showed that heterogeneity variances were, on average, increased among trials at

Results Univariable analyses showed that heterogeneity variances were, on average, increased among trials at Between-trial heterogeneity in eta-analyses ay be partially explained by reported design characteristics KM Rhodes 1, RM Turner 1,, J Savović 3,4, E Jones 3, D Mawdsley 5, JPT iggins 3 1 MRC Biostatistics

More information

Bivariate Quantitative Trait Linkage Analysis: Pleiotropy Versus Co-incident Linkages

Bivariate Quantitative Trait Linkage Analysis: Pleiotropy Versus Co-incident Linkages Genetic Epideiology 14:953!958 (1997) Bivariate Quantitative Trait Linkage Analysis: Pleiotropy Versus Co-incident Linkages Laura Alasy, Thoas D. Dyer, and John Blangero Departent of Genetics, Southwest

More information

Predicting Time Spent with Physician

Predicting Time Spent with Physician Ji Zheng jizheng@stanford.edu Stanford University, Coputer Science Dept., 353 Serra Mall, Stanford, CA 94305 USA Ioannis (Yannis) Petousis petousis@stanford.edu Stanford University, Electrical Engineering

More information

A Comparison of Poisson Model and Modified Poisson Model in Modelling Relative Risk of Childhood Diabetes in Kenya

A Comparison of Poisson Model and Modified Poisson Model in Modelling Relative Risk of Childhood Diabetes in Kenya Aerican Journal of Theoretical and Applied Statistics 2018; 7(5): 193-199 http://www.sciencepublishinggroup.co/j/ajtas doi: 10.11648/j.ajtas.20180705.15 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Missing Data and Imputation

Missing Data and Imputation Missing Data and Imputation Barnali Das NAACCR Webinar May 2016 Outline Basic concepts Missing data mechanisms Methods used to handle missing data 1 What are missing data? General term: data we intended

More information

The sensitivity analysis of hypergame equilibrium

The sensitivity analysis of hypergame equilibrium 3rd International Conference on Manageent, Education, Inforation and Control (MEICI 015) The sensitivity analysis of hypergae equilibriu Zhongfu Qin 1,a Xianrong Wei 1,b Jingping Li 1,c 1 College of Civil

More information

Matching Methods for High-Dimensional Data with Applications to Text

Matching Methods for High-Dimensional Data with Applications to Text Matching Methods for High-Diensional Data with Applications to Text Margaret E. Roberts, Brandon M. Stewart, and Richard Nielsen This draft: October 6, 2015 We thank the following for helpful coents and

More information

Adaptive visual attention model

Adaptive visual attention model H. Hügli, A. Bur, Adaptive Visual Attention Model, Proceedings of Iage and Vision Coputing New Zealand 2007, pp. 233 237, Hailton, New Zealand, Deceber 2007. Adaptive visual attention odel H. Hügli and

More information

Fuzzy Analytical Hierarchy Process for Ecological Risk Assessment

Fuzzy Analytical Hierarchy Process for Ecological Risk Assessment Inforation Technology and Manageent Science Fuzzy Analytical Hierarchy Process for Ecological Risk Assessent Andres Radionovs 1 Oļegs Užga-Rebrovs 2 1 2 Rezekne Acadey of Technologies ISSN 2255-9094 (online)

More information

Evolution of Indirect Reciprocity by Social Information: The Role of

Evolution of Indirect Reciprocity by Social Information: The Role of 1 Title: Evolution of indirect reciprocity by social inforation: the role of Trust and reputation in evolution of altruis Author Affiliation: Mojdeh Mohtashei* and Lik Mui* *Laboratory for Coputer Science,

More information

How Should Blood Glucose Meter System Analytical Performance Be Assessed?

How Should Blood Glucose Meter System Analytical Performance Be Assessed? 598599DSTXXX1.1177/1932296815598599Journal of Diabetes Science and TechnologySions research-article215 Coentary How Should Blood Glucose Meter Syste Analytical Perforance Be Assessed? Journal of Diabetes

More information

OBESITY EPIDEMICS MODELLING BY USING INTELLIGENT AGENTS

OBESITY EPIDEMICS MODELLING BY USING INTELLIGENT AGENTS OBESITY EPIDEMICS MODELLING BY USING INTELLIGENT AGENTS Agostino G. Bruzzone, MISS DIPTEM University of Genoa Eail agostino@iti.unige.it - URL www.iti.unige.i t Vera Novak BIDMC, Harvard Medical School

More information

Optical coherence tomography (OCT) is a noninvasive

Optical coherence tomography (OCT) is a noninvasive Coparison of Optical Coherence Toography in Diabetic Macular Edea, with and without Reading Center Manual Grading fro a Clinical Trials Perspective Ada R. Glassan, 1 Roy W. Beck, 1 David J. Browning, 2

More information

Keywords: meta-epidemiology; randomised trials; heterogeneity; Bayesian methods; Cochrane

Keywords: meta-epidemiology; randomised trials; heterogeneity; Bayesian methods; Cochrane Label-invariant odels for the analysis of eta-epideiological data KM Rhodes 1, D Mawdsley, RM Turner 1,3, HE Jones 4, J Savović 4,5, JPT Higgins 4 1 MRC Biostatistics Unit, School of Clinical Medicine,

More information

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values Sutthipong Meeyai School of Transportation Engineering, Suranaree University of Technology,

More information

Data Mining Techniques for Performance Evaluation of Diagnosis in

Data Mining Techniques for Performance Evaluation of Diagnosis in ISSN: 2347-3215 Volue 2 Nuber 1 (October-214) pp. 91-98 www.ijcrar.co Data Mining Techniques for Perforance Evaluation of Diagnosis in Gestational Diabetes Srideivanai Nagarajan 1*, R.M.Chandrasekaran

More information

Performance Measurement Parameter Selection of PHM System for Armored Vehicles Based on Entropy Weight Ideal Point. Yuanhong Liu

Performance Measurement Parameter Selection of PHM System for Armored Vehicles Based on Entropy Weight Ideal Point. Yuanhong Liu nd International Conference on Coputer Engineering, Inforation Science & Application Technology (ICCIA 17) Perforance Measureent Paraeter Selection of PHM Syste for Arored Vehicles Based on Entropy Weight

More information

Implications of ASHRAE s Guidance On Ventilation for Smoking-Permitted Areas

Implications of ASHRAE s Guidance On Ventilation for Smoking-Permitted Areas Copyright 24, Aerican Society of Heating, Refrigerating and Air-Conditioning Engineers, Inc. This posting is by perission fro ASHRAE Journal. This article ay not be copied nor distributed in either paper

More information

Survival and Probability of Cure Without and With Operation in Complete Atrioventricular Canal

Survival and Probability of Cure Without and With Operation in Complete Atrioventricular Canal ORIGINAL ARTICLES Survival and Probability of Cure Without and With Operation in Coplete Atrioventricular Canal Thoas J. Berger, M.D., Eugene H. Blackstone, M.D., John W. Kirklin, M.D., L. M. Bargeron,

More information

Multiple Imputation For Missing Data: What Is It And How Can I Use It?

Multiple Imputation For Missing Data: What Is It And How Can I Use It? Multiple Imputation For Missing Data: What Is It And How Can I Use It? Jeffrey C. Wayman, Ph.D. Center for Social Organization of Schools Johns Hopkins University jwayman@csos.jhu.edu www.csos.jhu.edu

More information

CHAPTER 7 THE HIV TRANSMISSION DYNAMICS MODEL FOR FIVE MAJOR RISK GROUPS

CHAPTER 7 THE HIV TRANSMISSION DYNAMICS MODEL FOR FIVE MAJOR RISK GROUPS CHAPTER 7 THE HIV TRANSMISSION DYNAMICS MODEL FOR FIVE MAJOR RISK GROUPS Chapters 2 and 3 have focused on odeling the transission dynaics of HIV and the progression to AIDS for hoosexual en. That odel

More information

FAST ACQUISITION OF OTOACOUSTIC EMISSIONS BY MEANS OF PRINCIPAL COMPONENT ANALYSIS

FAST ACQUISITION OF OTOACOUSTIC EMISSIONS BY MEANS OF PRINCIPAL COMPONENT ANALYSIS FAST ACQUISITION OF OTOACOUSTIC EMISSIONS BY MEANS OF PRINCIPAL COMPONENT ANALYSIS P. Ravazzani 1, G. Tognola 1, M. Parazzini 1,2, F. Grandori 1 1 Centro di Ingegneria Bioedica CNR, Milan, Italy 2 Dipartiento

More information

Advanced Handling of Missing Data

Advanced Handling of Missing Data Advanced Handling of Missing Data One-day Workshop Nicole Janz ssrmcta@hermes.cam.ac.uk 2 Goals Discuss types of missingness Know advantages & disadvantages of missing data methods Learn multiple imputation

More information

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch.

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch. S05-2008 Imputation of Categorical Missing Data: A comparison of Multivariate Normal and Abstract Multinomial Methods Holmes Finch Matt Margraf Ball State University Procedures for the imputation of missing

More information

Module 14: Missing Data Concepts

Module 14: Missing Data Concepts Module 14: Missing Data Concepts Jonathan Bartlett & James Carpenter London School of Hygiene & Tropical Medicine Supported by ESRC grant RES 189-25-0103 and MRC grant G0900724 Pre-requisites Module 3

More information

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC Selected Topics in Biostatistics Seminar Series Missing Data Sponsored by: Center For Clinical Investigation and Cleveland CTSC Brian Schmotzer, MS Biostatistician, CCI Statistical Sciences Core brian.schmotzer@case.edu

More information

A new approach for epileptic seizure detection: sample entropy based feature extraction and extreme learning machine

A new approach for epileptic seizure detection: sample entropy based feature extraction and extreme learning machine J. Bioedical Science and Engineering, 2010, 3, 556-567 doi:10.4236/jbise.2010.36078 Published Online June 2010 (http://www.scirp.org/journal/jbise/). A new approach for epileptic seizure detection: saple

More information

The Roles of Beliefs, Information, and Convenience. in the American Diet

The Roles of Beliefs, Information, and Convenience. in the American Diet The Roles of Beliefs, Inforation, and Convenience in the Aerican Diet Selected Paper Presented at the AAEA Annual Meeting 2002 Long Beach, July 28 th -31 st Lisa Mancino PhD Candidate University of Minnesota

More information

Bayesian Networks Modeling for Crop Diseases

Bayesian Networks Modeling for Crop Diseases Bayesian Networs Modeling for Crop Diseases Chunguang Bi and Guifen Chen College of nforation & Technology, Jilin gricultural University, Changchun, China Bi_chunguan@126.co, guifchen@163.co bstract. Severe

More information

Help! Statistics! Missing data. An introduction

Help! Statistics! Missing data. An introduction Help! Statistics! Missing data. An introduction Sacha la Bastide-van Gemert Medical Statistics and Decision Making Department of Epidemiology UMCG Help! Statistics! Lunch time lectures What? Frequently

More information

Beating by hitting: Group Competition and Punishment

Beating by hitting: Group Competition and Punishment Beating by hitting: Group Copetition and Punishent Eva van den Broek *, Martijn Egas **, Laurens Goes **, Arno Riedl *** This version: February, 2008 Abstract Both group copetition and altruistic punishent

More information

Physical Activity Training for

Physical Activity Training for Physical Activity Training for Functional Mobility in Older Persons To Hickey Fredric M. Wolf Lynne S. Robins University of Michigan Marilyn B. Wagner Cleveland State University Wafa Harik Case Western

More information

Exploring the Impact of Missing Data in Multiple Regression

Exploring the Impact of Missing Data in Multiple Regression Exploring the Impact of Missing Data in Multiple Regression Michael G Kenward London School of Hygiene and Tropical Medicine 28th May 2015 1. Introduction In this note we are concerned with the conduct

More information

Direct in situ measurement of specific capacitance, monolayer tension, and bilayer tension in a droplet interface bilayer

Direct in situ measurement of specific capacitance, monolayer tension, and bilayer tension in a droplet interface bilayer Electronic Suppleentary Material (ESI) for Soft Matter. This journal is The Royal Society of Cheistry 2015 Taylor et al. Electronic Supporting Inforation Direct in situ easureent of specific capacitance,

More information

Application of Factor Analysis on Academic Performance of Pupils

Application of Factor Analysis on Academic Performance of Pupils Aerican Journal of Applied Matheatics and Statistics, 07, Vol. 5, No. 5, 64-68 Available online at http://pubs.sciepub.co/aas/5/5/ Science and Education Publishing DOI:0.69/aas-5-5- Application of Factor

More information

Toll Pricing. Computational Tests for Capturing Heterogeneity of User Preferences. Lan Jiang and Hani S. Mahmassani

Toll Pricing. Computational Tests for Capturing Heterogeneity of User Preferences. Lan Jiang and Hani S. Mahmassani Toll Pricing Coputational Tests for Capturing Heterogeneity of User Preferences Lan Jiang and Hani S. Mahassani Because of the increasing interest in ipleentation and exploration of a wider range of pricing

More information

Assessment of Human Random Number Generation for Biometric Verification ABSTRACT

Assessment of Human Random Number Generation for Biometric Verification ABSTRACT Original Article www.jss.ui.ac.ir Assessent of Huan Rando Nuber Generation for Bioetric Verification Elha Jokar, Mohaad Mikaili Departent of Engineering, Shahed University, Tehran, Iran Subission: 07-01-2012

More information

Learning the topology of the genome from protein-dna interactions

Learning the topology of the genome from protein-dna interactions Learning the topology of the genoe fro protein-dna interactions Suhas S.P. Rao, SUnet ID: suhasrao Stanford University I. Introduction A central proble in genetics is how the genoe (which easures 2 eters

More information

Statistical data preparation: management of missing values and outliers

Statistical data preparation: management of missing values and outliers KJA Korean Journal of Anesthesiology Statistical Round pissn 2005-6419 eissn 2005-7563 Statistical data preparation: management of missing values and outliers Sang Kyu Kwak 1 and Jong Hae Kim 2 Departments

More information

Dendritic Inhibition Enhances Neural Coding Properties

Dendritic Inhibition Enhances Neural Coding Properties Dendritic Inhibition Enhances Neural Coding Properties M.W. Spratling and M.H. Johnson Centre for Brain and Cognitive Developent, Birkbeck College, London, UK The presence of a large nuber of inhibitory

More information

Follicle Detection in Digital Ultrasound Images using Bidimensional Empirical Mode Decomposition and Fuzzy C-means Clustering Algorithm

Follicle Detection in Digital Ultrasound Images using Bidimensional Empirical Mode Decomposition and Fuzzy C-means Clustering Algorithm Follicle Detection in Digital Ultrasound Iages using Bidiensional Epirical Mode Decoposition and Fuzzy C-eans Clustering Algorith M.Jayanthi Rao @, Dr.R.Kiran Kuar # @ Research Scholar, Departent of CS,

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

The prevention and handling of the missing data

The prevention and handling of the missing data Review Article Korean J Anesthesiol 2013 May 64(5): 402-406 http://dx.doi.org/10.4097/kjae.2013.64.5.402 The prevention and handling of the missing data Department of Anesthesiology and Pain Medicine,

More information

Evolutionary and Ecological Perspectives on Systems Diseases using Agent-based Modeling

Evolutionary and Ecological Perspectives on Systems Diseases using Agent-based Modeling Evolutionary and Ecological Perspectives on Systes Diseases using Agent-based Modeling Swarfest 2014 Notre Dae University South Bend, IN, June 30, 2014 Gary An, MD Associate Professor of Surgery Departent

More information

DIET QUALITY AND CALORIES CONSUMED: THE IMPACT OF BEING HUNGRIER, BUSIER AND EATING OUT

DIET QUALITY AND CALORIES CONSUMED: THE IMPACT OF BEING HUNGRIER, BUSIER AND EATING OUT Working Paper 04-02 The Food Industry Center University of Minnesota Printed Copy $25.50 DIET QUALITY AND CALORIES CONSUMED: THE IMPACT OF BEING HUNGRIER, BUSIER AND EATING OUT Lisa Mancino and Jean Kinsey

More information

arxiv: v1 [cs.lg] 28 Nov 2017

arxiv: v1 [cs.lg] 28 Nov 2017 Snorkel: Rapid Training Data Creation with Weak Supervision Alexander Ratner Stephen H. Bach Henry Ehrenberg Jason Fries Sen Wu Christopher Ré Stanford University Stanford, CA, USA {ajratner, bach, henryre,

More information

AUC Optimization vs. Error Rate Minimization

AUC Optimization vs. Error Rate Minimization AUC Optiization vs. Error Rate Miniization Corinna Cortes and Mehryar Mohri AT&T Labs Research 180 Park Avenue, Florha Park, NJ 0793, USA {corinna, ohri}@research.att.co Abstract The area under an ROC

More information

LONG-TERM PROGNOSIS OF SEIZURES WITH ONSET IN CHILDHOOD LONG-TERM PROGNOSIS OF SEIZURES WITH ONSET IN CHILDHOOD. Patients

LONG-TERM PROGNOSIS OF SEIZURES WITH ONSET IN CHILDHOOD LONG-TERM PROGNOSIS OF SEIZURES WITH ONSET IN CHILDHOOD. Patients LONG-TERM PROGNOSIS OF SEIZURES WITH ONSET IN CHILDHOOD LONG-TERM PROGNOSIS OF SEIZURES WITH ONSET IN CHILDHOOD MATTI SILLANPÄÄ, M.D., PH.D., MERJA JALAVA, M.D., PH.D., OLLI KALEVA, B.SC., AND SHLOMO SHINNAR,

More information

Biomedical Research 2016; Special Issue: S178-S185 ISSN X

Biomedical Research 2016; Special Issue: S178-S185 ISSN X Bioedical Research 2016; Special Issue: S178-S185 ISSN 0970-938X www.bioedres.info A novel autoatic stepwise signal processing based coputer aided diagnosis syste for epilepsy-seizure detection and classification

More information

Validity and reliability of measurements

Validity and reliability of measurements Validity and reliability of measurements 2 3 Request: Intention to treat Intention to treat and per protocol dealing with cross-overs (ref Hulley 2013) For example: Patients who did not take/get the medication

More information

Selective Averaging of Rapidly Presented Individual Trials Using fmri

Selective Averaging of Rapidly Presented Individual Trials Using fmri Huan Brain Mapping 5:329 340(1997) Selective Averaging of Rapidly Presented Individual Trials Using fmri Anders M. Dale* and Randy L. Buckner Massachusetts General Hospital Nuclear Magnetic Resonance Center

More information

Investigation of Binaural Interference in Normal-Hearing and Hearing-Impaired Adults

Investigation of Binaural Interference in Normal-Hearing and Hearing-Impaired Adults J A Acad Audiol 11 : 494-500 (2000) Investigation of Binaural Interference in Noral-Hearing and Hearing-Ipaired Adults Rose L. Allen* Brady M. Schwab* Jerry L. Cranford* Michael D. Carpenter* Abstract

More information

Validity and reliability of measurements

Validity and reliability of measurements Validity and reliability of measurements 2 Validity and reliability of measurements 4 5 Components in a dataset Why bother (examples from research) What is reliability? What is validity? How should I treat

More information

Human Biology Open Access Pre-Prints

Human Biology Open Access Pre-Prints Wayne State University Huan Biology Open Access Pre-Prints WSU Press -- New Approaches to Juvenile Age Estiation in Forensics: Application of Transition Analysis via the Shackelford et al. Method to a

More information

Fig.1. Block Diagram of ECG classification. 2013, IJARCSSE All Rights Reserved Page 205

Fig.1. Block Diagram of ECG classification. 2013, IJARCSSE All Rights Reserved Page 205 Volue 3, Issue 9, Septeber 2013 ISSN: 2277 128X International Journal of Advanced Research in Coputer Science and Software Engineering Research Paper Available online at: www.ijarcsse.co Autoatic Classification

More information

AIDS Epidemiology. Min Shim Math 164: Scientific Computing. April 30, 2004

AIDS Epidemiology. Min Shim Math 164: Scientific Computing. April 30, 2004 AIDS Epideiology Min Shi Math 64: Scientiic Coputing April 30, 004 Abstract Thopson s AIDS epideic odel, which is orulated in his article AIDS: The Misanageent o an Epideic, published in Great Britain,

More information

Hierarchical Cellular Automata for Visual Saliency

Hierarchical Cellular Automata for Visual Saliency https://doi.org/.7/s263-7-62-2 Hierarchical Cellular Autoata for Visual Saliency Yao Qin Mengyang Feng 2 Huchuan Lu 2 Garrison W. Cottrell Received: 2 May 27 / Accepted: 26 Deceber 27 Springer Science+Business

More information

(From the Department of Biology, St. Louis University, and the Department of Pathology, St. Louis University School of Medicine, St.

(From the Department of Biology, St. Louis University, and the Department of Pathology, St. Louis University School of Medicine, St. EFFECT OF ENZYME INHIBITORS AND ACTIVATORS ON THE MULTIPLICATION OF TYPHUS RICKETTSIAE II. 'remperatiyre, POTASSIII~ CYANIDE, AND TOLITIDIN ]3LIIE BY DONALD GREIFF, Sc.D., AND HENRY PINKERTON, M.D. (Fro

More information

A CLOUD-BASED ARCHITECTURE PROPOSAL FOR REHABILITATION OF APHASIA PATIENTS

A CLOUD-BASED ARCHITECTURE PROPOSAL FOR REHABILITATION OF APHASIA PATIENTS Rev. Rou. Sci. Techn. Électrotechn. et Énerg. Vol. 62, 3, pp. 332 337, Bucarest, 2017 A CLOUD-BASED ARCHITECTURE PROPOSAL FOR REHABILITATION OF APHASIA PATIENTS DORIN CÂRSTOIU 1, VIRGINIA ECATERINA OLTEAN

More information

Research Article Association Patterns of Ontological Features Signify Electronic Health Records in Liver Cancer

Research Article Association Patterns of Ontological Features Signify Electronic Health Records in Liver Cancer Hindawi Journal of Healthcare Engineering Volue 2017, Article ID 6493016, 9 pages https://doi.org/10.1155/2017/6493016 Research Article Association Patterns of Ontological Features Signify Electronic Health

More information

Best Practice in Handling Cases of Missing or Incomplete Values in Data Analysis: A Guide against Eliminating Other Important Data

Best Practice in Handling Cases of Missing or Incomplete Values in Data Analysis: A Guide against Eliminating Other Important Data Best Practice in Handling Cases of Missing or Incomplete Values in Data Analysis: A Guide against Eliminating Other Important Data Sub-theme: Improving Test Development Procedures to Improve Validity Dibu

More information

Evaluation of an In-Situ Output Probe-Microphone Method for Hearing Aid Fitting Verification*

Evaluation of an In-Situ Output Probe-Microphone Method for Hearing Aid Fitting Verification* 0196/0202/90/1101-003 1$02.00/0 EAR AND HEARNG Copyright 0 1990 by The Willias & Wilkins Co. Vol., No. Printed in U. S. A. AMPLFCATON AND AURAL REHABLTATON Evaluation of an n-situ Output Probe-Microphone

More information

Dose Intensity Analysis of High-Dose Carboplatin in Refractory Ovarian Carcinoma Relative to Age

Dose Intensity Analysis of High-Dose Carboplatin in Refractory Ovarian Carcinoma Relative to Age 65 Dose Intensity Analysis of High-Dose Carboplatin in Refractory Ovarian Carcinoa Relative to Age Terri L. Cornelison, M.D., and Eddie Reed, M.D. Background. Cisplatin and carboplatin are essential to

More information

Spatiotemporal Dynamics of Tuberculosis Disease and Vaccination Impact in North Senatorial Zone Taraba State Nigeria

Spatiotemporal Dynamics of Tuberculosis Disease and Vaccination Impact in North Senatorial Zone Taraba State Nigeria IOSR Journal of Matheatics (IOSR-JM ISSN: 78-578. Volue, Issue (Sep-Oct. 1, PP 1- Spatioteporal Dynaics of Tuberculosis Disease and Vaccination Ipact in North Senatorial Zone Taraba State Nigeria 1 A.

More information

Longevity Clubs. Ulla Lehmijoki University of Helsinki and HECER. Discussion Paper No. 234 September 2008 ISSN

Longevity Clubs. Ulla Lehmijoki University of Helsinki and HECER. Discussion Paper No. 234 September 2008 ISSN ömföäflsäafaäsflassflassflas ffffffffffffffffffffffffffffffffffff Discussion Papers Longevity Clubs Ulla Lehijoki University of Helsinki and HECER Discussion Paper No. 234 Septeber 2008 ISSN 1795 0562

More information

interactions (mechanism of folding/contact free energies/range of interactions/monte Carlo)

interactions (mechanism of folding/contact free energies/range of interactions/monte Carlo) Proc. Nat. Acad. Sci. USA Vol. 72, No. 1, pp. 382386, October 1975 Cheistry Model of protein folding: Inclusion of short, ediu, and longrange interactions (echanis of folding/contact free energies/range

More information

Strategies for handling missing data in randomised trials

Strategies for handling missing data in randomised trials Strategies for handling missing data in randomised trials NIHR statistical meeting London, 13th February 2012 Ian White MRC Biostatistics Unit, Cambridge, UK Plan 1. Why do missing data matter? 2. Popular

More information

A novel technique for stress recognition using ECG signal pattern.

A novel technique for stress recognition using ECG signal pattern. Curr Pediatr Res 2017; 21 (4): 674-679 ISSN 0971-9032 www.currentpediatrics.co A novel technique for stress recognition using ECG signal pattern. Supriya Goel, Gurjit Kau, Pradeep Toa Gauta Buddha University,

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

Bayesian approaches to handling missing data: Practical Exercises

Bayesian approaches to handling missing data: Practical Exercises Bayesian approaches to handling missing data: Practical Exercises 1 Practical A Thanks to James Carpenter and Jonathan Bartlett who developed the exercise on which this practical is based (funded by ESRC).

More information

Analysis of TB prevalence surveys

Analysis of TB prevalence surveys Workshop and training course on TB prevalence surveys with a focus on field operations Analysis of TB prevalence surveys Day 8 Thursday, 4 August 2011 Phnom Penh Babis Sismanidis with acknowledgements

More information

Lung Volume Reduction Surgery Using the NETT Selection Criteria

Lung Volume Reduction Surgery Using the NETT Selection Criteria Lung Volue Reduction Surgery Using the NETT Selection Criteria Mark E. Ginsburg, MD, Byron M. Thoashow, MD, Chun K. Yip, MD, Angela M. DiMango, MD, Roger A. Maxfield, MD, Matthew N. Bartels, MD, Patricia

More information

An original approach to the diagnosis of scolineinduced

An original approach to the diagnosis of scolineinduced J. clin Path., 1972, 25, 422-426 An original approach to the diagnosis of scolineinduced apnoea A. FSHTAL, R. T. EVANS, AND C. N. CHAPMAN Fro the Departent ofpathology, Southead General Hospital, Bristol

More information

Missing by Design: Planned Missing-Data Designs in Social Science

Missing by Design: Planned Missing-Data Designs in Social Science Research & Methods ISSN 1234-9224 Vol. 20 (1, 2011): 81 105 Institute of Philosophy and Sociology Polish Academy of Sciences, Warsaw www.ifi span.waw.pl e-mail: publish@ifi span.waw.pl Missing by Design:

More information

Seizure recurrence after a 1st unprovoked seizure:

Seizure recurrence after a 1st unprovoked seizure: Seizure recurrence after a st unprovoked seizure: An extended follow-up W. Allen Hauser, MD; Stephen S. Rich, PhD; John F. Annegers, PhD; and V. Elving Anderson, PhD Article abstract-we followed 28 patients

More information

Sodium Chloride Content in Ketchup by Precipitation Titration

Sodium Chloride Content in Ketchup by Precipitation Titration Background Sodiu Chloride Content in Ketchup by Precipitation Titration Sodiu chloride is one of the ost coon substances found in nature. Knowing the salt content in food products is iportant not only

More information

Application guide. High speed migration SYSTIMAX InstaPATCH 360 and SYSTIMAX Ultra-Low-Loss configuration guideline

Application guide. High speed migration SYSTIMAX InstaPATCH 360 and SYSTIMAX Ultra-Low-Loss configuration guideline Application guide High speed igration SYSTIMAX InstaPATCH 360 and SYSTIMAX Ultra-Low-Loss configuration guideline Contents Contents... 2 SYSTIMAX preterinated fiber-optic cabling systes configuration guide...

More information

Outcome measures in palliative care for advanced cancer patients: a review

Outcome measures in palliative care for advanced cancer patients: a review Journal of Public Health Medicine Vol. 19, No. 2, pp. 193-199 Printed in Great Britain Outcoe easures in for advanced cancer s: a review Julie Hearn and Irene J. Higginson Suary Inforation generated using

More information

Policy Trap and Optimal Subsidization Policy under Limited Supply of Vaccines

Policy Trap and Optimal Subsidization Policy under Limited Supply of Vaccines olicy Trap and Optial Subsidization olicy under Liited Supply of Vaccines Ming Yi 1,2, Achla Marathe 1,3 * 1 Networ Dynaics and Siulation Science Laboratory, VBI, Virginia Tech, Blacsburg, Virginia, United

More information

Chapter 4: Nutrition. Teacher s Guide

Chapter 4: Nutrition. Teacher s Guide Chapter 4: Nutrition Teacher s Guide 55 Chapter 4: Nutrition Teacher s Guide Learning Objectives Students will explain two ways that nutrition affects health Students will describe the function of 5 iportant

More information

Some General Guidelines for Choosing Missing Data Handling Methods in Educational Research

Some General Guidelines for Choosing Missing Data Handling Methods in Educational Research Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 3 11-2014 Some General Guidelines for Choosing Missing Data Handling Methods in Educational Research Jehanzeb R. Cheema University

More information

Clustering of Hypertension, Abnormal Glucose Tolerance, Hypercholesterolaemia and Obesity in Malaysian Adult Population

Clustering of Hypertension, Abnormal Glucose Tolerance, Hypercholesterolaemia and Obesity in Malaysian Adult Population ORIGINAL ARTICLE Clustering of Hypertension, Abnoral Glucose Tolerance, Hypercholesterolaeia and Obesity in Malaysian Adult Population TO Li*, L M Ding*, M Zaki**, I Merican***, S TKew***, A H Maiunah****,

More information

Community Health Environment Scan Survey (CHESS): a novel tool that captures the impact of the built environment on lifestyle factors

Community Health Environment Scan Survey (CHESS): a novel tool that captures the impact of the built environment on lifestyle factors æstudy DESIGN ARTICLE Counity Health Environent Scan Survey (CHESS): a novel tool that captures the ipact of the built environent on lifestyle factors Fiona Wong 1 *, Denise Stevens 1, Kathleen O Connor-Duffany

More information

A TWO-DIMENSIONAL THERMODYNAMIC MODEL TO PREDICT HEART THERMAL RESPONSE DURING OPEN CHEST PROCEDURES

A TWO-DIMENSIONAL THERMODYNAMIC MODEL TO PREDICT HEART THERMAL RESPONSE DURING OPEN CHEST PROCEDURES A TWO-DIMENSIONAL THERMODYNAMIC MODEL TO PREDICT HEART THERMAL RESPONSE DURING OPEN CHEST PROCEDURES F. G. Dias, J. V. C. Vargas, and M. L. Brioschi Universidade Federal do Paraná Departaento de Engenharia

More information

Sudden Noise Reduction Based on GMM with Noise Power Estimation

Sudden Noise Reduction Based on GMM with Noise Power Estimation J. Software Engineering & Applications, 010, 3: 341-346 doi:10.436/jsea.010.339 Pulished Online April 010 (http://www.scirp.org/journal/jsea) 341 Sudden Noise Reduction Based on GMM with Noise Power Estiation

More information

SPERM RECEPTIVITY IN SEA URCHIN OOCYTES AND EGGS

SPERM RECEPTIVITY IN SEA URCHIN OOCYTES AND EGGS J. exp. Biol. 118, 8-97 (198) 8 Printed in Great Britain The Copany of Biologists Liited 198 SPERM RECEPTIVITY IN SEA URCHIN OOCYTES AND EGGS BY BRIAN DALE Stazione Zoologica, Villa Counale, 80121 Naples,

More information

Computational modelling of amino acid transfer interactions in the placenta

Computational modelling of amino acid transfer interactions in the placenta Exp Physiol 95.7 pp 829 840 829 Experiental Physiology Research Paper Coputational odelling of aino acid transfer interactions in the placenta B. G. Sengers 1,C.P.Please 2 andr.m.lewis 3 1 Bioengineering

More information

ASBESTOSIS AND PRIMARY INTRATHORACIC NEOPLASMS

ASBESTOSIS AND PRIMARY INTRATHORACIC NEOPLASMS ASBESTOSIS AND PRIMARY INTRATHORACIC NEOPLASMS Willia D. Buchanan Ministry of Labour, London, Gwat Britain The purpose of this paper is to describe one aspect of a study, which has been continued over

More information

The Price OF POLLUTION. Cost Estimates of Environmentally-Related Disease in Oregon

The Price OF POLLUTION. Cost Estimates of Environmentally-Related Disease in Oregon The Price OF POLLUTION Cost Estiates of Environentally-Related Disease in Oregon FEBRUARY 2008 The Price Cost Estiates of Environentally-Related Disease in Oregon OF POLLUTION Project Sponsors: Author:

More information

Did Modeling Overestimate the Transmission Potential of Pandemic (H1N1-2009)? Sample Size Estimation for Post-Epidemic Seroepidemiological Studies

Did Modeling Overestimate the Transmission Potential of Pandemic (H1N1-2009)? Sample Size Estimation for Post-Epidemic Seroepidemiological Studies Georgia State University ScholarWorks @ Georgia State University Public Health Faculty Publications School of Public Health 2011 Did Modeling Overestiate the Transission Potential of Pandeic (H1N1-2009)?

More information

Master thesis Department of Statistics

Master thesis Department of Statistics Master thesis Department of Statistics Masteruppsats, Statistiska institutionen Missing Data in the Swedish National Patients Register: Multiple Imputation by Fully Conditional Specification Jesper Hörnblad

More information

Brain Computer Interface with Low Cost Commercial EEG Device

Brain Computer Interface with Low Cost Commercial EEG Device Brain Coputer Interface with Low Cost Coercial EEG Device 1 * Gürkan Küçükyıldız, Suat Karakaya, Hasan Ocak and 2 Öer Şayli 1 Faculty of Engineering, Departent of Mechatronics Engineering, Kocaeli University,

More information

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Previously, when making inferences about the population mean,, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean,, we were assuming the following simple conditions: (1) Our data (observations)

More information

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study STATISTICAL METHODS Epidemiology Biostatistics and Public Health - 2016, Volume 13, Number 1 Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation

More information

JPES. Original Article. The usefulness of small-sided games on soccer training

JPES. Original Article. The usefulness of small-sided games on soccer training Journal of Physical Education and Sport (), 12(1), Art 15, pp 93-102, 2012 online ISSN: 2247-806X; p-issn: 2247 8051; ISSN - L = 2247-8051 Original Article The usefulness of sall-sided gaes on soccer training

More information

Speech Enhancement Using Temporal Masking in the FFT Domain

Speech Enhancement Using Temporal Masking in the FFT Domain PAGE 8 Speech Enhanceent Using Teporal Masking in the FFT Doain Yao Wang, Jiong An, Teddy Surya Gunawan, and Eliathaby Abikairajah School of Electrical Engineering and Telecounications The University of

More information

Chapter 1. Relationship Bridge Building

Chapter 1. Relationship Bridge Building Chapter 1 An Introduction to Relationship Bridge Building a I know of no ore encouraging fact than the unquestionable ability of an to elevate his life by conscious endeavor. -Henry Thoreau The Road to

More information

Acetic Acid in Vinegar by Acid/Base Titration

Acetic Acid in Vinegar by Acid/Base Titration Acetic Acid in Vinegar by Acid/Base Titration Background Vinegar is typically ade fro the ferentation of alcoholic liquids such as wine. Its ain constituents are water and acetic acid, usually about 5

More information