The effect of variance and tails of test on sample size determination

Similar documents
Study Guide #2: MULTIPLE REGRESSION in education

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

STATISTICS AND RESEARCH DESIGN

Teachers Sense of Efficacy Scale: The Study of Validity and Reliability

CHAPTER III METHODOLOGY

Factors Influencing Sample Size for Internal Audit Evidence Collection in the Public Sector in Kenya

Chapter 5: Field experimental designs in agriculture

Understandable Statistics

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Ecological Statistics

Inferential Statistics

MTH 225: Introductory Statistics

Daniel Boduszek University of Huddersfield

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Samples, Sample Size And Sample Error. Research Methodology. How Big Is Big? Estimating Sample Size. Variables. Variables 2/25/2018

Skala Stress. Putaran 1 Reliability. Case Processing Summary. N % Excluded a 0.0 Total

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

CHAPTER III RESEARCH METHODOLOGY

The Effect of Logical Choice Weight and Corrected Scoring Methods on Multiple Choice Agricultural Science Test Scores

Multiple Regression Analysis

IAASB Main Agenda (February 2007) Page Agenda Item PROPOSED INTERNATIONAL STANDARD ON AUDITING 530 (REDRAFTED)

How many speakers? How many tokens?:

CHAPTER 3 RESEARCH METHODOLOGY

QA 605 WINTER QUARTER ACADEMIC YEAR

CHAPTER TWO REGRESSION

Pitfalls in Linear Regression Analysis

WELCOME! Lecture 11 Thommy Perlinger

International Journal on Future Revolution in Computer Science & Communication Engineering ISSN: Volume: 4 Issue:

A COMPARISON BETWEEN MULTIVARIATE AND BIVARIATE ANALYSIS USED IN MARKETING RESEARCH

AUDIT SAMPLING CHAPTER LEARNING OUTCOMES CHAPTER OVERVIEW

REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK

DATA ANALYSIS AND INTERPRETATION

HOW STATISTICS IMPACT PHARMACY PRACTICE?

1.4 - Linear Regression and MS Excel

Business Research Methods. Introduction to Data Analysis

Simple Linear Regression

Business Statistics Probability

Determinants of Audit Expectation Gap: Evidence from Limited Companies in Kenya

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Bayesian and Frequentist Approaches

PRINCIPLES OF STATISTICS

Interpretation of Data and Statistical Fallacies

Evaluating EDI* Participant Reactions via Different Response Scales: A Technical Review. Keshav Gaur William A. Eckert

A Brief Introduction to Bayesian Statistics

Certificate Courses in Biostatistics

Chapter 11 Multiple Regression

Evaluating the results of a Systematic Review/Meta- Analysis

RELATIONSHIP BETWEEN EMOTIONAL INTELLIGENCE AND ETHICAL COMPETENCE: AN EMPIRICAL STUDY

CRITICAL EVALUATION OF BIOMEDICAL LITERATURE

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Chapter 7: Correlation

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

USE AND MISUSE OF MIXED MODEL ANALYSIS VARIANCE IN ECOLOGICAL STUDIES1

INTERNATIONAL STANDARD ON AUDITING (NEW ZEALAND) 530

The Relationship between Spiritual Leadership Features of the Principals and Job Empowerment

A SAS Macro to Investigate Statistical Power in Meta-analysis Jin Liu, Fan Pan University of South Carolina Columbia

Statistical techniques to evaluate the agreement degree of medicine measurements

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

The Role of Emotional Intelligence in Academic Performance of Male and Female Students in the University of the Punjab

Statistical reports Regression, 2010

CLINICAL RESEARCH METHODS VISP356. MODULE LEADER: PROF A TOMLINSON B.Sc./B.Sc.(HONS) OPTOMETRY

International Standard on Auditing (UK) 530

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Still important ideas

Module 14: Missing Data Concepts

Characterization of Renal Function and Morphology in Hypertensive Patients Using Renal Ultrasonography

City, University of London Institutional Repository

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

10 Intraclass Correlations under the Mixed Factorial Design

Analyzing binary outcomes, going beyond logistic regression

A study of association between demographic factor income and emotional intelligence

Chapter 1: Exploring Data

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS

Overview. Survey Methods & Design in Psychology. Readings. Significance Testing. Significance Testing. The Logic of Significance Testing

How to interpret results of metaanalysis

Chapter 4 Research Methodology

Sheila Barron Statistics Outreach Center 2/8/2011

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

The effect of vestibular stimulation on dynamic balance in mentally retarded children

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

What Is the Fat Intake of a Typical Middle School Student?

Section on Survey Research Methods JSM 2009

A Biostatistics Applications Area in the Department of Mathematics for a PhD/MSPH Degree

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

NORTH SOUTH UNIVERSITY TUTORIAL 1

The Research Roadmap Checklist

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Experimental Psychology

Examining the Psychometric Properties of The McQuaig Occupational Test

1. The Role of Sample Survey Design

SRI LANKA AUDITING STANDARD 530 AUDIT SAMPLING CONTENTS

Journal of Biostatistics and Epidemiology

1. The figure below shows the lengths in centimetres of fish found in the net of a small trawler.

Day 11: Measures of Association and ANOVA

The Impact of Different Malaria Drugs in Curing Malaria Within the Ghanaian Populace: A Case Study of Sekondi-Takoradi

Theoretical Exam. Monday 15 th, Instructor: Dr. Samir Safi. 1. Write your name, student ID and section number.

Transcription:

EUROPEAN ACADEMIC RESEARCH Vol. VI, Issue 10/ January 2019 ISSN 2286-4822 www.euacademic.org Impact Factor: 3.4546 (UIF) DRJI Value: 5.9 (B+) The effect of variance and tails of test on sample size determination MOHAMMED OMER ADAM 1 Department of Statistics Sudan University of Science and Technology ADIL MOUSA YOUNIS Qassim University AFRAA HASHIM Department of Statistics Sudan University of Science and Technology Abstract When different techniques of sampling are used, there are different formulas for determination of appropriate sample size. The aim of this study was to employ formula for determining representative sample size, when simple random sampling technique is used test from one tail and two tails. at the level of significant, depending on that, the value of test from two tails and test from one tail with variance ranging (0.1 to 2) and Standard Error, was used to obtain different sizes of samples. Excel office and SPSS package were used for sample size generation and analysis. To obtained results showed that, the sample sizes increases, as the variance increases. The result also showed significant Correlation between variance value ( ) and sample size ( ) in the two cases. Significant differences were observed between sample sizes estimator when Z test from one tail or two tails. The obtained samples sizes showed significant goodness of fitting with Poisson distribution, in the two cases. 1 The corresponding author email: mamoh99@gmail.com 5579

Key words: sample size, variance effect, precision, random sampling, confidence interval 1. INTRODUCTION Sample size estimation is usually conducted through a prestudy power analysis. To select a sample size for achieving a desired power of correct detection of a meaningful difference at a given level of significance (1) If we desire to estimate, with a confidence interval, the mean of a population (µ), one of the first question arise is how large the sample should be? This question must be given serious consideration, because it is a waste of time and resources to take a larger sample than is needed for the required results. On the other hand, if the number of samples is too small it may lead to results of no practical value. The key questions are how do you want our estimate to be close to the true value? Or, how wide would we like to make the confidence interval? The second question is, how much confidence do we want to place in our interval? That is, what confidence coefficient do we wish to employ? (2). To answer these questions may need to use the confidence interval Where is equal one half of the confidence interval, accordingly question one can be answered, by the set up of the following equation: Where indicates how close the true mean we want our estimate to be. From equations 1 and 2 sample size equation can be written as 5580

Sample size estimation is referred to the calculation of required number of samples for achieving some desired statistical confidence of accuracy and reliability (2). In many researches, sample size calculation may be performed based on precision analysis, variance of population, probability assessment or other statistical inferences) (1). This study was focusing on the estimation of sample size depending on the effect of variance of population. In surveying studies, once data are collected, the most important objective of a statistical analysis is to draw inferences about the population using sample information, e.g. how big a sample is required?. as one of the most frequently asked questions by the investigators. If the sample size is not taken properly, conclusions drawn from the investigation may not reflect the real situation for the whole population (5). The computer programming or computation may be effectively used to determine an appropriate sample size, as one of the most important steps in statistical studies. Sample size computation should be done appropriately because if the sample size is not appropriate for a particular study then the inference drawn from the sample may lead to some wrong conclusions (5). According to Gwowen Shieh (2013), the a priori determination of a proper sample size is necessary to achieve some specified power as an important problem frequently encountered in practical studies. To establish the needed sample size for a two-sample t test, researchers may conduct the power analysis by specifying scientifically important values as underlying population means while using a variance estimate obtained from related research or pilot study. In order to take account of the variability of sample variance, two approaches to sample size determinations should be considered. One provides the sample size required to given 5581

assurance probability, that, the actual power exceeds the planned power. The other provides the necessary sample size, such that the expected power attains the designated power level. The suggested paradigm of adjusted sample variance combines the existing procedures into one unified framework. Numerical results may be presented to illustrate the usefulness and advantages of the proposed approaches that accommodate the stochastic nature of the sample variance. More importantly, supplementary computer programs are developed to aid the implementation of the suggested techniques. The exposition may helps to clarify discrepancy and to extend the development of sample size methodology (2). Simple regression and correlation analysis of some data collected, to test, hypotheses and make conclusions on the factors affecting the sample size for audit purposes of the internal auditors in the public sector in Kenya, was found that, materiality of audit issue, type of information available, source of information, degree of risk of misstatement, auditor skills and independence are some of the factors influencing the sample size determination for the purposes of internal audit evidence collection in public sector in Kenya (3). 2. METHODOLOGY Simple random sampling technique was used to determine the representative sample size from one and two tails. Z value for various levels were employed (table1), where level of significance is, variance values ranging from (0.1 to 2), Standard Error, and total population size was assumed to be. This was used to find the main sample size. Microsoft excel office was used to generate sample size values and ratios (tables 2 and 3). SPSS program was used for a test of significances (tables 4, 5) and (figs 1, 2, 3, 4). 5582

Table 1: Z test values Two tails 2.58 One tail 5 2.33 3. RESULTS AND DISCUSSION Z-test value from two tails, with level of significance, variance values ranging from (0.1 to 2) and Standard Error were used to obtain different sizes of samples (table 2). Table 2: size of samples and ratio 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 3842 7683 11525 15366 19208 23050 26891 30733 34574 38416 42258 46099 49941 53782 57624 61466 65307 69149 72990 76832 3700 7135 10334 13319 16113 18732 21192 23508 25691 27754 29705 31553 33307 34972 36558 38067 39506 40880 42193 43449 Ratio 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 5583

Fig 1: scatter diagram for sample sizes Fig 2: box plot for sample sizes According to the obtained results in table 2 and figures (1, 2) the sample size increases, as the variance increases. Table 3: size of samples and ratio 0.1 2690 2619 Ratio 1 0.2 5379 5104 2 0.3 8069 74667 3 0.4 10758 9713 4 0.5 13448 11854 5 0.6 16138 13896 6 0.7 18827 15844 7 0.8 21517 17707 8 0.9 24206 19489 9 1 26896 21195 10 1.1 29587 22832 11 1.2 32275 24400 12 1.3 34965 25907 13 1.4 37654 27354 14 1.5 40344 28747 15 5584

1.6 1.7 1.8 1.9 2 43034 45723 48413 51102 53792 30087 31377 32620 33820 34977 By using z-test value from one tail, with level of significance variance values ranging from and standard error different sizes of samples were obtained (table 3). 16 17 18 19 20 Fig 3: Scatter diagram for sample sizes Fig 4: box plot for sample sizes From the table 3 and fig (3, 4) we can observe that, the sample size increases, as the variance increases without out layer fit and normal ratio. 5585

Table (4): Independent sample T test t-test for Equality of Means t Sig Equal variances assumed 915.2 7 Equal variances not assumed 916.5 3 Table (4) showed that the value of t-test is (915.2) with (pvalue= 7<0.05). According to tables (1 and 2), there is significant difference between sample sizes estimators of Z test from one tail and two tails, at the significant level 0.05. Table 5: One-Sample Kolmogorov-Smirnov Test Poisson Parameter Mean 4.0337E4 2.8241E4 Kolmogorov-Smirnov Z 2.236 2.236 Sig 0.000 0.000 SPSS analysis showed that, the value of chi square test is (2.236) with (p-value= 0.000< 0.05). From table (5), and depending on tables (2 and 3), we may concluded that, there is a significant goodness of fit, in cases of Z test from one tail and two tails, at the significant level 0.05. Table 6: Correlations Pearson Correlation 0.939 0.938 Sig 0.000 0.000 From table (6), the Pearson Correlation value was found to be (1.000) with (p-value= 0.000<0.05). Depending on that, there is significant Correlation between the variance values ( ) and sample size ( ) in the case of one tail and two tails, at the level of significant 0.05. 5586

4. CONCLUSION According to the obtained results we may conclude that, the sample size was affected by the increases or decreases of variance. To select appropriate sample size for any study, we need to know the hypothesis, where the test from one tail or two tails. The value of correlation between estimated sample sizes and variance when the test from two tails is equal 0.938, and equal 0.939, when, the test from one tail, may indicate the existence of the strong correlation between sample sizes and variance. In a sample surveys, it may be use test Z from two tails to giving larger sample size. REFERENCES 1. Shein-Chung Chow, Jun Shao, Hansheng Wang, (2017), Sample Size Calculations in Clinical Research- print edition Marcel Dekker, Inc. New York. 2. Gwowen Shieh, (2013), On using a pilot sample variance for sample size determination in the detection of differences between two means, Psicológica, 34: 125-143. 3. Charles Guandaru, Kariuki, Samuel Nduati, (2012), Factors Influencing Sample Size for Internal Audit Evidence Collection in the Public Sector in Kenya Kamau. International Journal of Advances in Management and Economics, 1: 42-49. 4. H.K. Sarmah, B. Bora Hazrkia, G. Choudhury, (2013) An investigation on effect of bias on determination of sample size on the basis of data related to the students of schools of Guwahati, International Journal of Applied Mathematics and Statistical Sciences 2 (1): 5. William G. Cochran John Willey & Sons, (1977), Sampling techniques- 5587

6. Sharon L. lohr, (2010), Sampling design and analysis. Second edition-brooks/cole- USA 7. Paula Lagares Barreiro - Justo Puerto Albandoz MaMaEuSch, (2001), Population and sample. Sampling techniques, International Journal of Applied Mathematics and Statistical Sciences. 2 (1). 8. Mohd Asraf and James K. Brewer, (2004), Conducting test of hypotheses the need for an adequate Sample size, The Australian educational researcher, 31(1). 5588