Sta 309 (Statistics And Probability for Engineers)

Similar documents
Chapter 1: The Nature of Probability and Statistics

aps/stone U0 d14 review d2 teacher notes 9/14/17 obj: review Opener: I have- who has

Chapter 1: Data Collection Pearson Prentice Hall. All rights reserved

Vocabulary. Bias. Blinding. Block. Cluster sample

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Chapter 3. Producing Data

Chapter 3. Producing Data

Section 1.1 What is Statistics?

Chapter 1 Data Collection

Sampling. (James Madison University) January 9, / 13

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

CHAPTER 5: PRODUCING DATA

Chapter 5: Producing Data

Ch. 1 Collecting and Displaying Data

Introduction to Statistics

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

Chapter 1 - The Nature of Probability and Statistics

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

Ch 1.1 & 1.2 Basic Definitions for Statistics

Chapter 1: Exploring Data

Outline. Chapter 3: Random Sampling, Probability, and the Binomial Distribution. Some Data: The Value of Statistical Consulting

Unit 1 Exploring and Understanding Data

Design, Sampling, and Probability

The Nature of Probability and Statistics

Chapter 1 - Sampling and Experimental Design

Class 1. b. Sampling a total of 100 Californians, where individuals are randomly selected from each major ethnic group.

1. If a variable has possible values 2, 6, and 17, then this variable is

Sampling Reminders about content and communications:

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

Chapter 4 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

MATH 2300: Statistical Methods. What is Statistics?

Quiz 4.1C AP Statistics Name:

AP Psychology -- Chapter 02 Review Research Methods in Psychology

Introduction to Biostatics.

Experimental Design There is no recovery from poorly collected data!

Math 140 Introductory Statistics

MATH-134. Experimental Design

Homework Answers. 1.3 Data Collection and Experimental Design

Objectives. Data Collection 8/25/2017. Section 1-3. Identify the five basic sample techniques

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Moore, IPS 6e Chapter 03

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

Do Now Prob & Stats 8/26/14 What conclusions can you draw from this bar graph?

What Is Statistics. Chapter 01. Copyright 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

Introduction to Statistics and Research Design. Arlo Clark-Foos

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

9.63 Laboratory in Cognitive Science

Chapter 1 Introduction to I/O Psychology

Data collection, summarizing data (organization and analysis of data) The drawing of inferences about a population from a sample taken from

What Is Statistics. Chapter 1

AP Statistics Chapter 5 Multiple Choice

REVIEW FOR THE PREVIOUS LECTURE

Chapter 1 Overview. Created by Tom Wegleitner, Centreville, Virginia. Copyright 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

CHAPTER 3 METHOD AND PROCEDURE

What Is Statistics. Learning Objectives. Definition. Who Uses Statistics? 12/9/2015

STA 291 Lecture 4 Jan 26, 2010

Basic Statistical Concepts, Research Design, & Notation

Observation Studies, Sampling Designs and Bias

A Probability Puzzler. Statistics, Data and Statistical Thinking. A Probability Puzzler. A Probability Puzzler. Statistics.

1. Introduction a. Meaning and Role of Statistics b. Descriptive and inferential Statistics c. Variable and Measurement Scales

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

Probabilities and Research. Statistics

Handout 16: Opinion Polls, Sampling, and Margin of Error

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

Comparing Different Studies

Chapter 4 SAMPLING METHODS AND RESEARCH DESIGNS

full file at

7) A tax auditor selects every 1000th income tax return that is received.

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Example 1. October 1, / 14

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

Slide 1 - Introduction to Statistics Tutorial: An Overview Slide notes

TOPIC: Introduction to Statistics WELCOME TO MY CLASS!

MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION

in explaning the result of research studies In planning and decision making are supported by data

Chapter 1: Introduction to Statistics

Statistics Mathematics 243

Psych 1Chapter 2 Overview

Chapter 5: Producing Data Review Sheet

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Test Bank for Privitera, Statistics for the Behavioral Sciences

Formulating Research Questions and Designing Studies. Research Series Session I January 4, 2017

Chapter 01 What Is Statistics?

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

10.1 Estimating with Confidence. Chapter 10 Introduction to Inference

august 3, 2018 What do you think would have happened if we had time to do the same activity but with a sample size of 10?

Summer AP Statistic. Chapter 4 : Sampling and Surveys: Read What s the difference between a population and a sample?

CHAPTER 1 SAMPLING AND DATA

Statistics are commonly used in most fields of study and are regularly seen in newspapers, on television, and in professional work.

Chapter 1: Statistical Basics

Section Introduction

Transcription:

Instructor: Prof. Mike Nasab Sta 309 (Statistics And Probability for Engineers) Chapter (1) 1. Statistics: The science of collecting, organizing, summarizing, analyzing numerical information called data (Descriptive Statistics), and drawing conclusions (Inferential Statistics). 2. Type of data: a. Qualitative variables: Variables that can be separated into different categories distinguished by some nonnumeric characteristic. Gender (male or female), Race (White, Black, Hispanic, etc.), Favorite Color (Blue, Red, Silver, etc.) b. Quantitative variables: Variables consisting of numbers representing counts or measurements. Age, Height, Speed, Test scores, the number books in our bookstore and salary. 3. Type of Quantitative variables: a. Discrete variables: Data that can assume the values corresponding to isolated points along a line interval. In this type the data are to be counted. 1. Numbers of telephone calls is made at the switchboard of our school everyday. 2. Number of car accidents on 405 FWY. 3. Number of babies delivered at Long Beach Memorial hospital Numbers of telephone calls is made at the switchboard of our school everyday. {No calls, 1call, 2 calls, 3 calls, and so 2. Number of car accidents on 40umber of bab b. Continuous variables: Data that can assume any value along a line interval, including every possible value between any two values. In this type the data are to be measured. 1. Height of boys born at UCLA hospital on fourth of July. 2. Amount of rain fall in California in the year 2005. 3. Amount (Volume) of coffee consumed by Americans in one day. 4. Variable - Property of an object or event that can take on different values. For example, college major is a variable that takes on values like mathematics, computer science, English, psychology, etc. Variables whose values are determined by chance are called random variables. 5. Data: List of observations a variable assumes 6. Variable versus Data: Gender is a variable whereas being male or female is the data 7. Control group: is a group of subjects in an experiment who are not given a particular treatment. (Like Placebo) 8. Experimental group: is a group of subjects in an experiment who are given a particular treatment. (Like new drug) 9. Double blind: Neither the doctor nor the patient know whether he or she is part of the experimental or control group. 10. Population: The complete collection of all elements to be studied. (Scores, People, Measurements,) 11. Sample: A sub collection of elements drawn from a population. 1

Example1: A quality control manager randomly selects 50 bottles of Coca-Cola that were filled on October 15 in order to assess the calibration of the filling machine. Determine the individuals, the population and the sample. Ans: The population consists of all bottles of Coca-cola filled by that particular machine on October 15. the individuals are just the individual bottles. The sample consists of the 50 bottles selected by the quality control manager. Example 2: A researcher is claiming that the average age of women who are graduated from Engineering School at UCLA is about 28 years. To test his hypothesis, he randomly selected 300 female engineers who have graduated from UCLA school of Engineering. Determine the population, Identify the variable of interest, is the variable quantitative (qualitative)? Is the variable discrete or continuous? Describe the sample. Describe the inference. 12. Parameter: Numerical measurement describing some characteristic of a population µ = Population mean, σ = Population standard deviation and p = Population proportion 13. Statistic: Numerical measurement describing some characteristic of a sample x = sample mean, s = Sample standard deviation and ˆp = Sample proportion 14. Census versus a sample: Census is a collection of data from every element in a population whereas a sample is a subset of a population. 15. Survey: We study a part of a larger population in order to understand the whole 16. Survey sampling: a. Observational study (Association/ relationship between two variables): Study in which we observe and measure specific characteristics, but do not attempt to manipulate or modify the subjects being studied. The incidence of lung disease in a sample of workers in asbestos factories is compared to the incidence of lung disease in a sample of college professors. b. Designed experiment (Causation): Study in which a treatment is applied to the experimental units (individuals) and attempts to manipulate or modify the subjects being studied. The advantage of an experiment over an observational study is that an experiment is controlled An experiment is defined by the following types of variables: (Response and Predictor) 17. Response variable: The variable, which measures the response of units or subjects to the various treatment. A researcher is interested in determining if one can predict the scores on a statistics exam from the amount of time spent studying for the exam. Identify the response variable. (The scores on the exam) A large study used records from Canada s national health care system to compare the effectiveness of two ways to treat prostate disease, traditional surgery and a new method that does not require surgery. You have 300 prostrate patients who are willing to serve as subjects in an experiment to compare the two methods. What is the response variable in this experiment? [Existence of prostrate disease is the response variable] A study is done to compare the lung capacity (measured by certain breathing tests) of coal miners to the lung capacity of farm workers. The researcher is able to study 200 workers of each type. (Lung capacity) 18. Predictor Variable: The factor (s) that affect the response variables 19. Levels of predictor variable: The values that a factor can take are called the levels of the factor. For example, a drug dosage (the factor) may be administered at three different levels. 2

20. Lurking variable: The factors that are related to our study but they are not being identified. 21. Frame: The list of all individuals within the population. A school psychologist wants to test the effectiveness of a new method for teaching reading. She selects five hundred first grade students in Long Beach District and randomly divides them into two groups. Group 1 is taught by means of the new method, while Group 2 is taught via traditional methods. The same teacher is assigned to teach both groups. At the end of the year, an achievement test is administered and the results of the two groups compared. Determine: a. Population b. Sample c. Subject units d. Response variable e. Treatment f. Levels of the treatment g. Predictors h. Designed/Observational study? a. Population First graders in District 203 b. Sample: 500 hundred first grade students in that district c. Subjects units: 500 Students d. Response Variable: Test scores e. Treatment: Method of teaching f. Levels of the treatment: 2 (New versus the traditional method of teaching) g. Predictors: Grades, Teachers, School District h. Designed/Observational study: Designed experiment Sampling Techniques 1. Random sampling (Simple random sample) In this technique, each member of a population has an equal chance of being selected. Each member of the population is assigned a number. You can select a random sample of any population by using a calculator or computer to generate random numbers. A list of students in elementary statistics is obtained in which the individuals are numbered 1 to 65. A professor randomly selects 12 of the students. IT IS DIFFICULT TO OBTAIN A SIMPLE RANDOM SAMPLE (SRS) IN PRACTICE! 2. Stratified sampling (Separating the population into non-overlapping groups) In this technique, a population is divided into at least two different subsets, called strata that share a similar characteristic. A sample is then randomly selected from each. (Conducting a SRS separately within each strata) The defining characteristic can be gender, age, or even political preference. Using a stratified sample ensures that each segment of a population is represented. For this reason, stratified samples are usually preferred over simple random samples. A researcher segments the population of car owners into four groups: Ford, General Motors, Chrysler, and foreign. She obtains a random sample from each group and conducts a survey. 3. Cluster sampling To select a cluster sample, divide a population into groups, called clusters, then select all of the members in one or more, but not all, of the clusters. This technique is often used because of practical or economical restrictions, but data collected may be less reliable than when a random sample is used. A researcher randomly selects 5 of the 70 hospitals in Long Beach area and then surveys all of the surgical doctors in each hospital. 4. Systematic sampling In this technique, a population is ordered in some way and then members of the population are selected at regular intervals. The selection process can start at any randomly chosen point. An advantage of systematic sample is that it is easy to use. An interviewer in a mall is told to survey every fifth shopper, starting with the second. Systematic sampling (When the population size is known) 3

Procedure for systematic sampling when the population size is known: 1. N (Pop. Size) 2. n (sample size) 3. Form N/n and round it down to the nearest integer and call it K 4. Select a number between 1 and K and call this number p 5. the sample will consist of the following individuals: p, p + k, p + 2 k,, p +(n-1) k 5. Convenience sampling (Self-selected individuals) In this technique, simply use any members of population that are readily available. This method is likely to produce biased results. (Usually contains the most bias) I. A radio station asks its listeners to call in their opinion regarding the use of American forces in Peacekeeping missions. II. A professor would like to know how many hours per week college students spend watching television. She is teaching two large classes and uses all students in those classes as her sample. Sampling error: The difference between a characteristic of the entire population and a sample of that population. Sources of Errors in sampling: a. Sampling error: The error that results from using sampling to estimate information regarding a population. The size of the sample, the amount of variation that exists in the population (i.e. How different the members of the population are from one another with regard to the variable being studied) b. Non-sampling error: Respondent s lying, measurement errors, poorly worded questions, and the error due to people not responding. Sources of bias: Sampling bias: A systematic tendency to exclude one type of person from the sample. A large sample will not solve this problem. Non-Response bias: This is when people who do not answer questions are different from people who do. Undercoverage bias: This is when some groups in the population are left out of the process of choosing the sample. Response bias: This is when the individuals do not reply truthfully. Voluntary Response bias: This is when the survey relies on individuals who volunteer to respond. (e.g. Internet surveys) and they are unscientific and unreliable. The following can affect the validity of a study: a. The method in which a sample was obtained b. Wording of questions c. The order in which questions are presented A magazine is conducting a study on the effects of infidelity in a marriage. The editor randomly select 400 women whose husbands were unfaithful and ask Do you believe a marriage can survive when the husband destroys the trust that must exist between husband and wife? What is wrong with wording of the question that was asked? Ans: Do you believe that a marriage can be maintained after an extramarital relation? A key component of a well-designed experiment is RANDOMIZATION 4

Variables and Types of Data 1. The nominal level of measurement : Refers to data consists of names, or categories so that the data cannot be arranged in any specific ordering scheme. a. Sex ( Male, Female) b. Race (White, African American, Hispanic,...) c. Colors of car in the street. 2. The ordinal level of measurement The ordinal level of measurement classifies data into categories that can be ranked but differences between the ranks cannot be determined. I. Letter Grades such as A = superior, B = good, C = average, D = poor, F = Fail II. Size of cars in the street: Small, Medium, and Large. 3. The interval level of measurement: Like ordinal, with additional property that differences between units of data can be defined, but there is no meaningful zero. a. Temperature, as we know there is no natural 0. b. The years c. IQ scores 4. The ratio level of measurement: Like the interval measurement, and there exists a natural zero. In addition, true ratios and for the same variable. differences both exist a. Weight b. Height c. Age d. Length e. Distance 5