What is Regularization? Example by Sean Owen
|
|
- MargaretMargaret Reed
- 5 years ago
- Views:
Transcription
1 What is Regularization? Example by Sean Owen
2 What is Regularization? Name3 Species Size Threat Bo snake small friendly Miley dog small friendly Fifi cat small enemy Muffy cat small friendly Rufus dog large friendly Jebediah snail small friendly Allison dog large enemy Tomi cat large enemy Pets with four-letter names are enemies, as are large dogs with names that begin with A, snakes are not enemies.
3 What is Regularization?
4 What is Regularization? Pets with four-letter names are enemies Large dogs with names that begin with A Snakes are not enemies
5 What is Regularization? Large dogs are enemies Cats are enemies
6 What is Regularization? Regularization discourages complexity in the prediction logic that is learned from the training data.
7 Model Complexity Model Error = Bias2 + Variance + Irreducible Error sum = Estimation Error
8 Optimum Model Complexity Error Model Complexity Bias can reduce variance and lower MSE overall Total Error Variance Bias2 Model Complexity
9 ? Bias Variation Quiz Select the description that best suits each illustration. Put its letter in the textbox: C A B A. High bias, low variance B. Low bias, high variance C. Middle bias and variance, overall low estimation error
10 ? Bias Variation Quiz Select the description that best suits each illustration. Put its letter in the textbox: High bias low variance Underfitting training data
11 ? Bias Variation Quiz Select the description that best suits each illustration. Put its letter in the textbox: Low bias High variance Overfitting training data
12 ? Bias Variation Quiz Select the description that best suits each illustration. Put its letter in the textbox: Not overfitting training data nor underfitting training data. Small mistakes are made in predicting training data labels, but low error on predicting future data.
13 ? Target Quiz Determine the bias and variance case for each target. Use LB for low bias, HB for high bias, LV for low variance, and HV for high variance. For example: LBLV for low bias low variance LVLB HVLB LVHB HVHB
14 Goldilocks Principle Prediction Error High Bias Low Variance Low Bias High Variance Test Sample Underfitting Training Sample Low Overfitting Bias trade-off Model Complexity High
15 ? Overfitting Quiz Select the best method for addressing overfitting when there are a lot of features: Reduce the number of model parameters (often implying fewer features and feature transformations) Keep all features but reduce the magnitude of some features
16 MLE Maximum Likelihood Estimate (MLE) is: A technique for estimating model parameters. It answers the question: Which parameters will most likely to characterize the dataset?
17 MLE Traditional statistics: Number of parameters << Train set size d << n MLE performs well Asymptotic optimality kicks in MLE: Performs well Asymptotic optimality kicks in *Asymptotic optimality: as your training set approaches infinity, the model will converge to the ground truth or nature that generates the data, and the convergence happens at the fastest possible rate.
18 MLE Traditional statistics: Number of parameters << Train set size d << n MLE performs well Asymptotic optimality kicks in MLE: Performs well Asymptotic optimality kicks in Asymptotic variance of the MLE is the best possible
19 MLE Traditional statistics: Number of parameters << Train set size d << n MLE performs well Asymptotic optimality kicks in MLE: Performs well Asymptotic optimality kicks in Asymptotic variance of the MLE is the best possible The inverse Fisher information.
20 MLE Traditional statistics: Number of parameters << Train set size MLE: Performs well Asymptotic optimality kicks in d << n d << n MLE performs well MLE performs poorly Asymptotic optimality kicks in Overfits training data
21 Mean Squared Error 2
22 Mean Squared Error MSE = Bias2 + Variance MLE of Linear Regression is unbiased MLE may be biased in general 2
23 Mean Squared Error Bias can reduce variance and lower MSE overall!
24 Mean Squared Error Bias to simplicity can reduce variance and lower MSE overall: Reducing 2
25 Mean Squared Error Simplicity Bias lower estimation variance better overall estimation accuracy & less overfitting of training data
26 Regularization Methods James-Stein Shrinkage Lasso Estimator Breiman s Garrote Elastic Net Estimator Ridge Estimator
27 James-Stein Shrinkage Traditional statistical theory: no other estimation rule for means is uniformly better than the observed average. James-Stein Estimator: when estimating multiple means: shrink all individual averages towards a grand average.
28 James-Stein Shrinkage James-Stein estimator shrinks all the dimensions of uniformly.
29 James-Stein Shrinkage??? What is the proportion of: people that will vote for Hillary Clinton? babies in China that are girls? Americans that have light colored eyes? What kind of study is this?
30 James-Stein Shrinkage!!! The James-Stein estimate of the proportion of voters for Hillary Clinton depends on. Chinese baby data and eye color!!!!
31 James-Stein Shrinkage Estimation Establishment Jame s-stei n Estim ator
32 James-Stein Shrinkage Estimation Jame s-stei n Estim ator
33 James-Stein Shrinkage li ab ent t s E s h m
34 James-Stein Shrinkage James-Stein estimator shrinks all the dimensions of uniformly.
35 ? James-Stein Estimator Quiz
36 ? James-Stein Weakness Quiz What is a weakness of the James-Stein Estimator?
37 ? James-Stein Weakness Quiz
38 ? James-Stein Weakness Quiz
39 Breiman s Garrote where subject to When c<d some or all of the components of reduced in absolute value toward 0. When we constrain it is called the non-negative garotte. The parameter c is a tuning parameter and several different values are typically considered.
40 Breiman s Garrote Breiman s Garrote: Can shrink some dimensions more than others as needed Computation can be expensive in high dimensions
41 Ridge Regression subject to where
42 Ridge Regression Equivalent to maximizing
43 Ridge Regression Equivalence can be seen using Lagrange multipliers subject to =>
44 Ridge Regression subject to =>
45 Ridge Regression Solve for different values of lambda, evaluate each solution on hold-out set and select model that performs best
46 Ridge Trace for x4 Ridge Trace
47 ? Estimator Comparison Quiz Model: E(Y) = 0 + X1 + X2 + e ; e ~ N(0,1) The measured variables are: x1,x2,x3 x1 and x2 are U(0,1) x3 = 10 * X1 + unif(0,1) corr(x1,x3) = sqrt(100/101) = 0.995
48 ? Estimator Comparison Quiz # OLS fit of 3-variable model using correlated x3. olsc <- lm(y~ x1 + x2 + x3c) summary(olsc)
49 ? Estimator Comparison Quiz
50 ? Ridge Regression Quiz Perform ridge regression for both independent and correlated variables.
51 ? Ridge Regression Quiz ridgec <- lm.ridge (y ~ x1+x2+x3c, lambda = seq(0,.1,.001))
52 ? Ridge Regression Quiz
53 ? Ridge Regression Quiz
54 Lasso Estimator LASSO = least absolute shrinkage and selection operator Model Characteristic Ridge Must include all or none of the coefficients Lasso Does parameter estimation and variable selection
55 Lasso Estimator Ridge Lasso subject to
56 Lasso Estimator Penalty Model Result l1 lasso Encourages sparse solution. For a given some of the components of lasso will be zero l2 ridge Does not encourage sparsity. The coefficients of ridge may be close to zero but not precisely zero.
57 ? Lasso Ridge Quiz In each illustration the colored lines are paths of regression coefficients shrinking to zero. Label each as either lasso or ridge estimation: Lasso Ridge Regression
58 Linear Regression The methods in this lesson apply generally to maximum likelihood problems. The linear regression setting enables derivation of closed forms which provide insight into the differences between different regularizers. However, ridge and lasso can also be applied for other models like logistic regression (and they are very successfully applied).
59 Linear Regression Assume: X: a matrix of training data X(1),..., X(n) Y : vector whose entries are training labels Y(1),..., Y(n) Then the conditional log likelihood is: -(Y - Xθ)T(Y - Xθ) + c Setting its gradient to zero: XTXθ + XTY = 0 Implies: ^mle = (XTX)-1XTY θ
60 Linear Regression Special case of orthonormal training examples: XTX = I : ^θjmle = XTY (the orthonormal projection of the columns of X on Y).
61 Linear Regression In the case of ridge regularization: setting the penalized log likelihood gradient to zero
62 Linear Regression In the orthogonal case: Each component is shrunk by the same constant factor
63 Linear Regression In the case of lasso regression: where
64 Linear Regression In the case of lasso regression, orthonormal case: The penalized log likelihood is:
65 Linear Regression In the case of lasso regression, orthonormal case: The penalized log likelihood is:
66 Linear Regression In the case of lasso regression, orthonormal case: The penalized log likelihood is: Which decomposes into a sum of d maximization problems: That can be solved independently For each j, if the objective function will be negative unless then the objective function = 0
67 Linear Regression In the case of lasso regression: If setting Results in a 0 objective function
68 Linear Regression In the case of lasso regression: If Then: The objective function is differentiable Set the derivative to 0 To obtain:
69 Linear Regression In the case of lasso regression: Combining the two cases we get: where Therefore: The sparsity encouraging threshold nature of lasso as it zeros out small coefficients Recall: ridge will make them smaller, but non-zero.
70 Lasso Plots Install.packages( glmnet ) library(glmnet)
71 ? Uncorrelated Data Plot The predictors go into the model in the order of their magnitude of the true linear regression coefficient.
72 ? In the programming node, write the commands needed to use the lasso model on the mtcars dataset. Your plot should be similar to mine. Mtcars and lasso Quiz
73 ? Mtcars and lasso Quiz Install.packages( glmnet ) library(glmnet) mtcarslasso = glmnet(as.matrix(mtcars[-1]), mtcars [,1]) plot(mtcarslasso, col=1:12, lwd=2) legend('bottomleft',legend=names(mtcars)[-1],col=1: 12,lty=1,lwd=1)
74 MLE Comparisons
75 Elastic Net Estimator Linear regression model trained with L1 and L2 Very common in high dimensional modeling in industry
76 Elastic Net Estimator subject to
77 ? Elastic Net Graph The lasso plot The elastic net plot
Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach
University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen
More informationChapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)
Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it
More informationCSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression
CSE 258 Lecture 2 Web Mining and Recommender Systems Supervised learning Regression Supervised versus unsupervised learning Learning approaches attempt to model data in order to solve a problem Unsupervised
More informationUNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014
UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 Exam policy: This exam allows two one-page, two-sided cheat sheets (i.e. 4 sides); No other materials. Time: 2 hours. Be sure to write
More informationIntroduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018
Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this
More informationUNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016
UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016 Exam policy: This exam allows one one-page, two-sided cheat sheet; No other materials. Time: 80 minutes. Be sure to write your name and
More informationArticle from. Forecasting and Futurism. Month Year July 2015 Issue Number 11
Article from Forecasting and Futurism Month Year July 2015 Issue Number 11 Calibrating Risk Score Model with Partial Credibility By Shea Parkes and Brad Armstrong Risk adjustment models are commonly used
More informationVARIABLE SELECTION WHEN CONFRONTED WITH MISSING DATA
VARIABLE SELECTION WHEN CONFRONTED WITH MISSING DATA by Melissa L. Ziegler B.S. Mathematics, Elizabethtown College, 2000 M.A. Statistics, University of Pittsburgh, 2002 Submitted to the Graduate Faculty
More informationApplying Machine Learning Methods in Medical Research Studies
Applying Machine Learning Methods in Medical Research Studies Daniel Stahl Department of Biostatistics and Health Informatics Psychiatry, Psychology & Neuroscience (IoPPN), King s College London daniel.r.stahl@kcl.ac.uk
More informationLinear Regression Analysis
Linear Regression Analysis WILEY SERIES IN PROBABILITY AND STATISTICS Established by WALTER A. SHEWHART and SAMUEL S. WILKS Editors: David J. Balding, Peter Bloomfield, Noel A. C. Cressie, Nicholas I.
More informationPart [2.1]: Evaluation of Markers for Treatment Selection Linking Clinical and Statistical Goals
Part [2.1]: Evaluation of Markers for Treatment Selection Linking Clinical and Statistical Goals Patrick J. Heagerty Department of Biostatistics University of Washington 174 Biomarkers Session Outline
More informationMachine Learning to Inform Breast Cancer Post-Recovery Surveillance
Machine Learning to Inform Breast Cancer Post-Recovery Surveillance Final Project Report CS 229 Autumn 2017 Category: Life Sciences Maxwell Allman (mallman) Lin Fan (linfan) Jamie Kang (kangjh) 1 Introduction
More informationSelection and Combination of Markers for Prediction
Selection and Combination of Markers for Prediction NACC Data and Methods Meeting September, 2010 Baojiang Chen, PhD Sarah Monsell, MS Xiao-Hua Andrew Zhou, PhD Overview 1. Research motivation 2. Describe
More informationST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods
(10) Frequentist Properties of Bayesian Methods Calibrated Bayes So far we have discussed Bayesian methods as being separate from the frequentist approach However, in many cases methods with frequentist
More informationRISK PREDICTION MODEL: PENALIZED REGRESSIONS
RISK PREDICTION MODEL: PENALIZED REGRESSIONS Inspired from: How to develop a more accurate risk prediction model when there are few events Menelaos Pavlou, Gareth Ambler, Shaun R Seaman, Oliver Guttmann,
More informationReview: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections
Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi
More informationModeling Sentiment with Ridge Regression
Modeling Sentiment with Ridge Regression Luke Segars 2/20/2012 The goal of this project was to generate a linear sentiment model for classifying Amazon book reviews according to their star rank. More generally,
More informationSmall Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh
Small Sample Bayesian Factor Analysis PhUSE 2014 Paper SP03 Dirk Heerwegh Overview Factor analysis Maximum likelihood Bayes Simulation Studies Design Results Conclusions Factor Analysis (FA) Explain correlation
More informationAnale. Seria Informatică. Vol. XVI fasc Annals. Computer Science Series. 16 th Tome 1 st Fasc. 2018
HANDLING MULTICOLLINEARITY; A COMPARATIVE STUDY OF THE PREDICTION PERFORMANCE OF SOME METHODS BASED ON SOME PROBABILITY DISTRIBUTIONS Zakari Y., Yau S. A., Usman U. Department of Mathematics, Usmanu Danfodiyo
More informationCase Studies of Signed Networks
Case Studies of Signed Networks Christopher Wang December 10, 2014 Abstract Many studies on signed social networks focus on predicting the different relationships between users. However this prediction
More information3. Model evaluation & selection
Foundations of Machine Learning CentraleSupélec Fall 2016 3. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr
More informationClassification. Methods Course: Gene Expression Data Analysis -Day Five. Rainer Spang
Classification Methods Course: Gene Expression Data Analysis -Day Five Rainer Spang Ms. Smith DNA Chip of Ms. Smith Expression profile of Ms. Smith Ms. Smith 30.000 properties of Ms. Smith The expression
More informationStudy of cigarette sales in the United States Ge Cheng1, a,
2nd International Conference on Economics, Management Engineering and Education Technology (ICEMEET 2016) 1Department Study of cigarette sales in the United States Ge Cheng1, a, of pure mathematics and
More informationMULTIPLE REGRESSION OF CPS DATA
MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear
More information11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES
Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are
More informationNumerical Integration of Bivariate Gaussian Distribution
Numerical Integration of Bivariate Gaussian Distribution S. H. Derakhshan and C. V. Deutsch The bivariate normal distribution arises in many geostatistical applications as most geostatistical techniques
More informationMultivariate Regression with Small Samples: A Comparison of Estimation Methods W. Holmes Finch Maria E. Hernández Finch Ball State University
Multivariate Regression with Small Samples: A Comparison of Estimation Methods W. Holmes Finch Maria E. Hernández Finch Ball State University High dimensional multivariate data, where the number of variables
More informationStructured Association Advanced Topics in Computa8onal Genomics
Structured Association 02-715 Advanced Topics in Computa8onal Genomics Structured Association Lasso ACGTTTTACTGTACAATT Gflasso (Kim & Xing, 2009) ACGTTTTACTGTACAATT Greater power Fewer false posi2ves Phenome
More informationTesting Statistical Models to Improve Screening of Lung Cancer
Testing Statistical Models to Improve Screening of Lung Cancer 1 Elliot Burghardt: University of Iowa Daren Kuwaye: University of Hawai i at Mānoa Iowa Summer Institute in Biostatistics - University of
More information11/24/2017. Do not imply a cause-and-effect relationship
Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection
More informationComputer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California
Computer Age Statistical Inference Algorithms, Evidence, and Data Science BRADLEY EFRON Stanford University, California TREVOR HASTIE Stanford University, California ggf CAMBRIDGE UNIVERSITY PRESS Preface
More informationRegularization Methods for Fitting Linear Models with Small Sample Sizes: Fitting the Lasso Estimator using R
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute
More informationCHAPTER 6 HUMAN BEHAVIOR UNDERSTANDING MODEL
127 CHAPTER 6 HUMAN BEHAVIOR UNDERSTANDING MODEL 6.1 INTRODUCTION Analyzing the human behavior in video sequences is an active field of research for the past few years. The vital applications of this field
More informationAPPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR)
APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR) Lung function is related to physical characteristics such as age and height. In order to assess the Peak Expiratory Flow
More informationNature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.
Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze
More informationRidge regression for risk prediction
Ridge regression for risk prediction with applications to genetic data Erika Cule and Maria De Iorio Imperial College London Department of Epidemiology and Biostatistics School of Public Health May 2012
More informationSupersparse Linear Integer Models for Interpretable Prediction. Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013
Supersparse Linear Integer Models for Interpretable Prediction Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013 CHADS 2 Scoring System Condition Points Congestive heart failure 1 Hypertension 1 Age
More informationStewart Calculus: Early Transcendentals, 8th Edition
Math 1271-1272-2263 Textbook - Table of Contents http://www.slader.com/textbook/9781285741550-stewart-calculus-early-transcendentals-8thedition/ Math 1271 - Calculus I - Chapters 2-6 Math 1272 - Calculus
More informationThe impact of pre-selected variance inflation factor thresholds on the stability and predictive power of logistic regression models in credit scoring
Volume 31 (1), pp. 17 37 http://orion.journals.ac.za ORiON ISSN 0529-191-X 2015 The impact of pre-selected variance inflation factor thresholds on the stability and predictive power of logistic regression
More informationm 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers
SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:
More informationCitation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.
University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document
More informationRidge Regression and Multicollinearity: An In-Depth Review
Ridge Regression and Multicollinearity: An In-Depth Review Deanna Schreiber-Gregory Henry M Jackson Foundation for the Advancement of Military Medicine SAS and all other SAS Institute Inc. product or service
More information4. Model evaluation & selection
Foundations of Machine Learning CentraleSupélec Fall 2017 4. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr
More informationBayes Linear Statistics. Theory and Methods
Bayes Linear Statistics Theory and Methods Michael Goldstein and David Wooff Durham University, UK BICENTENNI AL BICENTENNIAL Contents r Preface xvii 1 The Bayes linear approach 1 1.1 Combining beliefs
More informationFind the slope of the line that goes through the given points. 1) (-9, -68) and (8, 51) 1)
Math 125 Semester Review Problems Name Find the slope of the line that goes through the given points. 1) (-9, -68) and (8, 51) 1) Solve the inequality. Graph the solution set, and state the solution set
More informationSubLasso:a feature selection and classification R package with a. fixed feature subset
SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced
More informationJ2.6 Imputation of missing data with nonlinear relationships
Sixth Conference on Artificial Intelligence Applications to Environmental Science 88th AMS Annual Meeting, New Orleans, LA 20-24 January 2008 J2.6 Imputation of missing with nonlinear relationships Michael
More informationNotes for laboratory session 2
Notes for laboratory session 2 Preliminaries Consider the ordinary least-squares (OLS) regression of alcohol (alcohol) and plasma retinol (retplasm). We do this with STATA as follows:. reg retplasm alcohol
More information12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
More informationHPS301 Exam Notes- Contents
HPS301 Exam Notes- Contents Week 1 Research Design: What characterises different approaches 1 Experimental Design 1 Key Features 1 Criteria for establishing causality 2 Validity Internal Validity 2 Threats
More informationResponse to Mease and Wyner, Evidence Contrary to the Statistical View of Boosting, JMLR 9:1 26, 2008
Journal of Machine Learning Research 9 (2008) 59-64 Published 1/08 Response to Mease and Wyner, Evidence Contrary to the Statistical View of Boosting, JMLR 9:1 26, 2008 Jerome Friedman Trevor Hastie Robert
More informationPredicting Diabetes and Heart Disease Using Features Resulting from KMeans and GMM Clustering
Predicting Diabetes and Heart Disease Using Features Resulting from KMeans and GMM Clustering Kunal Sharma CS 4641 Machine Learning Abstract Clustering is a technique that is commonly used in unsupervised
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationSawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.
Sawtooth Software RESEARCH PAPER SERIES MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB Bryan Orme, Sawtooth Software, Inc. Copyright 009, Sawtooth Software, Inc. 530 W. Fir St. Sequim,
More informationSample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome
Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome Stephen Burgess July 10, 2013 Abstract Background: Sample size calculations are an
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector
More informationIntroduction to Econometrics
Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and
More informationThe Relationship between Crime and CCTV Installation Status by Using Artificial Neural Networks
, pp.150-157 http://dx.doi.org/10.14257/astl.2016.139.34 The Relationship between Crime and CCTV Installation Status by Using Artificial Neural Networks Ahyoung Jung 1, Changjae Kim 2, Dept. S/W Engr.
More informationA Comparative Study of Some Estimation Methods for Multicollinear Data
International Journal of Engineering and Applied Sciences (IJEAS) A Comparative Study of Some Estimation Methods for Multicollinear Okeke Evelyn Nkiruka, Okeke Joseph Uchenna Abstract This article compares
More informationIEEE SIGNAL PROCESSING LETTERS, VOL. 13, NO. 3, MARCH A Self-Structured Adaptive Decision Feedback Equalizer
SIGNAL PROCESSING LETTERS, VOL 13, NO 3, MARCH 2006 1 A Self-Structured Adaptive Decision Feedback Equalizer Yu Gong and Colin F N Cowan, Senior Member, Abstract In a decision feedback equalizer (DFE),
More informationPrediction of Malignant and Benign Tumor using Machine Learning
Prediction of Malignant and Benign Tumor using Machine Learning Ashish Shah Department of Computer Science and Engineering Manipal Institute of Technology, Manipal University, Manipal, Karnataka, India
More informationNoise Cancellation using Adaptive Filters Algorithms
Noise Cancellation using Adaptive Filters Algorithms Suman, Poonam Beniwal Department of ECE, OITM, Hisar, bhariasuman13@gmail.com Abstract Active Noise Control (ANC) involves an electro acoustic or electromechanical
More informationMEA DISCUSSION PAPERS
Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de
More informationisc ove ring i Statistics sing SPSS
isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements
More informationMultiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple
More informationBringing machine learning to the point of care to inform suicide prevention
Bringing machine learning to the point of care to inform suicide prevention Gregory Simon and Susan Shortreed Kaiser Permanente Washington Health Research Institute Don Mordecai The Permanente Medical
More informationChemometrics for Analysis of NIR Spectra on Pharmaceutical Oral Dosages
Chemometrics for Analysis of NIR Spectra on Pharmaceutical Oral Dosages William Welsh, Sastry Isukapalli, Rodolfo Romañach, Bozena Kohn-Michniak, Alberto Cuitino, Fernando Muzzio NATIONAL SCIENCE FOUNDATION
More informationConsumer Review Analysis with Linear Regression
Consumer Review Analysis with Linear Regression Cliff Engle Antonio Lupher February 27, 2012 1 Introduction Sentiment analysis aims to classify people s sentiments towards a particular subject based on
More information1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp
The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve
More informationSUPPLEMENTAL MATERIAL
1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across
More informationHow much data is enough? Predicting how accuracy varies with training data size
How much data is enough? Predicting how accuracy varies with training data size Mark Johnson (with Dat Quoc Nguyen) Macquarie University Sydney, Australia September 4, 2017 1 / 33 Outline Introduction
More informationNetwork Estimation and Sparsity
Chapter Network Estimation and Sparsity Abstract Network models, in which psychopathological disorders are conceptualized as a complex interplay of psychological and biological components, have become
More informationSampling Weights, Model Misspecification and Informative Sampling: A Simulation Study
Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Marianne (Marnie) Bertolet Department of Statistics Carnegie Mellon University Abstract Linear mixed-effects (LME)
More information10-1 MMSE Estimation S. Lall, Stanford
0 - MMSE Estimation S. Lall, Stanford 20.02.02.0 0 - MMSE Estimation Estimation given a pdf Minimizing the mean square error The minimum mean square error (MMSE) estimator The MMSE and the mean-variance
More informationMLE #8. Econ 674. Purdue University. Justin L. Tobias (Purdue) MLE #8 1 / 20
MLE #8 Econ 674 Purdue University Justin L. Tobias (Purdue) MLE #8 1 / 20 We begin our lecture today by illustrating how the Wald, Score and Likelihood ratio tests are implemented within the context of
More informationPrediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer
Prediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer Ronghui (Lily) Xu Division of Biostatistics and Bioinformatics Department of Family Medicine
More informationDepartment of Allergy-Pneumonology, Penteli Children s Hospital, Palaia Penteli, Greece
International Pediatrics Volume 2009, Article ID 952042, 6 pages doi:10.1155/2009/952042 Clinical Study Logistic Regression and Linear Discriminant Analyses in Evaluating Factors Associated with Asthma
More informationMethods for Predicting Type 2 Diabetes
Methods for Predicting Type 2 Diabetes CS229 Final Project December 2015 Duyun Chen 1, Yaxuan Yang 2, and Junrui Zhang 3 Abstract Diabetes Mellitus type 2 (T2DM) is the most common form of diabetes [WHO
More informationMinimizing Uncertainty in Property Casualty Loss Reserve Estimates Chris G. Gross, ACAS, MAAA
Minimizing Uncertainty in Property Casualty Loss Reserve Estimates Chris G. Gross, ACAS, MAAA The uncertain nature of property casualty loss reserves Property Casualty loss reserves are inherently uncertain.
More informationLecture II: Difference in Difference. Causality is difficult to Show from cross
Review Lecture II: Regression Discontinuity and Difference in Difference From Lecture I Causality is difficult to Show from cross sectional observational studies What caused what? X caused Y, Y caused
More informationCHAPTER VI RESEARCH METHODOLOGY
CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the
More informationLinear and Nonlinear Optimization
Linear and Nonlinear Optimization SECOND EDITION Igor Griva Stephen G. Nash Ariela Sofer George Mason University Fairfax, Virginia Society for Industrial and Applied Mathematics Philadelphia Contents Preface
More informationIdentification of Tissue Independent Cancer Driver Genes
Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important
More informationParameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX
Paper 1766-2014 Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX ABSTRACT Chunhua Cao, Yan Wang, Yi-Hsin Chen, Isaac Y. Li University
More informationLec 02: Estimation & Hypothesis Testing in Animal Ecology
Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then
More informationPerformance of Median and Least Squares Regression for Slightly Skewed Data
World Academy of Science, Engineering and Technology 9 Performance of Median and Least Squares Regression for Slightly Skewed Data Carolina Bancayrin - Baguio Abstract This paper presents the concept of
More informationIntelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger
Intelligent Systems Discriminative Learning Parts marked by * are optional 30/12/2013 WS2013/2014 Carsten Rother, Dmitrij Schlesinger Discriminative models There exists a joint probability distribution
More informationMODEL SELECTION STRATEGIES. Tony Panzarella
MODEL SELECTION STRATEGIES Tony Panzarella Lab Course March 20, 2014 2 Preamble Although focus will be on time-to-event data the same principles apply to other outcome data Lab Course March 20, 2014 3
More informationPredicting Breast Cancer Survival Using Treatment and Patient Factors
Predicting Breast Cancer Survival Using Treatment and Patient Factors William Chen wchen808@stanford.edu Henry Wang hwang9@stanford.edu 1. Introduction Breast cancer is the leading type of cancer in women
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationHomo heuristicus and the bias/variance dilemma
Homo heuristicus and the bias/variance dilemma Henry Brighton Department of Cognitive Science and Artificial Intelligence Tilburg University, The Netherlands Max Planck Institute for Human Development,
More informationOn cows and test construction
On cows and test construction NIELS SMITS & KEES JAN KAN Research Institute of Child Development and Education University of Amsterdam, The Netherlands SEM WORKING GROUP AMSTERDAM 16/03/2018 Looking at
More informationSmall Group Presentations
Admin Assignment 1 due next Tuesday at 3pm in the Psychology course centre. Matrix Quiz during the first hour of next lecture. Assignment 2 due 13 May at 10am. I will upload and distribute these at the
More informationMMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?
MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference
More informationPublisher: Pearson Education, Inc. publishing as Prentice Hall
Section I. Correlation with the Mathematics 2009 SOL and Curriculum Framework Rating 6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8 6.9 6.10 6.11 6.12 6.13 6.14 6.15 6.16 6.17 6.18 6.19 6.20 Section II. Additional Criteria:
More informationSocial Effects in Blau Space:
Social Effects in Blau Space: Miller McPherson and Jeffrey A. Smith Duke University Abstract We develop a method of imputing characteristics of the network alters of respondents in probability samples
More information