HARK: A New Approach for Regression with Functional Predictors

Similar documents
Religious Beliefs, Knowledge about Science and Attitudes Towards Medical Genetics. Nick Allum, Elissa Sibley, Patrick Sturgis & Paul Stoneman

Data Fusion for Predicting Breast Cancer Survival

Model-driven Reengineering for a Blue Planet - Refactoring for Energy Efficiency -

The estimator, X, is unbiased and, if one assumes that the variance of X7 is constant from week to week, then the variance of X7 is given by

23/11/2015. Introduction & Aims. Methods. Methods. Survey response. Patient Survey (baseline)

Reliability and Validity Plan 2017

CDC Influenza Division Key Points MMWR Updates February 20, 2014

Using Causal Inference To Make Sense of Messy Data

AP Biology Lab 12: Introduction to the Scientific Method and Animal Behavior

Building Code 101 OWMC November 20, Ministry of Municipal Affairs and Housing

Module 6: Goal Setting

Assessment Field Activity Collaborative Assessment, Planning, and Support: Safety and Risk in Teams

FDA Dietary Supplement cgmp

Making Medicare + Medi-Cal Work for California s Dual Eligibles

Annual Assembly Abstract Review Process

Commun. Theor. Phys. (Beijing, China) 38 (2002) pp. 555{560 c International Academic Publishers Vol. 38, No. 5, November 15, 2002 Capability Analysis

University of Rochester Course Evaluation Project. Ronald D. Rogge. Associate Professor. Ista Zahn. Doctoral Candidate

PROVIDER ALERT. Comprehensive Diagnostic Evaluation (CDE) Guidelines to Access the Applied Behavior Analysis (ABA) Benefit.

The Great Divide: Is it Operant or Classical? Lindsay Wood

SCALES NW HEARING PROTECTION PROGRAM

PET FORM Planning and Evaluation Tracking ( Assessment Period)

A pre-conference should include the following: an introduction, a discussion based on the review of lesson materials, and a summary of next steps.

Psychology Class 11 Syllabus

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 NARROW TUBES ACOUSTIC IMPEDANCE CHARACTERIZATION USING FINITE ELEMENT BASED TOOLS

RI International Peer Employment Training

Lesson Unit content* Activities Resource checklist Links to other units

Social Learning Theories

Nutrition Care Process Model Tutorials. Nutrition Monitoring & Evaluation: Overview & Definition. By the end of this module, the participant will:

Internal Jugular Vein Location and Anatomy on Ultrasound. Coppens S. MD Botermans W.

Lecture 9 PCL201 Drug Distribution

Accounting Assessment Report

Meeting the Nutritional Requirements of Individuals with Dementia

Extended G/L Segment Codes

VCCC Research and Education Lead for Breast Cancer

EDPS 475: Instructional Objectives for Midterm Exam Behaviorism

Bayesian Inference on Mixed-effects Models with Skewed Distributions for HIV longitudinal Data

Continuous Positive Airway Pressure (CPAP) and Respiratory Assist Devices (RADs) including Bi-Level PAP

2019 Canada Winter Games Team NT Female Hockey Selection Camp August 16-19, 2018

EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice.

Guideline Number: NIA_CG_301 Last Revised Date: October 2014 Responsible Department: Implementation Date: October 2014 Clinical Operations

FOUNDATIONS OF DECISION-MAKING...

CNMC Rounds: Can CME save lives? Dave Davis, MD Senior Director, Continuing Education & Performance Improvement (with Nancy Davis, PhD)

Part 1. Saturated and Branched-Chain Fatty Acids

Commissioning Policy: South Warwickshire CCG (SWCCG)

Success Criteria: Extend your thinking:

Test 3 Study Guide: Photosynthesis, Respiration, and the Cell Membrane

The Four Links of Obesity: Diabetes, Fatty Liver, Cardiomyopathy and AF The Potential Benefit and Rapid Evolution of Bariatric Surgery

ENGLISH LANGUAGE ARTS CURRICULUM MAP GRADE 10 (Suggested timeline for introducing standards some overlap all four quarters)

Frontier School of Innovation District Wellness Policy

OLYMPIC WEIGHT TRAINING. Enhancing Athletic Performance

INSTALLATION AND OPERATING INSTRUCTIONS YD30 COBALT

Functional starch: A better use of starch in foods

New Mexico Striving Toward Excellence Program (NM STEP), The Data Scholars Initiative for Child Welfare

Mental Health Statistics Improvement Plan (MHSIP) Consumer Survey

Improving Surveillance and Monitoring of Self-harm in Irish Prisons

2017 Optum, Inc. All rights reserved BH1124_112017

2017 PEPFAR Data and Systems Applied Learning Summit Day 2: MER Analytics/Available Visualizations, Clinical Cascade Breakout Session TB/HIV EXERCISE

Sensorimotor Changes Following Distal Radius Fractures: Clinical Significance

P02-03 CALA Program Description Proficiency Testing Policy for Accreditation Revision 1.9 July 26, 2017


MASS SPECTRA OF DERIVATIVES OF CYCLOPROPYL AND CYCLOPROPENYL FATTY ACIDS

EXECUTIVE SUMMARY INNOVATION IS THE KEY TO CHANGING THE PARADIGM FOR THE TREATMENT OF PAIN AND ADDICTION TO CREATE AN AMERICA FREE OF OPIOID ADDICTION

Medical Device Software Development Management: Following FDA Guidelines for Software Validation

Introduction. Forensic toxicology helps determine cause-and-effect relationships. Toxic or lethal effects from that exposure. between.

Chapter 37 The Skeletal and Muscular System:

Creating and Linking Charge Objects

Safety Rules. Danger Failure to obey the instructions and safety rules in this manual will result in death or serious injury. Do Not Operate Unless:

NIR and Immunisation Webinar. Leading-edge software for health professionals

New London County Unified Intake for Homeless Families

Frequently Asked Questions: IS RT-Q-PCR Testing

WHAT IS HEAD AND NECK CANCER FACT SHEET

QP Energy Services LLC Hearing Conservation Program HSE Manual Section 7 Effective Date: 5/30/15 Revision #:

Cancer Association of South Africa (CANSA)

Oral Surgery (Facial Pain) Service Specification

Swindon Joint Strategic Needs Assessment Bulletin

Castilion Primary School. Spiritual Moral Social and Cultural Education Policy

BIOLOGY 101. CHAPTER 7: Membrane Structure and Function: Life at the Edge

EMDR EUROPE ACCREDITED PRACTITIONER COMPETENCY BASED FRAMEWORK

FUNCTIONAL MOVEMENT SYSTEMS SCREEN FINDINGS REPORT

Analysis of left-censored multiplex immunoassay data: A unified approach

Who is eligible for LifeCare? What services are available?

Statistical Methods for Data Mining

CHEAC Summary. BMCR Medical Cannabis Regulations. Distribution, Transportation and Dispensaries

EXPLORING THE PROCESS OF ASSESSMENT AND OTHER RELATED CONCEPTS

detailed in Ward and Lockhead (1970), is only summarized here.

Patience with Patients. Don Pinkston, LCSW, CADC Kim Pinkston, LCPC, CADC, BC-DMT, GL-CMA

Introduction to Exercise Physiology HKIN 206 Human Kinetics Program. Course Outline

Year 10 Food Technology. Assessment Task 1: Foods for Special Needs. Name: Teacher:

during Last Days of Life

Effective date: 15 th January 2017 Review date: 1 st May 2017

Annual Principal Investigator Worksheet About Local Context

BIOLOGY 101. CHAPTER 15: The Chromosomal Basis of Inheritance: Locating Genes Along Chromosomes

2018 Medical Association Poster Symposium Guidelines

Sensory Loss. Unit reference number: M/616/7368 Level: 3. Credit value: 3 Guided learning hours: 21. Unit summary

The Shea Supply Chain and its Value in Confectionery Products. GSA New York May 2014

Durham E-Theses. Grip strength, forearm muscle fatigue and the response to handgrip exercise in rheumatoid arthritis. Speed, Catherine A.

This clinical study synopsis is provided in line with Boehringer Ingelheim s Policy on Transparency and Publication of Clinical Study Data.

SOLUBLE URANIUM DEFINITION FOR REGULATORY COMPLIANCE

The Relationship between Compassion Fatigue and Organizational Culture

A. Catalonia World Health Organization Demonstration Project

Transcription:

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins HARK: A New Apprach fr Regressin with Functinal Predictrs Dawn Wdard Operatins Research and Infrmatin Engineering Crnell University Ciprian Crainiceanu (Jhns Hpkins) David Ruppert (Crnell) JSM, August 2010 1

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Regressin with Functinal Predictrs Regressin with functinal predictrs: Our applicatin: Relating sleep patterns, as measured using electrencephalgrams (EEG), t health utcmes such as cardivascular health indicatrs Other app.s: estimating chemical variables frm spectrscpic data; relating diffusin tensr images t multiple sclersis. Mst existing methds make strng linearity and additivity assumptins Fail t accunt fr events that ccur at variable times, such as sleep transitins in the EEG data 3

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Regressin with Functinal Predictrs Our methd (HARK: Hierarchical Adaptive Regressin Kernels): Represent the functinal predictr with a nnparametric kernel mixture mdel. Parsimnius, interpretable Captures features such as spikes, bumps, dips, whse frequency, lcatin, size varies acrss subjects. Regress the utcme n summaries f this representatin: e.g. frequency f bumps, r their average height r width. Jint inference n functinal representatins and regressin parameters. 4

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Regressin with Functinal Predictrs Advantages: - Des nt require alignment f functins r bservatin lcatins, r a cmmn dmain - Naturally handles missing, c-lcated data 5

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Existing Wrk Mst existing methds relate the utcme t a finite set f cefficients frm a basis functin representatin f the predictr: Principal cmpnent scres: Cardt et al. (2003); Müller and Stadtmüller (2005) Spline cefficients: James (2002) Furier cefficients: Ramsay and Silverman (2005) Partial least squares cefficients: Gutis and Fearn (1996); Reiss and Ogden (2007) 6

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Existing Wrk These methds assume that the expected respnse Y i is linear and additive in the functinal predictr f i (x) at each lcatin x: E(Y i ) = f i (x)β(x)dx 7

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Mtivating Simulatin Fr subjects i generate nisy bservatins frm a functin f i(x) having a single blip at randm time µ i [0, 100], with randm amplitude γ i [5, 20]: Subj. i = 1 Subj. i = 2 W 10 0 10 fi(x) 10 0 10 0 20 40 60 80 100 Time x 0 20 40 60 80 100 Time x Take the utcme t be γ i. Try t detect this relatinship between predictr f i(x) and utcme γ i using (a) HARK; (b) Principal cmpnent regressin. 9

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Mtivating Simulatin HARK effectively captures relatinship between predictr & utcme, even fr smallest sample sizes: Represents predictr using a Gaussian kernel mixture Finds that average magnitude f mixture cmpnents is psitively crrelated with the utcme 10

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Mtivating Simulatin Principal cmpnent regressin: PC functins difficult t interpret: 0.2 0.0 0.2 PC 1 0 20 40 60 80 100 Time x 0.2 0.0 0.2 PC 10 0 20 40 60 80 100 Time x 11

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Mtivating Simulatin Principal cmpnent regressin: Estimated regressin cefficient functin β(x) hard t interpret: beta(x) 1.0 0.0 0.5 1.0 1.5 0 20 40 60 80 100 Time (recall E(Y i) = R f i(x)β(x)dx). x 12

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Functin Representatin Regressin Mdel HARK Mdel Nnparametric functinal data mdel fr subject i: Nisy bservatins W i(x ik) f a functinal predictr f i(x) at lcatins x ik X i: W i(x ik) ind. N(f i(x ik), τi 2 ) Kernel mixture mdel fr f i( ): M X i f i(x) = β 0i + γ imk(x, s im). m=1 where K(x, s) is a specified kernel functin n X i S and the parameters f the kernel are defined n S. 14

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Functin Representatin Regressin Mdel HARK Mdel M i : # f mixture cmpnents M i f i (x) = β 0i + γ im K(x, s im ). m=1 γ im R and s im S: magnitudes and parameter vectrs f the mixture cmpnents E.g. fr K a Gaussian kernel, s im = (µ im, σ 2 im). Scaling and ther parameters can vary between cmpnents, adapting t the lcal features f f i ( ) Sparsity is induced thrugh the prirs n M i, γ im, and s im. 15

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Functin Representatin Regressin Mdel HARK Mdel Illustratin fr a test functin: Test data: Functin estimated & true: 0 2 4 6 0 2 4 6 Mixture representatin (ne psterir sample): 1 1 2 3 4 5 0.0 0.2 0.4 0.6 0.8 1.0 x 16

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Functin Representatin Regressin Mdel HARK Mdel Regressin Mdel: Functin representatin is ω i = (β 0i, τ 2 i, {(γ im, s im)} M i m=1 ) Define a vectr θ(ω i) f summaries f ω i. E.g. when s im = (µ im, σ 2 im): θ(ω i) = (1, β 0i, τ 2 i, M i, γ i, σ 2 i ) where γ i = P M i m=1 γim /Mi and σ2 i = P M i m=1 σ2 im/m i. Linear regressin mdel fr the utcme Y i given θ i = θ(ω i ): ind. Y i N(θ i η, ψ 2 ) 17

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Functin Representatin Regressin Mdel HARK Mdel Jint estimatin f functin representatins ω i and regressin parameters η, φ 2. Cmputatin is via reversible jump Markv chain Mnte Carl (Green 1995), using an apprximatin f the psterir distributin btained via mdularizatin (Liu, Bayarri, & Berger 2009). Cmp. increases linearly in # subjects & is parallelizable. 18

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Results fr the Sleep Data Relate EEG time series btained during sleep t respiratry distress index (RDI) and bdy mass index (BMI); 6,000+ subjects. EEG series: Subject 1 Subject 2 delta pwer 0.0 0.4 0.8 0 1 2 3 4 Subject 3 0.0 0.4 0.8 0 1 2 3 4 Subject 4 delta pwer 0.0 0.4 0.8 0.8 0 1 2 3 4 Time (hurs) Subject 5 0.0 0.4 0.8 0.8 0 1 2 3 4 Time (hurs) Subject 6 with penalized spline estimates. RDI is a measure f sleep apnea. # and timing f fluctuatins varies acrss i 20

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Results fr the Sleep Data Psterir mean estimates f f i( ) frm HARK (slid curve) are similar t penalized spline estimates (dashed curve): Subject A: Subject B: lgit( delta pwer ) 0.5 0.5 1.5 lgit( delta pwer ) 1.0 0.0 1.0 0 1 2 3 4 Time (hurs) 0 1 2 3 4 Time (hurs) 21

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Results fr the Sleep Data Kernel mixture representatin (slid curves) f f i( ) frm a single HARK psterir sample: Mixture Cmpnents frm MCMC Iteratin # 9607 Subject A: Subject B: lgit(δ-pwer) delta pwer ) lgit( lgit(δ-pwer) delta pwer ) 1.0 0.0 1.0 0.5 0.5 1.5 0 Mixture 1Cmpnents frm 2 MCMC Iteratin 3# 9607 4 Time (hurs) 0 1 2 3 4 Time (hurs) Hriz. line is β 0i, mixture cmpnents deviate frm this line. Dashed curve: f i. 22

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Results fr the Sleep Data Regressin cefficient estimates frm HARK: Outcme Predictr Cef. Est. 95% Psterir Int. lg(rdi + 0.5) β 0i -0.210 (-0.304, -0.117) M i -0.058 (-0.096, -0.020) γ 1/2 i -0.835 (-1.279, -0.401) lg BMI β 0i -0.026 (-0.039,-0.012) lg τi 2-0.041 (-0.073,-0.009) 23

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Results fr the Sleep Data RDI & BMI are negatively assciated with average δ-pwer Subjects with higher RDI tend t have fewer and less prnunced fluctuatins in δ-pwer, a measure f slw neurnal firing (RDI negatively assciated with M i and γ i) Subjects with higher BMI have less measurement errr in δ-pwer (reasnable since EEG measurement errr affected by skin prperties, perspiratin) 24

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Cnclusins Intrduced a methd fr regressin with functinal predictrs, using a parsimnius, interpretable functin representatin Mre effective and efficient than existing methds fr data that include features ccurring at varying lcatins Applied HARK t find imprtant relatinships between sleep characteristics and health utcmes. Large and cmplex dataset! A cpy f this paper and seminar are available at: http://peple.rie.crnell.edu/wdard 26

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Supplementary Material The fllwing slides cntain supplementary material. 27

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins Mtivating Simulatin Principal cmpnent regressin: Smthed β(x) functin: beta(x) 0.10 0.05 0.00 0.05 0.10 0 20 40 60 80 100 Time 28

Mtivating Simulatin HARK Mdel Results fr the Sleep Data Cnclusins HARK Mdel Typical prir fr functinal data mdel: M i Pis(λ) γ im M i ind. Symmetric Gamma(α, ρ) µ im M i ind. Unif(X i) σ 2 im M i ind. IG(α σ, ρ σ) See e.g. Wlpert, Clyde, & Tu (2010) 29