Statistical analysis supporting the development of the guidance on dermal absorption

Similar documents
Changes to the guidance on dermal absorption

Update ofefsa Guidanceon dermal absorption. A fewcase studies by a Member State

EFSA GD on dermal absorption Industry feedback and considerations on bridging opportunities

BfR Proposal for a Harmonised Procedure for Estimating the Dermal Absorption

Unit 1 Exploring and Understanding Data

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic

Guidance on Dermal Absorption

Dermal Technology Laboratory Ltd

WORKSHOP. Understanding Formulation Design and Dermal Absorption: Technical, Scientific and Regulatory perspectives

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Guidance on dermal absorption

OpenFoodTox and Other Open Source In silico EFSA. Jean Lou Dorne Senior Scientific Officer Scientific Committee and Emerging Risks Unit EFSA

Risk assessment of mycotoxins: the EFSA approach. Katleen Baert Scientific officer, EFSA

Motivation Empirical models Data and methodology Results Discussion. University of York. University of York

Using Statistical Intervals to Assess System Performance Best Practice

Toxicity of mineral oil hydrocarbons

FINAL. Recommendations for Update to Arsenic Soil CTL Computation. Methodology Focus Group. Contaminated Soils Forum. Prepared by:

STATISTICS & PROBABILITY

Where does "analysis" enter the experimental process?

Observational studies; descriptive statistics

Development of NJ Human Health-based Criteria and Standards

RNA-seq. Differential analysis

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Scientific Opinion of the Panel on Plant Protection Products and their Residues (PPR-Panel) (Question No EFSA-Q ) Adopted on 12 December 2007

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

RAG Rating Indicator Values

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015

Seeing the Forest & Trees: novel analyses of complex interventions to guide health system decision making

Why Mixed Effects Models?

Clinical Trials A Practical Guide to Design, Analysis, and Reporting

Classical Psychophysical Methods (cont.)

EFSA s Reputation Barometer

AP Statistics. Semester One Review Part 1 Chapters 1-5

The QUOROM Statement: revised recommendations for improving the quality of reports of systematic reviews

Score Tests of Normality in Bivariate Probit Models

Lecture 21. RNA-seq: Advanced analysis

Emerging Risks Mapping of Activities in Member States. 67th Advisory Forum meeting, Utrecht, The Netherlands, 6 February 2018

investigate. educate. inform.

you-try-it-02.xlsx Step-by-Step Guide ver. 8/26/2009

Analysis of TB prevalence surveys

Graphical display of population data

Estimating Lung Cancer Deaths in Thailand based on the 2005 Verbal Autopsy Study

Normal Distribution. Many variables are nearly normal, but none are exactly normal Not perfect, but still useful for a variety of problems.

Models for potentially biased evidence in meta-analysis using empirically based priors

The decision on actual implementation depends on the results of a further consequence analysis.

War and Relatedness Enrico Spolaore and Romain Wacziarg September 2015

ANOVA in SPSS (Practical)

Statistical reports Regression, 2010

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Business Statistics Probability

Russian Journal of Agricultural and Socio-Economic Sciences, 3(15)

EU Risk Assessment Agenda Research and collaboration

Dr. Kelly Bradley Final Exam Summer {2 points} Name

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

One-Way Independent ANOVA

Standard Scores. Richard S. Balkin, Ph.D., LPC-S, NCC

GE Standard V: Scientific Standard

EFSA Emerging Risks Activities. Ana Afonso Scientific Committee and Emerging Risks Unit ECVPH Annual Conference 2-4 October Liege, Belgium

Specific aspects for PNEC derivation for metals. P. Van Sprang, F. Verdonck, M. Vangheluwe

How to analyze correlated and longitudinal data?

What are the challenges in addressing adjustments for data uncertainty?

SUPPLEMENTAL MATERIAL

Daniel Boduszek University of Huddersfield

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

CHAMP: CHecklist for the Appraisal of Moderators and Predictors

Biostatistics for Med Students. Lecture 1

Research Methods. Donald H. McBurney. State University of New York Upstate Medical University Le Moyne College

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Action Levels and Allergen Thresholds What they will mean for the Food Industry Dr. Rachel WARD r.ward consultancy limited

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

Functional Properties of Foods. Database and Model Prediction

CARISMA-LMS Workshop on Statistics for Risk Analysis

Lesson 9 Presentation and Display of Quantitative Data

From single studies to an EBM based assessment some central issues

Psychology Research Excellence Framework 2014 Impact Analysis

Performance of Median and Least Squares Regression for Slightly Skewed Data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Statistical analysis of comparative data on composition and agronomic characteristics: new software tool and recurring issues identified

Biostat/Stat 571: Coursework 5

How to interpret scientific & statistical graphs

Chapter 1: Exploring Data

Ecological Statistics

Quantitative Methods in Managment. An introduction to GLMs and measurement theory

Search e Fall /18/15

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Evaluating behavior change interventions in terms of their component techniques. Susan Michie

Critical Review Form Clinical Decision Analysis

Dan Byrd UC Office of the President

Size Matters: the Structural Effect of Social Context

Technical Monograph n 17, 2nd Edition. Guidelines for Specifying the Shelf Life of Plant Protection Products

Statistical techniques to evaluate the agreement degree of medicine measurements

Assessing Agreement Between Methods Of Clinical Measurement

RNA-seq. Design of experiments

Quizzes (and relevant lab exercises): 20% Midterm exams (2): 25% each Final exam: 30%

11/24/2017. Do not imply a cause-and-effect relationship

Transcription:

Statistical analysis supporting the development of the guidance on dermal absorption Peter Craig (Gilles Guillot, EFSA) Durham University United Kingdom 27 September, 2017

Variability of percentage absorbed Highly skewed distribution Lots of replicates near 0% absorption Potentially tricky to use as response variable in regression 2

Variability within blocks: raw data Each point in the figures summarises a block of replicates: an in vitro study 75% of blocks have positive skewness Transformation needed Strong relationship in @location-scale plot 3

Logit transform The logit transform stretches out values near 0 or near 1 Transformed data can be plotted with a logit scale 4

Variability within blocks: logit transform Each point in the figures summarises a block of replicates: an in vitro study No tendency to positive or negative skewness Little relationship in location-scale plot Transformation successful 5

Data transformations Summary of conclusions Logit transform removes issue of skewness and of lack of homogeneity at different levels of absorption Logit-transformed data more suitable than raw data for use of standard statistical procedures, including confidence intervals and regression Other transformations were investigated but found to be less satisfactory: probit, arc-sine, log Existence of measurements with 0% absorption means that logit-transform needs a small adjustment to include those data, for example in regression modelling. 6

GD5.3: Variability within results EFSA 2012 GD and OECD 2011 guidance notes: If variability low (standard deviation <= 25% of mean), use mean of replicates in risk assessment If variability high (standard deviation > 25% of mean): EFSA 2012: preferred option is to add one standard deviation to the mean. OECD 2011: consider using maximum measurement if n=4 and add one standard deviation to the mean when n>4 Making allowance for uncertainty, due to finite sample size, about mean absorption. 7

GD5.3: Variability within results In the database, 85 % of blocks have standard deviation > 25% of mean: variability is usually high. EFSA 2017 GD: Use approximate upper (97.5%) confidence limit for mean to allow for uncertainty due to finite sample size: where and are the mean and standard deviation of the data and depends on the number of replicates (standard confidence interval calculation). Rewards higher number of replicates by reducing multiple of standard deviation as sample size increases. Same as EFSA 2012 GD when n=6. Would theoretically be better to calculate confidence limit using logit-transformed data but was considered unnecessarily complicated and empirical performance was not very different. 8

GD5.3: Variability within results Comparison of outcomes using raw data and transformed data upper confidence limits Transformed data upper confidence limit typically a little higher for n=4. For n>4, very little difference in most cases (blue lines show +/- 10%) 9

GD5.3: Variability within results Comparison of outcome using raw data upper confidence limit to outcome using sample maximum Sample maximum does not allow sufficiently for uncertainty when n=4 10

Pro-rata extrapolation Each point represents a case where a substance was tested in the same formulation at different dilutions in the same study Red line: pro-rata extrapolation Blue lines: Aggarwal et al (2014,2015) Pink/blue points: section 4.7 of Aggarwal (2014) 11

Explaining absorption variability Many potential explanatory variables but a lot of confounding. Two statistical modelling approaches used to try to understand which explanatory variables are more important: Beta-regression (special regression methodology directly addressing the range limitation for absorption without transforming data). Explored all possible combinations of explanatory variables (without interactions). Random-effects regression for logit-transformed absorption percentage. 12

Explaining absorption variability Explanatory variables: Physico-chemical properties (active substances) and formulation (product) Skin sample Experimental conditions Active substance code, MW, log P ow, water solubility, formulation type, concentration tested, concentration status (concentrate/dilution) Skin type, sex, age, donor, skin source, skin region Exposure duration, chamber type, mass balance recovery, receptor medium MW: molecular weight 13

Explaining absorption: beta-regression Number of Change in explanatory Best sub-model DIC variables 1 FormTypeConcDil - 2 logmolconc + AScode 3122 3 logmolconc + ReceptMedium + 314 AScode 4 logmolconc + Duration + 174 ReceptMedium + AScode 5 FormTypeConcDil + logmolconc + 111 Duration + ReceptMedium + AScode 6 86 7 35 8 29 9 28 10 24 14

Explaining absorption: beta-regression Number of explanatory variables Best sub-model without AScode and Receptor Medium Change in DIC 1 FormTypeConcDil - 2 FormTypeConcDil + Duration 306 3 FormTypeConcDil + SkinType + 108 Duration 4 FormTypeConcDil + SkinType + Age 99 + Duration 5 FormTypeConcDil + logmolconc + 82 SkinType + Age + Duration 6 logmw + logpow + FormTypeConcDil 46 + logmolconc + SkinType + Duration 7 107 8 22 9 11 10-1.5 15

Conclusions from beta-regression Strong support for distinguishing between formulation groups and for distinguishing concentrates from dilutions Evidence that many other variables have some explanatory capability BUT that the dominant explanatory variable is active substance, i.e. the level of absorption varies greatly between substances. In particular physical/chemical properties of substances did not easily explain the same amount of variation. Need to find some way to directly address variability between substances Turned away from beta-regression due to issues with model fit, limitations of software used to fit models and difficulties with predictive calculations 16

Explaining absorption: logit-regression Difference between concentrates and dilutions but not much evidence of other dependence on concentration 17

Explaining absorption: logit-regression Consistent difference between concentrates and dilutions and differences between formulation groups 18

Explaining absorption: logit-regression Analyses of variance: Source of variation % of variation explained All data Data with donor ID Dilution/concentrate 33 32 Formulation group 3 3 Active substance 28 28 Active substance specific 7 7 dilution/concentrate Study (on same substance) 9 9 Different concentrations and formulations within study 5 5 Replicates: between donors 10 15 Replicates: within donors 5 19

Explaining absorption: logit-regression More about Analysis of Variance results A variety of analyses were performed, adding explanatory variables in different sequences Confirmed the conclusions of the beta-regression modelling In particular, dilution/concentrate, formulation group and active substance dominate. Other potential explanatory variables explain relatively little variation if added after active substance. This includes the data-set origin: ECPA versus BfR which is substantially confounded with active substance. 20

EFSA GD (2012): special case 10% DV? EFSA 2012 GD: if log Pow < -1 or > 4 and MW > 500 a default dermal absorption value of 10% may be applied. EFSA 2017 GD: no evidence found in the data to support this special case. Points coloured blue in figures are those for which log Pow < -1 or > 4 and MW > 500: 21

Towards DVs: variability & uncertainty Variability: Default values might be based on estimating a specified percentile of variability. Choosing which percentile to use is a risk management decision. Uncertainty: All statistical estimates are subject to uncertainty. Uncertainty should be taken into account when setting default values. How much allowance to make for uncertainty is a risk management decision. 22

Towards DVs: empirical quantiles Example of table of empirical 95 th percentiles and approximate upper confidence limits Formulation category Organic solvent + other Water based + solid Organic solvent + other Water based + solid Dilution/ concentrate Sample size Data 95 th percentile UCL for percentile Concentrate 1284 18 20 Concentrate 1544 8 9 Dilution 1658 51 55 Dilution 2277 45 48 23

Towards DVs: empirical quantiles Merits of empirical quantiles Easily understood Relatively easy to implement Weaknesses: Variability: All sources are included. Inter-, intra-human should be excluded, perhaps more. Each measurement in the dataset gets equal weight. No allowance for hierarchical structure: varying numbers of studies, experimental conditions and replicates 95 th %ile of what? Makes RM decision tricky Uncertainty: UCL method used is designed for random sample from homogeneous population not a hierarchical dataset. How is uncertainty being addressed? 24

Towards DVs: random effects modelling Random effects logit-regression: Active substance effects seen as sampled from population of substance effects. Model statistically as random variability modelled by normal distribution standard deviation of distribution controls amount of variation between substances unexplained variation between substances also modelled as random variability when included in model Basis for default values: Effect for substance without data for risk assessment is treated as random value sampled from the distribution. Risk management decision: what percentile of variation of substances to use Statistical meaning of effect 25

Towards DVs: random effects modelling Pink rectangle highlights range of uncertainty about the 95th percentile of variability (95% credible interval) Normal Normal distribution distribution modelling modelling all only (too much) variability variability directly attributable to substances Red line shows 95th percentile of intersubstance variability Estimates are subject to uncertainty 26

Towards DVs: random effects modelling A decision must be made about which sources of variation to include in the random effects part of the model: A or A+B or A+B+C Source of variation Dilution/concentrate Formulation group Active substance Active substance specific dilution/concentrate Study (on same substance) Different concentrations/formulations in study Replicates: between donors Replicates: within donors Effects group Fixed effects A B C Not included 27

Towards DVs: random effects modelling Example of table of estimated 95% percentiles and upper credible limits Sources of variation to include A Form n category Dilut n/ conc. Org. solv. + Conc. other H2O based Conc. + solid Org. solv. + Dilution other H2O based Dilution + solid Est. 95% A+B UCL Est. 95% A+B+C UCL Est. 95% UCL 5 7 8 10 11 13 3 4 4 6 6 8 36 44 47 54 56 62 23 29 32 38 40 46 28

Towards DVs: random effects modelling Merits: addresses all identified weaknesses of the empirical approach Weakness: dependent on choice of statistical model. In particular, the model used: assumes that random effects are homogeneous across the 8 (or 4) categories whereas the empirical approach treats each category separately; makes distributional assumptions which may not be valid: random effects and replicate variation are both assumed to be normally distributed. Overall: Some identified modelling approach weaknesses could be addressed by further statistical analysis, whereas the empirical approach weaknesses are difficult to overcome. 29

THANK YOU FOR YOUR ATTENTION Subscribe to www.efsa.europa.eu/en/news/newsletters www.efsa.europa.eu/en/rss Engage with careers www.efsa.europa.eu/en/engage/careers Follow us on Twitter @efsa_eu @plants_efsa @methods_efsa 30