Uncertainty Characterization: The Role of Hypothesis-Based Weight of Evidence"

Similar documents
THEORY DEVELOPMENT PROCESS

DEVELOPMENT OF THE CHROMIUM PUBLIC HEALTH GOAL

Application of human epidemiological studies to pesticide risk assessment

Durkheim. Durkheim s fundamental task in Rules of the Sociological Method is to lay out

European Food Safety Authority (EFSA)

Methodologies for development of human health criteria and values for the lake Erie drainage basin.

Approaches to Integrating Evidence From Animal and Human Studies in Chemical Assessments

9/5/ Research Hazards Awareness Training

Validity Arguments for Alternate Assessment Systems

DISCUSSION GROUP 3. Mechanism of carcinogenicity. EFSA Scientific Colloquium on Acrylamide carcinogenicity, 22/23 May

Applying GRADE-CERQual to qualitative evidence synthesis findings paper 4: how to assess coherence

Dose Response Approaches for Nuclear Receptor Mediated Modes of Action Workshop Preliminary Report

Threshold-Based Risk Assessment is the Same for Cancer and Non-cancer Endpoints for Non-DNA Reactive Carcinogens

School of Nursing, University of British Columbia Vancouver, British Columbia, Canada

Distinguishing between Mode and Mechanism of Action

Comments DRAFT Preamble to the IARC Monograph. Health and Safety Department International Union, UAW 8000 East Jefferson Avenue Detroit, MI 48214

What is good qualitative research?

Hexavalent Chromium Oral Reference Dose

Addendum to the 12th Report on Carcinogens

Introduction to the Scientific Method. Knowledge and Methods. Methods for gathering knowledge. method of obstinacy

EFSA Scientific Colloquium No. 17, June 2012, Parma, Italy. Dieter Schrenk Food Chemistry and Toxicology University of Kaiserslautern 2012

Tetrachloroethylene: Integrated Risk Information System (IRIS) Draft Toxicological Review Kate Z. Guyton, PhD DABT

Insight Assessment Measuring Thinking Worldwide

Discussion of Changes in the Draft Preamble

Critical Thinking Rubric. 1. The student will demonstrate the ability to interpret information. Apprentice Level 5 6. graphics, questions, etc.

This article, the last in a 4-part series on philosophical problems

The idea of an essentially contested concept is incoherent.

Cognitive domain: Comprehension Answer location: Elements of Empiricism Question type: MC

6. A theory that has been substantially verified is sometimes called a a. law. b. model.

Audio: In this lecture we are going to address psychology as a science. Slide #2

Work No. Rater Initials Average Score

PHTHALATES STAKEHOLDER WORKSHOP SUMMARY REPORT

Improving the U.S. Public s Understanding of Climate Change

Overview and Comparisons of Risk of Bias and Strength of Evidence Assessment Tools: Opportunities and Challenges of Application in Developing DRIs

Child Mental Health: A Review of the Scientific Discourse

Received: 20 March 2014, Revised: 11 June 2014, Accepted: 11 June 2014 Published online in Wiley Online Library: 25 July 2014

Are tumor incidence rates from chronic bioassays telling us what we need to know about carcinogens?

Part 2. Chemical and physical aspects

Guide to Rating Critical & Integrative Thinking for eportfolios Washington State University, Fall 2006

STUDIES TO EVALUATE THE SAFETY OF RESIDUES OF VETERINARY DRUGS IN HUMAN FOOD: GENERAL APPROACH TO ESTABLISH AN ACUTE REFERENCE DOSE

Consideration of Reproductive/Developmental Mode of Action Data from Laboratory Animals in the Risk Assessment of Environmental Chemicals

Evaluation of Ramazzini Institute Aspartame Studies and EFSA s Assessment

Key Aspects of U.S. EPA s External Review Draft Toxicological Review of Trichloroethylene (TCE)

GUIDELINE COMPARATORS & COMPARISONS:

UNCORRECTED PROOF. Daniel Steel / Across the Boundaries 01-DanielSteel-chap01 Page Proof page :48pm ACROSS THE BOUNDARIES

Robert G. Sussman, Ph.D., DABT Managing Principal, Eastern Operations. SafeBridge Consultants, Inc. Mountain View, CA New York, NY Liverpool, UK

Qualitative Data Analysis. Richard Boateng, PhD. Arguments with Qualitative Data. Office: UGBS RT18 (rooftop)

Prediction, Causation, and Interpretation in Social Science. Duncan Watts Microsoft Research

III.B. 4. A Delphi Evaluation of Agreement between Organizations

POLI 343 Introduction to Political Research

Chapter 02 Developing and Evaluating Theories of Behavior

How was your experience working in a group on the Literature Review?

Definition and Purpose

2 Critical thinking guidelines

EFSA working group on BPA assessment protocol. Ursula Gundert-Remy Chair of the EFSA Working Group BPA assessment Protocol

- draft scientific opinion -

International Conference on Harmonisation of Technical Requirements for Registration of Pharmaceuticals for Human Use

(an intro to AP Biology)

Quantitative Risk Assessment: An Overview and Discussion of Emerging Issues. Anne-Marie Nicol, PhD

FAQs on bisphenol A in consumer products

Lesson 1 Understanding Science

Texas Commission on Environmental Quality

Framework for Causation Analysis of Toxic Exposure Claims. CLCW SME Training August, 2017

Considerations in Toxicology Study Design and Interpretation: An Overview Gradient Corporation: Lewis, AS; Beyer, LA; Langlois, CJ; Yu, CJ; Wait, AD

Thought Starter Combined Exposures to Multiple Chemicals Second International Conference on Risk Assessment

The role of biokinetics in in vitro tests and the interpretation of results. Emanuela Testai

How Does Analysis of Competing Hypotheses (ACH) Improve Intelligence Analysis?

FÜR RISIKOBEWERTUNG BUNDESINSTITUT

Applying the Experimental Paradigm to Software Engineering

TRANSLATING RESEARCH INTO PRACTICE

A Weight of Evidence Approach to Cancer Assessment. Alan R Boobis Imperial College London

A Brief Introduction to Bayesian Statistics

Chapter Three: Hypothesis

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Introduction to Applied Research in Economics

Critical Thinking and Reading Lecture 15

RISK21: Q-KEDRF. ILSI Health and Environmental Sciences Institute

Gather essential and accurate information about the patient (Patient Care)

Introduction to Research. Ways of Knowing. Tenacity 8/31/10. Tenacity Intuition Authority. Reasoning (Rationalism) Observation (Empiricism) Science

Safeguarding public health Subgroup analyses scene setting from the EU regulators perspective

Narrative overviews. Lesley Rushton Epidemiology and Public Health

Maisons-Alfort, 10 January 2007 OPINION

Epidemiology 101. Nutritional Epidemiology Methods and Interpretation Criteria

an invitation to philosophy of science

Funnelling Used to describe a process of narrowing down of focus within a literature review. So, the writer begins with a broad discussion providing b

ETHYLENE OXIDE: AN UPDATE

Understanding noninferiority trials

Optimal Design for in Vivo Mutation Studies to Inform Cancer Mode-of- Action Assessment

Checklist for Randomized Controlled Trials. The Joanna Briggs Institute Critical Appraisal tools for use in JBI Systematic Reviews

Contribution to the European Commission s Public Consultation on. 15 January 2015

Research Approach & Design. Awatif Alam MBBS, Msc (Toronto),ABCM Professor Community Medicine Vice Provost Girls Section

! Parallels with clinical studies.! Two (of a number of) concerns about data from trials.! Concluding comments

CHARACTERIZING THE IMPACTS OF UNCERTAINTY AND SCIENTIFIC JUDGMENT IN EXPOSURE LIMIT DEVELOPMENT

OPINION of the French Agency for Environmental and Occupational Health Safety

Overview of Study Designs in Clinical Research

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level. Published

Disposition. Quantitative Research Methods. Science what it is. Basic assumptions of science. Inductive and deductive logic

Report of a Workshop on Dose-Response Approaches for Nuclear Receptor- Mediated Modes of Action

Transcription:

Uncertainty Characterization: The Role of Hypothesis-Based Weight of Evidence" Lorenz Rhomberg, PhD Gradient Corporation lrhomberg@gradientcorp.com 617-395-5552 LSRO / CHRA -- 23 May 2007 -- Washington DC

Outline Uncertainty in QRA WoE 1 2 Hypothesis-Based WoE 4 3 5 Epistemology

QRA Challenges Many choices of datasets, dose measures, models, low-dose extrapolations, Just doing most sensitive no longer acceptable calls for uncertainty characterization most sensitive may not be most supportable or representative why should one dataset represent the whole of the agent s potential carcinogenicity? ranges (as with TCE) are problematic with risk managers How to weigh alternatives? How much information to move from defaults?

Main Components of Uncertainty Is each potential endpoint a human-relevant hazard? choice of dataset to represent human risk D-R model fit Uncertainty in response measure Uncertainty in dose measure low-dose extrapolation of high-dose effects toxicologic equivalency of exposures across species

Expert Judgment Assessment of Chloroform Carcinogenic Potency Probability Tree Evans et al. (1994) Reg Toxicol Pharmacol 20:15-36

Individual Expert s Distributions for excess cancer risk from 100 ppb Chloroform in Drinking Water Expert Opinion Varies Among Experts We Seek an Objective, Transparent, Data- Driven Means to Assign Weights Evans et al. (1994) Reg Toxicol Pharmacol 20:15-36

Where QRA Uncertainty Gets Stuck Significant Qualitative Uncertainties that do not fit easily into statistical distribution characterization (questions of bearing, relevance, extrapolability) Qualitative questions complicate even straightforward uncertainty approaches (through questions about model misspecification) And they are the most consequential elements of the overall uncertainty!

FOR HUMAN RISK ESTIMATION: What specific endpoints are: Expected in Humans? Likely? Possible? What datasets best represent them? What models provide an appropriate basis for generalizing from studied responses to human risk projections?

Outline Uncertainty in QRA WoE 1 2 Hypothesis-Based WoE 4 3 5 Epistemology

Weight-of-Evidence for Human Carcinogenicity (Hazard ID) Douglas Weed Weight of Evidence has several definitions. Need to specify what is meant and specify criteria Christina Rudén Different organizations when evaluating the same set of (TCE) data come to different conclusions about hazard

Weight-of-Evidence for Human Carcinogenicity (Hazard ID) An instance in which expert judgment about qualitative questions is routinely used 1986 Guidelines Too Prescriptive (need to consider MoA, synthesis across studies) 2005 Guidelines Too Unstructured? Factors to consider, but how to synthesize? How to weigh + / -?

Weight-of-Evidence Factors Human Tumor Epidemiology Animal Bioassays Physicochemical Properties Pharmacokinetics Mode of Action / Precursor Effects Similar Chemicals Increase Weight Number of independent studies with consistent results Same site across species, structural analogues Multiple observations Species Sites Sexes Severity and progression of lesions Early in life tumors/malignancy Dose response relationships Lesion progression Uncommon tumor Route of administration like human exposure BIOASSAY FACTORS Decrease Weight Single study Inconsistent results Single site/species/sex Benign tumors only High background of incidence tumors Route of administration unlike human exposure

Weight-of-Evidence for Human Carcinogenicity (Hazard ID) An instance in which expert judgment about qualitative questions is routinely used 1986 Guidelines Too Prescriptive (need to consider MoA, synthesis across studies) 2005 Guidelines Too Unstructured? Factors to consider, but how to synthesize? How to weigh + / -? Role of negative studies Several weak endpoints Likely? How to argue conclusions against evidence?

Carcinogen WoE & QRA -- Current EPIDEMIOL. A B B C D E BIOASSAY B B F - MoA X Y Z Haz ID WoE Classification QRA Choice of Dataset Epi B Animal B (alternative dosimeters & models) Upper Bound Potency Factor

Outline Uncertainty in QRA WoE 1 2 Hypothesis-Based WoE 4 3 5 Epistemology

INTERNAL VALIDITY (Rigor) EXTERNAL VALIDITY (Bearing)

as a metaphor as a method WEIGHT OF EVIDENCE use all the data systematic evaluation aim at objective procedures that lay out the process of scientific professional judgment Question: In view of incomplete and contradictory evidence, how compelling is the case for the existence of human cancer risk?

WEIGHT OF EVIDENCE HOW MUCH DOES EVIDENCE WEIGH? i.e., what process for the evaluation of evidence provides a means for judging how compelling it is and how to trade off among apparent contradictions? Evidence has no meaning except in its relation to a specific hypothesis It is the hypothesis that is evaluated w.r.t. the evidence (not the other way around) So WoE should be organized around evaluating specific hypotheses against data

Endpoint-by Endpoint Evaluation of Data EPIDEMIOL. A B B C D - E BIOASSAY B B F - MoA X - Y - Z WoE for Endpoint B

Endpoint-by Endpoint Evaluation of Data EPIDEMIOL. BIOASSAY MoA A B B C D - E B B F - X - Y - Z Systematic, endpointby-endpoint evaluation Note all results, not just supportive ones Evaluate w.r.t. the hypothesized basis for human prediction of risk Result: a WoE evaluation for each endpoint of interest WoE for Endpoint C

Articulate an Hypothesis What is the proposed basis for inferring that a particular phenomenon seen in studies of a chemical's effects will also happen in environmentally (or occupationally) exposed humans? animal studies and human studies general scientific understanding and experience with other agents

Articulate an Hypothesis What is the proposed basis for inferring that a particular phenomenon seen in studies of a chemical's effects will also happen in environmentally (or occupationally) exposed humans? what commonality of: material bases? causal processes? progressions of events?

Articulate an Hypothesis What is the proposed basis for inferring that a particular phenomenon seen in studies of a chemical's effects will also happen in environmentally (or occupationally) exposed humans? a generalization, not just an extrapolation (should apply to all cases within its realm) issue is to define that realm and what manifestations of the hypothesis are expected and not expected which can then be checked against all the actual observed results (not just the source study to human extrapolation)

Evaluate how compelling is the case for the proposed basis for human risk in view of: "predictions" of hypothesis that are confirmed in the observations more weight to "risky" predictions more weight to specific predictions less weight when subsidiary assumptions or explanations needed

Evaluate how compelling is the case for the proposed basis for human risk in view of: apparent refutations (couterexamples) failure to repeat result across studies non-responding sexes or species unpredicted but clearly relevant phenomena An hypothesis can often be reconciled with apparent refutations by modifying it or by adding subsidiary assumptions but this entails a weight "penalty"

Hypotheses (and the human risk projections they entail) are compelling to the extent that: they are compatible with existing data they explain patterns among existing data they comport with and emerge from our knowledge of carcinogenesis and biology in general they are predictive and not just accommodative competing hypotheses aren't also compatible/explanatory apparent refutations can be reconciled using plausible, likely explanations

Endpoint-by Endpoint Evaluation of Data EPIDEMIOL. BIOASSAY MoA A B B C D - E B B F - X - Y - Z Systematic, endpointby-endpoint evaluation Note all results, not just supportive ones Evaluate w.r.t. the hypothesized basis for human prediction of risk Result: a WoE evaluation for each endpoint of interest WoE for Endpoint C

Evaluating Hypotheses Against the Data Point is not to prove or disprove which is impossible Rather, to gauge plausibility relative to other endpoints and other cases Weights inversely proportional to how much one must fill in gaps with defaults and/or ad hoc explanations of apparent refutations How much stretching of credibility is needed? How much faith in assumptions is required? How much does the hypothesis explain that would be unexplained without it?

Can (and should) also evaluate hypotheses about lack of human carcinogenic risk how to reconcile assertion of lack of human risk with positive studies and the general presumption that animals and humans share fundamental physiology and control of cell division and differentiation? how compelling is the basis for assertion of human risk versus the basis for assertion of lack of risk gauged by their abilities to account for the array of all the relevant data

For Each Hypothesis Kidney tumors secondary to high-dose cytotoxicity due to reactive thiol PREDICTED SUPPORTING SUPPORTING WITH SIGNIFICANT AD HOC ASSUMPTIONS NEITHER SUPPORTING NOR REFUTING REFUTING UNLESS ACCEPT SIGNIFICANT AD HOC EXPLANATIONS REFUTING Bioassay Rats w kidney tumors have kidney cytotoxicity Some Strains of Rat have fewer tumors and females have fewer (sensitivity differences? metabolic differences?) Bioassay Mice have kidney toxicity but no kidney tumors (less sensitive to cytotoxicity? why?) Human studies w kidney tumors only at very high exposures (high enough for cytotoxicity?) Humans seem to produce reactive thiol (enough?) Most human studies are w/o kidney tumors, but have lower exposures (low enough to avoid cytotocity?) note key assumptions (general and agent-specific) parallel evaluation of alternative MoA (e.g., mutagenic thiol) need an account of the whole array of observations

Mode of Action Assessing Relevance to Humans Mammary Tumors Yes No? Table from Cohen SM et al. 2004. Toxicol Sci 78:181

Does THIS ENDPOINT constitute a Human Hazard? Reasoned Ranking: Compelling Likely Plausible (w further assumptions) Liver, Kidney Not Impossible MCL, Esophageal

OVERALL WoE is Secondary to the Set of Endpoint-Specific WoE Does THIS ENDPOINT constitute a Human Hazard? Reasoned Ranking: Compelling Likely Plausible (w further assumptions) Not Impossible Liver, Kidney MCL, Esophageal Human Carcinogen Likely Possible Cannot Be Determined Unlikely This (along with the reasoning to develop it) becomes part of the narrative

Advantages of Hypothesis-Based WoE Shows which endpoints are most compelling Makes reasoning explicit shows both strengths and weaknesses Allows debate about particular data and interpretations Frames WoE classifications as scientific statements (falsifiable, points to tests) Does not blend all endpoints together in an uninterpretable generalized carcinogenicity statement Informs the QRA process

Outline Uncertainty in QRA WoE 1 2 Hypothesis-Based WoE 4 3 5 Epistemology

Does THIS ENDPOINT constitute a Human Hazard? Reasoned Ranking: Compelling QRA Based on animal liver Based on human liver metaanalysis Based on human high-dose kidney Likely Plausible (w further assumptions) Not Impossible Liver, Kidney MCL, Esophageal Based on animal kidney Based on rat MCL Based on human esophageal metaanalysis

QRA Uncertainty Characterization Endpoint WoE categories give relative weighting Hypothesis informs quantitative uncertainty analysis (e.g., variation in sensitivity of rats and mice may indicate uncertainty about human sensitivity) Still have statistical uncertainties for each dataset analysis (model fit, extrapolation, x-spp), but the qualitative questions have been removed to the WoE category question representativeness of alternative potencies tied to Hazard ID understanding

Advantages of Hypothesis-Based WoE Gives some measure of how compelling the evidence is for each endpoint Gives structure and substance to the overall WoE Gives context for model and dataset choices; ties QRA to WoE Not just one endpoint to represent all possible human risks Lays out reasoning; transparent

Is This Just What we do already, but reorganized? The MoA human-relevance framework? The Bradford Hill Criteria? Evidence-Based Toxicology? (Guzelian)

Outline Uncertainty in QRA WoE 1 2 Hypothesis-Based WoE 4 3 5 Epistemology

Help from Epistemology in Gauging WoE and Evaluating Hypotheses Against Data Epistemology, or theory of knowledge, is the branch of philosophy that studies the nature and scope of knowledge and belief. What is the nature of knowledge? By what processes of inference can we acquire knowledge? How are we justified in claiming knowledge?

Considerations in Evaluating a Theory falsifiability parsimony predictivity (and retrodiction ) risky predictions penalties for ad hoc addenda to reconcile underdetermined theories with discordant observations penalties for accommodation

Thank you for the opportunity to present these ideas.