A Data Mining Approach for Signal Detection and Analysis

Similar documents
Key Elements in Adverse Drug Interaction Safety Signals

Principles of Signal Detection. Pia Caduff-Janosa MD Annual Course May 2013

Statistical methods for knowledge discovery in adverse drug reaction surveillance

Active TB drug-safety management & monitoring

Rhabdomyolysis a result of azithromycin and statins: an unrecognized interaction

Automatic extraction of adverse drug reaction terms from medical free text

A method for data driven exploration to pinpoint key features in medical data and facilitate expert review

Reporting Patterns Indicative of Adverse Drug Interactions

TITLE: SAFE USE OF MEDICINES IN ZANZIBAR A

Principles of signal detection. Marie Lindquist

Time series disturbance detection for hypothesis-free signal detection in longitudinal observational databases

A Comparative Study of Data Mining Algorithms used for Signal Detection in FDA AERS Database

Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012

OHDSI Tutorial: Design and implementation of a comparative cohort study in observational healthcare data

PHARMACOVIGILANCE: WHEN ADVERSE DRUG EVENTS OCCUR. Bartlomiej Piechowski-Jozwiak, MD Rabih Dabliz, Pharm.D., FISMP

A Proof-of-Concept Evaluation of Adverse Drug Reaction Surveillance in Electronic Health Records

Humane League Labs. Sabine Doebel, Susan Gabriel, and The Humane League

Modeling Individual and Group Behavior in Complex Environments. Modeling Individual and Group Behavior in Complex Environments

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Scientific Council Fifty-first Session 21/11/2014

Response to Comment on Cognitive Science in the field: Does exercising core mathematical concepts improve school readiness?

2008 Public Status Report on the Implementation of the European Risk Management Strategy. Executive Summary

Pharmacovigilance. Dr. Anjan Adhikari Associate Professor Department of Pharmacology R G Kar Medical College & Hospital Kolkata

SUPPLEMENTARY INFORMATION

How Does Analysis of Competing Hypotheses (ACH) Improve Intelligence Analysis?

Representing Association Classification Rules Mined from Health Data

Bayesians methods in system identification: equivalences, differences, and misunderstandings

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?

Neurons and neural networks II. Hopfield network

COMMITMENT &SOLUTIONS UNPARALLELED. Assessing Human Visual Inspection for Acceptance Testing: An Attribute Agreement Analysis Case Study

Lecture 3: Bayesian Networks 1

Access from the University of Nottingham repository:

Collaborative Project of the 7th Framework Programme. WP6: Tools for bio-researchers and clinicians

Pattern Recognition of Functional Neuroimage Data of the Human Sensorimotor System after Stroke

Predicting Breast Cancer Survivability Rates

Regulatory Aspects of Pharmacovigilance

AP STATISTICS 2008 SCORING GUIDELINES (Form B)

Clinical risk management in community pharmacy - Henk Buurma SUMMARY

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

Adverse Drug Effect Detection

I. INTRODUCTION III. OVERALL DESIGN

Signal Detection of Adverse Drug Reaction using the Adverse Event Reporting System: Literature Review and Novel Methods

Novel developments in the risk assessment of food allergens

Adverse Events Monitoring (aka Pharmacovigilance)

Reach and grasp by people with tetraplegia using a neurally controlled robotic arm

Signal detection at Lareb

Improved Intelligent Classification Technique Based On Support Vector Machines

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Numerical Integration of Bivariate Gaussian Distribution

Computational Cognitive Neuroscience

Chapter 1. Introduction

A New Approach for Detection and Classification of Diabetic Retinopathy Using PNN and SVM Classifiers

Decision Making in Confirmatory Multipopulation Tailoring Trials

EQUATOR Network: promises and results of reporting guidelines

MORPHOLOGICAL CHARACTERIZATION OF ECG SIGNAL ABNORMALITIES: A NEW APPROACH

Clinical Measurement of Drug Action

WHO-ART. Magnus Wallberg. November 24 th 2009 Dar Es Salaam, Tanzania.

Practicality of direct Reporting Of ADRs by Patients

Regression Benchmarking with Simple Middleware Benchmarks

Risk Assessment Terminology, Expression of Nature and level of Risk and Uncertainties

CHAPTER VI RESEARCH METHODOLOGY

Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases

Guideline on good pharmacovigilance practices (GVP)

Statistical Methods for Wearable Technology in CNS Trials

Systematic Reviews and meta-analyses of Diagnostic Test Accuracy. Mariska Leeflang

Global Vaccine Safety Initiative Portfolio activities

OATech+ Network: Using Technology to work towards improving Patient Outcomes. Leanne Sawle

Identification of Tissue Independent Cancer Driver Genes

arxiv: v1 [stat.ml] 23 Jan 2017

ANALYSIS AND DETECTION OF OUTLIERS AND SYSTEMATIC ERRORS FROM AN INDUSTRIAL DATA PLANT

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation

International Comparative Performance of India s Scientific Research INTRODUCTION... 3 EXECUTIVE SUMMARY... 3 INDIA S BIBLIOMETRIC FINGERPRINT...

A study of adverse reaction algorithms in a drug surveillance program

Building an evidence-based social sector in New Zealand. Prepared and presented by Dr Carolyn O Fallon September 2017

Methodological Framework and Tools for Cross-National comparisons of Drug Utilization Data

A practical handbook on the pharmacovigilance of antiretroviral medicines

Impact Evaluation Toolbox

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

eappendix A. Simulation study parameters

A practical handbook on the pharmacovigilance of antiretroviral medicines

A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions

Computer based delineation and follow-up multisite abdominal tumors in longitudinal CT studies

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick.

Vaccine Safety: the risk of undermining our successes at the global level

Applying STPA to the Artificial Pancreas for People with Type 1 Diabetes

MRC Integrative Epidemiology Unit (IEU) at the University of Bristol. George Davey Smith

The impact of age and sex on the reporting of cough and angioedema with renin angiotensin system inhibitors: a case/ noncase study in VigiBase

Enumerative and Analytic Studies. Description versus prediction

Need for improved post-marketing surveillance in receiving countries and suggestions on how to monitor safety of newly introduced vaccines

Overview of the procedures currently used at EFSA for the assessment of dietary exposure to chemical substances

EpiDash A GI Case Study

M.Sc. in Cognitive Systems. Model Curriculum

Läkemedelsverkets Farmakovigilansdag

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns

Drugs Regulatory Unit Ministry of Health PHARMACOVIGILANCE GUIDELINES

ANN predicts locoregional control using molecular marker profiles of. Head and Neck squamous cell carcinoma

Enhanced Asthma Management with Mobile Communication

Causal Modeling of the Glucose-Insulin System in Type-I Diabetic Patients J. Fernandez, N. Aguilar, R. Fernandez de Canete, J. C.

This document was created on 07 January 2005 and has been printed from

ST. MICHAEL S HOSPITAL Guidelines for Reporting Serious Adverse Events / Unanticipated Problems to the SMH Research Ethics Board (REB) July 09, 2014

Transcription:

SHORT COMMUNICATION Drug Safety 2002; 25 (6): 393-397 0114-5916/02/0006-0393/$25.00/0 Adis International Limited. All rights reserved. A Data Mining Approach for Signal Detection and Analysis Andrew Bate, 1,2 Marie Lindquist, 1 I. Ralph. Edwards 1 and Roland Orre 3 1 The Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden 2 Division of Clinical Pharmacology, Umea University, Umea, Sweden 3 Department of Mathematical Statistics, Stockholm University, Stockholm, Sweden Abstract The WHO database contains over 2.5 million case reports, analysis of this data set is performed with the intention of signal detection. This paper presents an overview of the quantitative method used to highlight dependencies in this data set. The method Bayesian confidence propagation neural network (BCPNN) is used to highlight dependencies in the data set. The method uses Bayesian statistics implemented in a neural network architecture to analyse all reported drug adverse reaction combinations. This method is now in routine use for drug adverse reaction signal detection. Also this approach has been extended to highlight drug group effects and look for higher order dependencies in the WHO data. Quantitatively unexpectedly strong relationships in the data are highlighted relative to general reporting of suspected adverse effects; these associations are then clinically assessed. According to the WHO, an adverse drug reaction signal is Reported information on a possible causal relationship between an adverse event and a drug, the relationship being unknown or incompletely documented previously. [1] Asignalis an evaluated association between a drug and an adverse drug reaction, which is considered important to investigate further, and which may refer to new information on an already known association. Usually more than a single report is required to generate a signal, depending upon the seriousness of the event and the quality of the information. The role of the Uppsala Monitoring Centre [2] is to detect such signals from a database of spontaneously reported case reports of suspected adverse drug reactions (ADRs). The case reports are sent from the 67 member countries of the WHO Programme for International Drug Monitoring. 1. Old System of Signal Detection Previously, every 3 months, lists of new drug- ADR combinations were generated from all new reports received during that time period. This data was sent to a panel of experts for review, and the summaries of the signals detected by the experts were circulated to National Centres. The large volume of generated data made all the listed information impossible to review effectively, and there

394 Bate et al. was a lack of automation and follow-up within the signalling system. The large size of the data set (more than 2.5 million cases) makes individual case by case assessment impossible when entering cases into the database, therefore clinical review of case series has to be performed. As the volume of data received on a quarterly basis continued to increase it was decided that a new approach to signal detection was needed. [3] In order to focus the clinical analysis of case series on those drug-adr combinations most likely to be signals, a quantitative approach could be used to filter out noise, and to highlight potential signals. 2. Objectives A new method was needed that would provide: Objective initial assessment of all drug-adr combinations Transparent selection of drug-adr combinations for review A quantitative aid to signal assessment. To meet these objectives a Bayesian neural network was developed for quantitative signal detection on the WHO database. The method is called a Bayesian confidence propagation neural network (BCPNN) and implements Bayesian statistics within a neural network architecture. 3. Use of a Neural Network The term neural network is used to describe a wide range of different computational architectures. These architectures are used for a diverse range of tasks, including prediction, classification of data sets, and data mining. Neural networks are made up of many simple processors ( units ), where each unit has a small amount of local memory. Communication channels ( connections ), which carry numerical data, link the units to each other, so that each unit only operates on local data and on inputs received via connections from other units. In this application, a neural network is used to search for dependencies in the data set. The network is routinely used to analyse the strength of connections between drugs and adverse reactions and is a very computationally efficient architecture for considering all combinations. The network is used to count all reports in the database: all occurrences of variable x (for example a drug), all occurrences of variable y (for example an ADR) and all occurrences of x and y together. The method is readily adapted to count occurrences of combinations of any pair of variables. While the routine use of the method does not strictly require a neural network, the neural network architecture properties are used fully when performing unsupervised pattern recognition to search for previously unknown higher order dependencies in the data set. [4] 4. Choice of Measure of Disproportionality The strength of dependency between a drug and adverse reaction is defined by a logarithmic measure of disproportionality called the information component (IC). This measure is defined as: IC = log 2 pxy (, ) px ( ) py ( ) where: p (x) = probability of a specific drug x being listed on a case report; p (y) = probability of a specific ADR y being listed on a case report; p (x, y) = probability that a specific drug-adr combination x and y, is listed on a case report. This equation can also be written as: IC = log 2 p( y x) p( y) where: p (y x) = conditional probability of y given x, i.e. the probability of a specific ADR y being listed on a case report given the information that a specific drug x is listed as suspected on that case report. Thus the IC value is based on: the number of case reports with drug x ; and the number of case reports with ADR y ; and

Data Mining Approach in Signal Detection and Analysis 395 the number of reports with the specific combinationofdrugandadr;and the total number of reports. The IC measure can be seen as the calculation of the logarithm of the ratio of observed rate of reporting of a specific drug-adverse drug reaction combination, to the expected rate, under the null hypothesis of no association between drug and ADR. Thus when a drug-adr combination is reported more often than expected relative to general reporting of the drug, and general reporting of the ADR, it results in positive values of the IC. This IC measure is calculated within a Bayesian framework; rather than the IC being a point estimate, the IC is a distribution that changes on additionofnewdata.itisassumedthatadrugandan ADR are independent in the absence of data of either the drug or ADR. A prior distribution is set upbasedonthisassumption.thereforeintheabsence of data, the expectation value of the IC distribution E [IC] equals zero. The distribution shrinks (the variance of the IC diminishes in magnitude), as data is received, and the expectation value of the IC distribution either increases or decreases on addition of this data. Bayesian statistics is used: To motivate the use of the information component as a measure of disproportionality useful in highlighting unexpected dependencies: as the IC is the logarithm of ratio of the posterior and prior probabilities, it represents the change in probability on addition of new data. To calculate the information component as a distribution, rather than just a point estimate, basedonpriorandposteriordistributions. When the network is used for classification or in pattern recognition, as the outputs in the neural network are posterior probabilities. 5. Use of Bayesian Method A Bayesian statistical approach has been used for a number of reasons: Ready applicability to low (and zero) counter values. Results can be calculated despite missing data, by providing increased uncertainty in the results. Analysis of other variables and multi-variable analyses can be performed. There is an intuitive relationship between the point estimate of the IC and its confidence interval. These are merely two properties of the IC distribution, where the point estimate of the IC is calculated from the expectation value, and the confidence interval is calculated from the variance. 6. Use and Implementation of a Bayesian Confidence Propagation Neural Network A Bayesian statistical approach is used to highlight unexpected dependencies in the data set; [5] implementation in a neural network architecture allows the routine calculation of the strength (or weakness) of all drug-adr dependencies in the data set. The calculations performed routinely will ultimately be extended to calculate dependencies between all variables in the data set, although currently such calculations are done on an ad hoc basis. These calculations result in an IC and confidence intervals for each IC (more accurately probability intervals ), for each relationship in the network. As the IC is also the weight in the neural network, the computational efficiency of the method is further optimised. This process is used to highlight drug-adr combinations most likely to be signals, and to look for more complex patterns in the data. This quantitative signal detection methodology is now in routine use for drug adverse reaction signal detection in the WHO database. The method has been implemented into a new systematic signalling process. [6] Every3months the IC and its confidence interval are calculated for all drug-adr combinations. Combinations, where the lower 95% confidence limit is newly greater than zero, are highlighted for clinical review. If considered signals after clinical assessment these

396 Bate et al. are then communicated externally. Follow-up processes are also used to provide further information on signals, or allow those potentially clinically interesting combinations to be signalled at a later stage when sufficient data has accumulated. The BCPNN approach is: transparent easy to see what has been calculated robust valid results can be produced despite missing data producing reproducible results making validation and checking easy time efficient network only needs one pass across data. 7. Evaluation of Method Initial testing of the approach was done by examining whether the IC was positive for drug-adr combinations we would hope to signal, such as captopril-coughing, while negative for drug-adr combinations we would hope not to highlight, such as the combinations digoxin-rash, or digoxin-acne. Captopril-coughing is a signal that would have been highlighted much earlier using the BCPNN method. [7] In contrast, neither digoxin-rash nor digoxin-acne was highlighted with this method. [7] Tests to determine whether we would have found signals earlier with the new method were made both against general reference sources (Physicians Desk Reference and Martindale) and extant literature reports in another sensitive international signalling system (database of the publication Reactions Weekly). In the investigation against general reference sources, a retrospective evaluation of 95 combinations highlighted by BCPNN, and 13 not highlighted by BCPNN from 1993 were made. In this study the positive predictive value reached 44.2% since 42 of the 95 combinations would have been found with the BCPNN. The negative predictive value was 84.6%, since only two of the 13 combinations not highlighted, were found to be signals. For the purposes of the study signals were defined as combinations unknown in 1993, but listed in the literature 7 years later. Both of these studies are described in detail in earlier publications. [6,8] 8. Detection of Group Effects Beyond detecting specific drug specific ADR signals, the BCPNN method can also be used to examine the relation between a group of similar drugs and the same ADR. [9] Grouping of drugs can be done for example using the WHO Anatomical Therapeutic Chemical (ATC) classification. [10] An investigation is performed to look at both: IC [drug-adr] IC [(ATC group-drug)-adr] and the two values compared. The comparison group must be clinically valid and selected objectively before the analysis is performed. The use of this method relies on the amount of relevant data available and the proportion of cases of multiple suspected drugs on case reports. The types of drugs investigated may also effect the usefulness of the approach; for example drugs with multiple drug indications (at potentially different doses) may be particularly problematic. Also differences in use of ADR terminology, such as inter-country variation, will influence the results. 9. Conclusions This quantitative method has been developed to improve the usefulness of the WHO database of spontaneously reported suspected drug-adverse reactions. The BCPNN method has been shown to be of use in routine signal detection [8] and has also been used in classification tasks, such as analysing organophosphate poisoning cases, and quality in pulp manufacturing process. The usefulness of the output is influenced by the quality of the data in the database, therefore this method should be used to detect, rather than evaluate signals. The need for clinical analyses of case series remains crucial. Since there will always be some drug-adr combinations that can be considered clinically interesting and important, but that are not being reported unexpectedly frequently, other approaches

Data Mining Approach in Signal Detection and Analysis 397 are also needed. The BCPNN is a tool developed to enhance rather than replace traditional signal detection procedures used on the WHO database by: (i) initial highlighting of associations for clinical review; (ii) providing a tool for further analysis of potential signals; and (iii) seamless extension of the method for detection of complex dependencies in the data set. References 1. Edwards IR, Biriell C. Harmonisation in pharmacovigilance. Drug Saf 1994; 10 (2): 93-102 2. Olsson S. The role of the WHO programme on international drug monitoring in coordinating worldwide drug safety efforts. Drug Saf 1998; 19 (1): 1-10 3. Bate A, Orre R, Lindquist M, et al. Explanation of data mining methods [online]. Available from URL: http://www.bmj. com/cgi/content/full/322/7296/1207/dc1 [Accessed 2001 Oct 15] 4. Bate A, Orre R, Lindquist M, et al. Pattern recognition using a recurrent neural network and its application to the WHO database [abstract]. Pharmacoepidemiol Drug Saf 2001; 10 (S1): S163 5. Orre R, Lansner A, Bate A, et al. Bayesian neural networks with confidence estimations applied to data mining. Computational Statistics Data Anal 2000; 34 (8): 473-93 6. Lindquist M, Edwards IR, Bate A, et al. From association to alert: a revised approach to international signal analysis. Pharmacoepidemiol Drug Saf 1999; 8: S15-25 7. Bate A, Lindquist M, Edwards IR, et al. A Bayesian neural network method for adverse drug reaction signal generation. Eur J Clin Pharmacol 1998; 54: 315-21 8. Lindquist M, Stahl M, Bate A, et al. A retrospective evaluation of a data mining approach to aid finding new adverse drug reaction signals in the WHO international database. Drug Saf 2000; 23 (6): 533-42 9. Bate A, Lindquist M, Orre R, et al. Automated classification of signals as group effects or drug specific on the WHO database [abstract]. 8th Annual Meeting European Society of Pharmacovigilance; 2000 Sep; Verona; Elsevier, 2000 10. Guidelines for ATC classification and DDD assignment. 3rd ed. Oslo: WHO Collaborating Centre for Drug Statistics Methodology, 2000 Correspondence and offprints: Andrew Bate, The Uppsala Monitoring Centre, Stora Torget, Uppsala, S-75320, Sweden. E-mail: andrew.bate@who-umc.org