From Delphi. to Phi-T

Similar documents
NeuroBayes A modern analysis tool from High Energy Physics and its way as prognosis tool into business

NeuroBayes in and outside the Ivory Tower of High Energy Physics. Particle Physics Seminar, University Bonn, January 13, 2011

MANAGEMENT. MGMT 0021 THE MANAGEMENT PROCESS 3 cr. MGMT 0022 FINANCIAL ACCOUNTING 3 cr. MGMT 0023 MANAGERIAL ACCOUNTING 3 cr.

LHCb Outreach example Ben Couturier (CERN) on behalf of the LHCb Collaboration. Frontiers of Fundamental Physics 14 Marseille July 15th to 18th 2014

Operational Efficiency:

Fully Employed MBA (MGMT FE)

Walgreen Co. Reports Second Quarter 2010 Earnings Per Diluted Share of 68 Cents; Results Include 2 Cents Per Diluted Share of Restructuring Costs

PHARMACY BENEFITS MANAGER SELECTION FAQ FOR PRODUCERS

Workshop on Hadron Beam Therapy of Cancer Erice, Sicily April 24-May

A SIMPLIFIED PROCESS TO LIQUIDATE POWERSPORTS INVENTORY AND MAXIMIZE RETURNS

Industry experience in consortium i.e. from BEE party or non-bee partners

Non Alcoholic Fatty Liver Disease (NAFLD) - Global API Manufacturers, Marketed and Phase III Drugs Landscape, 2018

THE POWER OF. Savings through the largest dentist network. Hometown expertise. Measurably superior service

UNIVERSITY OF DUBLIN TRINITY COLLEGE. Faculty of Arts Humanities and Social Sciences. School of Business

Poland Proton Therapy Market (Actual & Potential), Patients Treated, List of Proton Therapy Centers and Forecast to 2022

Exercises: Differential Methylation

5 $3 billion per disease

Business plan SATIVA GmbH & Co. KG

Analysis and Interpretation of Data Part 1

After simple testing that only takes minutes people with elevated test results are then recommended to see their physicians for follow-up.

The power of innovation to save lives

MATRIX ANALYSIS FRAMED STRUCTURES BY WILLIAM WEAVER, JAMES M. GERE

ONCOLOGY: WHEN EXPERTISE, EXPERIENCE AND DATA MATTER. KANTAR HEALTH ONCOLOGY SOLUTIONS: FOCUSED I DEDICATED I HERITAGE

Proton Therapy Market Outlook - Global Analysis

Noninvasive Glucose Monitors Devices, Technologies, Players and Prospects

Acurian on. Patient Centricity and Enrollment Certainty

IT Foundation Application Map

LEP加速器と検出器 田中礼三郎 岡山大学

Appendix I Teaching outcomes of the degree programme (art. 1.3)

WALGREENS DRONE FOOTAGE AVAILABLE. Absolute NNN Lease Investment Opportunity. 758 N Ellington Parkway Lewisburg, TN

Brain Resource. For personal use only. riding the wave of demand for brain health solutions

INVESTOR PRESENTATION

Probabilistic Modeling to Support and Facilitate Decision Making in Early Drug Development

12/7/2016. Hard Cider in the North Central Region: Industry Survey Findings and Opportunities for Rural Development. About the Survey.

Challenges in Developing Learning Algorithms to Personalize mhealth Treatments

Error Detection based on neural signals

artnet AG Quarterly Interim Statement for the First Quarter of 2017 artnet AG Quarterly Interim Statement for the First Quarter of 2017

The Dental Corporation Opportunity

Noninvasive Glucose Monitors to 2022

The Danish Medicines Agency s availability strategy

Economic Valuation of FES and approaches to payments for FES in Germany

Ear Beamer. Aaron Lucia, Niket Gupta, Matteo Puzella, Nathan Dunn. Department of Electrical and Computer Engineering

HOW ANIMALS SEE THE WORLD: COMPARATIVE BEHAVIOR, BIOLOGY, AND EVOLUTION OF VISION FROM OXFORD UNIVERSITY PRESS

An Auditory System Modeling in Sound Source Localization

Some Regulatory Implications of Machine Learning Larry D. Wall Federal Reserve Bank of Atlanta

Bayesian Joint Modelling of Benefit and Risk in Drug Development

Visualise a better practice

Dental & Vision. For World Travelers of America Members. Marketed by:

Task Test beam infrastructure in Frascati LNF, Ferrara & Perugia INFN structure and University of Bergen

Market Potential for MDX Alcobra Investor Breakfast July 15, 2014

Value creation through profitable growth Danske Bank Winter Seminar 2017, Copenhagen Lars Rasmussen, President & CEO

APPENDIX A: Tobacco Marketing Expenditure Categories

INSIDE BROKER THE SAFE CHOICE IS OFTEN THE

Sampling Calorimeter Reconstruction Issues and Approaches: An Overview. Gary Bower & Ron Cassell, SLAC May 22, 2003 SLAC LCD Workshop

Why Coaching Clients Give Up

Opioid Addiction and an approach to solving the problem. A White Paper by Tom Bronack

Dental Floss (Oral Hygiene) Market in South Korea - Outlook to 2020: Market Size, Growth and Forecast Analytics

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Cognizanti. You Don t Join Us; We Join You. Insurance/Health & Wellness VOLUME

DMA will take your dental practice to the next level

Political Science 15, Winter 2014 Final Review

China Silymarin Industry Overview,

Cannabis Legalization August 22, Ministry of Attorney General Ministry of Finance

Investor Presentation

NEW SEX THERAPY: ACTIVE TREATMENT OF SEXUAL DYSFUNCTIONS BY HELEN SINGER KAPLAN

DRONE FOOTAGE AVAILABLE

Run Time Tester Requirements Document

DATA ANALYSIS & STATISTICAL PACKAGES. Daniel Inusa Yakmut ICT Directorate Federal University Lafia

HONG GAO LIANG JIA ZHU ('RED SORGHUM: A NOVEL OF CHINA' IN TRADITIONAL CHINESE CHARACTERS) BY YAN MO

How to Select a VRI Provider

Today s Presenter Kyle Roberts

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

Non Performing Assets (NPAs): A Comparative Analysis of SBI and ICICI Bank

Section Processing

Neurons and neural networks II. Hopfield network

AN INTRODUCTION TO THE PHYSIOLOGY OF HEARING BY JAMES O. PICKLES

Review Questions Part 2 (MP 4 and 5) College Statistics. 1. Identify each of the following variables as qualitative or quantitative:

BRIC Transurethral Resection of the Prostate (TURP) Procedures Outlook to 2020

Evaluation: Scientific Studies. Title Text

Chapter 1 Introduction to I/O Psychology

Master the Metrics that Matter. A dentist s guide to managing key performance indicators (KPIs) for greater productivity and efficiency.

SMART Communities Need A SMART Workforce Partners in SMART Education and Workforce Development & Training

COMPANY PROFILE. AL SHAMS Group

Start your own diet and weight loss business! FabJob Guide to. Become a. Weight Loss. Center Owner. Pamela White. Visit

ALL YOU NEED TO KNOW ABOUT PILATES

Section 6: Analysing Relationships Between Variables

THE UNTOLD SECRET OF SELF-ACTUALIZATION

Performance in competitive Environments: Gender differences

The Value of Walgreens

Why RDI Diamonds? With over 8,000 diamonds in stock, RDI has the selection you want and the quality you need!

The Scientific Method

START & RUN A MARIJUANA DISPENSARY OR POT SHOP: WHEREVER IT IS LEGAL! (START & RUN A BUSINESS SERIES) BY JAY CURRIE

INSIGHTS INTO ADHD CARE IN GERMANY BASED ON SHI CLAIMS DATA 03. March Results of the CoCA Study (Horizon 2020)

MINDWORKS: A PRACTICAL GUIDE FOR CHANGING THOUGHTS BELIEFS, AND EMOTIONAL REACTIONS BY GARY VAN WARMERDAM

CURRICULUM FOR THE DIPLOMA PROGRAMME BUSINESS LAW AT THE UNIVERSITY OF INNSBRUCK

Master of Science in Management: Fall 2018 and Spring 2019

Transcription:

From Delphi to Phi-T Spin-Off from Physics Research to Business Prof. Dr. Michael Feindt KCETA - Centrum für Elementarteilchen- und Astroteilchenphysik IEKP, Universität Karlsruhe, Karlsruhe Institute of Technology KIT & Phi-T GmbH, Karlsruhe DELPHI Closing Meeting CERN, May 29, 2008

HPC (1991-92) Initially energy and spatial resolutions catastrophic Solve problems by looking IN DETAIL (144 modules ) into the real DATA. Correct by software

HPC design performance almost reached, physics analysis possible

Introduction of Software Tasks e.g. ELEPHANT, MAMMOTH packages

(Inclusively reconstr.) B physics B* B**

Start to work with neural networks (in1993) BSAURUS: b flavour tagging MACRIB: Combined Particle ID

Gained lots of experience with neural network applications in DELPHI (1993-2003) Electron id (ELEPHANT) Kaon, proton id (MACRIB) (BSAURUS:: Optimisation of resolutions inclusive B- E, φ, θ, Q-value Inclusive B production flavour tagging Inclusive B decay flavour tagging Inclusive B+/B0/Bs/Lambda_b separation B**, B s ** enrichment B fragmentation function Limit on B s -mixing B 0 -mixing B- F/B-asymmetry B-> wrong sign charm)

Neural reconstruction of a direction Resolution of azimuthal angle of inclusively Resolution of azimuthal angle of inclusively reconstructed B-hadrons in the DELPHIdetector reconstructed B-hadrons in the DELPHIdetector first neural reconstruction of a direction first neural reconstruction of a direction NeuroBayes NeuroBayes phi-direction phi-direction Best Best classical" classical" chi**2- chi**2- fit fit (BSAURUS) (BSAURUS) After selection cut on estimated error: No No selection: After selection cut on estimated error: selection: Resolution massively improved, no tails Improved Improved resolution Resolution massively improved, no tails resolution ==> allows reliable selection of good events ==> allows reliable selection of good events

Naïve neural networks and criticizm We ve tried that but it didn t give good results - Stuck in local minimum - Learning not robust We ve tried that but it was worse than our 100 person-years analytical high tech algorithm - Selected too naive input variables - Use your fancy algorithm as INPUT! We ve tried that but the predictions were wrong - Overtraining: the net learned statistical fluctuations Yeah but how can you estimate systematic errors? - How can you with cuts when variables are correlated? - Tests on data, data/mc agreement possible and done.

Address all these topics and build a professional robust and flexible neural network package for physics and general prediction tasks: (2000-2002) NeuroBayes turned out to be a very strong tool

After having looked outside the ivory tower, realized: These methods are not only applicable in physics <phi-t>: Foundation of private company out of University of Karlsruhe, initially sponsored by exist-seedprogramme of the federal ministery for Education and Research BMBF

History 2000-2002 NeuroBayes -specialisation for economy at the University of Karlsruhe Oct. 2002: GmbH founded, first industrial projects June 2003: Removal into new office 199 sqm IT-Portal Karlsruhe May 2008: Expansion to 700 sqm office Foundation of sub-company Phi-T products&services Exclusive rights for NeuroBayes Staff all physicists (almost all from HEP) Continuous further development of NeuroBayes

Customers: (among others): BGV and VKB car insurances AXA and Central health insurances Lupus Alpha Asset Management dm drogerie markt (drugstore chain) Otto Versand (mail order business) Libri (book wholesale) Thyssen Krupp (steel industry)

NeuroBayes principle Input Preprocessing NeuroBayes Teacher: Learning of complex relationships from existing data bases (e.g. Monte Carlo) NeuroBayes Expert: Prognosis for unknown data Significance control Postprocessing Output

How it works: training and application Historic or simulated data Data set a =... b =... c =...... t =! NeuroBayes Teacher Actual (new real) data Expert system Expertise Probability that hypothesis is correct (classification) or probability density for variable t Data set a =... b =... c =...... t =? NeuroBayes Expert f t t

NeuroBayes task 1: Classifications Classification: Binary targets: Each single outcome will be yes or no NeuroBayes output is the Bayesian posterior probability that answer is yes (given that inclusive rates are the same in training and test sample, otherwise simple transformation necessary). Examples: > This elementary particle is a K meson. > This event is a Higgs candidate. > Germany will become soccer world champion in 2010. > Customer Meier will have liquidity problems next year. > This equity price will rise.

NeuroBayes task 2: Conditional probability densities Probability density for real valued targets: For each possible (real) value a probability (density) is given. From that all statistical quantities like mean value, median, mode, standard deviation, percentiles etc can be deduced. Examples: > Energy of an elementary particle (e.g a semileptonically decaying B meson with missing neutrino) > Q value of a decay > Lifetime of a decay > Price change of an equity or option > Company turnaround or earnings

Prediction of the complete probability distribution event by event unfolding - f r ( t x) Expectation value Standard deviation volatility Mode Deviations from normal distribution, e.g. crash probability t

Conditional probability densities in particle physics What is the probability density of the true B momentum in this semileptonic B candidate event taken with the CDF II detector with these n tracks with those momenta and rapidities in the hemisphere, which are forming this secondary vertex with this decay length and probability, this invariant mass and transverse momentum, this lepton information, this missing transverse momentum, this difference in Phi and Theta between momentum sum and vertex topology, etc pp f t x r r ( t x)

<phi-t> NeuroBayes > is based on neural 2nd generation algorithms, Bayesian regularisation, optimised preprocessing with transformations and decorrelation of input variables and linear correlation to output. > learns extremely fast due to 2nd order methods and 0-iteration mode > can train with weights and background subtraction > is extremly robust against outliers > is immune against learning by heart statistical noise > tells you if there is nothing relevant to be learned > delivers sensible prognoses already with small statistics > has advanced boost and cross validation features > is steadily further developed

Ramler-plot (extended correlation matrix)

Ramler-II-plot (visualize correlation to target)

Visualisation of single input-variables

Visualisation of correlation matrix Variable 1: Training target

Visualisation of network performance Purity vs. efficiency Signal-effiziency vs. total efficiency (Lift chart)

Visualisation of NeuroBayes network topology

Successful applications in High Energy Physics: at CDF II: Electron ID, muon ID, kaon/proton ID Optimisation of resonance reconstruction in many channels (X, Y, D, D s, D s **, B, B s, B**,B s **) Spin parity analysis of X(3182) Inclusion of NB output in likelihood fits B-tagging for high pt physics (top, Higgs, etc.) Single top quark production search Higgs search B-Flavour Tagging for mixing analyses (new combined tagging) B0, B s -lifetime, Γ, mixing, CP violation

Further NeuroBayes applications in HEP: AMS: Particle ID design studies CMS (ongoing) : B-tagging Belle (ongoing) : Continuum suppression H1 (ongoing) : Hadron calorimeter energy resolution optimisation

More than 35 diploma and Ph.D. theses from experiments DELPHI, CDF II, AMS and CMS used NeuroBayes or predecessors very successfully. Many of these can be found at www.phi-t.de Wissenschaft NeuroBayes Talks about NeuroBayes and applications: www-ekp.physik.uni-karlsruhe.de/~feindt Forschung

Just a few examples NeuroBayes soft electron identification for forcdf II II (Ulrich Kerzel, Michael Milnik, M.F.) Thesis U. U. Kerzel: on on basis of of Soft Electron Collection (much more efficient than cut selection or orjetnet with same inputs) - after clever preprocessing by byhand and careful learning parameter choice this could also be beas as good as as NeuroBayes

Just a few examples NeuroBayes selection

Just a few examples First observation of of B_s1 and most precise of of B_s2* Selection using NeuroBayes

Presse (Die Welt vom 21. April 2006)

Successful in competition with other data-miningmethods World s largest students competion: Data-Mining-Cup 2005: Fraud detection in internet trading 2006: price prediction in ebay auctions 2007: coupon redemption prediction

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA Phi-T: Applications of NeuroBayes in Economy > Medicine and Pharma research e.g. effects and undesirable effects of drugs early tumor recognition > Banks e.g. credit-scoring (Basel II), finance time series prediction, valuation of derivates, risk minimised trading strategies, client valuation > Insurances e.g. risk and cost prediction for individual clients, probability of contract cancellation, fraud recognition, justice in tariffs > Trading chain stores: turnover prognosis for individual articles/stores Necessary prerequisite: Historic or simulated data must be available! 37

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA Individual risk prognoses for car insurances: Accident probability Cost probability distribution Large damage prognosis Contract cancellation prob. very successful at 38

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA The unjustice of insurance premiums Anzahl Kunden Ratio of the accident risk calculated using NeuroBayes to premium paid (normalised to same total premium sum): The majority of customers (with low risk) are paying too much. Less than half of the customers (with larger risk) do not pay enough, some by far not enough. These are currently subsidised by the more careful customers. Risiko/Prämie 39

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA 40

Turnover prognosis for mail order business Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA 41

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA Presse Physicists modernise sales management 42

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA Test Test 43

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA Prognosis of individual health costs Pilot project for a large private health insurance Prognosis of costs in following year for each person insured with confidence intervals 4 years of training, test on following year Results: Probability density for each customer/tarif combination Kunde N. 00000 Mann, 44 Tarif XYZ123 seit ca. 17 Jahre Very good test results! Has potential for a real and objective cost reduction in health management 44

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA Prognosis of financial markets NeuroBayes based risk averse market neutral fonds for institutional investors Lupus Alpha NeuroBayes Short Term Trading Fonds (up to now 20 Mio, aim: 250 Mio end of 2008) Test VDI-Nachrichten, 9.3.2007 Test Börsenzeitung, 6.2.2008 45

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA The <phi-t> mouse game: or: even your ``free will is predictable //www.phi-t.de/mousegame 46

Bindings and Licenses NeuroBayes is commercial software. Special rates for public research. Essentially free for high energy physics research. License files needed. Separately for expert and teacher. Please contact Phi-T. NeuroBayes core code written in Fortran. Libraries for many platforms (Linux, Windows, ) available. Bindings exist for C++, Fortran, java, lisp, etc. Code generator for easy usage exists for Fortran and C++. New: Interface to root-tmva available (classification only) Integration into root planned

Documentation Basics: M. Feindt, A Neural Bayesian Estimator for Conditional Probability Densities, E-preprint-archive physics 0402093 M. Feindt, U. Kerzel, The NeuroBayes Neural Network Package, NIM A 559(2006) 190 Web Sites: www.phi-t.de (Company web site, German & English) www.neurobayes.de (English site on physics results with NeuroBayes & all diploma and PhD theses using NeuroBayes) www-ekp.physik.uni-karlsruhe.de/~feindt (some NeuroBayes talks can be found here under -> Forschung)

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA At DELPHI I have learned how to extract information and learn from data. With a physicist s pragmatic analytical and quantitative thinking, data-analysis and programming expertise and NeuroBayes, we have achieved a lot in physics and with Phi-T in very different areas of industry and economics. 49

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA It is not physics, but the understanding, forecasting and steering of complex systems, with the tools of a physicist. It is (after many years of being a physicist at least) equally fascinating. What is absolutely great: our influence is larger! First years were hard and risky: you have to sell! There is no public money just coming in But with many very successful projects we now have many contacts to high-ranked people convinced of our capabilities. 50

Start Idee NeuroBayes Idee Hintergrund Ziele NeuroBayes f(t x) Beispiele Historie Anwendung Prinzip Funktion Beispiel Konkurrenz Projekt l Forschung Projekt ll Ablauf Spiel Summary A BA In general: HEP community very effective and well-trained. Standard and motivation much higher than in industry. Phi-T, NeuroBayes and its industrial applications are a real spin-off from High Energy Physics with its origin in DELPHI. btw: There is much to do! Fascinating and challenging projects and permanent positions available. Phi-T will grow! 51