EECS 433 Statistical Pattern Recognition

Similar documents
A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger

Gender Based Emotion Recognition using Speech Signals: A Review

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

AClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL

Mammogram Analysis: Tumor Classification

Utilizing Posterior Probability for Race-composite Age Estimation

1 Pattern Recognition 2 1

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

10CS664: PATTERN RECOGNITION QUESTION BANK

Bayesian Networks Representation. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University

Learning from data when all models are wrong

MS&E 226: Small Data

Mammogram Analysis: Tumor Classification

Error Detection based on neural signals

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis

Multiclass Classification of Cervical Cancer Tissues by Hidden Markov Model

A Semi-supervised Approach to Perceived Age Prediction from Face Images

COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION

Bayesian (Belief) Network Models,

EEG signal classification using Bayes and Naïve Bayes Classifiers and extracted features of Continuous Wavelet Transform

Lecture 13: Finding optimal treatment policies

Introduction to Computational Neuroscience

MACHINE LEARNING ANALYSIS OF PERIPHERAL PHYSIOLOGY FOR EMOTION DETECTION

Video-Based Face Recognition Using Spatio-Temporal Representations

Bayesian Inference Bayes Laplace

Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012

Neurons and neural networks II. Hopfield network

Putting Context into. Vision. September 15, Derek Hoiem

Computational Cognitive Neuroscience

Predicting Sleep Using Consumer Wearable Sensing Devices

COMMITMENT. &SOLUTIONS Act like someone s life depends on what we do. UNPARALLELED

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network

An Improved Algorithm To Predict Recurrence Of Breast Cancer

Machine Learning for Predicting Delayed Onset Trauma Following Ischemic Stroke

Hybrid HMM and HCRF model for sequence classification

THE data used in this project is provided. SEIZURE forecasting systems hold promise. Seizure Prediction from Intracranial EEG Recordings

Video-based Face Recognition Using Exemplar-Driven Bayesian Network Classifier

COMBINATION DEEP BELIEF NETWORKS AND SHALLOW CLASSIFIER FOR SLEEP STAGE CLASSIFICATION.

MORPHOLOGICAL CHARACTERIZATION OF ECG SIGNAL ABNORMALITIES: A NEW APPROACH

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

OUTLIER DETECTION : A REVIEW

Bayesian Nonparametric Methods for Precision Medicine

NONLINEAR REGRESSION I

Econometrics II - Time Series Analysis

Using the Soundtrack to Classify Videos

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)

Outlier Analysis. Lijun Zhang

Georgetown University ECON-616, Fall Macroeconometrics. URL: Office Hours: by appointment

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

Cognitive Modeling. Lecture 12: Bayesian Inference. Sharon Goldwater. School of Informatics University of Edinburgh

Design of Multi-Class Classifier for Prediction of Diabetes using Linear Support Vector Machine

The Statistical Analysis of Failure Time Data

SPEECH TO TEXT CONVERTER USING GAUSSIAN MIXTURE MODEL(GMM)

Predicting Kidney Cancer Survival from Genomic Data

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

A HMM-based Pre-training Approach for Sequential Data

Emotion Recognition using a Cauchy Naive Bayes Classifier

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Hybridized KNN and SVM for gene expression data classification

Analysis of Emotion Recognition using Facial Expressions, Speech and Multimodal Information

Intelligent Edge Detector Based on Multiple Edge Maps. M. Qasim, W.L. Woon, Z. Aung. Technical Report DNA # May 2012

Advanced Methods and Tools for ECG Data Analysis

Probabilistic Graphical Models: Applications in Biomedicine

Intelligent Control Systems

Noise-Robust Speech Recognition Technologies in Mobile Environments

ISIR: Independent Sliced Inverse Regression

An assistive application identifying emotional state and executing a methodical healing process for depressive individuals.

Empirical Cognitive Modeling

NMF-Density: NMF-Based Breast Density Classifier

Facial expression recognition with spatiotemporal local descriptors

Empirical Mode Decomposition based Feature Extraction Method for the Classification of EEG Signal

Methods for Predicting Type 2 Diabetes

Classification and Statistical Analysis of Auditory FMRI Data Using Linear Discriminative Analysis and Quadratic Discriminative Analysis

Large-Scale Statistical Modelling via Machine Learning Classifiers

Sequential Decision Making

CS343: Artificial Intelligence

Tandem modeling investigations

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

Model-Based fmri Analysis. Will Alexander Dept. of Experimental Psychology Ghent University

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network

The Role of Face Parts in Gender Recognition

Seizure Prediction Through Clustering and Temporal Analysis of Micro Electrocorticographic Data

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Introduzione. Ruggero Donida Labati

Adaptation of Classification Model for Improving Speech Intelligibility in Noise

Data mining for Obstructive Sleep Apnea Detection. 18 October 2017 Konstantinos Nikolaidis

A FRAMEWORK FOR ACTIVITY-SPECIFIC HUMAN IDENTIFICATION

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852

Applied Machine Learning, Lecture 11: Ethical and legal considerations; domain effects and domain adaptation

Semi-Supervised Disentangling of Causal Factors. Sargur N. Srihari

An empirical evaluation of text classification and feature selection methods

Unsupervised Pattern Discovery in Sparsely Sampled Clinical Time Series

Electrocardiogram beat classification using Discrete Wavelet Transform, higher order statistics and multivariate analysis

Transcription:

EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19

Outline What is Pattern Recognition? Applications Supervised and Unsupervised Schemes Bayesian and Non-Bayesian Approaches Descriptive, Discriminative and Generative Models This Course 2 / 19

What are Patterns? Random vectors x R n they are static Random processes 1-D signals s(t), e.g., time series 2-D signals I (u, v), e.g., images 3-D signals V (u, v, t), e.g., video they are dynamic What is a pattern? a prototype with variations its polymorphism is reflected by its variations the variation has a probabilistic distribution for random vector data, it simply is its distribution for random process data, it is very difficult! E.g., speech, music, ECG image, microarray video 3 / 19

How Do We Represent Patterns? Using templates and rules is far from enough as a pattern is likely to exhibit large variations thus, a critical issue is to model its variations i.e., learning from the data this is clear for patterns of random vector data and this is the center problem in classical statistical pattern recognition parametric or non-parametric Bayesian or non-bayesian but this is still not quite clear for random process data representing these data is quite challenging features and descriptive models generative models pattern theory 4 / 19

Three Basic Problems in Statistical Pattern Recognition Let s denote the data by x. There are three basic problems in statistical pattern recognition: Classification f : x C, where C is a discrete set Regression f : x y, where y R a continuous space Density estimation model p(x) that is the probabilistic density of x 5 / 19

Feature features can be regarded as the descriptors of the pattern they are represented as vectors in this sense, a pattern can be represented by a high dimensional random vector curse of dimensionality features are extracted from raw data weak features and strong features domain knowledge features can also be learned linear combination optimal projection, e.g., PCA, LDA sequential selection, e.g., boosting nonlinear embedding dimension reduction, e.g., ISOMAP, LLE, MDS manifold learning, e.g., nonlinear PCA 6 / 19

Components in Pattern Recognition Systems Feature extraction Feature selection Recognizer Training raw data feature extraction feature selection pattern recognition decision training 7 / 19

Outline What is Pattern Recognition? Applications Supervised and Unsupervised Schemes Bayesian and Non-Bayesian Approaches Descriptive, Discriminative and Generative Models This Course 8 / 19

Applications Speech recognition Optical character recognition Image-based object recognition Image-based biometrics Face recognition/identification Gender/age recognition Gait recognition Fingerprint recognition Iris recognition Expression and emotion recognition Gesture and action recognition Video-based activity recognition Video-based event recognition and abnormality detection... 9 / 19

Outline What is Pattern Recognition? Applications Supervised and Unsupervised Schemes Bayesian and Non-Bayesian Approaches Descriptive, Discriminative and Generative Models This Course 10 / 19

Supervised and Unsupervised Schemes To train a classifier from data, there are two typical training schemes: Supervised learning the training data has supervision, i.e., input-output pair given a set of {xk, c k } N k=1, to train the mapping f : x C the labeling acts as the teacher e.g., LDA, SVM Unsupervised learning no teacher, only a set of data {xk } N k=1 needs to find the clustering or grouping of the data e.g., most clustering methods it is difficult to determine the number of clusters for training minimum description length (MDL) principle 11 / 19

Other Schemes Besides supervised and unsupervised learning schemes, there are some others: Semi-supervised learning a mixture of labeled and unlabeled data, {xk } N1 k=1 {xk, c k } N2 only critical and informative data need to be labeled what are those? e.g., active learning Reinforcement learning it has a teacher. More precisely, it is a critic but the critic only tells right or wrong, but not how such binary feedbacks are used to improve learning k=1 12 / 19

Outline What is Pattern Recognition? Applications Supervised and Unsupervised Schemes Bayesian and Non-Bayesian Approaches Descriptive, Discriminative and Generative Models This Course 13 / 19

Bayesian and Non-Bayesian Approaches Bayesian approach models likelihood p(x C) and prior p(c) distributions computes posterior p(c x) based on Bayesian decision theory and Bayes risk C R(α k x) = λ(α k c j )p(c j x) leads to minimum-error-rate classification Bayesian learning (ML estimation and Bayesian estimation) Non-Bayesian approach empirical risk and overfitting minimizing empirical risk maximizing margin minimizing training error maximizing the generalizability a great success of support vector machines j=1 14 / 19

Outline What is Pattern Recognition? Applications Supervised and Unsupervised Schemes Bayesian and Non-Bayesian Approaches Descriptive, Discriminative and Generative Models This Course 15 / 19

Descriptive, Discriminative and Generative Models Descriptive models features are derived from data features are descriptive it gives the likelihood of the observed data e.g., PCA Discriminative models features are derived from data but they are discriminative to differentiate from other classes it concentrates on the differences from others e.g., LDA, SVM Generative models explicitly models the formation or generation of the data based on which the likelihood model is formed no features derived from data directly it can be used to synthesize data. Neither descriptive nor discriminative model can e.g., Gaussian mixture models, HMM, MRF 16 / 19

Outline What is Pattern Recognition? Applications Supervised and Unsupervised Schemes Bayesian and Non-Bayesian Approaches Descriptive, Discriminative and Generative Models This Course 17 / 19

This Course In this course, we ll cover the following topics: W-1: Bayesian decision and Bayesian classification, PCA/LDA W-2: ICA, Nearest neighbor classifiers W-3: Nonparametric density estimation, and linear discriminative models W-4: SVM and Kernel machines W-5: Feature selection and boosting W-6: EM, spectral clustering, sparsity models W-7: Metric learning, Deep neural networks, Dimension reduction and embedding W-8/9: Generative models Bayesian networks (BN) and dynamic Bayesian nets (DBN) Markov random field (MRF) Conditional random field (CRF) 18 / 19

This Course Textbook Pattern Classification, by R. Duda, P. Hart and D. Stork, 2nd Ed., Wiley-Interscience, 2001 Selected recent research papers Lecture notes No midterm exam. Final projects small group projects ( group 2) suggestion: find a smaller topic and write a good comprehensive survey 30-min talk and 20-page report 19 / 19