Parsing Discourse Relations. Giuseppe Riccardi Signals and Interactive Systems Lab University of Trento, Italy

Similar documents
Generalizing Dependency Features for Opinion Mining

Exemplars in Syntax: Evidence from Priming

Outline. Teager Energy and Modulation Features for Speech Applications. Dept. of ECE Technical Univ. of Crete

Modeling the Use of Space for Pointing in American Sign Language Animation

Using a grammar implementation to teach writing skills

Foundations of Natural Language Processing Lecture 13 Heads, Dependency parsing

Referring Expressions & Alternate Views of Summarization. Ling 573 Systems and Applications May 24, 2016

Semantic Structure of the Indian Sign Language

The Effect of Sensor Errors in Situated Human-Computer Dialogue

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions?

Learning the Fine-Grained Information Status of Discourse Entities

Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments

M.Sc. in Cognitive Systems. Model Curriculum

Committee-based Decision Making in Probabilistic Partial Parsing

Irit Meir, Carol Padden. Emergence of Language Structures Workshop UCSD, February 6, 2007

WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions

Extracting Opinion Targets in a Single- and Cross-Domain Setting with Conditional Random Fields

Running Head: AUTOMATED SCORING OF CONSTRUCTED RESPONSE ITEMS. Contract grant sponsor: National Science Foundation; Contract grant number:

Today we will... Foundations of Natural Language Processing Lecture 13 Heads, Dependency parsing. Evaluating parse accuracy. Evaluating parse accuracy

ASL 102 American Sign Language II (4) Second in a related series of courses that focus on the use and study of ASL. This course

Combining unsupervised and supervised methods for PP attachment disambiguation

Information Extraction

Discourse Level Opinion Relations: An Annotation Study

Lecture 10: POS Tagging Review. LING 1330/2330: Introduction to Computational Linguistics Na-Rae Han

Exploiting Ordinality in Predicting Star Reviews

Joint Inference for Heterogeneous Dependency Parsing

COMBINING CATEGORICAL AND PRIMITIVES-BASED EMOTION RECOGNITION. University of Southern California (USC), Los Angeles, CA, USA

Signals from Text: Sentiment, Intent, Emotion, Deception

Extraction of Adverse Drug Effects from Clinical Records

Chapter 12 Conclusions and Outlook

OVERVIEW TUTORIAL BEHAVIORAL METHODS CLAIM: EMLAR VII EYE TRACKING: READING. Lecture (50 min) Short break (10 min) Computer Assignments (30 min)

Connecting Distant Entities with Induction through Conditional Random Fields for Named Entity Recognition: Precursor-Induced CRF

Character-based Embedding Models and Reranking Strategies for Understanding Natural Language Meal Descriptions

May All Your Wishes Come True: A Study of Wishes and How to Recognize Them

Stylometric Text Analysis for Dutch-speaking Adolescents with Autism Spectrum Disorder

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Open Architectures and Community Building. Professor Joakim Gustafson. Head of Department Department of Speech, Music and Hearing KTH, Sweden

Understanding the Dynamics of Knowledge

An Avatar-Based Weather Forecast Sign Language System for the Hearing-Impaired

Clinical Coreference Annotation Guidelines (with excerpts from ODIE guidelines and modified for SHARP) Arrick Lanfranchi and Kevin Crooks

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts

Introduction to Sentiment Analysis

A Web Tool for Building Parallel Corpora of Spoken and Sign Languages

Thursday, July 14, Monotonicity

Captioning Your Video Using YouTube Online Accessibility Series

EMOTION DETECTION FROM TEXT DOCUMENTS

Syntax and Semantics of Korean Numeral Classifier Constructions

CPSC81 Final Paper: Facial Expression Recognition Using CNNs

Overt Prosody in English as a Function of Working Memory. Van Rynald Liceralde Saint Louis University SREBCS 2013 Mentor: Fernanda Ferreira, PhD

Speech Group, Media Laboratory

Sentiment Classification of Chinese Reviews in Different Domain: A Comparative Study

Kalpana Raja, PhD 1, Andrew J Sauer, MD 2,3, Ravi P Garg, MSc 1, Melanie R Klerer 1, Siddhartha R Jonnalagadda, PhD 1*

Processing MWEs: Neurocognitive Bases of Verbal MWEs and Lexical Cohesiveness within MWEs

Intelligent Machines That Act Rationally. Hang Li Bytedance AI Lab

Building Evaluation Scales for NLP using Item Response Theory

Centroid-Based Exemplar Selection of ASL Non-Manual Expressions using Multidimensional Dynamic Time Warping and MPEG4 Features

World Languages American Sign Language (ASL) Subject Matter Requirements

Schema-Driven Relationship Extraction from Unstructured Text

An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation

The Emergence of Grammar: Systematic Structure in a New Language. Article Summary

Evaluation & Systems. Ling573 Systems & Applications April 9, 2015

Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes

Guidelines for Effective Usage of Text Highlighting Techniques

Understanding CELF-5 Reliability & Validity to Improve Diagnostic Decisions

Sign Language Automation

Investigating the Reliability of Classroom Observation Protocols: The Case of PLATO. M. Ken Cor Stanford University School of Education.

FINAL REPORT Measuring Semantic Relatedness using a Medical Taxonomy. Siddharth Patwardhan. August 2003

Extraction of Regulatory Events using Kernel-based Classifiers and Distant Supervision

Distillation of Knowledge from the Research Literatures on Alzheimer s Dementia

S URVEY ON EMOTION CLASSIFICATION USING SEMANTIC WEB

Is Phonology Necessary for Language?

ATLAS Automatic Translation Into Sign Languages

1. INTRODUCTION. Vision based Multi-feature HGR Algorithms for HCI using ISL Page 1

Nature Neuroscience: doi: /nn Supplementary Figure 1. Overlap between default mode network (DMN) and movie/recall maps.

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

Advanced Natural Language Processing

Semi-formal Evaluation of Conversational Characters

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

Sign Language MT. Sara Morrissey

TITLE: Acquisition and generalization responses in aphasia treatment: Evidence from sentence-production treatment

1 Pattern Recognition 2 1

Large Scale Analysis of Health Communications on the Social Web. Michael J. Paul Johns Hopkins University

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods

The Role of Representation in the Interpretation of Representational Noun Phrases

Analyzing the structure of parent-moderated narratives from children with ASD using an entity-based approach

Textual Emotion Processing From Event Analysis

DOMAIN BOUNDED ENGLISH TO INDIAN SIGN LANGUAGE TRANSLATION MODEL

Playing the Telephone Game: Determining the Hierarchical Structure of Perspective and Speech Expressions

Lecturer: T. J. Hazen. Handling variability in acoustic conditions. Computing and applying confidence scores

Initial coordination and the Law of Coordination of Likes *

1. Introduction and Background

Automated Conversion of Text Instructions to Human Motion Animation

Deep Learning based Information Extraction Framework on Chinese Electronic Health Records

A Smart Texting System For Android Mobile Users

Textual Entailment. Arindam Bhattacharya. M.Tech, Computer Science Indian Institute of Technology, Bombay. November 9, 2011

LIGN171: Child Language Acquisition Developmental Disorders affecting language

defying complexity (lessons learned)

Task-oriented Dependency Parsing Evaluation Methodology

UKParl: A Data Set for Topic Detection with Semantically Annotated Text

Emotion Recognition Modulating the Behavior of Intelligent Systems

Transcription:

Parsing Discourse Relations Giuseppe Riccardi Signals and Interactive Systems Lab University of Trento, Italy

Behavioral Analytics

Parser Run on Genia Corpus Among 25 cases, 2 homozygous deletions and 1 hemizygous deletion were found in HCC samples. No point mutation was identified in the remaining 22 tumor samples without p16 gene deletions. Hypermethylation was detected in 24% (6/25) of tumor samples. However, the corresponding non-tumor liver tissue specimens were always unmethylated at the p16 locus. Loss of p16 protein expression occurred in 16 of 35 (45.7%) tumor samples, and all the non-tumor liver tissue specimens showed positive p16 staining. For the 25 cases examined for p16 gene alterations, the loss of p16 protein expression was observed in all tumors with p16 gene alterations and also in 3 tumors without p16 gene alterations. (Source: Genia corpus)

Parser Run on Genia Corpus Among 25 cases, 2 homozygous deletions and 1 hemizygous deletion were found in HCC samples. No point mutation was identified in the remaining 22 tumor samples without p16 gene deletions. Hypermethylation was detected in 24% (6/25) of tumor samples. However, the corresponding non-tumor liver tissue specimens were always unmethylated at the p16 locus. Loss of p16 protein expression occurred in 16 of 35 (45.7%) tumor samples, and all the non-tumor liver tissue specimens showed positive p16 staining. For the 25 cases examined for p16 gene alterations, the loss of p16 protein expression was observed in all tumors with p16 gene alterations and also in 3 tumors without p16 gene alterations. (Source: Genia corpus) Parser Output : Hypermethylation was detected in 24 % 6\/25 ) of tumor samples However(Comparison) the corresponding non-tumor liver tissue specimens were always unmethylated at the p16 locus Loss of p16 protein expression occurred in 16 of 35 45.7 % ) tumor samples and(expansion ) all the non-tumor liver tissue specimens showed positive p16 staining

Social Media User Opinions: Negative The acting is below average, even from the likes of Curtis. You're more likely to get a kick out of her work in Halloween H20. Sutherland is wasted and Baldwin, well, he's acting like a Baldwin, of course. The real star here are Stan Winston's robot design, some schnazzy CGI, and the occasional good gore shot, like picking into someone's brain. So, if robots and body parts really turn you on, here's your movie. Otherwise, it's pretty much a sunken ship of a movie. 5/1/12 5

Social Media User Opinions: Positive From here on, the plot takes a back seat, and we are treated to some of the best camera work and action staged. Most all the action is plausible and will hold you at the edge of your seat. There are a few melodramatic parts here, but, they tend to work out well. There is no general antagonist in this film, but the action and suspense makes you forget all about that. Daylight is a great film, I saw a non-matinee showing of it, and I thought it was worth every penny. The characterizations are mostly flat, one dimesional, but they have enough in them to get you to care for some of the characters. Rob Cohen (Dragonheart) does a great job with this film. 5/1/12 6

Discourse Relation Parsing Joint work with Sucheta Ghosh, U. Trento Richard Johansson, U. Trento/U. Gothenburg Sara Tonelli, FBK-Irst Ghosh S., Tonelli S., Riccardi G. and Johansson R., End-to-End Discourse Parser Evaluation, IEEE International Conference on Semantic Computing, Menlo Park, USA, 2011 Ghosh S., Johansson R., Riccardi G. and Tonelli S., Shallow Discourse Parsing with Conditional Random Fields, International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, 2011 Ghosh S., Johansson R., Riccardi G. and Tonelli S., Improving Recall Through Global Constraint Selection, To appear on LREC, 2012 Giuseppe Riccardi

Discourse Parser From raw text extract: Discourse relations: Discourse Predicate (Connective) Connective sense Arg1 Arg2 Explicit Connective Giuseppe Riccardi

Parsing Architecture

Parser end2end Architecture Doc Parser Stanford (K&M) Parse_Tree Chunklink AddDiscourse RootExtract +Morpha By Sabaine Buchholz CoNLL 00 task Pitler & Nenkova 09 Conn. SenseDet. Morph & All Feat Johansson+ Minnen et al Windowing (-2,+2) Arg2 Arg1

Features: Example

Selected Features: Arg1 Features used for Arg1 and Arg2 segmentation and labeling. F1. Token (T) F2. Sense of Connective (CONN) F3. IOB chain (IOB) F4. PoS tag F5. Lemma (L) F6. Inflection (INFL) F7. Main verb of main clause (MV) F8. Boolean feature for MV (BMV) Additional feature used only for Arg1 F9. Previous Sentence (PREV) F10. Arg2 Labels

Inter vs Intra Sentence Arguments Illustration: PREV Feature This &ilm should be brilliant. Howeverr, it can t hold up. 13

Inter vs Intra Sentence Arguments Illustration: PREV Feature However However However However However 0 0 0 0 0 This &ilm should be brilliant. Howeverr, it can t hold up. 14

Inter vs Intra Sentence Arguments 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0.77 0.68 0.61 0.52 0.36 0.27 P R F1 Intra+Prev Inter+Prev - PREV +PREV Illustration: PREV Feature However However However However However 0 0 0 0 0 This &ilm should be brilliant. Howeverr, it can t hold up. 15

Selected Features: Arg2 Features used for Arg1 and Arg2 segmentation and labeling. F1. Token (T) F2. Sense of Connective (CONN) F3. IOB chain (IOB) Ghosh S., Johansson R., Riccardi G. and Tonelli S., Shallow Discourse Parsing with Conditional Random Fields, International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, 2011

Parser Evaluation Giuseppe Riccardi 17

Parser Evaluation Giuseppe Riccardi 18

Lightweight Features -Reduce dimensionality of IOB chain features -Control robustness of parser (wrt to syntactic parse) -Binary features selected from IOB chain Giuseppe Riccardi, UNITN 19

Lightweight Features IOB Chain feature replaced by two pairs of Boolean features (1) The second top parent node whether starting (B) or not (2) The third top parent node whether starting (B) or not (3) The second top parent node whether ending (E) or not (4) The third top parent node whether ending (E) or not Example: Tree diagram showed IOB feature for token flashed is I-S/E-VP/E-SBAR/E-S/C-VP Replacing Boolean feature for flashed respectively: (1) 0 ( ß E-VP ) (2) 0 ( ß E-SBAR ) (3) 1 ( ß E-VP ) (4) 1 ( ß E-SBAR ) Giuseppe Riccardi, UNITN 20

Parser Evaluation: Arg2 Exact Match P R F1 Baseline 0.53 0.46 0.49 Gold - Standard 0.84 0.74 0.79 Gold-Lightweight 0.80 0.74 0.77 AutoConn+GoldSPT 0.82 0.70 0.76 GoldConn+AutoSPT 0.76 0.61 0.68 Lightweight(Auto) 0.72 0.56 0.63 Giuseppe Riccardi 21

N-Best Parse Re-ranking Ø Online Passive-Aggressive Perceptron Ø Structured Voted Perceptron Ø Linear Preference Learning Support Vector Machine Ø Linear Best vs. Rest Support Vector Machine End2End Disc Parse 22

N-Best ReRanking with Global Constraints Ø GF0. Log Posteriors Ø GF1. Overgeneration. Ø GF2. Undergeneration. Ø GF3. Intersentential Arg2. Ø GF4. Arg1 after the connective sentence Ø GF5. Argument overlapping with the connective. Ø GF6. Argument begins with I- tag Ø GF7. Argument begins with E- tag End2End Disc Parse 23

N-Best ReRanking with Global Constraints Exact Arg1 Arg2 P R F1 P R F1 Baseline 69.88 48.51 57.26 83.44 75.14 79.07 Online PA 66.10 53.92 59.39(16) 82.59 76.39 79.37(4) Struct Per 67.18 52.64 59.03(4) 82.96 76.28 79.48(8) Bestvs Rest 66.19 52.83 58.94(8) 81.69 77.14 79.35(4) Pref-Linear 66.54 53.31 59.20(4) 82.82 76.28 79.42(4) Exact Match Scores. Used n- best list numbers in parenthesis End2End Disc Parse 24

Research Challenges Speech, Dialog and Discourse Speech Signal vs Linguistic correlates Eat your porridge! You re not going to football practice Parser Trade-off btw coverage and agreement Robustness of features Semantic Annotation Domain/Genre Adaptation

Research Challenges Speech, Dialog and Discourse Acoustics vs lexical correlates Eat your porridge! You re not going to football practice Parser Trade-off amongst sense-depth, coverage, agreement Robustness of features Semantic Annotation Domain/Genre Adaptation

Publications Speech (LUNA Corpus) Tonelli S., Riccardi G., Prasad R. and Joshi A. "Annotation of Discourse Relations for Conversational Spoken Dialogs", LREC Valletta, 2010. Text (PDTB corpus) Ghosh S., Johansson R., Riccardi G. and Tonelli S., Shallow Discourse Parsing with Conditional Random Fields, International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, 2011 Ghosh S., Tonelli S., Riccardi G. and Johansson R., End-to-End Discourse Parser Evaluation, IEEE International Conference on Semantic Computing, Menlo Park, USA, 2011 Giuseppe Riccardi 27