Automatic Identification & Classification of Surgical Margin Status from Pathology Reports Following Prostate Cancer Surgery

Size: px
Start display at page:

Download "Automatic Identification & Classification of Surgical Margin Status from Pathology Reports Following Prostate Cancer Surgery"

Transcription

1 Automatic Identification & Classification of Surgical Margin Status from Pathology Reports Following Prostate Cancer Surgery Leonard W. D Avolio MS a,b, Mark S. Litwin MD c, Selwyn O. Rogers Jr. MD, MPH d, Alex A. T. Bui PhD a a Medical Imaging Informatics Group, University of California, Los Angeles CA b Department of Information Studies, University of California, Los Angeles CA c Departments of Urology and Health Services, University of California, Los Angeles CA d Center for Surgery and Public Health, Brigham and Women s Hospital, Boston MA Abstract Prostate cancer removal surgeries that result in tumor found at the surgical margin, otherwise known as a positive surgical margin, have a significantly higher chance of biochemical recurrence and clinical progression. To support clinical outcomes assessment a system was designed to automatically identify, extract, and classify key phrases from pathology reports describing this outcome. Heuristics and boundary detection were used to extract phrases. Phrases were then classified using support vector machines into one of three classes: positive (involved) margins, negative (uninvolved) margins, and not-applicable or definitive. A total of 851 key phrases were extracted from a sample of 782 reports produced between 1996 and 2006 from two major hospitals. Despite differences in reporting style, at least 1 containing a diagnosis was extracted from 780 of the 782 reports (99.74%). Of the 851 s extracted, 97.3% contained diagnoses. Overall accuracy of automated classification of extracted s into the three categories was 97.18%. Introduction According to the American Cancer Society, there were an estimated 234,460 new cases of prostate cancer in the United States in 2006 [1]. A prevalent treatment for prostate cancer is removal of the prostate and the surrounding lymph nodes in a procedure known as a radical retropubic prostatectomy (RRP). One unintended consequence of RRP is a positive surgical margin, which results from incising inadvertently into the prostate or incising into the extraprostatic tumor [2]. Positive margins are correlated with decreased cancer-specific and overall survival and a 2 to 4 times greater chance of biochemical cancer recurrence [3]. Identifying the number of patients with positive surgical margin is a requisite step toward improving RRP outcomes. Unfortunately, surgical margin status is largely reported in non-standardized, free text form within the associated pathology reports. As a result, margin status must be manually abstracted from these documents, a timely process. The Clinical Outcomes Assessment Tool (COAT) is being designed as both an interface and Java API to facilitate medical records-based clinical outcomes assessment. As part of an ongoing study in the urological domain, COAT is being used to automatically identify phrases in pathology reports that reference the margin status of a prostate specimen. Once identified, these phrases (herein referred to as margin s) are automatically classified into one of three classes: 1) positive (involved) margin; 2) negative (uninvolved) margin; and 3) not applicable or definitive enough to diagnose margin status. We report an early evaluation of its ability to automatically identify and to classify margin s from a large sample of 782 free-text pathology reports taken from two hospitals. Background Reporting of margin status appears in the pathology report, along with other key prognostic parameters as set forth by the College of American Pathologists (CAP) prostate cancer protocol, including pathologic TNM stage (tumor staging) and Gleason score (histological analysis of tumor). Unlike the other CAP measures, which typically appear in a standard format (e.g., T2bN0Mx for TNM stage and 3+3=6 for Gleason score), margin status has no standard format for representation and often appears in or phrase form. Examples of typical margin s extracted from pathology reports in the sample are provided in Table 1. Positive surgical margin surgical margins involved at right apex base margin positive focal left tumor is present focally at the margin of resection Negative surgical margin no tumor present at the soft tissue resection margin no carcinoma is present at the inked margin Table 1. Examples of margin status references The challenge of extracting information of interest from the clinical record has been approached with a variety of techniques. Robust natural language processing (NLP) technologies such as MedLEE [4] and MetaMap/MMTx [5] employ comprehensive medical dictionaries and part-of-speech taggers to structure clinical data. A few examples of successful clinical applications of such systems include formatting the AMIA 2007 Symposium Proceedings Page - 160

2 contents of radiology reports [6], identifying candidates for clinical trials [7], and detection public health outbreaks [8]. Full NLP systems are not always necessary for identifying a fewer number of predetermined lexical targets. In many cases, fast and lightweight information extraction systems have been used to capitalize on the relative consistency of the appearance of specific clinical values in medical reports. These approaches typically employ patternrecognition techniques such as regular expression matching [9]. Several systems have combined extracted text features with machine learning algorithms to classify clinical documents or clinical values of interest within the documents [10]. One particular machine learning technique that has proven successful in classification of text features is support vector machines (SVMs) [11]. SVMs are linear classifiers that attempt to find a hyperplane that maximizes the margin between two different classes of instances. SVMs have been used in the clinical domain for several NLP-related tasks including document classification [12] and complex concept identification in radiology reports [13]. The availability of downloadable application programming interfaces (APIs) for various components required for medical language processing have made it possible for researchers to assemble pipelines of functionalities to accomplish specific information extraction tasks. For example, the Cancer Biomedical Informatics Grid s (cabig) text extraction tool ca- TIES [14] and Zeng et al. s HITEx [15] feature language tools from the General Architecture for Text Engineering (GATE) [16]. While caties uses the NLM s MetaMap/MMTx for medical concept mapping, HITEx implements only portions of the Unified Medical Language System [17] relevant to its tasks. The Clinical Outcomes Assessment Tool (COAT) is similarly designed, assembling a suite of functionalities specific to the task of facilitating records-based clinical outcomes assessment. It features a Java API of generalizable text processing facilities (record import, tokenization, regular expression matching, etc.), integration with MetaMap/MMTx for concept mapping, and Weka s API for data and text mining [18]. These features are combined in an interface created for outcomes researchers to manipulate, manage, visualize, and analyze clinical data for outcomes assessment. COAT is being developed at UCLA in cooperation with the Center for Surgery and Public Health at Brigham and Women s Hospital (BWH). In this current study, COAT was extended to identify and classify margin s in cooperation with urologists at both UCLA and BWH. Methods The goal of this study was to facilitate RRP outcomes assessment by identifying key phrases which describe the margin status of patients from pathology reports. Once extracted, a classification algorithm was applied to automatically classify the extracted s. A consideration in this study was how best to balance the level of customization needed to achieve an acceptable level of performance in identifying and classifying margin s, relative to the extensibility of this technique to encompass different institutions. To explore this issue, a sample of pathology reports from two major teaching hospitals, UCLA Medical Center and Brigham and Women s Hospital, was collected for analysis and testing. Data: A corpus of 787 pathology reports for patients having undergone RRPs was randomly selected from BWH (n = 456) and the UCLA Medical Center (n = 331). The pathology reports selected from BWH were drawn from RRP surgeries conducted at BWH between Jan. 1, 1996 and June 1, The UCLA pathology reports were drawn from surgeries performed between Jan. 1, 1998 and June 1, The population of patients having RRP procedures at both institutions during these time periods was approximately A total of 5 reports (.6% of sample) were excluded from the sample for being inaccurately coded as prostate cancer or for not featuring some reference to the surgical margin status as required by the American College of Surgeons Commission on Cancer [19]. As a result, the final number of reports comprising the sample was 453 from BWH and 329 from UCLA for a total sample of 782 reports. To conduct this study, IRB approvals were obtained from both institutions. Heuristics Design: The first step in designing a method to automatically identify and extract margin s was an exploratory pilot analysis of the pathology reports. A random sample of 30 pathology reports was taken from each institution (60 total) to manually identify potential consistencies and differences in the description of margin status. Reports from both BWH and UCLA feature a summary section in which margin status is described in narrative form, along with other CAP prognostic parameters (tumor stage, Gleason score, etc.). Most UCLA reports also included a Microscopic Examination section that featured semi-structured text describing margin status and other analytic measures. For example: Surgical Margin: Less than 1 mm from margin These features were incorporated in the design of an identification and extraction algorithm. A graphical overview of the pipeline used is provided in Figure 1. Identification & Extraction of Margin Sentences: Sentence boundary detection was used to break up the reports and all text converted to lowercase. Based on the pilot study results a pair of two, two-word combinations appearing consistently in the corpus was used to flag potential s of interest. This set of trigger rules was supplemented with three additional combinations after iterating through the sample and logging misses, for a total of five trigger rules. Table 2 shows the rules used for extracting s describing margin status that were applied. AMIA 2007 Symposium Proceedings Page - 161

3 First Run Margin Sentence Rules resection and (margin or margins) surgical and (margin or margins) Margin Sentence Rules Added apical and (margin or margins) tumor and (margin or margins) carcinoma and (margin or margins) Table 2. Rules used for extracting margin s To capitalize on the consistency of the semistructured Microscopic Exam section in UCLA reports, the extraction algorithm was designed to make an initial pass on UCLA reports to identify margin status in this section using regular expression matching. The expression matched case-insensitive appearances of the word margin or margins followed by a colon and the word or words that followed. Regular expression: (?i)\\w+\\s*\\w*((margin)s*(:)\\s*\\w*\\w+) implementation of an SVM classifier that used a polynomial kernel function and sequential minimal optimization (SMO) for training [20]. The performance of the classifier was evaluated using a 10-fold cross-validation. BWH Text Docs File import Text cleaner Document Object UCLA XML Docs XML parsed Data structure for medical records In the case of a match, a modified boundary detection algorithm was used to accommodate the bulleted-style of text featured in the Microscopic Exam section. It captured all text between double line breaks when the semi-structured margin regular expression was matched. This initial pass to capture semi-structured margin status in UCLA reports was the only variation in handling data from one hospital versus the other. The IDs of reports for which no match was found were logged and these reports were manually reviewed for causes of failures. Classification of Margin Sentences: The s extracted from the sample of 782 reports were manually classified into one of three classes by the author (LWD) based a review of the literature and consultation with urologists to create a training set and gold standard for classification. All s were classified into one of three categories: 1) positive (involved) surgical margin, 2) negative (uninvolved) surgical margin, and 3) not applicable or no explicit diagnosis. This third category was used to classify s extracted with no relevance to margin status, as well as s in which the diagnosis could not be definitively determined (i.e., false positives). Examples of category three s from the sample are provided in Table 3. the apical and basal margins are amputated and fixed separately note benign prostate glands are focally present at an inked resection margin Table 3. Examples of category 3 (false positive) s Once classified, the s extracted from the reports were tokenized into vectors of lowercase words. A feature vector was created from all unique token appearances. Vectors of tokens appearing in classified s, as well as their assigned category were also created. COAT was integrated with Weka s Java API and the vectors were passed to an End Miss logged N RegEx matcher Break content into s Meets Criteria? Has semistructured section Has regular expression match? Narrow boundaries Text cleaner Custom pipeline assembled for margin extraction N (BWH match) N (UCLA w/o semi-structure match) Final margin (UCLA with semistructure match) Figure 1. Workflow and classes for identifying and extracting margin s Results Identification of Margin Sentences: At least one potential margin was extracted for 780 of the 782 documents. Extrapolated to the collection of approximately 3480 pathology reports, at least one describing margin status should be extracted in 99.74% of reports (C.I %, 99.92% with 95% confidence). The s describing margin status that were missed are listed below in Table 4. Tumor is within 0.1 cm of the ink on both sides. Margins, negative. Table 4. Missed s describing margin status For the 780 documents from which s were extracted, 851 potential s were identified. Of AMIA 2007 Symposium Proceedings Page - 162

4 the 851 s extracted, there were 23 false positives (category 3) for a extraction precision of 97.3%. Of the 23 false positive s, 22 were from BWH, and 1 from UCLA. Examples of s falsely considered to be describing surgical margin status are featured in the previous Table 3. classified as Category 1 (positive) Category 2 (negative) Category 3 (NA) Category (97%) 4 (3%) 0 (0%) Category 2 8 (1.1%) 688 (98.7%) 1 (.1%) Category 3 1 (4.3%) 10 (43.5%) 12 (52.2%) Table 9. Confusion matrix of classification results Each document yielding a false positive also produced at least 1 true positive. In other words, all of the 780 documents from which s were identified produced at least one true positive (category 1 or 2) from which a diagnosis of margin status could be made. The results of margin extraction are provided in Table 5. Reports RRP path reports with diagnosis of carcinoma of the prostate 782 Reports from which true positive (category 1 or 2) s were extracted 780 (99.74%) Sentences Extracted Number of s extracted 851 Positive margin s (category 1) 131 Negative margin s (category 2) 697 Not applicable or definitive (category 3) 23 Number of true positives (precision) 828 (97.3%) Distribution of False Positives BWH false positive s 22 UCLA false positive 1 Table 5. Margin extraction results Classification of Margin Sentences: The SVM classifier correctly classified 827 of 851 s for an overall accuracy of 97.18%. Sensitivity and specificity for the three categories was as follows; category 1 (positive margin) = 96.95%, 98.77%, category 2 (negative margin) = 98.71%, 91.67, and category 3 (not applicable to or definitive of margin status) = 52.17%, 99.88%. Summaries of the overall classification results, sensitivity and specificity, area under receiver operator curves (ROC), and a confusion matrix are provided in the tables below. Total Number of Sentences 851 Correctly Classified Sentences 827 (97.19%) Incorrectly Classified Sentences 24 (2.82%) Table 6. Overall classification accuracy Sensitivity Specificity Category % 98.77% (positive margin) Category 2 (negative margin) 98.71% 91.67% Category 3 (not applicable to margin status) 52.17% 99.88% Table 7. Sensitivity and specificity of classification Area under ROC Category 1 (positive margin).9789 Category 2 (negative margin).9482 Category 3 (not applicable).8457 Table 8. Area under ROC for classification results Discussion The overall performance of the system in identifying, extracting, and classifying s was promising. The only customization included in handling reports from the two different hospitals was a preliminary iteration in the algorithm to capitalize on the existence of any semi-structured reporting of margin status. The preliminary pilot analysis of a small subset of reports was a useful starting point for designing heuristics for capturing potential margin s. It led to the five simple rules based on keyword appearance to capture s from 780 of 782 reports (99.74%) with only 23 (2.7%) extracted s not containing a margin status diagnosis. It also informed the decision to modify the extraction algorithm to capitalize the appearance of semi-structured reporting. The larger collection of BWH documents explains some of the differential in false positives, but not the ratio of 22 to 1. In addition, only 3 of the 22 BWH false positive reports was created in 1996 or 1997, ruling out some significant temporal factor in producing false positives. Instead, the majority of BWH false positives might be attributed to the lack of consistency in the way margin status and related concepts are referenced in BWH RRP pathology reports. If this is the case, then the results offer support for the standardization of key analytic measures to facilitate automated outcomes assessment research. The strong performance of the SVM classifier (97.19% accuracy) was partially an indication of the success of the boundary detection techniques in extracting representative feature vectors (margin s). It was also additional evidence of the power of SVMs in classifying text using distance-based techniques. We were also encouraged by the classifier s performance in light of no modifications made to accommodate classification of margin s from one hospital versus the other. Performance was predictably poor (12/23 for 53.3%) for classifying category three (not applicable) s due to a training set of only 23 s in the collection of 851 total s extracted. However, only one of the 11 misses for category 3 was falsely classified as a positive surgical margin (category 1) while the rest were classified as negative margin s. This observation is important as it implies that, using this technique, incorrectly classified category 3 s are unlikely to inflate the number of cases resulting in positive surgical margin. While only 1.1% (8 s) of all AMIA 2007 Symposium Proceedings Page - 163

5 negative margin s was incorrectly classified as positive margin, the number of positive margin cases was falsely inflated by 6.3%. The effect of the large ratio of negative to positive margin cases must therefore be considered in efforts to extend this technique for discovering the total number of positive margin cases in a collection. The conclusions drawn from this study are limited by the sample used. First, both UCLA and BWH are American College of Surgeons Commission on Cancer-approved hospitals which indicates that both have achieved a baseline of quality in regards to their oncology services [19]. The format and inclusion of margin status in non-approved facilities may differ systematically. Second, this study did not account for the effects of poor data quality introduced by incorrectly assigned administrative codes or missing values. Conclusion We have demonstrated the ability to automatically identify and classify s describing a key outcome of prostate cancer surgery from pathology reports with high accuracy. This is a fundamental first step in supporting automated prostate cancer surgery outcomes assessment. Currently, COAT is being extended to capture other key quality measures including Gleason score and tumor stage. An evaluation of the effects of incorrect administrative code assignments and missing values on produced results is also in progress. Acknowledgements This work was supported in part by the NLM Medical Informatics Training Grant # LM This paper benefited from suggestions from Dr. Jim Sayre, Dr. David Miller, Dr. Jim Hu, and Vijay Bashyam. References [1] American Cancer Society. Overview: Prostate Cancer. [Web site] 2006 [cited 2006; Available: t=36 [2] Richie J. Management of patients with positive surgical margins following radical prostatectomy. The Urological Clinics of North America. 1994;21:717. [3] Hull G, Rabbani F, et al. Cancer control with radical prostatectomy alone in 1,000 consecutive patients. The Journal of Urology. 2002;167:528. [4] Friedman C, Alderson P, et al. A general natural language text processor for clinical radiology. Journal of American Medical Informatics Association. 1994;1(2): [5] Aronson A. Effective mapping of biomedical text to the UMLS metathesaurus: the MetaMap program. In: Belfus H, editor. AMIA Symposium; 2001; p [6] Hripsak G, Austin J, et al. Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. Radiology. 2002;224(1): [7] Xu H, Anderson K, et al. Facilitating Cancer Research using Natural Language Processing of Pathology Reports. MedInfo; 2004; [8] Chapman W, Fiszman M, et al. Identifying respiratory findings in emergency department reports for biosurveillance using MetaMap. Medinfo ///;11(Pt 1): [9] Turchin A, Kolatkar N, et al. Using regular expressions to abstract blood pressure and treatment intensification information from the text of physician notes. Journal of the American Medical Informatics Association. 2006;13(6): [10] Chapman WW, Fizman M, et al. A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia. J Biomed Inform. 2001;34(1):14. [11] Joachims T. Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms: Kluwer Academic Publishers [12] etisgen-ildiz M, Pratt W. The effect of feature representation on MEDLINE document classification. Proceedings of the American Medical Informatics Association; 2005; Washington, DC; [13] Bashyam V, Taira R. Identifying Anatomical Phrases in Clinical Reports by Shallow Semantic Parsing Methods. Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining; 2007; Honolulu, Hawaii; [14] Cancer Biomedical Informatics Grid. About caties [cited 2007 March 10, 2007]; Available: [15] Zeng Q, Goryachev S, et al. Extracting principle diagnosis, co-morbidity, and smoking status for asthma research: evaluation of a natural language processing system. BMC Medical Informatics and Decision Making. 2006;6(30). [16] Cunningham H. GATE, A General Architecure for Text Engineering. Computers and the Humanities. 2004;36(2): [17] National Library of Medicine. UMLS Fact Sheet [cited 2006 August 10]; Available from: [18] Witten I, Frank E. Data Mining: Practical machine learning tools and techniques. 2nd Edition ed. San Francisco: Morgan Kaufmann [19] American College of Surgeons Commission on Cancer. Categories of Approval. [cited Feb. 24, 2007];Available: [20] Platt J. Machines using sequential minimal optimization. In: B S, C B, A S, eds. Advances in Kernel Methods - Support Vector Learning AMIA 2007 Symposium Proceedings Page - 164

A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1

A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1 A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1 1 Department of Biomedical Informatics, Columbia University, New York, NY, USA 2 Department

More information

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text Anthony Nguyen 1, Michael Lawley 1, David Hansen 1, Shoni Colquist 2 1 The Australian e-health Research Centre, CSIRO ICT

More information

IBM Research Report. Automated Problem List Generation from Electronic Medical Records in IBM Watson

IBM Research Report. Automated Problem List Generation from Electronic Medical Records in IBM Watson RC25496 (WAT1409-068) September 24, 2014 Computer Science IBM Research Report Automated Problem List Generation from Electronic Medical Records in IBM Watson Murthy Devarakonda, Ching-Huei Tsou IBM Research

More information

Symbolic rule-based classification of lung cancer stages from free-text pathology reports

Symbolic rule-based classification of lung cancer stages from free-text pathology reports Symbolic rule-based classification of lung cancer stages from free-text pathology reports Anthony N Nguyen, 1 Michael J Lawley, 1 David P Hansen, 1 Rayleen V Bowman, 2 Belinda E Clarke, 3 Edwina E Duhig,

More information

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor Text mining for lung cancer cases over large patient admission data David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor Opportunities for Biomedical Informatics Increasing roll-out

More information

BMC Medical Informatics and Decision Making 2006, 6:30

BMC Medical Informatics and Decision Making 2006, 6:30 BMC Medical Informatics and Decision Making This Provisional PDF corresponds to the article as it appeared upon acceptance. Copyedited and fully formatted PDF and full text (HTML) versions will be made

More information

Quality ID #250 (NQF 1853): Radical Prostatectomy Pathology Reporting National Quality Strategy Domain: Effective Clinical Care

Quality ID #250 (NQF 1853): Radical Prostatectomy Pathology Reporting National Quality Strategy Domain: Effective Clinical Care Quality ID #250 (NQF 1853): Radical Prostatectomy Pathology Reporting National Quality Strategy Domain: Effective Clinical Care 2018 OPTIONS FOR INDIVIDUAL MEASURES: REGISTRY ONLY MEASURE TYPE: Process

More information

Analyzing the Semantics of Patient Data to Rank Records of Literature Retrieval

Analyzing the Semantics of Patient Data to Rank Records of Literature Retrieval Proceedings of the Workshop on Natural Language Processing in the Biomedical Domain, Philadelphia, July 2002, pp. 69-76. Association for Computational Linguistics. Analyzing the Semantics of Patient Data

More information

Asthma Surveillance Using Social Media Data

Asthma Surveillance Using Social Media Data Asthma Surveillance Using Social Media Data Wenli Zhang 1, Sudha Ram 1, Mark Burkart 2, Max Williams 2, and Yolande Pengetnze 2 University of Arizona 1, PCCI-Parkland Center for Clinical Innovation 2 {wenlizhang,

More information

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials Riccardo Miotto and Chunhua Weng Department of Biomedical Informatics Columbia University,

More information

George Cernile Artificial Intelligence in Medicine Toronto, ON. Carol L. Kosary National Cancer Institute Rockville, MD

George Cernile Artificial Intelligence in Medicine Toronto, ON. Carol L. Kosary National Cancer Institute Rockville, MD George Cernile Artificial Intelligence in Medicine Toronto, ON Carol L. Kosary National Cancer Institute Rockville, MD Using RCA A system to convert free text pathology reports into a database of discrete

More information

Keeping Abreast of Breast Imagers: Radiology Pathology Correlation for the Rest of Us

Keeping Abreast of Breast Imagers: Radiology Pathology Correlation for the Rest of Us SIIM 2016 Scientific Session Quality and Safety Part 1 Thursday, June 30 8:00 am 9:30 am Keeping Abreast of Breast Imagers: Radiology Pathology Correlation for the Rest of Us Linda C. Kelahan, MD, Medstar

More information

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts jsci2016 Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Wutthipong Kongburan, Praisan Padungweang, Worarat Krathu, Jonathan H. Chan School of Information Technology King

More information

Modeling Annotator Rationales with Application to Pneumonia Classification

Modeling Annotator Rationales with Application to Pneumonia Classification Modeling Annotator Rationales with Application to Pneumonia Classification Michael Tepper 1, Heather L. Evans 3, Fei Xia 1,2, Meliha Yetisgen-Yildiz 2,1 1 Department of Linguistics, 2 Biomedical and Health

More information

Procedures Needle Biopsy Transurethral Prostatic Resection Suprapubic or Retropubic Enucleation (Subtotal Prostatectomy) Radical Prostatectomy

Procedures Needle Biopsy Transurethral Prostatic Resection Suprapubic or Retropubic Enucleation (Subtotal Prostatectomy) Radical Prostatectomy Prostate Gland Protocol applies to invasive carcinomas of the prostate gland. Protocol web posting date: July 2006 Protocol effective date: April 2007 Based on AJCC/UICC TNM, 6 th edition Procedures Needle

More information

Prostate cancer ~ diagnosis and impact of pathology on prognosis ESMO 2017

Prostate cancer ~ diagnosis and impact of pathology on prognosis ESMO 2017 Prostate cancer ~ diagnosis and impact of pathology on prognosis ESMO 2017 Dr Puay Hoon Tan Division of Pathology Singapore General Hospital Prostate cancer (acinar adenocarcinoma) Invasive carcinoma composed

More information

Measure #250 (NQF 1853): Radical Prostatectomy Pathology Reporting National Quality Strategy Domain: Effective Clincial Care

Measure #250 (NQF 1853): Radical Prostatectomy Pathology Reporting National Quality Strategy Domain: Effective Clincial Care Measure #250 (NQF 1853): Radical Prostatectomy Pathology Reporting National Quality Strategy Domain: Effective Clincial Care 2016 PQRS OPTIONS FOR INDIVIDUAL MEASURES: CLAIMS, REGISTRY DESCRIPTION: Percentage

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Automatic Extraction of ICD-O-3 Primary Sites from Cancer Pathology Reports

Automatic Extraction of ICD-O-3 Primary Sites from Cancer Pathology Reports Automatic Extraction of ICD-O-3 Primary Sites from Cancer Pathology Reports Ramakanth Kavuluru, Ph.D 1, Isaac Hands, B.S 2, Eric B. Durbin, DrPH 2, and Lisa Witt, A.S 2 1 Division of Biomedical Informatics,

More information

Probabilistic Reasoning for Medical Decision Support. Omolola Ogunyemi, PhD Director, Center for Biomedical Informatics Charles Drew University

Probabilistic Reasoning for Medical Decision Support. Omolola Ogunyemi, PhD Director, Center for Biomedical Informatics Charles Drew University Probabilistic Reasoning for Medical Decision Support Omolola Ogunyemi, PhD Director, Center for Biomedical Informatics Charles Drew University Overview Predictive & diagnostic models for medical decision

More information

Building a framework for handling clinical abbreviations a long journey of understanding shortened words "

Building a framework for handling clinical abbreviations a long journey of understanding shortened words Building a framework for handling clinical abbreviations a long journey of understanding shortened words " Yonghui Wu 1 PhD, Joshua C. Denny 2 MD MS, S. Trent Rosenbloom 2 MD MPH, Randolph A. Miller 2

More information

Identifying Deviations from Usual Medical Care using a Statistical Approach

Identifying Deviations from Usual Medical Care using a Statistical Approach Identifying Deviations from Usual Medical Care using a Statistical Approach Shyam Visweswaran, MD, PhD 1, James Mezger, MD, MS 2, Gilles Clermont, MD, MSc 3, Milos Hauskrecht, PhD 4, Gregory F. Cooper,

More information

Retrieving disorders and findings: Results using SNOMED CT and NegEx adapted for Swedish

Retrieving disorders and findings: Results using SNOMED CT and NegEx adapted for Swedish Retrieving disorders and findings: Results using SNOMED CT and NegEx adapted for Swedish Maria Skeppstedt 1,HerculesDalianis 1,andGunnarHNilsson 2 1 Department of Computer and Systems Sciences (DSV)/Stockholm

More information

Semantic Alignment between ICD-11 and SNOMED-CT. By Marcie Wright RHIA, CHDA, CCS

Semantic Alignment between ICD-11 and SNOMED-CT. By Marcie Wright RHIA, CHDA, CCS Semantic Alignment between ICD-11 and SNOMED-CT By Marcie Wright RHIA, CHDA, CCS World Health Organization (WHO) owns and publishes the International Classification of Diseases (ICD) WHO was entrusted

More information

A Method for Analyzing Commonalities in Clinical Trial Target Populations

A Method for Analyzing Commonalities in Clinical Trial Target Populations A Method for Analyzing Commonalities in Clinical Trial Target Populations Zhe (Henry) He 1, Simona Carini 2, Tianyong Hao 1, Ida Sim 2, and Chunhua Weng 1 1 Department of Biomedical Informatics, Columbia

More information

Mammogram Analysis: Tumor Classification

Mammogram Analysis: Tumor Classification Mammogram Analysis: Tumor Classification Literature Survey Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is

More information

CLAMP-Cancer an NLP tool to facilitate cancer research using EHRs Hua Xu, PhD

CLAMP-Cancer an NLP tool to facilitate cancer research using EHRs Hua Xu, PhD CLAMP-Cancer an NLP tool to facilitate cancer research using EHRs Hua Xu, PhD School of Biomedical Informatics The University of Texas Health Science Center at Houston 1 Advancing Cancer Pharmacoepidemiology

More information

Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments

Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments University of Wisconsin Milwaukee UWM Digital Commons Theses and Dissertations December 2014 Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments Eric James Klosterman University

More information

6/5/2010. Renal vein invasion & Capsule Penetration (T3a) Adrenal Gland involvement (T4 vs. M1) Beyond Gerota s Fascia? (?T4).

6/5/2010. Renal vein invasion & Capsule Penetration (T3a) Adrenal Gland involvement (T4 vs. M1) Beyond Gerota s Fascia? (?T4). GU Cancer Staging: Updates and Challenging Areas 13 th Current Issues in Surgical Pathology San Francisco, CA June 5, 2010 Jeffry P. Simko, PhD, MD Associate Professor Departments of Urology and Anatomic

More information

Data Mining in Bioinformatics Day 4: Text Mining

Data Mining in Bioinformatics Day 4: Text Mining Data Mining in Bioinformatics Day 4: Text Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen Karsten Borgwardt: Data Mining in Bioinformatics, Page 1 What is text mining?

More information

CANCER REPORTING IN CALIFORNIA: ABSTRACTING AND CODING PROCEDURES California Cancer Reporting System Standards, Volume I

CANCER REPORTING IN CALIFORNIA: ABSTRACTING AND CODING PROCEDURES California Cancer Reporting System Standards, Volume I CANCER REPORTING IN CALIFORNIA: ABSTRACTING AND CODING PROCEDURES California Cancer Reporting System Standards, Volume I Changes and Clarifications 16 th Edition April 15, 2016 Quick Look- Updates to Volume

More information

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods International Journal of Bioinformatics and Biomedical Engineering Vol. 1, No. 3, 2015, pp. 318-322 http://www.aiscience.org/journal/ijbbe ISSN: 2381-7399 (Print); ISSN: 2381-7402 (Online) Diagnosis of

More information

READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT

READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT BAHADORREZA OFOGHI 1, SAMIN SIDDIQUI 1, and KARIN VERSPOOR 1,2 1 Department of Computing and

More information

This is the accepted version of this article. To be published as : This is the author version published as:

This is the accepted version of this article. To be published as : This is the author version published as: QUT Digital Repository: http://eprints.qut.edu.au/ This is the author version published as: This is the accepted version of this article. To be published as : This is the author version published as: Chew,

More information

KNOWLEDGE-BASED METHOD FOR DETERMINING THE MEANING OF AMBIGUOUS BIOMEDICAL TERMS USING INFORMATION CONTENT MEASURES OF SIMILARITY

KNOWLEDGE-BASED METHOD FOR DETERMINING THE MEANING OF AMBIGUOUS BIOMEDICAL TERMS USING INFORMATION CONTENT MEASURES OF SIMILARITY KNOWLEDGE-BASED METHOD FOR DETERMINING THE MEANING OF AMBIGUOUS BIOMEDICAL TERMS USING INFORMATION CONTENT MEASURES OF SIMILARITY 1 Bridget McInnes Ted Pedersen Ying Liu Genevieve B. Melton Serguei Pakhomov

More information

Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes

Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes Authors: Xiao Liu, Department of Management Information Systems, University of Arizona Hsinchun Chen, Department of

More information

Classification of Smoking Status: The Case of Turkey

Classification of Smoking Status: The Case of Turkey Classification of Smoking Status: The Case of Turkey Zeynep D. U. Durmuşoğlu Department of Industrial Engineering Gaziantep University Gaziantep, Turkey unutmaz@gantep.edu.tr Pınar Kocabey Çiftçi Department

More information

Boundary identification of events in clinical named entity recognition

Boundary identification of events in clinical named entity recognition Boundary identification of events in clinical named entity recognition Azad Dehghan School of Computer Science The University of Manchester Manchester, UK a.dehghan@cs.man.ac.uk Abstract The problem of

More information

Mammogram Analysis: Tumor Classification

Mammogram Analysis: Tumor Classification Mammogram Analysis: Tumor Classification Term Project Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is the

More information

Extraction of Adverse Drug Effects from Clinical Records

Extraction of Adverse Drug Effects from Clinical Records MEDINFO 2010 C. Safran et al. (Eds.) IOS Press, 2010 2010 IMIA and SAHIA. All rights reserved. doi:10.3233/978-1-60750-588-4-739 739 Extraction of Adverse Drug Effects from Clinical Records Eiji Aramaki

More information

General Symptom Extraction from VA Electronic Medical Notes

General Symptom Extraction from VA Electronic Medical Notes General Symptom Extraction from VA Electronic Medical Notes Guy Divita a,b, Gang Luo, PhD c, Le-Thuy T. Tran, PhD a,b, T. Elizabeth Workman, PhD a,b, Adi V. Gundlapalli, MD, PhD a,b, Matthew H. Samore,

More information

An Improved Algorithm To Predict Recurrence Of Breast Cancer

An Improved Algorithm To Predict Recurrence Of Breast Cancer An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant

More information

Curriculum Vitae. Degree and date to be conferred: Masters in Computer Science, 2013.

Curriculum Vitae. Degree and date to be conferred: Masters in Computer Science, 2013. i Curriculum Vitae Name: Deepal Dhariwal. Degree and date to be conferred: Masters in Computer Science, 2013. Secondary education: Dr. Kalmadi Shamarao High School, Pune, 2005 Fergusson College, Pune 2007

More information

Clinical Trial and Evaluation of a Prototype Case-Based System for Planning Medical Imaging Work-up Strategies

Clinical Trial and Evaluation of a Prototype Case-Based System for Planning Medical Imaging Work-up Strategies From: AAAI Technical Report WS-94-01. Compilation copyright 1994, AAAI (www.aaai.org). All rights reserved. Clinical Trial and Evaluation of a Prototype Case-Based System for Planning Medical Imaging Work-up

More information

arxiv: v1 [stat.ml] 23 Jan 2017

arxiv: v1 [stat.ml] 23 Jan 2017 Learning what to look in chest X-rays with a recurrent visual attention model arxiv:1701.06452v1 [stat.ml] 23 Jan 2017 Petros-Pavlos Ypsilantis Department of Biomedical Engineering King s College London

More information

Large blocks in prostate and bladder pathology

Large blocks in prostate and bladder pathology Large blocks in prostate and bladder pathology Farkas Sükösd Department of Pathology, University of Szeged The history of the large block technique in radical prostatectomy and cystectomy The first large

More information

Standard 4.6: The Importance of CAP Protocols and Understanding Synoptic Reporting

Standard 4.6: The Importance of CAP Protocols and Understanding Synoptic Reporting Standard 4.6: The Importance of CAP Protocols and Understanding Synoptic Reporting Jerry Hussong, MD, FCAP Cedars Sinai Medical Center, Los Angeles CA M. Asa Carter, CTR Manager, Accreditation and Standards

More information

Primary Level Classification of Brain Tumor using PCA and PNN

Primary Level Classification of Brain Tumor using PCA and PNN Primary Level Classification of Brain Tumor using PCA and PNN Dr. Mrs. K.V.Kulhalli Department of Information Technology, D.Y.Patil Coll. of Engg. And Tech. Kolhapur,Maharashtra,India kvkulhalli@gmail.com

More information

An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation

An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation 1,2,3 EMR and Intelligent Expert System Engineering Research Center of

More information

COMPARISON OF BREAST CANCER STAGING IN NATURAL LANGUAGE TEXT AND SNOMED ANNOTATED TEXT

COMPARISON OF BREAST CANCER STAGING IN NATURAL LANGUAGE TEXT AND SNOMED ANNOTATED TEXT Volume 116 No. 21 2017, 243-249 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu COMPARISON OF BREAST CANCER STAGING IN NATURAL LANGUAGE TEXT AND SNOMED

More information

I.2 CNExT This section was software specific and deleted in 2008.

I.2 CNExT This section was software specific and deleted in 2008. CANCER REPORTING IN CALIFORNIA: ABSTRACTING AND CODING PROCEDURES FOR HOSPITALS California Cancer Reporting System Standards, Volume I Changes and Clarifications 8th th Edition Revised May 2008 SECTION

More information

Predicting Breast Cancer Recurrence Using Machine Learning Techniques

Predicting Breast Cancer Recurrence Using Machine Learning Techniques Predicting Breast Cancer Recurrence Using Machine Learning Techniques Umesh D R Department of Computer Science & Engineering PESCE, Mandya, Karnataka, India Dr. B Ramachandra Department of Electrical and

More information

EDUCATION/TRAINING (Begin with baccalaureate or other initial professional education, such as nursing, and include postdoctoral training.

EDUCATION/TRAINING (Begin with baccalaureate or other initial professional education, such as nursing, and include postdoctoral training. NAME Henry Lowe, M.D. BIOGRAPHICAL SKETCH Provide the following information for the key personnel and other significant contributors in the order listed on Form Page 2. Follow this format for each person.

More information

Comparing Decision Support Methodologies for Identifying Asthma Exacerbations

Comparing Decision Support Methodologies for Identifying Asthma Exacerbations MEDINFO 2007 K. Kuhn et al. (Eds) IOS Press, 2007 2007 The authors. All rights reserved. Comparing Decision Support Methodologies for Identifying Asthma Exacerbations Judith W Dexheimer a, Laura E Brown

More information

Lessons Extracting Diseases from Discharge Summaries

Lessons Extracting Diseases from Discharge Summaries Lessons Extracting Diseases from Discharge Summaries William Long, PhD CSAIL, Massachusetts Institute of Technology, Cambridge, MA, USA Abstract We developed a program to extract diseases and procedures

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 Exam policy: This exam allows two one-page, two-sided cheat sheets (i.e. 4 sides); No other materials. Time: 2 hours. Be sure to write

More information

Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study

Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study Li Li, MS, Herbert S. Chase, MD, Chintan O. Patel, MS Carol Friedman, PhD, Chunhua

More information

How can Natural Language Processing help MedDRA coding? April Andrew Winter Ph.D., Senior Life Science Specialist, Linguamatics

How can Natural Language Processing help MedDRA coding? April Andrew Winter Ph.D., Senior Life Science Specialist, Linguamatics How can Natural Language Processing help MedDRA coding? April 16 2018 Andrew Winter Ph.D., Senior Life Science Specialist, Linguamatics Summary About NLP and NLP in life sciences Uses of NLP with MedDRA

More information

Automatically extracting, ranking and visually summarizing the treatments for a disease

Automatically extracting, ranking and visually summarizing the treatments for a disease Automatically extracting, ranking and visually summarizing the treatments for a disease Prakash Reddy Putta, B.Tech 1,2, John J. Dzak III, BS 1, Siddhartha R. Jonnalagadda, PhD 1 1 Division of Health and

More information

A Review on Sleep Apnea Detection from ECG Signal

A Review on Sleep Apnea Detection from ECG Signal A Review on Sleep Apnea Detection from ECG Signal Soumya Gopal 1, Aswathy Devi T. 2 1 M.Tech Signal Processing Student, Department of ECE, LBSITW, Kerala, India 2 Assistant Professor, Department of ECE,

More information

Using Regular Expressions to Abstract Blood Pressure and Treatment Intensification Information from the Text of Physician Notes

Using Regular Expressions to Abstract Blood Pressure and Treatment Intensification Information from the Text of Physician Notes Journal of the American Medical Informatics Association Volume 13 Number 6 Nov / Dec 2006 691 Case Report Using Regular Expressions to Abstract Blood Pressure and Treatment Intensification Information

More information

The Relationship Between Surgical Volume and Patient Outcomes in Urologic Malignancies

The Relationship Between Surgical Volume and Patient Outcomes in Urologic Malignancies The Relationship Between Surgical Volume and Patient Outcomes in Urologic Malignancies Geoffrey Gotto PGY-5 UBC Department of Urologic Sciences October 8 th, 2008 Objective To review the literature on

More information

Cancer. Description. Section: Surgery Effective Date: October 15, 2016 Subsection: Original Policy Date: September 9, 2011 Subject:

Cancer. Description. Section: Surgery Effective Date: October 15, 2016 Subsection: Original Policy Date: September 9, 2011 Subject: Subject: Saturation Biopsy for Diagnosis, Last Review Status/Date: September 2016 Page: 1 of 9 Saturation Biopsy for Diagnosis, Description Saturation biopsy of the prostate, in which more cores are obtained

More information

Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts

Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts Erik M. van Mulligen, Zubair Afzal, Saber A. Akhondi, Dang Vo, and Jan A. Kors Department of Medical Informatics, Erasmus

More information

FDA Workshop NLP to Extract Information from Clinical Text

FDA Workshop NLP to Extract Information from Clinical Text FDA Workshop NLP to Extract Information from Clinical Text Murthy Devarakonda, Ph.D. Distinguished Research Staff Member PI for Watson Patient Records Analytics Project IBM Research mdev@us.ibm.com *This

More information

Data Science Reduces Anatomic Pathology Reporting Errors

Data Science Reduces Anatomic Pathology Reporting Errors Data Science Reduces Anatomic Pathology Reporting Errors Session # 270, February 14, 2019 Jay J. Ye, MD, PhD, Pathologist Dahl-Chase Pathology Associates 1 Conflict of Interest Jay J. Ye, MD, PhD Has no

More information

Conditional Outlier Detection for Clinical Alerting

Conditional Outlier Detection for Clinical Alerting Conditional Outlier Detection for Clinical Alerting Milos Hauskrecht, PhD 1, Michal Valko, MSc 1, Iyad Batal, MSc 1, Gilles Clermont, MD, MS 2, Shyam Visweswaran MD, PhD 3, Gregory F. Cooper, MD, PhD 3

More information

Innovative Risk and Quality Solutions for Value-Based Care. Company Overview

Innovative Risk and Quality Solutions for Value-Based Care. Company Overview Innovative Risk and Quality Solutions for Value-Based Care Company Overview Meet Talix Talix provides risk and quality solutions to help providers, payers and accountable care organizations address the

More information

Predicting Breast Cancer Survivability Rates

Predicting Breast Cancer Survivability Rates Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer

More information

Query Refinement: Negation Detection and Proximity Learning Georgetown at TREC 2014 Clinical Decision Support Track

Query Refinement: Negation Detection and Proximity Learning Georgetown at TREC 2014 Clinical Decision Support Track Query Refinement: Negation Detection and Proximity Learning Georgetown at TREC 2014 Clinical Decision Support Track Christopher Wing and Hui Yang Department of Computer Science, Georgetown University,

More information

Atigeo at TREC 2012 Medical Records Track: ICD-9 Code Description Injection to Enhance Electronic Medical Record Search Accuracy

Atigeo at TREC 2012 Medical Records Track: ICD-9 Code Description Injection to Enhance Electronic Medical Record Search Accuracy Atigeo at TREC 2012 Medical Records Track: ICD-9 Code Description Injection to Enhance Electronic Medical Record Search Accuracy Bryan Tinsley, Alex Thomas, Joseph F. McCarthy, Mike Lazarus Atigeo, LLC

More information

Pulmonary nodules are commonly encountered in clinical

Pulmonary nodules are commonly encountered in clinical ORIGINAL ARTICLE Automated Identification of Patients With Pulmonary Nodules in an Integrated Health System Using Administrative Health Plan Data, Radiology Reports, and Natural Language Processing Kim

More information

Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer

Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer Jayashree Kalpathy-Cramer PhD 1, William Hersh, MD 1, Jong Song Kim, PhD

More information

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods D. Weissenbacher 1, A. Sarker 2, T. Tahsin 1, G. Gonzalez 2 and M. Scotch 1

More information

Extracting Diagnoses from Discharge Summaries

Extracting Diagnoses from Discharge Summaries Extracting Diagnoses from Discharge Summaries William Long, PhD CSAIL, Massachusetts Institute of Technology, Cambridge, MA, USA Abstract We have developed a program for extracting the diagnoses and procedures

More information

Pilot Study: Clinical Trial Task Ontology Development. A prototype ontology of common participant-oriented clinical research tasks and

Pilot Study: Clinical Trial Task Ontology Development. A prototype ontology of common participant-oriented clinical research tasks and Pilot Study: Clinical Trial Task Ontology Development Introduction A prototype ontology of common participant-oriented clinical research tasks and events was developed using a multi-step process as summarized

More information

Evaluation of Clinical Text Segmentation to Facilitate Cohort Retrieval

Evaluation of Clinical Text Segmentation to Facilitate Cohort Retrieval Evaluation of Clinical Text Segmentation to Facilitate Cohort Retrieval Enhanced Cohort Identification and Retrieval S105 Tracy Edinger, ND, MS Oregon Health & Science University Twitter: #AMIA2017 Co-Authors

More information

Early Detection of Lung Cancer

Early Detection of Lung Cancer Early Detection of Lung Cancer Aswathy N Iyer Dept Of Electronics And Communication Engineering Lymie Jose Dept Of Electronics And Communication Engineering Anumol Thomas Dept Of Electronics And Communication

More information

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING 134 TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING H.F.S.M.Fonseka 1, J.T.Jonathan 2, P.Sabeshan 3 and M.B.Dissanayaka 4 1 Department of Electrical And Electronic Engineering, Faculty

More information

WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions

WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions Jannik Strötgen, Michael Gertz Institute of Computer Science, Heidelberg University Im Neuenheimer Feld 348, 69120 Heidelberg,

More information

College of American Pathologists. Pathology Performance Measures included in CMS 2012 PQRS

College of American Pathologists. Pathology Performance Measures included in CMS 2012 PQRS College of American Pathologists Pathology Performance Measures included in CMS 2012 PQRS Breast Cancer Resection Pathology Reporting Measure #99 pt category (primary tumor) and pn category (regional lymph

More information

Lung Cancer Screening

Lung Cancer Screening Scan for mobile link. Lung Cancer Screening What is lung cancer screening? Screening examinations are tests performed to find disease before symptoms begin. The goal of screening is to detect disease at

More information

Automatic Extraction of Synoptic Data. George Cernile Artificial Intelligence in Medicine AIM

Automatic Extraction of Synoptic Data. George Cernile Artificial Intelligence in Medicine AIM Automatic Extraction of Synoptic Data George Cernile Artificial Intelligence in Medicine AIM Agenda Background Technology used Demonstration Questions How often are checklist elements included in a report,

More information

Prediction of Key Patient Outcome from Sentence and Word of Medical Text Records

Prediction of Key Patient Outcome from Sentence and Word of Medical Text Records Prediction of Key Patient Outcome from Sentence and Word of Medical Text Records Takanori Yamashita 1, Yoshifumi Wakata 1, Hidehisa Soejima 2, Naoki Nakashima 1, Sachio Hirokawa 3 1 Medical Information

More information

NPQR Quality Payment Program (QPP) Measures 21_18247_LS.

NPQR Quality Payment Program (QPP) Measures 21_18247_LS. NPQR Quality Payment Program (QPP) Measures 21_18247_LS MEASURE ID: QPP 99 MEASURE TITLE: Breast Cancer Resection Pathology Reporting pt Category (Primary Tumor) and pn Category (Regional Lymph Nodes)

More information

. Semi-automatic WordNet Linking using Word Embeddings. Kevin Patel, Diptesh Kanojia and Pushpak Bhattacharyya Presented by: Ritesh Panjwani

. Semi-automatic WordNet Linking using Word Embeddings. Kevin Patel, Diptesh Kanojia and Pushpak Bhattacharyya Presented by: Ritesh Panjwani Semi-automatic WordNet Linking using Word Embeddings Kevin Patel, Diptesh Kanojia and Pushpak Bhattacharyya Presented by: Ritesh Panjwani January 11, 2018 Kevin Patel WordNet Linking via Embeddings 1/22

More information

Using Bayesian Networks to Direct Stochastic Search in Inductive Logic Programming

Using Bayesian Networks to Direct Stochastic Search in Inductive Logic Programming Appears in Proceedings of the 17th International Conference on Inductive Logic Programming (ILP). Corvallis, Oregon, USA. June, 2007. Using Bayesian Networks to Direct Stochastic Search in Inductive Logic

More information

A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER

A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER M.Bhavani 1 and S.Vinod kumar 2 International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.352-359 DOI: http://dx.doi.org/10.21172/1.74.048

More information

A Biostatistics Applications Area in the Department of Mathematics for a PhD/MSPH Degree

A Biostatistics Applications Area in the Department of Mathematics for a PhD/MSPH Degree A Biostatistics Applications Area in the Department of Mathematics for a PhD/MSPH Degree Patricia B. Cerrito Department of Mathematics Jewish Hospital Center for Advanced Medicine pcerrito@louisville.edu

More information

Pneumonia identification using statistical feature selection

Pneumonia identification using statistical feature selection Pneumonia identification using statistical feature selection Research and applications Cosmin Adrian Bejan, 1 Fei Xia, 1,2 Lucy Vanderwende, 1,3 Mark M Wurfel, 4 Meliha Yetisgen-Yildiz 1,2 < An additional

More information

Not all NLP is Created Equal:

Not all NLP is Created Equal: Not all NLP is Created Equal: CAC Technology Underpinnings that Drive Accuracy, Experience and Overall Revenue Performance Page 1 Performance Perspectives Health care financial leaders and health information

More information

Lung Cancer Screening

Lung Cancer Screening Scan for mobile link. Lung Cancer Screening What is lung cancer screening? Screening examinations are tests performed to find disease before symptoms begin. The goal of screening is to detect disease at

More information

Annotating Temporal Relations to Determine the Onset of Psychosis Symptoms

Annotating Temporal Relations to Determine the Onset of Psychosis Symptoms Annotating Temporal Relations to Determine the Onset of Psychosis Symptoms Natalia Viani, PhD IoPPN, King s College London Introduction: clinical use-case For patients with schizophrenia, longer durations

More information

How much data is enough? Predicting how accuracy varies with training data size

How much data is enough? Predicting how accuracy varies with training data size How much data is enough? Predicting how accuracy varies with training data size Mark Johnson (with Dat Quoc Nguyen) Macquarie University Sydney, Australia September 4, 2017 1 / 33 Outline Introduction

More information

Application of Automated Pathology Reporting Concepts to Radiology Reports

Application of Automated Pathology Reporting Concepts to Radiology Reports Original Article Application of Automated Pathology Reporting Concepts to Radiology Reports Suzanne March, MBA, CMC a ; George Cernile, BSc, CKE, PMP b ; Kim West, BS a ; Diane Borhani, MBA, CMC a ; April

More information

Sub-Topic Classification of HIV related Opportunistic Infections. Miguel Anderson and Joseph Fonseca

Sub-Topic Classification of HIV related Opportunistic Infections. Miguel Anderson and Joseph Fonseca Sub-Topic Classification of HIV related Opportunistic Infections Miguel Anderson and Joseph Fonseca Introduction Image collected from the CDC https://www.cdc.gov/hiv/basics/statistics.html Background Info

More information

Copyright 2007 IEEE. Reprinted from 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, April 2007.

Copyright 2007 IEEE. Reprinted from 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, April 2007. Copyright 27 IEEE. Reprinted from 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, April 27. This material is posted here with permission of the IEEE. Such permission of the

More information

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Dr. Sascha Losko, Dr. Karsten Wenger, Dr. Wenzel Kalus, Dr. Andrea Ramge, Dr. Jens Wiehler,

More information

Definition of Synoptic Reporting

Definition of Synoptic Reporting Definition of Synoptic Reporting The CAP has developed this list of specific features that define synoptic reporting formatting: 1. All required cancer data from an applicable cancer protocol that are

More information

Bayesian Bi-Cluster Change-Point Model for Exploring Functional Brain Dynamics

Bayesian Bi-Cluster Change-Point Model for Exploring Functional Brain Dynamics Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'18 85 Bayesian Bi-Cluster Change-Point Model for Exploring Functional Brain Dynamics Bing Liu 1*, Xuan Guo 2, and Jing Zhang 1** 1 Department

More information