Data with a human touch

Similar documents
Lingmo Translate Manual Smartphone / Tablet

Languages spoken by residents in the City of Knox

Language Assistance Services Policy

Information for Candidates Preparing for the CILISAT

Table 1: Home Languages across Six States Note: Pathways to college materials are available for languages in bold at

20 June 2018 Our Ref: RFI 23569

Language Assistance Services Policy

MEMORANDUM. June 30, 2017

Providing Language Access at Municipal Courts

CITY OF AURORA - LANGUAGE ASSISTANCE PLAN (LAP)

General Services Administration Federal Acquisition Service Authorized Federal Supply Schedule Price List Federal Supply Group: 738 II Class R608

Edward M. Kennedy: Clinical Challenges : Referrals and Capacity. Brian Genna, DMD VP of Dental Services

IMIA MINORITY LANGUAGE DIVISION SURVEY

2011 Canada Census Release 5. Walkerville Planning District Windsor, ON

Translation User Manual. Don t Translate it. Lingmo it! Leading edge language translation technology for the global market

2011 Canada Census Release 5. South Walkerville Planning District Windsor, ON

2011 Canada Census Release 5. City Centre Planning District Windsor, ON

24-7 Emergency line service (English)

The Cleft Palate and Craniofacial Institute

FREEDOM OF INFORMATION REQUEST

Thurston County Superior Court 2016

MONASH ARTS. Major Projects Rita Wilson School of Languages, Literatures, Cultures and Linguistics

Address Tel: Fax:

Communication with English Learners and their families.

School Curriculum and Standards Authority

Language Services / Tasks

Cannabis terpenes for sale

Toward Univeral Network-based Speech Translation

Geriatric Grand Rounds

Speech Processing / Speech Translation Case study: Transtac Details

IDR FORM Susac Syndrome International Disease Registry (IDR)

List of Office of Broadcasting Board of Governors (BBG) Office of Marketing and Program Placement (OMPP) Program Review Summaries,

Living with social anxiety

NEW HAMPSHIRE JUDICIAL BRANCH LANGUAGE SERVICES PLAN

European Day of Languages QUIZ

Reaching Out: Providing Inclusive Services for Families of Children on the Autism Spectrum

Optimizing the UMHS Interpreter Services Workload and Scheduling Team 5

Guide to BME cancer resources

Illinois State Seal of Biliteracy

Meducation. Medication Instructions That Any Patient Can Understand. Lori McLean, CEO Polyglot July 2016 FHIR Roundtable

Rehabilitation Catalogue

Open Sources And Foreign Media Analysis Translation at a Global Scale. Kalev Leetaru

Patient outcomes in Palliative Care:

Analysis of the Essential Programs and Services Limited English Proficiency (LEP) Cost Component

Rehabilitation Catalogue

Veeva Network 17R3.2.1 Data Governance

CITY OF LA MARQUE LA MARQUE POLICE DEPARTMENT PATROL ORDER

WAVERLY LABS - Press Kit

LOWER MAINLAND INTERPRETING SERVICES (LMIS)

Interpreting Service Instructions

GRAYS HARBOR COUNTY DISTRICT COURT LANGUAGE ASSISTANCE PLAN (LAP)

Rachel Rees, University College London Cate Calder, Cued Speech Association UK

Summary. About Pilot Waverly Labs is... Core Team Pilot Translation Kit FAQ Pilot Speech Translation App FAQ Testimonials Contact Useful Links. p.

Patient outcomes in Palliative Care:

Patient Outcomes in Palliative Care

27 Sept Jul. 1991

ONLINE RESOURCES. The National Standards for Culturally and Linguistically Appropriate Services in

Patient outcomes in Palliative Care:

OVERVIEW. Denver International Airport 1

Patient outcomes in Palliative Care

NS-LIJHS Southside Hospital. Language Access Services Interpreters Department

DonateLife. Engaging CALD Communities in Conversations about Organ and Tissue Donation

delaware S SuCCESSful StRIdES toward language ACCESS IN the CouRtS maria Perez-Chambers Coordinator, Court Interpreter Program, Delaware

Patient Outcomes in Palliative Care

Patient outcomes in Palliative Care

Language Volunteer Guide

European Day of Languages QUIZ 1 - World languages and linguistics ANSWERS

Open Architectures and Community Building. Professor Joakim Gustafson. Head of Department Department of Speech, Music and Hearing KTH, Sweden

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

Interpretation Resources. Foreign Language and Sign Language Providers

Multilingual Manchester

Media Accessibility. L a r r y G o l d b e r g, D i r e c t o r o f A c c e s s i b l e M e d i a, Y a h o o

Diagnosis of Neuropathic Pain. Didier Bouhassira

Patient outcomes in Palliative Care

Diazepam baclofen suppositories

Price list Tools for Research & Development

Language & Communication Access Services in Research

Suicide prevention from a global perspective Dr Alexandra Fleischmann Department of Mental Health and Substance Abuse World Health Organization

Does enjoying a movie stimulate the incidental acquisition of language? Géry d Ydewalle. University of Leuven Royal Academy of Sciences BELGIUM

Your Community in Profile: Halton-Peel

Networx Enterprise Proposal for Internet Protocol (IP)-Based Services. Supporting Features. Remarks and explanations. Criteria

Patient outcomes in Palliative Care

Use the following checklist to ensure that video captions are compliant with accessibility guidelines.

ASC-Inclusion. The Case, Initiatives and Impact. Interactive Emotion Games. Björn Schuller

WHATCOM COUNTY SUPERIOR / DISTRICT / MUNICIPAL COURT LANGUAGE ASSISTANCE PLAN (LAP)

HOW AI WILL IMPACT SUBTITLE PRODUCTION

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

NEW LEARNING ENVIRONMENTS THE EUROPEAN LEARNING SPACE NATIONAL REPORT POLAND. Bogusław Marek

Title VI - Limited English Proficiency Plan

CALL FOR A EUROPEAN ACTION PLAN FOR MEDICAL IMAGING TO IMPROVE QUALITY OF CARE AND PATIENT SAFETY

Summary Table Voluntary Product Accessibility Template. Supports. Please refer to. Supports. Please refer to

Lionbridge Interpretation Services 1010 Wayne Avenue Suite 240 Silver Spring, MD Telephone Interpretation Services Users' Guide

Challenges of Patient-Reported Outcome Assessment in Hemophilia Care a State of the Art Reviewvhe_

1. INTRODUCTION. Vision based Multi-feature HGR Algorithms for HCI using ISL Page 1

DANA Ubiquitous Insulin Pump System

Cantonese English Medical Interpreter Test Study Guide READ ONLINE

International Adaptation and Use of the Supports Intensity Scales

Patient outcomes in Palliative Care

ACCESSIBILITY FOR THE DISABLED

Enea Multilingual Solutions, LLC

Transcription:

Language Brochure Title Resources Style Data with a human touch High-quality data for machine learning, enhanced by human interaction appen.com

Quality speech and language data to power your applications Are you equipped to source the large amount of language data needed to train and test your machine learning model? Do you have the resources to scale your data collection to meet customer needs in all your target markets, including in low-resource languages? Appen can help. We understand the complex needs of today s organizations. For the past 20 years, Appen has delivered the highest quality linguistic data and services, in over 180 languages and dialects, to government agencies and the world s largest corporations. Data Collection You build the technology Appen hand-crafts the data for you. We help you train and evaluate these systems... Automatic speech recognition (ASR) Text-to-speech (TTS) engines Natural language processing Machine translation Computer vision & AI platforms...so you can build these applications: Virtual assistants Home automation devices Gaming consoles & smart TVs Call-center systems In-car infotainment systems

Here s why we re different: Unrivaled breadth of data collection coverage Global community of over 400,000 pre-screened speakers, covering 180+ languages 20 years of experience running collections of all sizes in 90+ countries No limit to languages, geography, collection modes and data volumes Experienced project managers, language/domain specialists and audio engineers Looking for speech data? Appen s got it covered. Environments Home Office Studio Outdoor In-car Recording technology Smartphone collection app Telephony server Single/multi-channel professional audio recording kit Personal computer with a built-in or USB mic Proprietary customer devices Speech modality Scripted Scenario-based responses Role-plays Spontaneous conversations Participants Single Pairs Groups Need non-speech data? We ve got that too. Scenario-based natural language text collections Handwriting and keyboard input Non-speech audio recordings (e.g. gunshots, shattering glass) Image capture Video recordings

Transcription and Annotation Are you working on a cutting-edge acoustic model? Appen s transcribed data can take it to the next level. Our proprietary web-based transcription interface - Appen Global - accelerates ramp up time so we can deliver large volumes of high-quality transcription and annotation data to you. What makes us unique: 20+ years of experience in human transcription and annotation 40 million audio files transcribed in 2016 alone 180+ languages supported Proprietary web-based transcription interface (Appen Global) All native alphabets and orthographies Speech, text, video and handwriting supported ISO 27001 and ISO 9001 certified for highly secure transcription Here s how we guarantee quality: Screening & training of in-country transcribers Automated spelling checks & rigorous post-processing by senior team members Extensive experience developing spelling standardization in low-resource languages And don t forget to ask about our premium extras: Orthographic transcription Automated or manual segmentation Time alignment Acoustic tagging Speaker ID / tracking Silence detection Spelling standardization Annotation of transcribed data Foreign / accented speech Foreground / background speech Mispronunciations and speech disfluencies Background / overlapping speech Slot value identification

Translation and Localization Got a great local product? Appen helps you take your technology into new global markets with: Localization of human-machine interfaces into 50+ languages Evaluation and tuning of machine-translation technology Post-editing and end-to-end support for all MT-related needs Traditional document translation, software and website localization services Global crowd coverage to meet the demand for low-resource languages Our expertise covers a variety of domains, including: Conversational speech translation Speech-to-speech MT evaluation Capture of persona and intent Live-chat, military and disaster relief Navigation, multimedia and voice search BNF grammar development

Custom Linguistic Solutions Appen s high-end linguistic annotation support helps you fine-tune your ASR, TTS and NLP systems and expand them into new markets and domains. Our services include: Text data annotation in support of: - Sentiment analysis - Semantic analysis - Detection of entities, relations, and events - Natural language understanding Customized linguistic consultation Pronunciation lexicon development for TTS and ASR Phonetic transcription and alignment for TTS Grammatical markup (POS, morphology) Proofing tools support TTS voice evaluation and benchmarking/ MOS testing Named entity tagging Domain and intent tagging Language analysis specification documents for ASR and TTS development Dialogue analysis User interaction analysis Licensable Resources Do you need immediate access to a complete speech and language database? Appen has an extensive catalog of off-the-shelf, licensable resources ready to ship. We even cover low-resource languages, including dialects from West and North Asia, the Middle East and Africa. Ask for a copy of our catalog today!

Supported Languages Afrikaans Albanian Amharic Arabic (15+ varieties) Armenian Assamese Azerbaijani Bahasa Indonesia Bahasa Malaysia Bakhtiari (Iran) Basque Belarusian Bengali Bodo Bulgarian Burmese Cantonese (China PRC, Hong Kong) Catalan Cebuano Corsican Cree Croatian Czech Danish Dari Dholuo Divehi Dogri Dutch (Holland, Belgium) Esperanto English (20+ varieties) Estonian Farsi Finnish French (5+ varieties) Frisian Gaelic Gallego (Galician) Georgian German (Austrian, German, Luxembourg, Swiss) Georgian Greek Greenlandic Gujarati Guarani Haitian Creole Hausa Hebrew Hindi Hungarian Icelandic Igbo Inuktitut Italian Japanese Javanese Kannada Kashmiri Kazakh Kermanji (Iran) Khmer Kinyarwanda Konkani Korean (North, South) Ku Waru Kurdish (Sorani, Kurmanji) Kyrgyz Laki (Iran) Lao Latvian Lithuanian Luri (Iran) Macedonian Malayalam Malagasy Mandarin (China, Taiwan) Manx Marathi Mazanderani (Iran) Min Mongolian Nepali Norwegian (Nynorsk, Bokmal) Oriya Oromo Pashto Polish Portuguese (Brazilian, European, African) Punjabi Romanian Russian Serbian Shona Sinhala Slovak Slovenian Somali Spanish (20+ varieties) Sundanese Swahili Swedish Sylheti Tagalog Tajik Tamil Telugu Thai Tok Pisin Turkish Turkmen Ukrainian Urdu Uzbek Vietnamese Welsh Wolof Wu Xiang Xhosa Yoruba Zulu

Improving machine learning for technology leaders worldwide Appen is a global leader in the development of highquality, human-annotated datasets for machine learning and artificial intelligence. With over 20 years of experience, expertise in more than 180 languages, and access to a crowd of over 1,000,000 worldwide, Appen partners with global companies to enhance their machine learning-based products. Contact Us Appen Pty Ltd Level 6 9 Help Street Chatswood, Sydney NSW 2067 Australia Sydney: +61 2 9468 6335 US: +1 866 673 6996 Europe: +31 622 799 535 Japan & Korea: +82 10 2087 1989 China: +86 181 4650 3673 hello@appen.com appen.com