Whole genome mapping as a novel high-resolution typing tool for Legionella pneumophila

Similar documents
Sensitivity of three serum antibody tests in a large outbreak of Legionnaires disease in the Netherlands

Evaluation of the Oxoid Xpect Legionella test kit for Detection of Legionella

Citation for published version (APA): IJzerman, E. P. F. (2009). Progress in diagnostics and prevention of Legionnaires' disease [S.n.

Citation for published version (APA): IJzerman, E. P. F. (2009). Progress in diagnostics and prevention of Legionnaires' disease [S.n.

Received 3 August 2005/Returned for modification 13 September 2005/Accepted 13 January 2006

Multi-clonal origin of macrolide-resistant Mycoplasma pneumoniae isolates. determined by multiple-locus variable-number tandem-repeat analysis

Citation for published version (APA): IJzerman, E. P. F. (2009). Progress in diagnostics and prevention of Legionnaires' disease [S.n.

Outbreak of travel-associated Legionnaires' disease Palmanova, Mallorca (Spain), September October Main conclusions and options for response

Exploring the evolution of MRSA with Whole Genome Sequencing

Abstract. Introduction

Volume 13, Issue September 2008

Sequence-based typing of Legionella pneumophila serogroup 1 clinical isolates from Belgium between 2000 and 2010

Molecular epidemiology of Legionnaires disease in Israel

Distribution of sequence-based types of Legionella pneumophila serogroup 1

Dual infections with different Legionella strains

Current and Emerging Legionella Diagnostics

Analysis of phenotypic variants of the serogroup C ET-15 clone of Neisseria meningitidis by pulsed-field gel electrophoresis.

A Newborn withdomestically Acquired Legionnaires Disease Confirmed by Molecular Typing

Commercial potting soils as an alternative infection source of Legionella pneumophila and other Legionella species in Switzerland

Liver cirrhosis as a predisposing condition for Legionnaires disease: a report of four laboratoryconfirmed

Legionnaires disease (LD) has been an emergent disease

New genomic typing method MLST

LEGIONELLOSIS: A WALK-THROUGH TO IDENTIFICATION OF THE SOURCE OF INFECTION

Presence and persistence of viable, clinically relevant Legionella pneumophila bacteria in

Whole genome sequencing

Legionella pollution in cooling tower water of air-conditioning systems in Shanghai, China

Measles Epidemic in The Netherlands,

Legionnaires disease in Europe: all quiet on the eastern front?

Whole genome sequencing & new strain typing methods in IPC. Lyn Gilbert ACIPC conference Hobart, November 2015

Sporadic and epidemic community legionellosis: two faces of the same illness

S u r v e y o n legislation r e g a r d i n g w e t c o o l i n g s y s t e m s

A Large Outbreak of Legionnaires' Disease at a Flower Show, the Netherlands, 1999

Fact Sheet: Legionellosis March 2017

Outbreak investigation and epidemiology - from practice to science -. Hoebe, C.J.P.A.

Genome-wide association studies for understanding pathogen evolution. Samuel K. Sheppard

Epidemiology of Verotoxigenic E. coli in Ireland 2004 Patricia Garvey, Anne Carroll, Eleanor McNamara and Paul McKeown

Complex clinical and microbiological effects on Legionnaires disease outcone; A retrospective cohort study

Trends in Legionnaires Disease, : Declining Mortality and New Patterns of Diagnosis

Molecular typing for surveillance of multidrug-resistant tuberculosis in the EU/EEA

Molecular typing for surveillance of multidrug-resistant tuberculosis in the EU/EEA

Viable Legionella pneumophila bacteria in natural soil and rainwater puddles

Investigating Legionella: PCR Screening and Whole-Genome Sequencing

Barrow-in-Furness: a large community legionellosis outbreak in the UK

Show me the evidence! Water can cause infection and illness if hospital water systems are not properly engineered and managed 6/29/2017

Deficient mannose-binding lectin-mediated complement activation despite MBL-sufficient genotypes in an outbreak of Legionella pneumophila pneumonia

pneumophila Bactericidal Effect of Calcium Oxide and Calcined Shell Calcium on Legionella

Chronic shedders as reservoir for nosocomial. transmission of norovirus

Cases in employees. Cases. Day of onset (July)

A Guideline for the Molecular Typing of Canadian Epidemic Methicillin-Resistant Staphylococcus aureus Using spa Typing.

Legionnella. You can t t see it but you can catch it. Dr Charmaine Gauci MD, MSc, PhD Head of Disease Surveillance Unit

Title. CitationGenome Announcements, 4(4): e Issue Date Doc URL. Rights. Type. File Information.

Legionnaire s disease

23/08/2015. What are we going to discuss here today? Legionella. Can dust and water harm you? Legionella Aspergillus

Chapter 2.1. Meticillin-resistant Staphylococcus aureus epidemiology and transmission in a Dutch hospital

Whole genome sequencing identifies zoonotic transmission of MRSA isolates with the novel meca homologue mecc

Preventing Legionella Transmission An Environmental Health View

Evaluation of methicillin-resistant Staphylococcus aureus (MRSA) colonization in pigs and people that work with pigs in Ontario Veterinary College

Epidemiological knowledge by genotyping Chlamydia trachomatis: an overview of recent achievements. Björn Herrmann

Legionellosis Water Management and Investigation Case Study

Legionnaires disease in Europe

Seroepidemiological Study after a Long-Distance Industrial Outbreak of Legionnaires Disease

JOURNAL OF CLINICAL MICROBIOLOGY, Sept. 1998, p Vol. 36, No. 9. Copyright 1998, American Society for Microbiology. All Rights Reserved.

Portugal s Legionella outbreak - What can we learn with it?

DIABETIC FOOT ULCERS AND Staphylococcus aureus : NEW INSIGHTS

GUIDE TO INFECTION CONTROL IN THE HOSPITAL. Carbapenem-resistant Enterobacteriaceae

Volume 13, Issue September 2008

Implementation of MenACWY vaccination because of ongoing increase in serogroup W invasive meningococcal disease, the Netherlands, 2018

ANSI/ASHRAE Standard Legionellosis: Risk Management for Building Water Systems

EUROTRAVNET SCIENCE WATCH NOVEMBER-DECEMBER 2009

Seroepidemiological study after a long-distance industrial outbreak of. Legionnaires disease

JCP Online First, published on June 17, 2013 as /jclinpath Original article

A first molecular characterization of Listeria monocytogenes isolates circulating in humans from 2009 to 2014 in the Italian Veneto region

Surveillance and outbreak investigation of Shiga toxin-producing Escherichia coli using whole genome sequencing- time for a change!

Overview of the Influenza Virus

Community acquired pneumonia due to Legionella pneumophila in a tertiary care hospital

Key words : Verotoxin-producing Escherichia coli, VTEC, EHEC, imported meat

Thesis for doctoral degree (Ph.D.) 2007 Molecular epidemiology of pneumococcal carriage and invasive disease Karin Sjöström.

THIS IS AN OFFICIAL NH DHHS HEALTH ALERT

Department of Infectious Disease Control, Municipal Public Health Service Rotterdam-Rijnmond, Rotterdam, The Netherlands 2

Characterization of community and hospital Staphylococcus aureus isolates in Southampton, UK

Burns outbreaks - the UHB experience

S. Wesley Long, MD, PhD

Burns outbreaks - the UHB experience

Environmental survival of Neisseria meningitidis


PRESENTER: DENNIS NYACHAE MOSE KENYATTA UNIVERSITY

Pseudomonas aeruginosa

Pathogens in the twilight zone: Update on emerging disease issues with implications for the pork industry

Viral Agents of Paediatric Gastroenteritis

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Request for Report for Projects Awarded in 2013 and 2014 by. Mississippi Center for Food Safety and Post-Harvest Technology

Annual Epidemiological Report

Breaking the chain of transmission: Immunisation and outbreak investigation Whelan, Jane

IS THE UK WELL PREPARED FOR A REPEAT OF THE 1918 INFLUENZA PANDEMIC?

Development and Application of an Enteric Pathogens Microarray

PART 1 FOODBORNE PATHOGEN SURVEILLANCE AND OUTBREAK INVESTIGATION

ONE STEP (Lemon, Mint, Pine) Detergent for Cleaning, Disinfecting and Deodorizing

Shedding of norovirus in patients and in persons with asymptomatic infection

WGS Works! Shared Mission Different Roles APPLICATIONS SEQUENCING (WGS) Non-regulatory. Regulatory CDC. FDA and USDA. Peter Gerner-Smidt, MD ScD

Transcription:

JCM Accepted Manuscript Posted Online 22 July 2015 J. Clin. Microbiol. doi:10.1128/jcm.01369-15 Copyright 2015, American Society for Microbiology. All Rights Reserved. 1 2 3 Whole genome mapping as a novel high-resolution typing tool for Legionella pneumophila Thijs Bosch, a # Sjoerd. M. Euser, b Fabian Landman, a Jacob P. Bruin, b Ed P. IJzerman, b Jeroen W. den Boer b and Leo M. Schouls a 4 5 6 National Institute for Public Health and the Environment, Bilthoven, the Netherlands a ; Regional Public Health Laboratory Kennemerland, Haarlem, the Netherlands b 7 8 Running head: Whole genome mapping of Legionella pneumophila 9 10 11 12 13 14 15 16 17 18 19 20 # Address correspondence to Thijs Bosch, Thijs.bosch@rivm.nl 21 22 23 24 25 1

26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 Abstract Legionella is the causative agent for Legionnaires disease (LD) and is held responsible for several large outbreaks in the world. More than 90% of LD cases are caused by Legionella pneumophila and studies on the origin and transmission routes of this pathogen rely on adequate molecular characterization of isolates. Current typing of L. pneumophila mainly depends on sequence based typing (SBT). However, studies have shown that in some outbreak situations, SBT has not the sufficient discriminatory power to distinguish between related and non-related L. pneumophila isolates. In this study, we used a novel high-resolution typing technique, named whole genome mapping (WGM), to differentiate between epidemiologically related and non-related L. pneumophila isolates. Assessment of the method by various validation experiments showed highly reproducible results and WGM was able to confirm two well-documented Dutch L. pneumophila outbreaks. Comparison of whole genome maps of both outbreaks together with WGMs of epidemiologically non-related L. pneumophila isolates showed major differences between the maps and WGM yielded a higher discriminatory power than SBT. In conclusion, WGM can be a valuable alternative to perform outbreak investigations of L. pneumophila in real time since the turnaround time from culture to comparison of the L. pneumophila maps is less than 24 hours. 43 44 45 46 47 48 49 50 Introduction Legionella is a rod-shaped, gram-negative bacterial pathogen, which is ubiquitous in aquatic reservoirs. It is the causative agent for Legionnaires disease (LD): an acute pneumonia, characterized by clinical symptoms such as cough, fever and radiological signs of infiltration that does not differ from pneumonia caused by other pathogens. LD is thought to account for 2-15% of all community-acquired pneumonias (1-3), and proves fatal in about 6% of cases (4). 2

51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 Several large outbreaks of LD have been reported worldwide. Examples are Murcia, Spain: 449 confirmed cases; Barrow-in-Furness, UK: 179 confirmed cases; Quebec City, Canada: 182 confirmed cases. These outbreaks often involved contaminated cooling towers that can infect hundreds of people within a short time-span, until the source of infection is detected and appropriate control measures are taken (5-7). In the source investigation of such outbreaks, epidemiological analyses together with genotypic comparisons between clinical and environmental isolates are essential (8, 9). More than 90% of LD cases are caused by Legionella pneumophila and as this species is commonly found in the environment, adequate typing methods are needed to differentiate between isolates in order to confirm or reject a potential source of infection (10). Sequencebased typing (SBT), a variant of the classic multi-locus sequence typing schemes (11), is an internationally recognized procedure for genotyping L. pneumophila isolates. It is a rapid, discriminatory and reproducible seven-gene molecular typing method, that is recommended as the method of choice by the ESCMID Study Group for Legionella Infections (ESGLI) (12). However, recent studies have described situations in which the SBT did not provide the discriminatory level that was needed to distinguish between outbreak-related L. pneumophila isolates and non-outbreak isolates (13). Exploring novel techniques such as next generation sequencing (NGS) has shown that NGS can provide comparable, if not better, discriminatory power within L. pneumophila isolates, although it is still too laborious and time-consuming to implement in routine surveillance (10, 13-15). Another novel molecular analysis method named whole genome mapping (WGM) has recently been used as a high-resolution typing technique (16). WGM uses high-resolution, ordered, whole genome restriction maps for comparative analysis. In a recent study, WGM successfully discriminated between isolates belonging to livestock-associated MRSA (LA- MRSA) and identified transmission and persistence of this genetically homogeneous MRSA 3

76 77 78 79 80 81 variant, where current typing techniques failed (17). Although WGM has been very successful for LA-MRSA, the number of reports in which WGM was applied for molecular typing of other bacterial pathogens is very limited (18, 19). In the study presented here, the capability of whole genome mapping to differentiate L. pneumophila isolates was investigated using epidemiologically related and non-related L. pneumophila isolates. 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 Material & Methods Strain selection and study design We selected 53 L. pneumophila isolates to generate 57 different whole genome maps (WGMs). For validation, two environmental L. pneumophila strains B9006 and D9010 were used for reproducibility and stability experiments. L. pneumophila strains A9001(derived from a patient) and D9010 were selected for comparison of whole genome maps created in our laboratory with in silico maps, based on their whole genome sequence obtained using a hybrid assembly approach of Illumina and PacBio sequencing (BaseClear, Leiden, The Netherlands). The discriminatory power of whole genome mapping for L. pneumophila and the capability to identify transmission events was studied using 35 isolates obtained during two well-documented outbreaks in the Netherlands of L. pneumophila (20, 21) and 15 epidemiologically unrelated isolates originating from different patients in the Netherlands (Table 1). These 15 isolates, as well as the three strains used for reproducibility and stability experiments (A9001, B9006, D9010) were collected within the Dutch National Legionella Outbreak Detection Program between 2002-2012. 98 99 100 All isolates used in this study came from pre-existing collections and the isolates used to create WGMs were also characterized by SBT (12, 22). 4

101 102 103 104 105 106 107 108 109 110 111 112 Whole genome mapping of L. pneumophila The input of high molecular weight DNA is required for whole genome mapping. The isolation of HMW DNA was performed using the same protocol as for S. aureus, with two modifications. First, no lysostaphine was added during the spheroplasting step and second, the incubation time of the spheroplasting step was reduced to one hour. The creation of whole genome maps and the analysis of the WGMs was performed as described previously (16). Based on the previous validation of WGM for S.aureus, a combination of a filtering setting that excluded fragments <3,000 bp from the comparison and a relative tolerance of 15% combined with an absolute tolerance of 1,000 bp were used to compensate for both the variation in sizing and the presence or absence of smaller fragments during clustering and alignment of Legionella WGMs. 113 114 115 116 117 118 119 120 121 122 123 124 Results Assessing the optimal settings for whole genome mapping of L. pneumophila To assess the optimal settings for WGM of L. pneumophila a series of validation experiments were conducted. First, the DNA sample obtained from L. pneumophila strain B9006 was used to create WGMs on three consecutive days. The obtained WGMs showed >99.5% similarity between WGMs created from the same DNA sample, while distinct maps were obtained from unrelated isolates under these conditions. Second, the temporally stability of L. pneumophila genomes under laboratory conditions was determined by sub-culturing L. pneumophila strain D9010 for 30 days and WGMs were created from DNA isolated from the day 1 and day 30 cultures. The resulting WGMs were indistinguishable, with a similarity of 99.3% between the maps. 5

125 126 127 128 129 Besides reproducibility, a comparative analysis between WGMs from L. pneumophila isolates A9001 and D9010, created in our laboratory, and their in-silico counterparts generated in the BioNumerics software was performed. The in-silico maps, based on the obtained whole genome sequences, and the WGMs obtained in the laboratory showed high similarities of 99.2% for A9001 and 98.4% for D9010. 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 Whole genome mapping confirms two well-known L. pneumophila outbreaks To assess whether WGM is capable of identifying outbreaks of L. pneumophila, isolates of two well-known L. pneumophila outbreaks from the Netherlands were subjected to WGM. The first set comprised seven isolates from an outbreak in the Amsterdam region (21). Six isolates were cultured from patients, while a single isolate was obtained from the identified source of the outbreak, a cooling tower. All isolates belonged to serogroup 1 and characterization with SBT revealed they were of ST42. The WGMs of the isolates were indistinguishable, showing a similarity of >99.9% between the most distinct maps (Figure 1). The second outbreak comprised 28 isolates obtained from the large L. pneumophila outbreak at the Westfriese flowershow in Bovenkarspel (20), of which 23 isolates originated from patients and five isolates came from different sampling points from the likely source, a whirlpool. Previous characterization showed that all isolates yielded serogroup 1 and ST62. The WGMs of the 23 patients isolates were indistinguishable with a similarity between the maps of 99.4%. All WGMs of isolates sampled from the likely source, a whirlpool, were indistinguishable from the WGMs of the isolates from patients. A comparison of both outbreaks revealed major differences between the whole genome maps and the similarity between the WGMs of the isolates from the two outbreaks was only 56.5%. Based on the reproducibility experiments, that yielded >98% similar profiles, and the result of the above described outbreaks we chose to set the cut-off value at 98% for indistinguishable 6

150 151 profiles, while isolates with similarities between 95% and 98% were considered as highly related. 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 Discriminatory power of WGM for L. pneumophila To determine the discriminatory power of WGM, maps were created of L. pneumophila isolates originating from 15 epidemiologically unrelated patients. Previous characterization with SBT revealed eight different types and eight isolates had the same ST, namely 47. Comparative analysis of the whole genome maps of the unrelated isolates together with the WGMs from both outbreaks yielded very distinctive maps with a similarity between the maps ranging from 43% to 90% (Figure 2). Four clusters were found based on a similarity cut-off of 98% for indistinguishable WGMs. The first cluster (cl-01) contained all WGMs of the Amsterdam outbreak, while cl-02 consisted of 28 of the 29 Bovenkarspel isolates and a single epidemiologically unrelated isolate. Although this unrelated isolate was separated from the Bovenkarspel isolates in the spanning tree, it had a similarity with the other maps of 98.8% and yielded the same SBT as the Bovenkarspel isolates. The other two clusters (cl-03 and cl- 04) were comprised of seven unrelated isolates (all typed as ST47). One cluster (cl-03) comprised four maps that yielded a similarity between the maps of 99.7%, while the other cluster (cl-04), consisting of three maps, showed a similarity between the most distinct maps of 99.2%. The similarity between clusters cl-03 and cl-04 was only 43% despite the fact that all isolates in these groups yielded ST47. The isolates that were not part of a cluster belonged to eight epidemiologically unrelated isolates and to a single isolate of the Bovenkarspel outbreak. 172 173 Discussion 7

174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 In this study, we used whole genome mapping (WGM) as a high resolution typing method for L. pneumophila. We were able to reveal a considerable degree of variation among different L. pneumophila strains and WGM seems to be a useful typing tool to identify outbreaks. WGM was able to confirm both L. pneumophila outbreaks and link the likely source of each outbreak with the isolates obtained from patients (20, 21). The discriminatory power of WGM for L. pneumophila was best illustrated by the high degree of variation between the WGMs of two well-documented Dutch L. pneumophila outbreaks and the epidemiologically unrelated isolates, where the most distinct maps only showed a similarity of 43%. In comparison, WGM of LA-MRSA in a recent study still revealed an 84% similarity between the most distinct LA-MRSA isolates (17). To assess whether WGM is capable of identifying L. pneumophila transmission events, maps of epidemiologically unrelated isolates were compared to maps obtained from outbreaks. Analysis showed that both outbreaks form different clusters, while most unrelated isolates cluster as singletons. However, in the cluster containing the Bovenkarspel isolates, an additional non-related isolate yielding the same ST62 was present. Although the map of this isolate showed some differences with the Bovenkarspel isolates, it still groups within this cluster based on our criterion for indistinguishable isolates. Whether this strain represents the same strain as obtained during the outbreak remains unclear. However, ST62 is (after ST47) the second most frequently found ST among clinical L. pneumophila serogroup 1 isolates in the Netherlands, which makes it more likely to find an epidemiologically non-related ST62 strain with a corresponding map, compared to strains with uncommon STs. Another group of isolates for which we were not able to make a clear distinction with WGM was the group of isolates yielding ST47, resulting in two additional clusters. It could be that these two clusters contain isolates, representing the same strain, with a previously unknown epidemiological link, but it is more likely that WGM has difficulties separating strains within this ST. 8

199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 However, WGM was able to split these strains in two distinct clusters with very limited similarity between the clusters, indicating that WGM has a higher discriminatory power and is better capable to assess whether isolates belong to a transmission event or not than SBT. The criterion for indistinguishable WGMs of 98% and higher was based on replicates of L. pneumophila strains created in this study. In addition, the possible effect of mobile genetic elements made us determine that maps with similarities between 95-98% may represent the same strain, while maps with a similarities <95% are considered as different strains. The same cut-off values were previously established for LA-MRSA and were also used in two other studies using WGM for MRSA with USA300 genotype and Pseudomonas (16, 18, 23). It therefore seems that these cut-off values can be utilized for WGM in general regardless the microorganism used. Current molecular characterization of L. pneumophila isolates is mainly based on SBT (12). Based on this study, WGM can be a suitable high-resolution alternative to type L. pneumophila isolates and perform transmission investigations due to its higher discriminatory power. However, with the current developments regarding next generation sequencing (NGS) we believe it is only a matter of time before NGS will become the ultimate typing tool for L. pneumophila and virtually all microorganisms. Yet, for the time being, WGM can be a valuable alternative to perform outbreak investigations in real time since the turnaround time from culture to comparison of the L. pneumophila maps is less than 24 hours. 218 219 220 221 222 223 224 225 226 227 References 1. Braun JJ, de Graaff CS, de Goey J, Zwinderman AH, Petit PL. 2004. [Community-acquired pneumonia: pathogens and course in patients admitted to a general hospital]. Ned Tijdschr Geneeskd 148:836-840. 2. Fields BS, Benson RF, Besser RE. 2002. Legionella and Legionnaires' disease: 25 years of investigation. Clin Microbiol Rev 15:506-526. 3. Sopena N, Sabria M, Pedro-Botet ML, Manterola JM, Matas L, Dominguez J, Modol JM, Tudela P, Ausina V, Foz M. 1999. Prospective study of community-acquired pneumonia of bacterial etiology in adults. Eur J Clin Microbiol Infect Dis 18:852-858. 9

228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 4. Joseph CA, Ricketts KD, European Working Group for Legionella I. 2010. Legionnaires disease in Europe 2007-2008. Euro Surveill 15:19493. 5. Bennett E, Ashton M, Calvert N, Chaloner J, Cheesbrough J, Egan J, Farrell I, Hall I, Harrison TG, Naik FC, Partridge S, Syed Q, Gent RN. 2014. Barrow-in-Furness: a large community legionellosis outbreak in the UK. Epidemiol Infect 142:1763-1777. 6. Garcia-Fulgueiras A, Navarro C, Fenoll D, Garcia J, Gonzalez-Diego P, Jimenez-Bunuales T, Rodriguez M, Lopez R, Pacheco F, Ruiz J, Segovia M, Balandron B, Pelaz C. 2003. Legionnaires' disease outbreak in Murcia, Spain. Emerg Infect Dis 9:915-921. 7. Levesque S, Plante PL, Mendis N, Cantin P, Marchand G, Charest H, Raymond F, Huot C, Goupil-Sormany I, Desbiens F, Faucher SP, Corbeil J, Tremblay C. 2014. Genomic characterization of a large outbreak of Legionella pneumophila serogroup 1 strains in Quebec City, 2012. PLoS One 9:e103852. 8. Chiarini A, Bonura C, Ferraro D, Barbaro R, Cala C, Distefano S, Casuccio N, Belfiore S, Giammanco A. 2008. Genotyping of Legionella pneumophila serogroup 1 strains isolated in Northern Sicily, Italy. New Microbiol 31:217-228. 9. Fry NK, Bangsborg JM, Bernander S, Etienne J, Forsblom B, Gaia V, Hasenberger P, Lindsay D, Papoutsi A, Pelaz C, Struelens M, Uldum SA, Visca P, Harrison TG. 2000. Assessment of intercentre reproducibility and epidemiological concordance of Legionella pneumophila serogroup 1 genotyping by amplified fragment length polymorphism analysis. Eur J Clin Microbiol Infect Dis 19:773-780. 10. Underwood AP, Jones G, Mentasti M, Fry NK, Harrison TG. 2013. Comparison of the Legionella pneumophila population structure as determined by sequence-based typing and whole genome sequencing. BMC Microbiol 13:302. 11. Brehony C, Jolley KA, Maiden MC. 2007. Multilocus sequence typing for global surveillance of meningococcal disease. FEMS Microbiol Rev 31:15-26. 12. Gaia V, Fry NK, Afshar B, Luck PC, Meugnier H, Etienne J, Peduzzi R, Harrison TG. 2005. Consensus sequence-based scheme for epidemiological typing of clinical and environmental isolates of Legionella pneumophila. J Clin Microbiol 43:2047-2052. 13. Graham RM, Doyle CJ, Jennison AV. 2014. Real-time investigation of a Legionella pneumophila outbreak using whole genome sequencing. Epidemiol Infect 142:2347-2351. 14. Reuter S, Harrison TG, Koser CU, Ellington MJ, Smith GP, Parkhill J, Peacock SJ, Bentley SD, Torok ME. 2013. A pilot study of rapid whole-genome sequencing for the investigation of a Legionella outbreak. BMJ Open 3. 15. Sabat AJ, Budimir A, Nashev D, Sa-Leao R, van Dijl J, Laurent F, Grundmann H, Friedrich AW, Markers ESGoE. 2013. Overview of molecular typing methods for outbreak detection and epidemiological surveillance. Euro Surveill 18:20380. 16. Bosch T, Verkade E, van Luit M, Pot B, Vauterin P, Burggrave R, Savelkoul P, Kluytmans J, Schouls L. 2013. High Resolution Typing by Whole Genome Mapping Enables Discrimination of LA-MRSA (CC398) Strains and Identification of Transmission Events. PLoS One 8:e66493. 17. Bosch T, Verkade E, van Luit M, Landman F, Kluytmans J, Schouls LM. 2015. Transmission and persistence of livestock-associated methicillin-resistant Staphylococcus aureus among veterinarians and their household members. Appl Environ Microbiol 81:124-129. 18. Boers SA, Burggrave R, van Westreenen M, Goessens WH, Hays JP. 2014. Whole-genome mapping for high-resolution genotyping of Pseudomonas aeruginosa. J Microbiol Methods 106:19-22. 19. Kotewicz ML, Jackson SA, LeClerc JE, Cebula TA. 2007. Optical maps distinguish individual strains of Escherichia coli O157 : H7. Microbiology 153:1720-1733. 20. Den Boer JW, Yzerman EP, Schellekens J, Lettinga KD, Boshuizen HC, Van Steenbergen JE, Bosman A, Van den Hof S, Van Vliet HA, Peeters MF, Van Ketel RJ, Speelman P, Kool JL, Conyn-Van Spaendonck MA. 2002. A large outbreak of Legionnaires' disease at a flower show, the Netherlands, 1999. Emerg Infect Dis 8:37-43. 10

279 280 281 282 283 284 285 286 287 288 289 290 21. Sonder GJ, van den Hoek JA, Bovee LP, Aanhane FE, Worp J, Du Ry van Beest Holle M, van Steenbergen JE, den Boer JW, Ijzerman EP, Coutinho RA. 2008. Changes in prevention and outbreak management of Legionnaires disease in the Netherlands between two large outbreaks in 1999 and 2006. Euro Surveill 13. 22. Ratzow S, Gaia V, Helbig JH, Fry NK, Luck PC. 2007. Addition of neua, the gene encoding N- acylneuraminate cytidylyl transferase, increases the discriminatory ability of the consensus sequence-based scheme for typing Legionella pneumophila serogroup 1 strains. J Clin Microbiol 45:1965-1968. 23. Shukla SK, Pantrangi M, Stahl B, Briska AM, Stemper ME, Wagner TK, Zentz EB, Callister SM, Lovrich SD, Henkhaus JK, Dykes CW. 2012. Comparative whole-genome mapping to determine Staphylococcus aureus genome size, virulence motifs, and clonality. J Clin Microbiol 50:3526-3533. 291 292 293 Table legend Table 1. Bacterial strains used in this study. 294 295 296 297 298 299 Figure legends Figure 1. Whole genome mapping of L. pneumophila of two well-documented outbreaks. The complete WGMs are displayed. The dendrogram on the left denotes the clusters A and B, and their similarities. On the right hand side, outbreak, SBT and origin of the isolates are indicated. 300 301 302 303 Figure 2. Minimum spanning tree depicting the genotypic diversity of L. pneumophila isolates (n = 50). Each node represents the WGM of a single L. pneumophila isolate. The halos represent clusters based on a similarity cut-off of 98%. 304 11

Experiment Isolates (n*) Sequence type (n*) WGMs created Reproducibility of WGM B9006 477 3 Stability of WGM D9010 47 2 Comparison with in-silico maps A9001 D9010 Legionella transmission events Bovenkarspel (28) Amsterdam (7) Discriminatory power of WGM Unrelated Legionella isolates (15) 23, 37, 47 (8), 62, 444, 485, 493, 524 Total 53 isolates 57 WGMs * n = number of isolates 45 47 62 42 1 1 28 7 15

1000k 1500k 2000k 2500k 3000k 3500k 4000k Outbreak SBT Origin Cooling tower A. 99.9% Amsterdam 42 Patients Patients Whirlpool sample 1 56.5% Bovenkarspel 62 Patients B. 99.4% Whirlpool samples 2,3,4 Patients Whirlpool sample 5 Patient Figure 1. Whole genome mapping of L. pneumophila of two well-documented outbreaks. The complete WGMs are displayed. The dendrogram on the left denotes the clusters A and B, and their similarities. On the right hand side, outbreak, SBT and origin of the isolates are indicated.

ST524 ST47 ST23 cl-04, ST47 ST444 54% ST62 41% cl-02, ST62 39% cl-03, ST47 39% cl-01, ST42 ST493 ST37 ST485 Bovenkarspel patients Bovenkarspel source Amsterdam patients Amsterdam source Unrelated patients Figure 2. Minimum spanning tree depicting the genotypic diversity of L. pneumophila isolates (n = 50). Each node represents the WGM of a single L. pneumophila isolates. The halos represent clusters based on a similarity cut-off of 98%.