SRA STUDY
SRA Study Id ERP014074  (Link to NCBI )
Study Title The evaluation of whole genome sequencing for epidemiological typing of Legionella pneumophila
SRA Experiments
SRA Experiment Id ERX1304419  (Link to NCBI )
Experiment Title Illumina HiSeq 2500 paired end sequencing
SRA Submission ID 2202313
Exp Library Strategy WGS
Library Source GENOMIC
Library Selection RANDOM
Library Layout PAIRED
Library Instrument Illumina HiSeq 2500
   
SRA Experiment Id ERX1304418  (Link to NCBI )
Experiment Title Illumina HiSeq 2500 paired end sequencing
SRA Submission ID 2202312
Exp Library Strategy WGS
Library Source GENOMIC
Library Selection RANDOM
Library Layout PAIRED
Library Instrument Illumina HiSeq 2500
   
SRA Experiment Id ERX1304417  (Link to NCBI )
Experiment Title Illumina HiSeq 2500 paired end sequencing
SRA Submission ID 2202311
Exp Library Strategy WGS
Library Source GENOMIC
Library Selection RANDOM
Library Layout PAIRED
Library Instrument Illumina HiSeq 2500
   
SRA Experiment Id ERX1304416  (Link to NCBI )
Experiment Title Illumina HiSeq 2500 paired end sequencing
SRA Submission ID 2202310
Exp Library Strategy WGS
Library Source GENOMIC
Library Selection RANDOM
Library Layout PAIRED
Library Instrument Illumina HiSeq 2500
Study Abstract Sequence-based typing (SBT) is the current gold-standard typing method for investigation of legionellosis outbreaks caused by Legionella pneumophila. However, as common sequence types (STs) cause a large number of infections, some investigations remain unresolved by SBT. Here, a number of whole genome sequencing (WGS)-based methods were evaluated as potential typing tools including: i) a SNP-based approach; ii) extended multi-locus sequence typing (MLST) using various numbers of genes; iii) a gene presence/absence-based method; iv) a kmer-based method. They were tested using 106 L. pneumophila serogroup 1 isolates from the standard “typing panel”, as previously used by the European Society for Clinical Microbiology Study Group on Legionella Infections (ESGLI), and an additional 229 isolates.Over 98% isolates were considered typable using the mapping- and kmer-based methods. The percentage of isolates with complete extended MLST profiles ranged from 99.1% (50-gene scheme) to 86.8% (1455-gene scheme) while only 41.5% produced a full profile with the gene presence/absence scheme. Sequencing replicates demonstrated that all WGS-based methods offer 100% reproducibility. When one or more differences yield different types, the methods produce indices of discrimination ranging from 0.972 (ribosomal MLST) to 0.999 (SNP-based method), and all values are higher than that achieved with SBT (0.940). The epidemiological concordance is generally inversely related to the index of discrimination, with the most discriminatory methods failing to provide sufficient epidemiological concordance. Based on these analyses, we propose that an extended MLST scheme with ~50-100 genes maintains good epidemiological concordance while substantially improving on the discrimination offered by SBT, and has the potential to become a new “gold standard” typing method for L. pneumophila.
Alias legionella_wgs_typing
External Id BioProject=PRJEB12585