Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

  • Ariel Amadio ,

    Contributed equally to this work with: Ariel Amadio, James L. Bono

    Roles Data curation, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Instituto de Investigación de la Cadena Láctea IDICaL (INTA-CONICET), Rafaela, Argentina

  • James L. Bono ,

    Contributed equally to this work with: Ariel Amadio, James L. Bono

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Resources, Software, Visualization, Writing – review & editing

    Affiliation U.S Meat Animal Research Center, Agricultural Research Service, U.S. Department of Agriculture, Clay Center, Nebraska, United States of America

  • Matías Irazoqui,

    Roles Formal analysis, Investigation

    Affiliation Instituto de Investigación de la Cadena Láctea IDICaL (INTA-CONICET), Rafaela, Argentina

  • Mariano Larzábal,

    Roles Data curation, Investigation, Resources, Visualization, Writing – review & editing

    Affiliation Instituto de Agrobiotecnología y Biología Molecular (IABIMO)-CICVyA, Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de investigaciones Científicas y Tecnológicas (CONICET), Hurlingham, Argentina

  • Wanderson Marques da Silva,

    Roles Data curation, Investigation, Visualization, Writing – review & editing

    Affiliation Instituto de Agrobiotecnología y Biología Molecular (IABIMO)-CICVyA, Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de investigaciones Científicas y Tecnológicas (CONICET), Hurlingham, Argentina

  • María Florencia Eberhardt,

    Roles Data curation, Formal analysis, Investigation

    Affiliation Instituto de Investigación de la Cadena Láctea IDICaL (INTA-CONICET), Rafaela, Argentina

  • Nahuel A. Riviere,

    Roles Data curation, Investigation, Visualization, Writing – review & editing

    Affiliation Instituto de Agrobiotecnología y Biología Molecular (IABIMO)-CICVyA, Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de investigaciones Científicas y Tecnológicas (CONICET), Hurlingham, Argentina

  • David Gally,

    Roles Conceptualization, Funding acquisition, Writing – review & editing

    Affiliation Division of Immunity and Infection, The Roslin Institute and R(D)SVS, The University of Edinburgh, Easter Bush, Midlothian, United Kingdom

  • Shannon D. Manning,

    Roles Formal analysis, Investigation, Writing – review & editing

    Affiliation Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, United States of America

  • Angel Cataldi

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing

    cataldi.angeladrian@inta.gob.ar

    Affiliation Instituto de Agrobiotecnología y Biología Molecular (IABIMO)-CICVyA, Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de investigaciones Científicas y Tecnológicas (CONICET), Hurlingham, Argentina

Abstract

Cattle are the main reservoir of Enterohemorrhagic Escherichia coli (EHEC), with O157:H7 the distinctive serotype. EHEC is the main causative agent of a severe systemic disease, Hemolytic Uremic Syndrome (HUS). Argentina has the highest pediatric HUS incidence worldwide with 12–14 cases per 100,000 children. Herein, we assessed the genomes of EHEC O157:H7 isolates recovered from cattle in the humid Pampas of Argentina. According to phylogenetic studies, EHEC O157 can be divided into clades. Clade 8 strains that were classified as hypervirulent. Most of the strains of this clade have a Shiga toxin stx2a-stx2c genotype. To better understand the molecular bases related to virulence, pathogenicity and evolution of EHEC O157:H7, we performed a comparative genomic analysis of these isolates through whole genome sequencing. The isolates classified as clade 8 (four strains) and clade 6 (four strains) contained 13 to 16 lambdoid prophages per genome, and the observed variability of prophages was analysed. An inter strain comparison show that while some prophages are highly related and can be grouped into families, other are unique. Prophages encoding for stx2a were highly diverse, while those encoding for stx2c were conserved. A cluster of genes exclusively found in clade 8 contained 13 genes that mostly encoded for DNA binding proteins. In the studied strains, polymorphisms in Q antiterminator, the Q-stx2A intergenic region and the O and P γ alleles of prophage replication proteins are associated with different levels of Stx2a production. As expected, all strains had the pO157 plasmid that was highly conserved, although one strain displayed a transposon interruption in the protease EspP gene. This genomic analysis may contribute to the understanding of the genetic basis of the hypervirulence of EHEC O157:H7 strains circulating in Argentine cattle. This work aligns with other studies of O157 strain variation in other populations that shows key differences in Stx2a-encoding prophages.

Introduction

Enterohemorrhagic Escherichia coli (EHEC), with cattle as the primary animal reservoir, is a globally important foodborne pathogen capable of causing hemorrhagic colitis and hemolytic uremic syndrome (HUS) in humans. EHEC infection in humans can occur via ingestion of food products of bovine origin such as meat, dairy, or by contact with fecal contaminated fruits, vegetables and water sources [1]. Due to its low infectious dose (<100 CFU) and severe clinical outcomes, EHEC infections are considered a serious public health concern. While numerous EHEC serotypes are linked to human infections, strains of serotype O157:H7 cause more severe clinical symptoms, and even HUS, than other EHEC serotypes [2, 3]. HUS caused by EHEC O157:H7 has been reported worldwide [4], with the highest incidence in Argentina [5]. The assessment of molecular characteristics governing pathogenicity and virulence of Argentine EHEC isolates is therefore essential.

The key virulence determinant in EHEC O157:H7 is Shiga toxin (Stx) production, which directly contributes to HUS. Stx genes are encoded on lambdoid phages and EHEC strains can contain one or two Stx subtypes including type 1 (Stx1) or type 2 (Stx2) [6]. Multiple stx variants, stx1 (stx1a, stx1c, and stx1d) and stx2 (stx2a, stx2b, stx2c, stx2d, stx2e, stx2f, stx2g [7] stx2h [8] stx2i [9]) and stx2k [10]) have been reported. According to epidemiological studies, Stx2-producing strains and especially Stx2a, are associated with more severe cases of infection than Stx1-producing strains [1113].

Another central virulence attribute of EHEC O157:H7 is the pathogenicity island encoding the locus of enterocyte effacement (LEE) [14, 15], which contains genes for a type 3 secretion system (T3SS) and intimin (eae), a gene critical for the histological attaching and effacing (A/E) lesions characteristic of EHEC O157:H7. In addition, other important genes involved in EHEC O157 virulence are within the pO157 plasmid [16]. Other factors not yet characterized, however, may also be essential for the full virulence of EHEC O157:H7.

According to phylogenetic studies, EHEC O157 can be divided into three main lineages and nine clades [17, 18]. Various clinical cases on multiple continents and countries have been associated with the clade 8 EHEC strains (I/II lineage). For instance, in Argentina a high prevalence of clade 8 strains was evident in cattle and humans [19, 20]. In the USA, clade 8 strains have been associated with more severe human disease, which led researchers to define it as hypervirulent [18]. Although clade 8 strains had higher Stx2a expression levels relative to strains from other clades, along with unique genetic features [11, 21, 22], to date the factors associated with the increased virulence and Stx2a production are not completely understood. Interestingly, the clade 8 EHEC O157:H7 isolates obtained from bovines in Argentina had a certain degree of genetic variability and displayed variable virulence in in vitro and in vivo assays [20].

To better understand the molecular bases related to virulence, pathogenicity and evolution of Argentinian EHEC O157:H7, we performed a comparative genomic analysis of these isolates through whole genome sequencing. Notably, we identified conserved regions, insertions/deletions (in/dels) and inversions, as well as variations within core and accessory genes. We also report on the distribution of key virulence genes and the presence of lambdoid phages and plasmids, further highlighting the diversity of the analyzed EHEC O157:H7 isolates.

Material and methods

Strains

Eight EHEC O157:H7 isolates previously recovered from cattle of the central Humid Pampas of Argentina between 2002 to 2011, were examined in this study (Table 1). Isolates were previously classified into clades by evaluating a subset of 23 single nucleotide polymorphisms (SNPs) [20]from the original 96 SNPs shown to differentiate each clade [18] following annotation (Table 1). In addition, six reference EHEC O157:H7 genomes were included in the analysis: EDL933 accession number CP008957 [23], Sakai (NC_002695, [24]), TW14359 (NC_013008, [25]), TW14588 (NZ_CM000662, [26]), 644-PT8 (NZ_CP015831, [27]), SS52 (NZ_CP010304, [28]). The bacteria were grown at 37°C on Luria-Bertani (LB, Difco Laboratories, USA) agar plates or aerobically in LB broth.

DNA preparation and PacBio whole-genome sequencing

An aliquot of 250 μl of an overnight culture of the different EHEC O157:H7 strains was added to 20 mL of LB and incubated at 37°C with shaking for 3 ½ h. The bacteria were then harvested for DNA extraction using Genomic-Tip 100/g (Qiagen Inc Valencia, CA. DNA (10 ug/ml) were sheared to a targeted size of 30 kb using a g-TUBE (Corvaris, Woburn, MA) and subsequently concentrated using 0.45X volume of AMPure PB magnetic beads (Pacific Biosciences, Menlo Park, CA) according to the manufacturer’s protocol.

Sequencing libraries were created using 5 μg of sheared, concentrated DNA and the PacBio SMRTbell Template Prep Kit 1.0, according to the manufacturer’s protocol. Each library was sequenced using the RS II sequencing platform (Pacific Biosciences, PacBio) with the P6/C4 sequencing chemistry and the 360 min data collection protocol.

PacBio sequence assembly into closed circularized genomes and annotation

PacBio reads were assembled using HGAP3 (SMRTanalysis Version 3.0, Pacific Biosciences) and the resulting contigs were imported into Geneious software (Biomatters, Ltd., Auckland, New Zealand). If present, overlapping sequences on the ends of the contigs were removed from the 5’ and 3’ ends to generate circularized chromosomes and plasmids using the software Geneious (Biomatters, Ltd.). The closed chromosomes were reoriented to start with the putative origin of replication using Ori-Finder 2 [29]. The closed chromosomes and plasmids were polished twice for accuracy using the RS_Resequencing 1.0 protocol in SMRTanalysis and by mapping error corrected PacBio reads to the chromosomes and plasmids using Geneious software (Biomatters, Ltd.). Genome sequences are deposited in GeneBank (BioProject PRJNA280853). Automatic annotations were performed using Prokka [30] against a reference set of six O157:H7 genomes as primary annotations described in (see the Strain section in M&M). Protein sequences were extracted from those genomes and clustered at 95% identity using cd-hit [31] to create a trusted source for annotation for Prokka.

Whole genome comparison

Genomes were compared using BLAST+ [32] and visualized in Artemis Comparison Tool software [33]. Comparison files were generated online at WebACT [34] or locally.

Prophage prediction and extraction of virulence genes

Phaster webserver was used to predict putative prophage sequences in the bacterial genomes [35]. Subsequently, prophage sequences, including Stx prophages, were extracted from the bacterial genome sequences using Artemis [36]. Only prophages identified as complete and questionable by Phaster software were considered for the analysis. Prophages were named R1 to Rn as they appear from 5´to 3´ in the linear representation of the chromosome (with origin of replication at the 5’ end). For example, RafaelaII-R2 means it is the second phage identified in strain RafaelaII.

Prophage comparisons were carried out using Blast+ [32] as follows: each prophage sequence was extracted and used as a query for comparison with all prophages from the same strain (Intra-strain) or from the other strains (Inter-strain). The output was processed with custom Perl scripts (https://github.com/arielamadio/Ecoli_parsing_data) to obtain an average of identity for the High Scoring Pairs (HSPs) and the coverage for each prophage as the ‘qcovus’ parameter (query coverage per unique subject). To have an overall estimation of phage similarity across strains, prophage comparisons were performed with settings Query Coverage Per Subject 80% and not restricting the other parameters (query Coverage Per Unique Subject, Average Identity and HSPs number). These criteria were applied in inter-strain prophage diversity section. To classify the prophages into families by similarity, comparisons were performed with hits Query Coverage Per Subject 80% and query Coverage Per Unique Subject 90% and not restricting the other parameters (Average Identity and HSPs number). Also, a maximum tolerance of 8kb between query and subject sizes was selected.

An even stricter relatedness criterion (query coverage 80% and Query coverage Per Unique Subject 90% and gaps and inversions < 4kb) was established to identify distributed identical prophages. For the nomenclature of these identical prophages, the name of the strains with the identical prophage is followed by the letter and number of the family at the end of the name. For example, TW14359-R3 denotes the third phage in TW14359 and belongs to family R and would be considered identical to Vac07.1-R3, RafaelaII-R3 and Balcarce14.2-R3 (S1 Table).

Virulence gene sequences were identified using Virulence Finder 2.0 [37], with a nucleotide identity threshold of 90% and a minimum length of 60%. The virulence genes considered in this analysis were those encoded on the LEE (tir, eae, espA, espB and espF) or those that coded for toxins (stx2 and stx1), effectors (tccP, espJ, nleA, nleB1, nleB2 and nleC), stress resistance proteins (gadA and gadB) and the adhesin iha [37].

Prediction of clusters of orthologous genes and pan-genomic analysis

Roary [38] was used to identify the pan-genome from the genomes of strains sequenced in this work and the set of references used for annotation. All genomes were re-annotated with Prokka (with a reference set of proteins) previous to running the pangenome pipeline to standardize the annotations.

Comparisons of gene content between clades were performed using “difference” method in Roary and by specifying the groups to compare. A tree was constructed with the alignment of the core genome using RAxML [39] with GTRGAMMA as model and 1000 bootstraps. Clade assignment by SNP analysis were determined with NucDiff [40] by comparing the complete genomes with the strain Sakai as a reference. Then, the 23 SNPs corresponding to the genotyping panel [18] were extracted from the comparison output to subsequently infer the strain genotypes and clades.

Shiga-toxin detection and cytotoxicity in Vero cells

Stx expression was determined for filtered culture supernatants with or without mitomycin C induction, by using RIDASCREEN Kit Verotoxin enzyme immunoassay (R-Biopharm Latin America) and then semi-quantified, as previously described [20]. To measure Vero cell cytotoxicity, the strains were cultured overnight at 37°C in 5 ml of LB broth and the culture was centrifuged at 200 rpm. The supernatant was filtered (0.22 μm filters) and subsequently assayed for cytotoxicity on Vero cells, as previously described [20].

Results and discussion

General genomic features of the EHEC O157:H7 isolates

Complete chromosomes and plasmids were obtained for all but one of the eight analysed strains (Table 2). The chromosome of Strain 438–99 assembled as two contigs that could not be resolved further. Chromosome sizes and gene content of the sequenced strains ranged from 5415530 bp to 5750490 bp and 5193 ORFs to 5599 ORFs, for strains 7.1Anguil and 9.1Anguil, respectively. All the strains characterized here and the reference strain TW14359 belong to lineage I/II, except for the reference strain EDL933, which is from lineage I [41, 42]. Whole genome multiple alignments showed a high level of conservation around the origin of replication and variation around the replication terminus. Most of the In/Dels and inversions were associated with the presence of lambdoid prophages (Fig 1), and located mostly around the terminus of replication of the genome. EHEC O157:H7 7.1Anguil had a large, inverted region comprising 5 Mb, except at the origin of genome replication. This inversion can also be interpreted as involving the origin of replication, but that is against the convention of linear representation of genomes using ori as the first nucleotide. Two approaches followed to assess and confirm the presence of this inversion. The first approach consisted of a manual inspection of long reads, whereas the second involved performing long PCR with two pairs of primers located outside of the inversion (and the repeated region) (S1 Fig) to amplify a 6kb region on both sides of the inversion point. Importantly, the use of PacBio long read sequencing technology allowed us to reconstruct complete finished circular chromosomes and plasmids for most of the analysed isolates, with the subsequent possibility of including genome features, such as phage content and position, in the analysis.

thumbnail
Fig 1. Schematic lineal representation of aligned EHEC O157 genomes.

The origin of replication is placed at the 5´and 3’end. Lambdoid prophages are represented as boxes of blue color, except for those carrying stx genes. In those cases, stx1, stx2a and stx2c prophages were indicated in yellow, red and light blue boxes, respectively. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear).

https://doi.org/10.1371/journal.pone.0258753.g001

thumbnail
Table 2. Chromosome and plasmid characteristics from the sequenced strains and two previously published strains.

https://doi.org/10.1371/journal.pone.0258753.t002

The strains 146N4 and 438–99 from clade 6 presented several reorganizations in the terminus region of the chromosome relative to other strains. The reorganized regions were contained between prophages R11 and R12 in 146N4 and R9 to R11 in 438–99. The chromosomal sequence is conserved in the inverted regions among all strains with the difference in the chromosomal architecture. On the other hand, 7.1 Anguil, has an inversion that isn’t not associated with a phage region but with a ribosomal RNA region. This large inversion the sub-region between R12-R13 from 438–99 is moved to the 5’ of the regions for all strains. This sub-region, however, is inverted between 146N4 and 438–99 and is different from that of other strains. In addition, 438–99, EDL933 and 7.1Anguil shared the orientation of this sub-region, whereas 146N4 shared the direction of the remaining strains (Fig 1).

The number of inserted prophages detected by Phaster, varied from 13 to 16 per genome. Lambdoid prophages were located between 1.2 Mb to 4.60 Mb of the genome coordinates. 149 prophages were identified in the strains, including the reference strains (S1 Table). The number and sequence variability of lambdoid prophages among our EHEC O157:H7 strains supports previous reports [11, 12, 4346]. The order of prophages in the genome has some conservation, as they occupy main preferential integration sites. Most of the sequenced strains had two Stx-converting lambdoid prophages, except EHEC O157:H7 438–99 that had one Stx2c-converting lambdoid prophage, and 9.1Anguil (clade 8) that had probably lost the Stx2a prophage. Stx2a encoding prophages were integrated at argW tRNA in clade 8 strains and 7.1Anguil, whereas it was integrated at yehV/mirA and wrbA in Balcarce24.2 and EDL933, respectively. The gene content of prophages encoding for stx2a are hypervariable. No stx2a containing prophage exhibited more than 70% conservation among the analysed genomes.

There was no genomic inversion larger than 50 kb among the clade 8 strains, in contrast to those of the strains of other clades. E. coli inversions are located around the terminus of replication, a region that seems prone to rearrangements [47] and Fitzgerald, personal communication. The 7.1Anguil strain showed the largest inversion in rRNA genes, which is a well described event in E. coli [48]. By contrast, the locations of the inversions for the remaining strains were restricted to lambdoid phages, which have an ad-hoc machinery of recombination that promotes inversions [49].

Prophage diversity

Inter-strain prophage diversity: some strains contained ubiquitous prophages with high level of similarity in relation to prophages from all strains. Balcarce24.2 and 7.1Anguil had five prophages with similarity hits in all strains whereas some clade 8 strains (Vac07.1, Tw14359, 9.1Anguil) had four conserved prophages (S2 Fig).

Intra-strain prophage diversity similarities only occurred with less than 85% coverage and 85% identity, as determined by blastn with all the prophages from a given strain. This is expected due to the immunity system of lambdoid phages [50]. However, a set of prophages had similarities covering between 70% to 75% of their sizes in other prophages in the same strain. For example, in the strains Vac07.1, Tw14359 and Balcarce14.2, prophages R13 and R6 were similar, whereas in 9.1Anguil, prophage R12 was similar to R5. Similarly, intra-strain prophage similarity was observed with R14 and R5 in 438–99; R13 and R4 in EDL933, R13 and R10 in RafaelaII and R12 and R5 in Balcarce24.2. Also, 438–99 showed two adjacent prophages (R6 and R7) with high level of similarity, which did not occur in any other strain. 438–99 was the strain with the highest level of intra strain phage similarity. Strains with less intra strain phage similarity were Balcarce14.2 and RafaelaII strains (S2 Table).

In order to identify highly related prophages across the strains, stricter relatedness parameters were used. Including visual inspection for large gaps and inversions in dot plots of paired genomes. The analysis identified 21 families with two to ten prophages in each family that accounted for 120 prophages (S1 Table). The remaining 29 prophages were unique to a particular strain. The families with 6 to 10 prophages are described in (S3 Table). Beside these families there are another 10 smaller families composed of 2 to 5 prophages. Interestingly, in some families, prophages have relevant metabolic or regulatory genes that are located 5’ or 3’ of prophage genes but are located between the predicted attL and attR integration sites (S3 Table).

Unique prophages.

Some prophages are unique to a single strain (with no other hits above the established cut off) (S1 Table, S2 Fig). Strains with more unique prophages are 146N4 (six) and Balc24.2 (four). Clade 8 strains tend to have a lower number of unique prophages, but this may be due to the higher frequency of clade 8 strains in the study.

Stx2 encoding prophages.

The diversity of Stx2a encoding prophages is reflected by the fact that prophages R2 from TW14359, Vac07.1 and Balcarce14.2 belong to family B, R3 from Balcarce24.2 belongs to family C and R2 from RafaelaII, R12 from EDL933 and R5 from 7.1Anguil are unique. This result strongly differed from that of prophages encoding stx2c that belongs to family E. In contrast to the stx2a containing prophage, the prophages containing the stx2c gene were highly conserved among the studied strains. All nine stx2c prophages showed an average similarity of 97.7% ±2.13%, whereas the seven stx2a prophages displayed an average similarity of 60.5% ± 6.3%.

Identical prophages.

We determined if the same prophage is integrated in the genome of different strains using a stricter relatedness criterion than that for grouping the prophages in families (S1 Table), We identified 16 shared types of prophages (identical prophages in sequence and size present in more than one strain).

The most distributed shared types in this set of strains were TW14359F6 (in all ten strains), TW14359E5 (eight strains) as well as TW14359J10 TW14359K11 and TW14359M13 (all present in seven strains). With these prophage sequences, we performed megablast against nr NCBI database with highly restrictive search parameters coverage 98%, identity 99% (S4 Table). The shared types most frequently found were Tw14359N14, Tw14359J10; Tw14359A1; Tw14359F6 with 179, 175, 170 and 160 hits, respectively followed by other less represented shared type prophages in the nr database. This method does not allow to determining the size of the targeted prophages in the nr database but indicates that the sequence of the prophages studied here are found as complete prophages in other genomes.

As reported by other authors [11, 12, 4346], extreme phage diversity was observed probably because lambdoid phages are equipped with specialized recombination machinery as Exo, Bet, Gam and other proteins [50]. However, in spite of this general diversity they could be grouped by sequence alignments in families of high similarity and even in individual identical sequences shared by different genomes. A tendency was observed that shorter prophages are likely to be associated with higher clustering (S3 Fig), as previously described by Shaaban et al. [46]. Conversely, Larger prophages (>80 kb) were overrepresented among the unique prophages.

Pan genomic analysis reveals regions exclusive to clade 8 strains

The genomes of all sequenced strains and the six strains used as references had 4,415 core genes and a pangenome of 7,881 genes. Roary was used to identify a set of genes only present in the clade 8 genomes. Although no long contiguous exclusive fragments common to all clades 8 strains were detected in the analysis, a short contiguous gene cluster of ten genes was exclusive to this clade. In RafaelaII strain (used as a prototype clade 8), this cluster corresponds to genes from RafaelaII_01805 to 17 and showed 100% identity and conserved synteny in the other clade 8 strains studied here. This segment is located in R3 prophage (family C) of RafaelaII strain (Table 3).

The pangenome analysis, even using a small number of genomes, was consistent with previous studies. For instance, the 4415 core genes are very similar in number to the 4369 core-genome identified in 185 strains from the UK [51]. A thorough analysis of the hypervirulent clade 8 strains to search for exclusive regions revealed only one short region that encode for DNA binding proteins, putative RNA binding domains and a ren protein involved in phage exclusion and protection from RecAB recombinase [52]. The product of many of these genes may have a regulatory effect of prophage gene expression as three of the genes are annotated as transcriptional regulators.

SNPs/Phylogeny.

A tree constructed using all genes corresponding to the core genome of the analysed and reference strains retrieved four major clusters (Fig 2). One clustered contained reference strains Sakai, EDL933 and TW14588, from clades 1, 3 and 2, respectively. The second cluster contained clade 6 strains 7.1Anguil, Balcarce24.2 and the non-stx2a 438–99 and 146N4. A third cluster consisted of clade 8 isolates Vac07.1 and RafaelaII. A fourth cluster comprised clade 8 strains 9.1Anguil, Balcarce14.2 and reference strains TW14359 and SS52.

thumbnail
Fig 2. Phylogenetic tree constructed with the core genome from the Argentinian strains and previously published strains using RAxML.

Bootstrap values are located at each branch point.

https://doi.org/10.1371/journal.pone.0258753.g002

Plasmid pO157

The isolates contained the megaplasmid pO157 that showed a high degree of conservation (Fig 3). EDL933 pO157 plasmid was smaller (92 kb), whereas the pO157 from other strains ranged from 94.5 to 96.5 kb. This longer length compared to pO157-EDL933 is due to a transposon and IS elements insertions after espP and before the lipid A modification enzyme genes cluster (Fig 3). Other genetic differences also related to the insertion of transposons in pO157 include the pO157 from strain 146N4. It had two transposons inserted at position 80800 at the 5´of the espP gene that resulted in two EspP proteins: EspP1 comprising the first 287 N-terminal residues and EspP2 composed of the last 1080 C-terminal fragment. These two fragments are probably non-functional as the segmentation interrupted the Peptidase S6 superfamily domain. EspP has been implicated in inhibiting complement activation, which allows the progress of EHEC infection [53, 54]. In EPEC, EspC, the orthologous of EspP, is related to epithelial cell severe alterations [55]. The loss of a functional EspP protein suggests that this strain my not cause disease in humans or may cause a milder form.

thumbnail
Fig 3. Alignment of pO157 plasmids from EHEC O157 isolates.

Plasmid are represented in lineal forms with the ORF colored blue. The tagA gene was oriented at the 5´end to allow for comparison of plasmid architecture. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear).

https://doi.org/10.1371/journal.pone.0258753.g003

Identification of additional plasmids

Two strains, 7.1Anguil and 438–99, also contained a 55 kb conjugative plasmid, p55 (Table 2). Both plasmids shared 99% similarity with 96% coverage according to blastn. These plasmids were similar to pMCR-1 from EHEC O157:H7 strain 2017C-4109 that carries the mrc-1 gene that mediates colistin resistance [56]. The mcr1 gene was absent from plasmid p55 from strains 7.1Anguil and 438–99 strains. However, the sequence 5’ and 3’ to the gene were conserved in p55 and pMCR.1 (Fig 4). The annotation of the ORFs of p55 was related mostly to hypothetical proteins and the conjugation machinery. Other plasmids highly similar to p55 from E. coli are TP114 from E. coli K12 (MF521836); pChi7122-3 from avian enteropathogenic E. coli (APEC) 7122 O78:K80:H9 (FR851304); AR_0011 plasmid tig00013784_pilon (CP024858; plasmid pF7386-2 from EHEC O157:H7 (CP038361); an unnamed plasmid from E. coli O177:H21 (CP016547) and plasmid pMRSN346355_65.5 (CP018124.).

thumbnail
Fig 4. Alignment of the p55 plasmid of strains 7.1Anguil and 146N4 with a related pMCR1plasmid.

Plasmid are represented in lineal forms with blue and red arrows indicating genes. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear).

https://doi.org/10.1371/journal.pone.0258753.g004

In addition, strain 146N4 carried a 7 kb plasmid, p7, which contained genes for colicin D synthesis, lysis protein for colicin release and mobilization genes. This plasmid was highly similar to the plasmid pF1273-2 (accession number CP038377). A small 3.3 kb plasmid that was also detected in Balcarce24.2 and 146N4. This plasmid was identical to pOSAK [5759] and carried the toxin endoribonuclease lsoA, an antitoxin lsoB genes and the mobilisation gene mobA2.

Inference of distinctive virulence functions based on gene content and polymorphisms

In a previous study, we had described the behaviour of the same set of EHEC O157:H7 strains from cattle in virulence correlated assays, such as Vero cell cytotoxicity, Red Blood Cell (RBS) lysis produced by the T3SS, adherence to different cell lines and lethality, weight loss and tissue damage in inoculated mice [20]. The following section refers to an analysis of some of the genes involved in these pathogenic processes.

Shiga toxin encoding prophages and determinants of stx2 expression.

We comprehensively evaluated the Stx prophages incorporated into each EHEC genome. The strains of clade 8 and 6 had prophages encoding stx2a and are frequently the second lambdoid prophage in the genome (R2, counting from the replication of origin) Fig 5). Stx toxin production was analyzed in strains carrying the stx2a operon (Fig 5) in order to associate gene content of the encoding prophages with Stx production.

thumbnail
Fig 5. Alignment of prophages carrying stx2a, the red arrows represent genes of the stx2a operon; the grey arrows indicate hypothetical proteins, and the blue arrows refer to genes encoding for protein with assigned function.

Some relevant genes of lambda phage are marked. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear) or no homology (no lines). Columns at the right show cytotoxicity in Vero cells, by ELISA with or without mitomicin and O and P protein subtype.

https://doi.org/10.1371/journal.pone.0258753.g005

Interestingly, the Stx2a prophage from RafaelaII (Fig 5), is a putative defective phage because it lacked the genes for endolysin, terminase, portal protein (a major capsid protein) and the tail fiber protein. All of these proteins are related to the lytic machinery and the bacteriophage body. While one of the isolates, 9.1Anguil (clade 8) had no Stx2a phage. The phage may have been lost before isolation, as suggested by its low Stx2 activity to Vero cells and by the fact that there was a complete argW tRNA, the integration site of Stx2a prophage.

Two strains, 9.1Anguil and RafaelaII, are of particular interest because they either lack the stx2a containing prophages or the phage is defective. Sequence analyses indicated that 9.1Anguil does not have a stx2a containing prophage. This is different from most the clade 8 strains as they carry both stx2a and stx2c containing prophages [18]. We presume that there was a precise excision of the stx2a containing prophage in 9.1Anguil that didn’t leave any phage sequence in the chromosome when it excised. However, we can’t rule out that 9.1Anguil had never had a stx2a containing prophage which would make it an interesting strain for further study. With only the stx2c gene, Stx2 production of this strain was extremely low. This finding suggests that almost all the production of Stx2 in stx2a-stx2c strains is due to the stx2a containing phage, as previously demonstrated [60], and that the low expression of stx2c gene is not due to a repression exerted by the stx2a prophage. In turn, RafaelaII (stx2a-stx2c) contains a defective stx2a containing prophage that lacks the genes encoding for essential components of the phage body and lytic machinery. However, this strain produces high levels of Stx in culture supernatants, as shown by ELISA and verocytotoxicity assay. This observation suggests a complete prophage is not needed for Stx2 production as long as the regulatory genes are present.

Previously published research demonstrated that different alleles of the Q antiterminator gene affect the expression level of the stx2a operon [61]. Stx2a prophages that carry the Q933 allele are related to high expression, while those with Q21 [62] and Q111H- [60]. are linked to low expression. All the strains studied here had the Q933 allele in the Stx2a prophage as determined by in silico PCR with primers described by Olavesen et al. [63]. However, the analysed strains presented an important variation in Stx2 production level. The analysis of the Q gene sequences showed previously undescribed polymorphisms in the Q933 allele of Q gene. These Q933 polymorphisms are evident in two strains, Balcarce24.2 and 7.1Anguil (Fig 6A). Interestingly, 7.1Anguil had the highest cytotoxicity to Vero cell line. Also, strains 7.1Anguil and Balcarce24.2 shared other mutations in the pR’-tRNA region Q-Stx2A intergenic region, in comparison to the other strains, including the reference strains (Fig 6B). The detected non-synonymous mutations in Q genes that may change the function of the protein as well as the SNPs in the intergenic region between the Q and stx2 genes deserves more research to assess the effect of these substitutions on stx expression.

thumbnail
Fig 6.

(A). amino acid substitutions (shaded in grey) in the Q antiterminator gene, (B) SNPs in Q-stx2A intergenic region. EDL933 sequence is used as reference. Dots represent nucleotide that are identical to the reference. SNPs are represented by the substituting nucleotide. A tRNA present in the intergenic region is shaded in grey.

https://doi.org/10.1371/journal.pone.0258753.g006

As replication proteins O and P from lambdoid prophages encoding stx2 have been implicated in Shiga toxin expression [11], we subsequently assessed these genes, which are located upstream of stx2a. Most of the strains had the γ allele of O and P genes, except Balcarce24.2 that carry the δ allele and 7.1Anguil that possesses a novel δ allele variant that we called δ–like allele (Fig 5). This variant of O and P allele has not been described previously.

Interestingly, most of the strains of our study that have stx2a genes have the subtype γ, except strains Balcarce24.2 (clade 6) and 7.1Anguil (clade 6). The O-protein δ-like subtype from the highest Stx producer 7.1Anguil has seven amino acid substitutions in relation to the canonical δ subtype, whereas the P-protein from δ-like subtype has eight substitutions concentrated in the N-terminal end. The role of these substitutions in O and P function remain to be elucidated. The O- and P-proteins, which have been proposed to have primase and replicase activity, respectively, are the only proteins from a lambdoid phage implicated in replication as the DNA polymerase and other constituents of the replication machinery are provided by the host.

Identification and variation in virulence genes.

The LEE genes eae, espA, espB, espF and iha adhesin were 100% conserved among the strains. Notably, iha was absent from 438–99 due to a deletion event in that strain.

Our analysis detected two tir alleles of a previously described T/A SNP at base 255 [64], seven of the Argentinian strains had the T allele while two strains had the A allele (Table 4). The effector tccp1 had three alleles with six strains forming one allele while the other alleles had two and one strain, respectively. The allele (AB253545) with two strains encoded for a protein that is 47-amino acid longer in relation to that encoded for the other two alleles which are synonymous (Table 4). Effector espJ had three alleles: The most common allele was found in eight of the ten strains and contained a non-sense mutation that created a protein 39 aa shorter at the N-terminal end. The other alleles contained a synonymous mutation (Table 4). The effector nleA had two synonymous mutations. The nleB1, nleB2, nleC1 gadA and gadB gene sequences were 100% conserved among strains (Table 4). In all strains, tccp2 was a pseudogene encoding for a non-functional protein.

thumbnail
Table 4. According to Virulence Finder alleles are named for its accession numbera.

https://doi.org/10.1371/journal.pone.0258753.t004

LEE genes polymorphisms.

LEE encodes for a T3SS, which is essential in EHEC virulence. The search for polymorphisms of LEE is relevant because we have previously observed important variations in T3SS activity in the same set of strains [20]. For this analysis, we aligned all nucleotide sequences of LEE from all the strains studied here in search of SNPs and In/Dels. Polymorphisms of LEE genes described in the Virulence Finder section will not be described to avoid redundancy.

In all strains, LEE was located in the same genomic position, between rorf1_2 and an IS66 family transposase gene located after the last gene of LEE, espF. The LEE sequence was highly conserved among all isolates. Five strains (RafaelaII, Vac07.1, Balcarce24.2, 7.1Anguil and 146N4) presented a substitution at nucleotide (nt) 8797 of LEE, which corresponded to the intergenic region comprised between genes etgA-grlR and where the etgA and grlR promoters should be located. The strain 7.1Anguil strain showed a unique polymorphism in espA: a non-synonymous mutation (AACxAAA) producing a NxK change at aa 111.

Among different EHEC O157 strains, LEE locus is much more conserved than prophages and even more than other genome regions [17]. The SNP analysis of the LEE region detected a substitution, between etgA-grlR genes, in five strains. The intergenic region is 195 bases in length and is the promoter region for both genes as the grlR gene is transcribed in the downstream direction while the etgA genes is transcribed in the upstream direction. GrlR is an important regulatory protein that forms a complex with GrlA regulatory protein [65]. This regulatory system forms a bicistronic operon that coordinates the LEE regulation, where GrlA is the positive regulator and GrlR acts negatively. In addition, other studies showed that the GrlR-GrlA system also regulates the expression of flagella and enterohemolysin in EHEC [6668]. The etgA gene encodes for a peptidoglycan lytic enzyme with muramidase activity, an enzyme that contributes with T3SS needle to go through the peptidoglycan layer of the bacteria so it can be ready to penetrate the host epithelial cell [69].

Conclusions

In conclusion, clade 8 strains showed conserved genomic structure to each other while other clade strains show inversions and in/dels (>50 kb) with respect to clade 8 strains and, in accordance with previous studies, lambdoid prophage varied among strains, i.e. those encoding for stx2a were extremely polymorphic, in opposition to those encoding for stx2c A cluster of genes presents exclusively in clade 8 contains genes encoding mostly for DNA binding proteins. Stx expression is generally increased in clade 8 strains. However, there are important variations of Stx expression among clade 8 strains that may be associated to the gene content of stx2a-prophages and nucleotide substitutions in the promoter region on of stx2 operon.

Supporting information

S1 Fig. Long PCR analysis of 7.1Anguil strain inversion.

A) Evidence and graphic representation primers for combinatorial PCR to evaluate inversion. B) In silico prediction of amplicons using assembled genomes. C) Agarose gel of amplified fragments. All primers pairs were tested on 2 isolates from 7.1Anguil and one from RafaelaIIA.

https://doi.org/10.1371/journal.pone.0258753.s001

(PDF)

S2 Fig. Circular genome representation of prophage similarity.

Prophage similarity among genomes of the analyzed strains. Each prophage is represented by a different color on each chromosome. On each circle, genome used as query is showed larger than the subject genomes. Lines starts on query prophages and end on every subject prophage hit showing at least 80% coverage. Size of prophages are represented in scale. For this representation Circos was used.

https://doi.org/10.1371/journal.pone.0258753.s002

(PDF)

S3 Fig. Prophage size vs frequency.

Sizes of prophage families are represented in a scatter graph against the frequency of finding a given prophage family in the 10 genome studied. R2 parameter and the tendency formula were calculated using excel. To determine the relationship between two variables Rho Spearman calculator was used (https://www.socscistatistics.com/tests/spearman/default2.aspx). rho was -0.4341. Shorter prophages were associated with higher clustering (p<0.01).

https://doi.org/10.1371/journal.pone.0258753.s003

(JPG)

S1 Table. Identified prophages.

Excel sheets represent all identified prophages. Sheet All: all prophages identified, family or unique, position in the genome, and size data are presented. Sheet family: prophages belonging to families. Sheet of unique prophages: Shared sheet: unique shared types prophages data are presented.

https://doi.org/10.1371/journal.pone.0258753.s004

(XLSX)

S2 Table. Intra strain prophage similarity.

Each sheet represents a different strain and query, subject, Query Coverage Per Subject, Query Coverage Per Unique Subject, Average Identity and HSPs Number BLAST+ parameters are presented.

https://doi.org/10.1371/journal.pone.0258753.s005

(XLSX)

S3 Table. Main prophage families having > 6 prophages.

https://doi.org/10.1371/journal.pone.0258753.s006

(DOCX)

S4 Table. Blastn of shared types and unique prophages.

Shared types and Unique prophages described here were searched against nr.

https://doi.org/10.1371/journal.pone.0258753.s007

(XLSX)

Acknowledgments

The authors would like to thank Sandy Fryda-Bradley and the USMARC core sequencing facility for excellent technical assistance. The mention of a trade name, proprietary product, or specific equipment does not constitute a guarantee or warranty by the USDA and does not imply approval to the exclusion of other products that might be suitable. USDA is an equal opportunity provider and employer. AA, ML WMdS MFE and AC are CONICET fellows. MI and NAR holds a CONICET fellowship. We thank Marina Palermo for helping in Stx ELISA determination and Valeria Rocha for her valuable technical help.

References

  1. 1. Nguyen Y, Sperandio V. Enterohemorrhagic E. coli (EHEC) pathogenesis. Frontiers in cellular and infection microbiology. Frontiers Media SA; 2012. p. 90.
  2. 2. Hedican EB, Medus C, Besser JM, Juni BA, Koziol B, Taylor C, et al. Characteristics of O157 versus Non-O157 shiga toxin-producing escherichia coli infections in minnesota, 2000–2006. Clin Infect Dis. 2009;49: 358–364. pmid:19548834
  3. 3. Tseng M, Sha Q, Rudrik JT, Collins J, Henderson T, Funk JA, et al. Increasing incidence of non-O157 Shiga toxin-producing Escherichia coli (STEC) in Michigan and association with clinical illness. Epidemiol Infect. 2016;144: 1394–1405. pmid:26584572
  4. 4. Yang SC, Lin CH, Aljuffali IA, Fang JY. Current pathogenic Escherichia coli foodborne outbreak cases and therapy development. Archives of Microbiology. Springer Verlag; 2017. pp. 811–825. https://doi.org/10.1007/s00203-017-1393-y pmid:28597303
  5. 5. Boletín Integrado de Vigilancia N_ 344–SE. Available online: http://www.msal.gob.ar/images/stories/boletines/Boletin-Integrado-De-Vigilancia-N344-SE3.pdf (accessed on 28 March 2018).
  6. 6. Strockbine NA, Marques LRM, Newland JW, Smith HW, Holmes RK, O’brien AD. Two toxin-converting phages from Escherichia coli O157:H7 strain 933 encode antigenically distinct toxins with similar biologic activities. Infect Immun. 1986;53: 135–140. pmid:3522426
  7. 7. Scheutz F, Teel LD, Beutin L, Piérard D, Buvens G, Karch H, et al. Multicenter evaluation of a sequence-based protocol for subtyping Shiga toxins and standardizing Stx nomenclature. J Clin Microbiol. 2012;50: 2951–2963. pmid:22760050
  8. 8. Bai X, Fu S, Zhang J, Fan R, Xu Y, Sun H, et al. Identification and pathogenomic analysis of an Escherichia coli strain producing a novel Shiga toxin 2 subtype. Sci Rep. 2018;8. pmid:29712985
  9. 9. Martin CC, Svanevik CS, Lunestad BT, Sekse C, Johannessen GS. Isolation and characterisation of Shiga toxin-producing Escherichia coli from Norwegian bivalves. Food Microbiol. 2019;84. pmid:31421781
  10. 10. Yang X, Bai X, Zhang J, Sun H, Fu S, Fan R, et al. Escherichia coli strains producing a novel Shiga toxin 2 subtype circulate in China. Int J Med Microbiol. 2020;310: 151377. pmid:31757694
  11. 11. Ogura Y, Mondal SI, Islam MR, Mako T, Arisawa K, Katsura K, et al. The Shiga toxin 2 production level in enterohemorrhagic Escherichia coli O157:H7 is correlated with the subtypes of toxin-encoding phage. Sci Rep. 2015;5: 16663. pmid:26567959
  12. 12. Dallman TJ, Ashton PM, Byrne L, Perry NT, Petrovska L, Ellis R, et al. Applying phylogenomics to understand the emergence of Shiga-toxin-producing Escherichia coli O157:H7 strains causing severe human disease in the UK. Microb Genomics. 2015;1. pmid:28348814
  13. 13. Boerlin P, McEwen SA, Boerlin-Petzold F, Wilson JB, Johnson RP, Gyles CL. Associations between virulence factors of Shiga toxin-producing Escherichia coli and disease in humans. J Clin Microbiol. 1999;37: 497–503. pmid:9986802
  14. 14. McDaniel TK, Kaper JB. A cloned pathogenicity island from enteropathogenic Escherichia coli confers the attaching and effacing phenotype on E. coli K-12. Mol Microbiol. 1997;23: 399–407. pmid:9044273
  15. 15. Elliott SJ, Wainwright LA, McDaniel TK, Jarvis KG, Deng YK, Lai LC, et al. The complete sequence of the locus of enterocyte effacement (LEE) from enteropathogenic Escherichia coli E2348/69. Molecular Microbiology. Mol Microbiol; 1998. pp. 1–4. pmid:9593291
  16. 16. Lim JY, Yoon JW, Hovde CJ. A brief overview of Escherichia coli O157:H7 and its plasmid O157. Journal of Microbiology and Biotechnology. Korean Society for Microbiolog and Biotechnology; 2010. pp. 1–10.
  17. 17. Zhang Y, Laing C, Steele M, Ziebell K, Johnson R, Benson AK, et al. Genome evolution in major Escherichia coli O157:H7 lineages. BMC Genomics. 2007;8: 121. pmid:17506902
  18. 18. Manning SD, Motiwala AS, Springman AC, Qi W, Lacher DW, Ouellette LM, et al. Variation in virulence among clades of Escherichia coli O157:H7 associated with disease outbreaks. Proc Natl Acad Sci U S A. 2008;105: 4868–4873. pmid:18332430
  19. 19. Pianciola L, D’Astek BA, Mazzeo M, Chinen I, Masana M, Rivas M. Genetic features of human and bovine Escherichia coli O157:H7 strains isolated in Argentina. Int J Med Microbiol. 2016;306: 123–130. pmid:26935026
  20. 20. Amigo N, Mercado E, Bentancor A, Singh P, Vilte D, Gerhardt E, et al. Clade 8 and clade 6 strains of Escherichia coli O157:H7 from cattle in Argentina have hypervirulent-like phenotypes. PLoS One. 2015;10. pmid:26030198
  21. 21. Hirai S, Yokoyama E, Wakui T, Ishige T, Nakamura M. Enterohemorrhagic Escherichia coli O157 subclade 8b strains in Chiba Prefecture, Japan, produced larger amounts of Shiga toxin 2 than strains in subclade 8a and other clades. PLoS One. 2018;13.
  22. 22. Neupane M, Abu-Ali GS, Mitra A, Lacher DW, Manning SD, Riordan JT. Shiga toxin 2 overexpression in Escherichia coli O157:H7 strains associated with severe human disease. Microb Pathog. 2011;51: 466–470. pmid:21864671
  23. 23. Latif H, Li HJ, Charusanti P, Palsson B, Aziz RK. A gapless, unambiguous genome sequence of the enterohemorrhagic Escherichia coli O157:H7 strain EDL933. Genome Announc. 2014;2: 821–835. pmid:25125650
  24. 24. Hayashi T, Makino K, Ohnishi M, Kurokawa K, Ishii K, Yokoyama K, et al. Complete genome sequence of enterohemorrhagic Eschelichia coli O157:H7 and genomic comparison with a laboratory strain K-12. DNA Res. 2001;8: 11–22. pmid:11258796
  25. 25. Kulasekara BR, Jacobs M, Zhou Y, Wu Z, Sims E, Saenphimmachak C, et al. Analysis of the genome of the Escherichia coli O157:h7 2006 spinach-associated outbreak isolate indicates candidate genes that may enhance virulence. Infect Immun. 2009;77: 3713–3721. pmid:19564389
  26. 26. Escherichia coli O157:H7 str. TW14588 chromosome, whole genome shotgun—Nucleotide—NCBI. [cited 5 May 2021]. Available: https://www.ncbi.nlm.nih.gov/nuccore/CM000662
  27. 27. Cowley LA, Dallman TJ, Fitzgerald S, Irvine N, Rooney PJ, McAteer SP, et al. Short-term evolution of Shiga toxin-producing Escherichia coli O157:H7 between two food-borne outbreaks. Microb genomics. 2016;2: e000084. pmid:28348875
  28. 28. Katani R, Cote R, Garay JAR, Li L, Arthur TM, DebRoy C, et al. Complete genome sequence of SS52, a strain of Escherichia coli O157:H7 recovered from supershedder cattle. Genome Announc. 2016;3.
  29. 29. Luo H, Zhang CT, Gao F. Ori-Finder 2, an integrated tool to predict replication origins in the archaeal genomes. Front Microbiol. 2014;5: 482. pmid:25309521
  30. 30. Seemann T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics. 2014;30: 2068–2069. pmid:24642063
  31. 31. Li W, Godzik A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22: 1658–1659. pmid:16731699
  32. 32. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: Architecture and applications. BMC Bioinformatics. 2009;10. pmid:20003500
  33. 33. Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J. ACT: The Artemis comparison tool. Bioinformatics. 2005;21: 3422–3423. pmid:15976072
  34. 34. Abbott JC, Aanensen DM, Rutherford K, Butcher S, Spratt BG. WebACT—An online companion for the Artemis Comparison Tool. Bioinformatics. 2005;21: 3665–3666. pmid:16076890
  35. 35. Arndt D, Marcu A, Liang Y, Wishart DS. PHAST, PHASTER and PHASTEST: Tools for finding prophage in bacterial genomes. Brief Bioinform. 2018;20: 1560–1567.
  36. 36. Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, et al. Artemis: Sequence visualization and annotation. Bioinformatics. 2000;16: 944–945. pmid:11120685
  37. 37. Joensen KG, Scheutz F, Lund O, Hasman H, Kaas RS, Nielsen EM, et al. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J Clin Microbiol. 2014;52: 1501–1510. pmid:24574290
  38. 38. Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: Rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31: 3691–3693. pmid:26198102
  39. 39. Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312–1313. pmid:24451623
  40. 40. Khelik K, Lagesen K, Sandve GK, Rognes T, Nederbragt AJ. NucDiff: In-depth characterization and annotation of differences between two sets of DNA sequences. BMC Bioinformatics. 2017;18: 1–14. pmid:28049414
  41. 41. Vidovic S, Korber DR. Prevalence of Escherichia coli O157 in Saskatchewan cattle: Characterization of isolates by using random amplified polymorphic DNA PCR, antibiotic resistance profiles, and pathogenicity determinants. Appl Environ Microbiol. 2006;72: 4347–4355. pmid:16751550
  42. 42. Steele M, Ziebell K, Zhang Y, Benson A, Konczy P, Johnson R, et al. Identification of Escherichia coli O157:H7 genomic regions conserved in strains with a genotype associated with human infection. Appl Environ Microbiol. 2007;73: 22–31. pmid:17056689
  43. 43. Eppinger M, Mammel MK, Leclerc JE, Ravel J, Cebula TA. Genomic anatomy of Escherichia coli O157:H7 outbreaks. Proc Natl Acad Sci U S A. 2011;108: 20142–20147. pmid:22135463
  44. 44. Kyle JL, Cummings CA, Parker CT, Quiñones B, Vatta P, Newton E, et al. Escherichia coli Serotype O55:H7 Diversity Supports Parallel Acquisition of Bacteriophage at Shiga Toxin Phage Insertion Sites during evolution of the O157:H7 lineage. J Bacteriol. 2012;194: 1885–1896. pmid:22328665
  45. 45. Ohnishi M, Kurokawa K, Hayashi T. Diversification of Escherichia coli genomes: Are bacteriophages the major contributors? Trends in Microbiology. Elsevier Current Trends; 2001. pp. 481–485. https://doi.org/10.1016/s0966-842x(01)02173-4 pmid:11597449
  46. 46. Shaaban S, Cowley LA, McAteer SP, Jenkins C, Dallman TJ, Bono JL, et al. Evolution of a zoonotic pathogen: investigating prophage diversity in enterohaemorrhagic Escherichia coli O157 by long-read sequencing. Microb genomics. 2016;2: e000096. pmid:28348836
  47. 47. El Kafsi H, Loux V, Mariadassou M, Blin C, Chiapello H, Abraham AL, et al. Unprecedented large inverted repeats at the replication terminus of circular bacterial chromosomes suggest a novel mode of chromosome rescue. Sci Rep. 2017;7: 1–11. pmid:28127051
  48. 48. Hill CW, Harnish BW. Inversions between ribosomal RNA genes of Escherichia coli. Proc Natl Acad Sci U S A. 1981;78: 7069–7072. pmid:6273909
  49. 49. Iguchi A, Iyoda S, Terajima J, Watanabe H, Osawa R. Spontaneous recombination between homologous prophage regions causes large-scale inversions within the Escherichia coli O157:H7 chromosome. Gene. 2006;372: 199–207. pmid:16516407
  50. 50. Casjens SR, Hendrix RW. Bacteriophage lambda: Early pioneer and still relevant. Virology. Academic Press Inc.; 2015. pp. 310–330.
  51. 51. Lupolova N, Dallman TJ, Matthews L, Bono JL, Gally DL. Support vector machine applied to predict the zoonotic potential of E. coli O157 cattle isolates. Proc Natl Acad Sci U S A. 2016;113: 11312–11317. pmid:27647883
  52. 52. Toothman P, Herskowitzt I. Rex-dependent exclusion of lambdoid phages I. Prophage requirements for exclusion. Virology. 1980;102: 133–146. pmid:6445121
  53. 53. Yen YT, Tsang C, Cameron TA, Ankrah DO, Rodou A, Stathopoulos C. Importance of conserved residues of the serine protease autotransporter β-domain in passenger domain processing and β-barrel assembly. Infect Immun. 2010;78: 3516–3528. pmid:20515934
  54. 54. Orth D, Ehrlenbach S, Brockmeyer J, Khan AB, Huber G, Karch H, et al. EspP, a serine protease of enterohemorrhagic Escherichia coli, impairs complement activation by cleaving complement factors C3/C3b and C5. Infect Immun. 2010;78: 4294–4301. pmid:20643852
  55. 55. Navarro-Garcia F, Serapio-Palacios A, Vidal JE, Isabel Salazar M, Tapia-Pastrana G. EspC promotes epithelial cell detachment by enteropathogenic Escherichia coli via sequential cleavages of a cytoskeletal protein and then focal adhesion proteins. Infect Immun. 2014;82: 2255–2265. pmid:24643541
  56. 56. Lindsey RL, Batra D, Smith P, Patel PN, Tagg KA, Garcia-Toledo L, et al. PacBio Genome Sequences of Escherichia coli Serotype O157:H7, Diffusely Adherent E. coli, and Salmonella enterica Strains, All Carrying Plasmids with an mcr-1 Resistance Gene. Microbiol Resour Announc. 2018;7.
  57. 57. Sharma VK, Stanton TB. Characterization of a 3.3-kb plasmid of Escherichia coli O157:H7 and evaluation of stability of genetically engineered derivatives of this plasmid expressing green fluorescence. Vet Microbiol. 2008;132: 421–427. pmid:18586417
  58. 58. Makino K, Ishii K, Yasunaga T, Hattori M, Yokoyama K, Yutsudo CH, et al. Complete nucleotide sequences of 93-kb and 3.3-kb plasmids of an enterohemorrhagic Escherichia coli O157:H7 derived from Sakai outbreak. DNA Res. 1998;5: 1–9. pmid:9628576
  59. 59. Rusconi B, Sanjar F, Koenig SSK, Mammel MK, Tarr PI, Eppinger M. Whole genome sequencing for genomics-guided investigations of Escherichia coli O157:H7 outbreaks. Front Microbiol. 2016;7. pmid:27446025
  60. 60. Fitzgerald SF, Beckett AE, Palarea-Albaladejo J, McAteer S, Shaaban S, Morgan J, et al. Shiga toxin sub-type 2a increases the efficiency of Escherichia coli O157 transmission between animals and restricts epithelial regeneration in bovine enteroids. PLoS Pathog. 2019;15: e1008003. pmid:31581229
  61. 61. LeJeune JT, Abedon ST, Takemura K, Christie NP, Sreevatsan S. Human Escherichia coli O157:H7 genetic marker in isolates of bovine origin. Emerg Infect Dis. 2004;10: 1482–1485. pmid:15496255
  62. 62. Matsumoto M, Suzuki M, Takahashi M, Hirose K, Minagawa H, Ohta M. Identification and epidemiological description of enterohemorrhagic Escherichia coli O157 strains producing low amounts of Shiga toxin 2 in Aichi Prefecture, Japan. Jpn J Infect Dis. 2008;61: 442–445. Available: https://pubmed.ncbi.nlm.nih.gov/19050350/ pmid:19050350
  63. 63. Olavesen KK, Lindstedt BA, Løbersli I, Brandal LT. Expression of Shiga toxin 2 (Stx2) in highly virulent Stx-producing Escherichia coli (STEC) carrying different anti-terminator (q) genes. Microb Pathog. 2016;97: 1–8. pmid:27208749
  64. 64. Bono JL, Keen JE, Clawson ML, Durso LM, Heaton MP, Laegreid WW. Association of Escherichia coli O157:H7 tir polymorphisms with human infection. BMC Infect Dis. 2007; 7: 98. pmid:17718910
  65. 65. Padavannil A, Jobichen C, Mills E, Velazquez-Campoy A, Li M, Leung KY, et al. Structure of GrlR-GrlA complex that prevents GrlA activation of virulence genes. Nat Commun. 2013;4: 1–10. pmid:24092262
  66. 66. Jobichen C, Fernandis AZ, Velazquez-Campoy A, Leung KY, Mok YK, Wenk MR, et al. Identification and characterization of the lipid-binding property of GrlR, a locus of enterocyte effacement regulator. Biochem J. 2009;420: 191–199. pmid:19228114
  67. 67. Iyoda S, Koizumi N, Satou H, Lu Y, Saitoh T, Ohnishi M, et al. The GrlR-GrlA regulatory system coordinately controls the expression of flagellar and LEE-encoded type III protein secretion systems in enterohemorrhagic Escherichia coli. J Bacteriol. 2006;188: 5682–5692. pmid:16885436
  68. 68. Iyoda S, Honda N, Saitoh T, Shimuta K, Terajima J, Watanabe H, et al. Coordinate control of the locus of enterocyte effacement and enterohemolysin genes by multiple common virulence regulators in enterohemorrhagic Escherichia coli. Infect Immun. 2011;79: 4628–4637. pmid:21844237
  69. 69. García-Gómez E, Espinosa N, de la Mora J, Dreyfus G, González-Pedrajo B. The muramidase EtgA from enteropathogenic Escherichia coli is required for efficient type III secretion. Microbiology. 2011;157: 1145–1160. pmid:21233160