The following is a description of all these files.
- Python script for the detection of Genomic Islands on the chromosomes.
- Running the above Python script to detect GI1 and GI2.
- Input file for the Python script. This file contains details of 499 isolates.
- Input file for the Python script. This file contains ARG and IS annotations for 499 genomes.
- The result of GI1 detection. This file contains information about Genomic Island 1—strB, strA, sul2, IS1H located on the same contig.
- Proofreading based on the result of GI1 detection above. Strains highlighted in yellow have been excluded.
- This file contains details of isolates that harbor GI1.
- The result of GI2 detection. This file contains information about Genomic Island 2—strB, strA, sul2, IS5075 located on the same contig.
- Proofreading based on the result of GI2 detection above. Strains highlighted in yellow have been excluded.
- This file contains details of isolates that harbor GI2.
-
- R script for employing UpSetR v1.4.0 to visualize phenotype data
- Input file for the R script. This file contains antibiotic resistance phenotypes of 485 isolates.
- UpSet plot illustrating the diversity of antibiotic resistance profiles for ExPEC isolates. This figure was based on the output result from the above R code, and was subsequently retouched using Adobe Illustrator.
-
- R script for employing ggplot2 v3.3.6 to visualize phenotype data in four STs and all 485 isolates.
- Input file for the R script. This file contains the prevalence of antimicrobial resistance in four common STs and all 485 isolates..
- Histogram illustrating the prevalence of antimicrobial resistance phenotypes. This figure was retouched using Adobe Illustrator to split it into Fig. 2b and Fig. 2c.
-
- R script for employing gggenes v0.4.1 to illustrate gene arrow maps of ARGs and their neighboring ISs in all complete genomes.
- Input file for the R script. This file contains position, strand, and class of genes.
- The output result from the above R code.
- Based on the figure 'gggenes.pdf', we subsequently used Adobe Illustrator to retouch and combine as Fig. 4a.
- Based on the figure 'gggenes.pdf', we subsequently used Adobe Illustrator to retouch and combine as Fig. 5a-d.
-
- R script for employing pheatmap v1.0.12 to visualize Jaccard similarity matrix of plasmid genomes.
- Input file for the R script 'pheatmap.R'.
- The output result from the R script 'pheatmap.R'.
- R script for employing ggplot2 v3.3.6 to illustrate the stacked histograms, which show the distribution ExPEC isolates for various phylogroups harboring genomic islands and the number of genomes for different phylogroups harboring each plasmid.
- Input file for the R script 'bar.R'.
- The output result from the R script 'bar.R'.
-
- R script for employing pheatmap v1.0.12 to visualize Jaccard similarity matrix of plasmid genomes.
- Input file for the R script 'pheatmap.R'.
- The output result from the R script 'pheatmap.R'.
- R script for employing ggplot2 v3.3.6 to illustrate the stacked histograms, which show the number of genomes for different phylogroups harboring each plasmid.
- Input file for the R script 'bar.R'.
- The output result from the R script 'bar.R'.
-
- R script for employing ggplot2 v3.3.6 to illustrate the bar plot, which show the annual count of isolated strains.
- Input file for the R script.
- The output result from the above R code.
-
- R script for employing ggplot2 v3.3.6 to illustrate the pie chart, which show the distribution of isolates from various tissues.
- Input file for the R script.
- The output result from the above R code.
-
- R script for employing ggplot2 v3.3.6 to illustrate two pie charts, which show the distribution of 499 ExPEC isolates across different MLSTs and serotypes.
- Input files for the R script.
- The output result from the above R code.
- The image is generated by combining the above pictures using Adobe Illustrator.