STs were deduced from the genome assemblies using mlst (v2.16.1;
https://github.com/tseemann/mlst) and assigned to pre-defined CCs on PubMLST [40 (
link)] (
https://pubmlst.org/campylobacter). The same database was used to calculate core-genome MLST (cgMLST) (Oxford scheme) on PubMLST [40 (
link)]. The relationship between STs, phenotypic resistance, seasonality and the presence of pTet plasmid was assessed using Fisher’s exact test (GraphPad Prism 9.1.2). Source attribution patterns were assigned based on ST–ecotype associations described in recent publications [41–43 (
link)]. Gene annotation was carried out from the draft assemblies using Prokka v.1.13 [44 (
link)]. The presence of plasmids or prophages was inferred from the assemblies using MOB-suite v.2.0.1 [45 (
link)]. The predicted plasmid sequences were used as queries in
blastn with threshold values set to >80 % coverage and >95 % identity. The sequences of accession numbers CP017866 and CP014746 were used to define the predicted sequences as pTet and pVir, respectively. Pan-genome analyses were carried out using Roary v.3.12.0 [46 (
link)] with an amino acid identity cut-off of 95 % and splitting homologous groups containing paralogues into groups of true orthologues. A summary of the pan-genome composition and visualization of gene diversity is provided in Fig. S1(a–c), available with the online version of this article.
In parallel, whole-genome SNP (wgSNP)-based alignments were built from trimmed reads using the Snippy v.4.3.6 pipeline (
https://github.com/tseemann/snippy). The closed genome of strain
C. jejuni subsp.
jejuni NCTC 11168 (GenBank assembly accession no. GCA_000009085.1) was used as a reference in read mapping. Areas of putative recombination were removed from the resulting alignment using Gubbins v.2.2.0 [47 (
link)] and default settings (five iterations and >3 base substitutions to identify a recombination event). Maximum-likelihood phylogenies were obtained from the recombination-removed alignments using the tree building option FastTree v2.1.4 [48 (
link)]. The core-genome phylogeny was visualized using iTOL [49 (
link)] and the pan-genome genes calculated in Roary were displayed alongside the recombination-removed phylogenetic tree using Phandango [50 (
link)] (
https://jameshadfield.github.io/phandango). Virulence gene detection was carried out using ABRicate (version 0.8.10;
https://github.com/tseemann/abricate) equipped with VFDB (Virulence Factor Database) [51 (
link)]. Hits with less than 80 % identity or coverage were filtered out of the analysis.
The PubMLST
C. jejuni database was screened for the major flagellin protein, FlaA (encoded by the
flaA gene). All 2058 deposited
C. jejuni sequences classified as CC-257 (as of November 4th 2022) were searched for the presence of the
flaA sequence by
blastn [NCBI, National Institutes of Health (NIH)] analysis using the DNA sequence from
C. jejuni strain NCTC 11168 as reference. Presence of the gene was determined by >90 % alignment and identity with the query sequence (
E value=0). Finally, the presence of type VI secretion system (T6SS) genes, encoding a total of 13 core components (TagH, TssA–TssG, TssI–TssM), was assessed using previously published reference sequences [52 (
link)] and the
blastn tool. The presence of genes was defined as DNA identity and coverage of ≥90 %.
Ghielmetti G., Seth-Smith H.M., Roloff T., Cernela N., Biggel M., Stephan R, & Egli A. (2023). Whole-genome-based characterization of Campylobacter jejuni from human patients with gastroenteritis collected over an 18 year period reveals increasing prevalence of antimicrobial resistance. Microbial Genomics, 9(2), mgen000941.