The dataset of sequences obtained for this study included 174 partial NS1 sequences from viruses detected between 2004 and 2014 in Newfoundland, other parts of North America (Nova Scotia, Ontario and Wisconsin), and Europe (Denmark). Sequences from Newfoundland originated from both wild mink and from ten different farms. A subset of 131 sequences was selected by excluding sequences that presented observable double peaks on chromatograms, recombinant sequences (identified using RDP software, see below), and all identical sequences derived from the same year and the same geographical area (same farm for Newfoundland) and used for phylogenetic inference. For each sample collected in 2014, only one representative clonal sequence for every strain was included. Reference sequences included only viruses for which the complete coding sequence of one or both ORFs were available and AMDV-like viruses (RFAV and GFADV) identified in foxes and raccoon dogs (Bloom et al. 1988 (
link); Gottschalck et al. 1991 (
link), 1994 (
link); Oie et al. 1996 (
link); Schuierer et al. 1997 (
link); Li et al. 2011 (
link), 2012 (
link); Huang et al. 2012 (
link); Shao et al. 2014 (
link)).
Other sequence datasets were built with sequences downloaded from GenBank (
www.ncbi.nlm.nih.gov/genbank) by mapping all available AMDV sequences (
N = 456) to a reference complete genome to identify the genomic regions most frequently used for molecular epidemiological studies (Olofsson et al. 1999 (
link); Dyer, Ching, and Bloom 2000 (
link); Mañas et al. 2001 (
link); Knuuttila et al. 2009 (
link), 2015 (
link); Jahns et al. 2010 (
link); Christensen et al. 2011 (
link); Jensen et al. 2012 (
link); Leimann et al. 2015 (
link)). Genomic areas represented by the majority of the sequences were selected, and sequences that mapped to these areas were extracted and used to perform phylogenetic analyses.
Sequences were aligned with ClustalX 2.1 (Larkin et al. 2007 (
link)), alignments were manually edited when needed with BioEdit 7.0.5.3 (Hall 1999 ) and then used for phylogenetic inference. A model selection was performed for each alignment to identify the best model for distance estimation, and phylogenetic trees were constructed with MEGA 6.06 (Tamura et al. 2013 (
link)) using the maximum-likelihood method (Felsenstein 1981 (
link)). Information about numbers of sequences, the genomic regions investigated, and models used are provided in the figure legends. Bootstrap analyses (Felsenstein 1985 ) with 1,000 replicates were performed to test the robustness of the analyses, and only clusters supported by ≥ 70 per cent were considered valid.
Accession numbers of all sequences used in this study are available in the
Supplementary Sequence Details File.
Canuti M., O’Leary K.E., Hunter B.D., Spearman G., Ojkic D., Whitney H.G, & Lang A.S. (2016). Driving forces behind the evolution of the Aleutian mink disease parvovirus in the context of intensive farming. Virus Evolution, 2(1), vew004.