We used standard, tree-based methods to address accuracy of identification in a thoroughly sampled phylogeny using both a species-level and ESU approach. One exemplar from each recognized species (the nominal subspecies if the species included multiple subspecies) or each identified ESU was used as the reference “barcode” exemplar in topological comparisons. We randomly selected 1,000 sequences from the cowrie COI dataset, excluding barcode exemplars, and limiting representation of each species or ESU to 15 or ten sequences, respectively, to minimize bias toward well-sampled taxa. Hybrid individuals (see above) were excluded. These 1,000 sequences were tested one at a time, and their placement relative to the barcoding exemplars evaluated in both neighbor-joining (K2P) and parsimony phylogenies. Identification was considered correct if the sister taxon of the test sequence was the exemplar sequence of its corresponding species or ESU. Identification was considered incorrect if the sister taxon was wrong. If the random sequence fell below a node linking two recognized sister taxa including the corresponding species, the identification was considered ambiguous, as assignment to one or the other is equivocal, as the unknown could also represent a novel taxon. Similar analyses were performed with the turbinid (n = 200 from 278) and limpet (n = 100 from 125) datasets.
Pairwise K2P distances, theta, and coalescent depth were used to characterize intraspecific variation. Genetic distance between terminal taxa and their closest sister was used to characterize interspecific divergence. While the phylogenies used are based upon sequence data from two mtDNA markers (16S and COI: [26 –28 (link)]), only COI was used for these analyses. The two most genetically distant individuals within each ESU (based on pairwise comparisons) were chosen to bookend genetic diversity and recover coalescent depth (maximum intra-ESU variability). These two individuals replaced the exemplar taxon used to construct the overall phylogeny (