Taxonomic Identification of Macroorganisms from Environmental DNA

We used custom PERL scripts to retrieve the index information and identify and trim the amplification primer sequences. We discarded any DNA sequence shorter than 50 bp, with the exception of sequences amplified with plant and bryophyte primers for which short amplification products were expected (see Supplemental Table 1). We also discarded any trimmed read pair for which the difference in sequence length was greater than 5 bp between the two paired-end reads to eliminate any reads where primers were not found in both reads. We then merged the paired reads into a single consensus DNA sequence using PANDAseq with default parameters26 (link). We used Mothur27 (link) to cluster unique DNA sequences and counted how many reads carried each unique DNA sequence.
While all raw sequences are freely available online (accession number SRP058316), we only describe here the analyses of macroorganism DNA sequences for sake of simplicity (microorganism sequences can be easily analyzed using standard packages such as those implemented in QIIME28 ). For macro-organisms such as mammals, many species have been sequenced for the locus of interest and if not, a closely related species is likely present in the NCBI database (but see also below). Therefore, to analyze DNA sequences from macroorganisms–mammals, amphibians, birds, bryophytes, arthropods, copepods and plants–we used BLAST29 (link) to directly identify the closest DNA sequences in the NCBI database and the likely species of origin. Briefly, we removed from our analyses any DNA sequence observed in less than 10 reads total (summing across all samples), as these likely represent sequencing errors. We then compared each remaining DNA sequence to all sequences deposited in the NCBI nt database using Blastn (excluding uncultured samples) and only considered matches with greater than 90% identity over the entire sequence length. We then retrieved taxonomic data of all best match(es) for each sequence from NCBI. If multiple species matched a single sequence, all species names were assigned to the sequence. We conducted further analyses at the species level for all taxa, using a minimum read count per sample of 10 to determine absence/presence.

Free full text: Click here

Cannon M.V., Hester J., Shalkhauser A., Chan E.R., Logue K., Small S.T, & Serre D. (2016). In silico assessment of primers for eDNA studies using PrimerTree and application to characterize the biodiversity surrounding the Cuyahoga River. Scientific Reports, 6, 22908.

Publication 2016

Amphibians Arthropods Birds Bryophytes Consensus dna sequence Copepods Dna sequences Dna sequences analyses Mammals Plants Primers Species

Corresponding Organization :

Other organizations : Cleveland Clinic, Case Western Reserve University

Top 5 similar protocols

Protocol cited in 5 other protocols

Variable analysis

independent variables

Custom PERL scripts to retrieve the index information
Custom PERL scripts to identify and trim the amplification primer sequences

dependent variables

Presence/absence of macroorganism DNA sequences based on a minimum read count per sample of 10

control variables

Discarded any DNA sequence shorter than 50 bp, with the exception of sequences amplified with plant and bryophyte primers for which short amplification products were expected
Discarded any trimmed read pair for which the difference in sequence length was greater than 5 bp between the two paired-end reads to eliminate any reads where primers were not found in both reads
Removed from analyses any DNA sequence observed in less than 10 reads total (summing across all samples), as these likely represent sequencing errors
Compared each remaining DNA sequence to all sequences deposited in the NCBI nt database using Blastn (excluding uncultured samples) and only considered matches with greater than 90% identity over the entire sequence length

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!