Classification performance was also slightly modified from a standard machine-learning scenario as the classifiers in this study are able to refuse classification if they are not confident above a taxonomic level for a given sample. This also accommodates the taxonomy truncation that we performed for this test. The methodology was consistent with that used below for novel taxon evaluations, so we defer its description to the next section.
Benchmarking Taxonomic Classification of Microbiome Sequences
Classification performance was also slightly modified from a standard machine-learning scenario as the classifiers in this study are able to refuse classification if they are not confident above a taxonomic level for a given sample. This also accommodates the taxonomy truncation that we performed for this test. The methodology was consistent with that used below for novel taxon evaluations, so we defer its description to the next section.
Corresponding Organization : Northern Arizona University
Other organizations : Australian National University, University of California, San Diego
Protocol cited in 545 other protocols
Variable analysis
- Primer used for simulated read generation (27F/1492R, 515F/806R, 27F/534R, BITSf/B58S3r)
- Classification performance of taxonomic classifiers
- Reference databases used (Greengenes or UNITE 99% OTUs)
- Taxonomic label cleaning process to remove ambiguous or null labels
- Simulated read generation by trimming reference sequences to mimic amplification using standard PCR primers
- Absence of simulated sequencing errors in the generated reads
- 10-fold randomized cross-validation data sets generated using scikit-learn's library functions
- Taxonomy truncation for test sets where a taxonomy was not present in the corresponding training set
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!