We classified all bacterial and archaeal long sequences directly with the Ribosomal Database Project Classifier (RDP) [28] (link). We used only RDP classifications with a bootstrap value of > = 80%. If the bootstrap value was <80%, the taxonomic assignment was moved to a higher classification level until an 80% or better bootstrap value was achieved. For example, if the genus assignment had a bootstrap value of 70%, but the family had a value of 85%, that sequence would be assigned only as far as family and not to genus. RDP Classifier does not classify sequences below the genus level but the GAST process is not inherently limited to genus; its resolution is constrained by the taxonomy of the reference sequence database. The accuracy of GAST will improve in response to refinements of the reference database including increased number of taxonomically-resolved sequences, removal of cryptic chimeric and short sequences, improvement of taxonomic identities for long sequences, and elimination of low quality entries.
Extracting Microbial Diversity Markers
We classified all bacterial and archaeal long sequences directly with the Ribosomal Database Project Classifier (RDP) [28] (link). We used only RDP classifications with a bootstrap value of > = 80%. If the bootstrap value was <80%, the taxonomic assignment was moved to a higher classification level until an 80% or better bootstrap value was achieved. For example, if the genus assignment had a bootstrap value of 70%, but the family had a value of 85%, that sequence would be assigned only as far as family and not to genus. RDP Classifier does not classify sequences below the genus level but the GAST process is not inherently limited to genus; its resolution is constrained by the taxonomy of the reference sequence database. The accuracy of GAST will improve in response to refinements of the reference database including increased number of taxonomically-resolved sequences, removal of cryptic chimeric and short sequences, improvement of taxonomic identities for long sequences, and elimination of low quality entries.
Corresponding Organization :
Other organizations : Marine Biological Laboratory, Stanford Medicine, VA Palo Alto Health Care System
Protocol cited in 88 other protocols
Variable analysis
- Protocol used to download, filter, and curate the aligned small subunit rRNA sequences from the SILVA database
- Number of unique sequences in the resultant dataset (RefSSU)
- Length distribution of the sequences in the RefSSU dataset
- Number of unique V3 and V6 reference tags in the RefV3 and RefV6 databases
- Length distribution of the V3 and V6 reference tags
- SILVA quality assessments used to eliminate low-quality sequences
- Removal of redundant sequences with identical SSU rRNA gene sequences
- Removal of sequences shorter than 50 nt or containing ambiguous bases
- Removal of gap characters to create the V3 and V6 reference tag databases
- RDP Classifier classifications with a bootstrap value of >=80%
- None specified
- None specified
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!