We collected the metavirome samples from those studies and aimed to add more viral diversity, especially adding viruses not- or under-represented in RefSeq, to the training data. We were careful in quality control of the samples because it is likely that the sample can be contaminated by prokaryotic DNA, since the physical filters may not exclude small sized prokaryotic cells. The details of preparation of metavirome data and quality control can be found in
Expanding Deep Learning Viral Datasets
Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link:
Access Free Full Text.
Corresponding Organization :
Other organizations : University of Southern California, Qingdao University, Clark University, University of California, Irvine, Google (United States)
Protocol cited in 27 other protocols
Variable analysis
- Including viral sequences from metavirome sequencing data to enlarge the training dataset
- Prediction accuracy of the deep learning algorithm
- Quality control of the metavirome samples to exclude prokaryotic DNA contamination
- Using the same number of prokaryotic sequences paired with the viral sequences in the enlarged dataset for training
- Positive control: Training the model using sequences derived from viral RefSeq before May 2015
- Negative control: Not using the sequences from RefSeq after May 2015 for training
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!