Two sets of whole-genome simulation data were generated based on the following two quality models. One is a normal quality simulation that derives the sequencing error and quality based on a training data set of 250k empirical reads randomly selected from our T-ALL WGS data while the other is a high quality data set that use only reads with qualities in the range of 32–40 for training. We created the high-quality simulation data because the mapping rate of the normal quality WGS is 10% lower than that of the empirical WGS data for the 10 T-ALL genomes which ranges from 92–95%. On the other hand, high-quality simulation data has a mapping rate of 91% which is close to the empirical mapping rate.
Benchmarking structural variation detection
Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link:
Access Free Full Text.
Corresponding Organization :
Other organizations : St. Jude Children's Research Hospital, Washington University in St. Louis
Protocol cited in 28 other protocols
Variable analysis
- Sequencing platforms (Illumina/Solexa, Roche/454 and Life Technologies/SOLiD)
- Sequencing quality models (normal quality and high quality)
- Sensitivity of CREST in identifying validated germline structural variations (deletions, duplications and insertions)
- NA12878 as the sample
- 100-bp paired-end reads with a mean size of 400 bp and a standard deviation of 20 bp
- 20-fold coverage of the human assembly NCBI build 36 for each haploid genome
- All reads mapped to the human assembly NCBI build 36 using the program BWA with default parameters
- Gold standard data set for NA12878 consisting of 642 deletions, 271 duplications and 30 insertions compiled by the 1000 Genomes Project
- Unable to include the 30 insertions for simulation because the inserted sequences were not accessible
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!