Reconstructed transcript sequences (via de novo assembly, Scripture, or Cufflinks) were mapped to the reference coding sequences using BLAT35 (link). Full-length reference annotation mappings were defined as having at least 95% sequence identity covering the entire reference coding sequence and containing at most 5% insertions or deletions (cumulative gap content). In evaluating methods that leverage the strand-specific data (Trinity and Cufflinks), proper sense-strand mapping of sequences was required. Transcripts reconstructed by the alternative methods (Scripture, ABySS, and SOAPdenovo) were allowed to map to either strand. Fusion transcripts were identified as individual reconstructed transcripts that mapped as full-length to multiple reference coding sequences and lacked overlap among the matching regions within the reconstructed transcript. One-to-one mappings were required between reconstructed transcripts and reference transcripts, including alternatively spliced isoforms, with the exception of fusion transcripts.
Benchmarking Transcriptome Reconstruction Methods
Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link:
Access Free Full Text.
Corresponding Organization :
Other organizations : Broad Institute, Massachusetts Institute of Technology, University of Massachusetts Chan Medical School, Science for Life Laboratory, Uppsala University, Hebrew University of Jerusalem
Protocol cited in 1 869 other protocols
Variable analysis
- Reconstructed transcript sequences (via de novo assembly, Scripture, or Cufflinks)
- Mappings of reconstructed transcript sequences to the reference coding sequences
- Full-length reference annotation mappings (at least 95% sequence identity covering the entire reference coding sequence and containing at most 5% insertions or deletions)
- Sense-strand mapping of sequences (for methods that leverage the strand-specific data, Trinity and Cufflinks)
- Fusion transcripts (individual reconstructed transcripts that mapped as full-length to multiple reference coding sequences and lacked overlap among the matching regions within the reconstructed transcript)
- Reference coding sequences extracted from the genome sequences based on the gene annotations
- Mouse reference coding sequences distilled to remove entirely identical sequences corresponding to isoforms encoding identical proteins and paralogous sequences
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!