RNA-Seq Data Integration and Normalization

To allow for consistent comparison across datasets, all read mapping was carried out using TopHat 1.1.0 [29 (link)] with supplied annotations and the --no-novel-juncs option set, except for the SOLiD datasets, which were only available in a pre-aligned form with mapping by BioScope 1.2.1. All expression estimation and bias correction were done using Cufflinks 0.9.3 with the same annotation and reference sequence as TopHat. In the case of strand-specific libraries, the correct --library-type option was used as per the Cufflinks manual. For the mouse dataset in the NanoString experiment, the RefSeq refGene annotation for assembly NCBI37/mm9 was used, and was downloaded from the UCSC Genome Browser. For all human datasets, the RefSeq refGene annotation for assembly NCBI36/hg18 [30 (link)] was used, and was downloaded from the UCSC Genome Browser. The only filtering was to remove non-chromosomal and 'random' contigs. After quanti cation with Cufflinks, the subset of transcripts with 1-to-1 mappings to the TaqMan qRT-PCR probes were selected (as listed in the supplement to [16 (link)]) to be used in the correlation tests. All yeast datasets used the Ensembl Saccharomyces cerevisiae annotation, release 59, which was downloaded from the Ensembl website [31 (link)]. Mitochondrial, non-coding, and ribosomal RNA sites were masked in the annotation. All remaining transcripts were used in our correlation tests.

Free full text: Click here

Roberts A., Trapnell C., Donaghey J., Rinn J.L, & Pachter L. (2011). Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biology, 12(3), R22.

Publication 2011

Chromosomal Genome Human Library Mitochondrial Mouse Ribosomal rna Saccharomyces cerevisiae Supplement

Corresponding Organization :

Other organizations : Berkeley College, University of California, Berkeley, Broad Institute

Top 5 similar protocols

Protocol cited in 135 other protocols

Variable analysis

independent variables

TopHat 1.1.0 with --no-novel-juncs option
BioScope 1.2.1 for SOLiD datasets
Cufflinks 0.9.3 for expression estimation and bias correction

dependent variables

Expression levels of transcripts

control variables

Supplied annotations used for TopHat and Cufflinks
Reference sequence used for TopHat and Cufflinks
Correct --library-type option used for Cufflinks on strand-specific libraries
RefSeq refGene annotation for assembly NCBI37/mm9 used for mouse dataset
RefSeq refGene annotation for assembly NCBI36/hg18 used for human datasets
Ensembl Saccharomyces cerevisiae annotation, release 59, used for yeast datasets
Masking of mitochondrial, non-coding, and ribosomal RNA sites in yeast annotation

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!