Protocol detail

Find Similar Protocols

Modeling Ribosome Footprinting Data

In sequencing-based ribosome footprinting, the RF read count is naturally confounded by mRNA abundance (Fig. 1A). We seek a strategy to compare RF measurements taking mRNA abundance into account in order to accurately discern the translation effect in case–control experiments. We model the vector of RNA-Seq and RF read counts

y_{mRNA}^{i}

and

y_{RF}^{i}

, respectively, for gene i with Negative Binomial (NB) distributions, as described before (for instance, Love et al., 2014 (link); Drewe et al., 2013 (link); Robinson et al., 2010 (link)):

y^{i} \sim N B (μ^{i}, κ^{i}),

where μⁱ is the expected count and κⁱ is the estimated dispersion across biological replicates. Here yⁱ denotes the observed counts normalized by the library size factor (Supplementary Section A). Formulating the problem as a generalized linear model (GLM) with the logarithm as link function, we can express expectations on read counts as a function of latent quantities related to mRNA abundance β_C in the two conditions (

C = {0, 1}

), a quantity

β_{RNA}

that relates mRNA abundance to RNA-Seq read counts, a quantity

β_{RF}

that relates mRNA abundance to RF read counts and a quantity

β_{Δ, C}

that captures the effect of the treatment on translation. In particular, the expected RNA-Seq read count

μ_{mRNA, C}^{i}

is given by the equation

log (μ_{mRNA, C}^{i}) = β_{C}^{i} + β_{RNA}^{i}

.
We assume that transcription and translation are successive cellular processing steps and that abundances are linearly related. The expected RF read count,

μ_{RF, C}^{i}

, is given by

log (μ_{RF, C}^{i}) = β_{C}^{i} + β_{R F}^{i} + β_{Δ, C}^{i}

. A key point to note is that

β_{C}^{i}

is revealed to be a shared parameter between the expressions governing the expected RNA-Seq and RF counts. It can be considered to be a proxy for shared transcriptional/translation activity under condition C in this context. Then,

β_{Δ, C}^{i}

indicates the deviation from that activity under condition C, with

β_{Δ, C}^{i} = 0

for C = 0 and free otherwise (See Supplementary Section B for more details).
Fitting the GLM consists of learning the parameters βⁱ and dispersions κⁱ given mRNA and RF counts for the two conditions

C = {0, 1}

. We perform alternating optimization of the parameters βⁱ given dispersions κⁱ and the dispersion parameters κⁱ given βⁱ, similar to the EM algorithm (Supplementary Sections B and C):

β^{i} = \underset{β^{i}}{arg max} ℓ_{g l m} (β^{i} | y^{i}, κ^{i}) and κ^{i} = \underset{κ^{i}}{arg max} ℓ_{N B} (κ^{i} | y^{i}, μ^{i}) .

As experimental procedures for measuring mRNA counts and RF counts differ, we enable the estimating of separate dispersion parameters for the data sources of RNA-Seq and RF profiling to account for different characteristics (Supplementary Section E).
As in Anders et al. (2012) (link), with raw dispersions estimated from previous steps, we regress all κⁱ given the mean counts to obtain a mean-dispersion relationship

f (μ) = λ_{1} / μ + λ_{0}

. We perform empirical Bayes shrinkage (Love et al., 2014 (link)) to shrink κⁱ towards

f (μ)

to stabilize estimates (see Supplementary Section D). The proposed model in RiboDiff with a joint dispersion estimate is conceptually identical to using the following GLM design matrix

protocol + condition + condition : protocol

(for instance, in conjunction with edgeR or DESeq1/2).
In a treatment/control setting, we can then evaluate whether a treatment (C = 1) has a significant differential effect on translation efficiency compared to the control (C = 0). This is equivalent to determining whether the parameter

β_{Δ, 1}

differs significantly from 0 and whether the relationship denoted by the dashed arrow in Figure 1A is needed or not. We can compute significance levels based on the

χ^{2}

distribution by analyzing

log

-likelihood ratios of the Null model (

β_{Δ, 1}^{i} = 0

) and the alternative model (

β_{Δ, 1}^{i} = 0

Free full text: Click here

Zhong Y., Karaletsos T., Drewe P., Sreedharan V.T., Kuo D., Singh K., Wendel H.G, & Rätsch G. (2016). RiboDiff: detecting changes of mRNA translation efficiency from ribosome footprints. Bioinformatics, 33(1), 139-141.

Publication 2016

Biological Gene Joint Library Love Mrna Rna seq Transcriptional Vector

Corresponding Organization : ETH Zurich

Other organizations : Max Delbrück Center

Top 5 similar protocols

Protocol cited in 38 other protocols

Variable analysis

independent variables

Condition (C = {0, 1})

dependent variables

MRNA abundance (βC)
Relationship between mRNA abundance and RNA-Seq read counts (βRNA)
Relationship between mRNA abundance and RF read counts (βRF)
Effect of the treatment on translation (βΔ,C)

control variables

Not explicitly mentioned

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!