Identifying Structural RNA Variants

RNAs with known secondary structures were doped into the initial RNA pool as positive controls to estimate the baseline changes in RNA structure in PARS. We calculated the PARS scores for all the bases in the transcripts and performed data normalization in order to directly compare secondary structures between different individuals. To normalize the data, we calculated the standard deviation (SD) for each transcript and divided the PARS score per base by the SD of that transcript. This resulted in a normal distribution of PARS scores for each transcript in each individual and enabled us to calculate the change in PARS scores due to SNVs by subtraction of PARS scores between the individuals. Since a true structure change is likely to extend beyond a single base, we define a structure difference of the i-th base of transcript j between conditions m and n in this formula, where PARS represents the normalized PARS score:

{StrucDiff}_{i, j, m, n} = \sum_{k = i - 2}^{k = i + 2} \frac{abs (P A R S_{k, j, m} - P A R S_{k, j, n})}{5}

We calculated the StrucDiff for all the bases in all the transcripts between each pair of individuals: GM12891 and GM12892, GM12891 and GM12878, GM12892 and GM12878. To identify RiboSNitches, we downloaded SNV annotations from HapMap project²², and then converted SNV annotations from hg18 assembly to hg19 assembly using UCSC executable LiftOver. We then overlaid the hg19 SNV coordinates with our transcriptome annotation, a non-redundant combination of RefSeq and Gencode v12 transcriptome assembly, to identify the positions in the transcriptome that have SNVs. For highly confident detection of structural changes, we require that the sequencing coverage around SNV is dense, such that (1) the SNV is located on a transcript whose average coverage is greater than 1 (on average one read per base); and (2) the average coverage in a 5-base window centered around the SNV is greater than 10 (average S1+V1≥5). We exclude bases that fall within 100 nucleotides from the 3’end of all the transcripts due to the blind tail of 100 nucleotides.
To identify SNVs with statistically significant changes in structure, we estimated a global baseline of structural change by calculating the fold differences between the doping control and SNV cumulative frequencies. We calculated a z-score for each detected SNV: z= (StrucDiffs-mean)/(SD of doped in controls). We used the Tetrahymena ribozyme as the doped in control. We noticed that a StrucDiff ≥1 is equivalent to a z-score≥4.5 and a 100 fold difference between the SNV and doping control cumulative frequencies. To calculate the p-value for the structural change at each detected SNV, we performed 1000 permutations on the absolute values of the non-zero delta PARS scores within each transcript that contains SNV. This p-value is an estimate of the likelihood that a 5-base average of the permutated PARS structural change is greater than the 5-base average of the SNV base’s structural change. The false discovery rate (FDR) of the significance of the structural change at the SNV site is estimated by a multi-hypothesis testing performed using the p.adjust function in R. A SNV is defined as a RiboSNitch if (1) its StrucDiff is greater than 1 (equivalent to z-score ≥ 4.5 and 100 fold cumulative frequency difference); (2) its p-value less than 0.05 and FDR less than 0.1; and (3) local read coverage greater than 10 and at least 3 out of 11 bases contain S1 or V1 signals in a 11-base sliding window centered by the SNV site. We also permutated the structural changes between the Trio by shuffling the StrucDiffs within every transcript. After structural PARS scores were permutated, we identified only 16 RiboSNitches based on the exact same aforementioned methods and thresholds. This number is less than 1% of the original number of RiboSNitches found, indicating that most of the discovered RiboSNitches are not random noise.

Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link: Access Free Full Text.

Wan Y., Qu K., Zhang Q.C., Flynn R.A., Manor O., Ouyang Z., Zhang J., Spitale R.C., Snyder M.P., Segal E, & Chang H.Y. (2014). Landscape and variation of RNA secondary structure across the human transcriptome. Nature, 505(7485), 706-709.

Publication 2014

Blind Confident Hapmap J base Nucleotides Pars Ribozyme Tail Tetrahymena Transcriptome Trio

Corresponding Organization :

Other organizations : Stanford University, Weizmann Institute of Science

Top 5 similar protocols

Protocol cited in 11 other protocols

Variable analysis

independent variables

Presence of SNVs (Single Nucleotide Variants) in the transcripts

dependent variables

Changes in RNA secondary structure, as measured by PARS (Parallel Analysis of RNA Structure) scores

control variables

Positive controls: RNAs with known secondary structures doped into the initial RNA pool to estimate baseline changes in RNA structure.
Tetrahymena ribozyme used as the doped-in control to estimate a global baseline of structural change.

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!