Repetitive sequence regions were identified by RepeatMasker [44 ] and the specific locations were downloaded from the UCSC genome browser [43 (link)]. The following repeat types were collected for this analysis: low complexity repeat family (low complexity), long interspersed nuclear elements (LINE), short interspersed nuclear elements (SINE), DNA transposons (DNA), RNA repeat families (RNA), satellite repeat family (Satellite), rolling circle (RC), unknown repeat family (Unknown), long terminal repeats (LTR) and other repeats (Other).
Mapping and Categorizing Short RNA Sequences
Repetitive sequence regions were identified by RepeatMasker [44 ] and the specific locations were downloaded from the UCSC genome browser [43 (link)]. The following repeat types were collected for this analysis: low complexity repeat family (low complexity), long interspersed nuclear elements (LINE), short interspersed nuclear elements (SINE), DNA transposons (DNA), RNA repeat families (RNA), satellite repeat family (Satellite), rolling circle (RC), unknown repeat family (Unknown), long terminal repeats (LTR) and other repeats (Other).
Corresponding Organization : Duke University
Other organizations : Northwestern University, Duke Medical Center
Protocol cited in 60 other protocols
Variable analysis
- Short read libraries were downloaded from the Short Read Archive
- Reads from the deep sequencing libraries were first stripped of the 3' adapter sequence
- Reads that were less than 13 nucleotides in length or contained an ambiguous nucleotide were discarded
- The remaining reads were aligned to the human genome (hg19) by the Bowtie algorithm, with up to two mismatches allowed
- All T = > C mismatches between a read and the genomic sequence were subtracted from the mismatch count at each mapped location
- Only reads that mapped to a single genomic location with no mismatches after conversion subtraction were used for further analysis
- Original clusters and CCRs were obtained from Hafner et al. [7] and converted to hg19 coordinates using the liftover tool from the UCSC genome browser
- Repetitive sequence regions were identified by RepeatMasker and the specific locations were downloaded from the UCSC genome browser
- Mapped locations were only reported for the optimal mismatch-stratum for each read up to a maximum of ten different locations
- The location that a read mapped to, relative to a known transcript, was determined based on the ENSEMBL database (release 57)
- Reads that overlapped by at least a single nucleotide were grouped together to form read groups
- The location of a read group relative to known transcripts was determined in the same way as for individual reads
- Specific control variables are not explicitly mentioned in the provided information.
- No positive controls are explicitly mentioned in the provided information.
- No negative controls are explicitly mentioned in the provided information.
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!