Identifying Transcription Factor Regulons

RcisTarget is a new R/Bioconductor implementation of the motif enrichment framework of i-cisTarget and iRegulon. RcisTarget identifies enriched transcription factor binding motifs and candidate transcription factors for a gene list. In brief, RcisTarget is based on two steps. First, it selects DNA motifs that are significantly over-represented in the surroundings of the transcription start site (TSS) of the genes in the gene-set. This is achieved by applying a recovery-based method on a database that contains genome-wide cross-species rankings for each motif. The motifs that are annotated to the corresponding TF and obtain a Normalized Enrichment Score (NES) > 3.0 are retained. Next, for each motif and gene-set, RcisTarget predicts candidate target genes (i.e. genes in the gene-set that are ranked above the leading edge). This method is based on the approach described by Aerts et al. 32 (link) which is also implemented in i-cisTarget (web interface) 33 (link) and iRegulon (Cytoscape plug-in) 34 (link). Therefore, when using the same parameters and databases, RcisTarget provides the same results as i-cisTarget or iRegulon, benchmarked against other TFBS-enrichment tools in Janky et al. 34 (link). More details about the method and its implementation in R are given in the package documentation.
To build the final regulons, we merge the predicted target genes of each TF-module that show enrichment of any motif of the given TF. To detect repression, it is theoretically possible to follow the same approach with the negative-correlated TF modules. However, in the datasets we analyzed, these modules were less numerous and showed very low motif enrichment, suggesting that these are lower quality modules. For this reason, we finally decided to exclude the detection of direct repression from the workflow, and continue only with the positive-correlated targets. The databases used for the analyses presented in this paper are the "18k motif collection" from iRegulon (gene-based motif rankings) for human and mouse. For each species, we used two gene-motif rankings (10kb around the TSS or 500bp upstream the TSS), which determine the search space around the transcription start site.

Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link: Access Free Full Text.

Aibar S., González-Blas C.B., Moerman T., Huynh-Thu V.A., Imrichova H., Hulselmans G., Rambow F., Marine J.C., Geurts P., Aerts J., van den Oord J., Atak Z.K., Wouters J, & Aerts S. (2017). SCENIC: Single-cell regulatory network inference and clustering. Nature methods, 14(11), 1083-1086.

Publication 2017

Dna motifs Gene Gene transcription factors Genes module Genes transcription Genome Human Mouse Regulons Repression Transcription start site

Corresponding Organization :

Other organizations : VIB-KU Leuven Center for Brain & Disease Research, KU Leuven, University of Liège, VIB-KU Leuven Center for Cancer Biology

Top 5 similar protocols

Protocol cited in 86 other protocols

Variable analysis

independent variables

Not explicitly mentioned

dependent variables

Not explicitly mentioned

control variables

Not explicitly mentioned

controls

None specified
None specified

Annotations

Based on most similar protocols

To optimize experimental performance, the protocols suggest using the 'gene-based motif rankings' database from iRegulon with a search space of 10kb around the transcription start site (TSS) or 500bp upstream the TSS (Protocols 1, 5).

The protocols validate the method's accuracy by comparing the results of RcisTarget to other TFBS-enrichment tools, such as i-cisTarget and iRegulon, which have been benchmarked in previous studies (Protocol 1).

The protocols suggest that RcisTarget provides the same results as i-cisTarget or iRegulon when using the same parameters and databases (Protocol 1).

To enhance the accuracy and reproducibility of the protocols, the lab notes should include the specific parameters used, such as the database, search space around the TSS, and the Normalized Enrichment Score (NES) threshold (Protocols 1, 2, 4).

The protocols use a Normalized Enrichment Score (NES) threshold of 3.0 to select enriched transcription factor binding motifs (Protocols 1, 4).

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!