Expansion and Curation of Transcription Factor-Ligand Database

The dataset published by Koch et al. [4 (link)] was used as a starting point for the Sensbio database. It contains a 2018 collection of TF-ligand interactions from different databases and literary resources. To expand and update this dataset, data dumps detailing aTFs and their triggering compounds were collected, cleaned and formatted accordingly from the following databases: BioNemo [5 (link)], RegulonDB [6 (link)], RegPrecise [7 (link)], RegTransBase [8 (link)], Sigmol [9 (link)] and GroovDB [10 (link)].
Custom Python 3 scripts (using standard libraries like Pandas and Numpy) were used to populate, clean, format and analyze the database and to build a web application through the Streamlit framework (https://streamlit.io/). Molecular fingerprints were extracted, analyzed and compared using the RDKit python library [11 ]. Networkx python module was used to describe and produce the molecular network. A local BLAST+ installation allowed the scoring and ranking of the protein sequences. Ete3 python toolkit [12 (link)] produced the phylogenetic trees of the TF sequences. Deep learning techniques were applied to build the predictive model through the Tensorflow and Keras Python libraries.
Classyfire [13 (link)] and iFragment [14 (link)] external web applications were used to classify the different molecules by chemical and metabolic categories respectively. Classyfire produces a hierarchical list of ontologies. In this case, the parent ontology was kept as the representative category for each molecule. iFragment on the other hand, produces a list of KEGG [15 (link)] metabolic pathways ordered by the probability of the input compound to belong to that particular pathway. The three pathways with the lowest p-value were selected. Using the KEGG restful API (https://www.kegg.jp/kegg/rest/keggapi.html), the parent ontology was extracted for each pathway and assigned as the final metabolic category.

Free full text: Click here

Tellechea-Luzardo J., Martín Lázaro H., Moreno López R, & Carbonell P. (2023). Sensbio: an online server for biosensor design. BMC Bioinformatics, 24, 71.

Publication 2023

Atfs Dumps Library Ligand Parent Protein sequences Python

Corresponding Organization : Universitat de València

Other organizations : Universitat Politècnica de València

Top 5 similar protocols

Variable analysis

independent variables

Data dumps detailing aTFs and their triggering compounds collected from the following databases: BioNemo, RegulonDB, RegPrecise, RegTransBase, Sigmol, and GroovDB

dependent variables

TF-ligand interactions
Molecular fingerprints
Molecular network
Protein sequence scoring and ranking
Phylogenetic trees of TF sequences
Predictive model built using deep learning techniques

control variables

Standard libraries like Pandas and Numpy used for data processing
RDKit python library used for molecular fingerprint extraction and analysis
Networkx python module used for molecular network description and production
Local BLAST+ installation used for protein sequence scoring and ranking
Ete3 python toolkit used for phylogenetic tree production
Tensorflow and Keras Python libraries used for deep learning model building
Classyfire and iFragment web applications used for molecular classification

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!