GO (Gene Ontology; [19 (link)]) functional categories were taken from the annotation files provided on the Ensembl [16 ] and Gene Ontology [20 ] websites. Further GO term annotations were derived by mapping functional GO annotations for the PDB (downloaded from [20 ]) onto Ensembl protein annotations, using 50% sequence identity and 0.8 fractional sequence coverage (for the protein domain) as thresholds, using alignment made by the program BLASTP (e-value ≤ 0.0001) [13 (link)]. These thresholds were benchmarked on the complete SCOP protein domain sequence database [18 (link)], to give a 2% false positive rate for GO term transfer. Significant associations between GO terms and lists of protein sequences we calculated using binomial statistics, and a P'-value threshold of 0.05, where P' has been adjusted to account for multiple hypothesis testing, using the Bonferroni correction. In addition we used two functional supercategories, wherein all transcription-associated and non-transcription-associated GO terms were pooled together. The transcription-associated GO terms are: GO:0006355; GO:0006357; GO:0006366; GO:0006367;GO:0016563;GO:0003676;GO:0003677;GO:0003700;GO:0003702;GO:0003704;GO:0003713;GO:0030374;GO:0030528.
Protein Disorder and Domain Annotation
GO (Gene Ontology; [19 (link)]) functional categories were taken from the annotation files provided on the Ensembl [16 ] and Gene Ontology [20 ] websites. Further GO term annotations were derived by mapping functional GO annotations for the PDB (downloaded from [20 ]) onto Ensembl protein annotations, using 50% sequence identity and 0.8 fractional sequence coverage (for the protein domain) as thresholds, using alignment made by the program BLASTP (e-value ≤ 0.0001) [13 (link)]. These thresholds were benchmarked on the complete SCOP protein domain sequence database [18 (link)], to give a 2% false positive rate for GO term transfer. Significant associations between GO terms and lists of protein sequences we calculated using binomial statistics, and a P'-value threshold of 0.05, where P' has been adjusted to account for multiple hypothesis testing, using the Bonferroni correction. In addition we used two functional supercategories, wherein all transcription-associated and non-transcription-associated GO terms were pooled together. The transcription-associated GO terms are: GO:0006355; GO:0006357; GO:0006366; GO:0006367;GO:0016563;GO:0003676;GO:0003677;GO:0003700;GO:0003702;GO:0003704;GO:0003713;GO:0030374;GO:0030528.
Corresponding Organization : McGill University
Protocol cited in 6 other protocols
Variable analysis
- DISOPRED with default parameters trained to give a 5% false positive rate
- MULTICOIL with default parameters
- ASTRAL 40% identity protein domain sequence set
- BLAST using e-value ≤ 0.01
- 50% sequence identity and 0.8 fractional sequence coverage (for the protein domain) as thresholds for mapping functional GO annotations from PDB
- Binomial statistics with a P'-value threshold of 0.05 for calculating significant associations between GO terms and lists of protein sequences
- Fraction of predicted protein disorder in a CB region (given by the D value)
- Identification of coiled coils
- Assignment of known protein domains
- Identification of types of biased region that map to repetitive Zinc-finger-containing proteins (> 0.5 of the length of the protein)
- GO (Gene Ontology) functional categories
- Bonferroni correction for multiple hypothesis testing
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!