Benchmarking Protein Mutation Effects

We used several different datasets to develop, validate, and independently test the SAAFEC-SEQ method. These datasets contain experimental thermodynamic information for wild type and mutant proteins, including the change in Gibbs free energy (ΔΔG). The following datasets contain only a single chain protein and single point missense mutations.
S2648. This is our training and test dataset, the S2648, collected from the ProTherm database [28 (link)], including 2648 unique single point missense entries in 131 different proteins and the corresponding ΔΔGs.
S350. This is our validation dataset. To compare with other methods, we used the same validation dataset used by other developers [16 (link),17 (link),18 (link),19 (link)], which contains 350 mutations (taken from 67 different proteins) randomly selected from S2648.
S276. This blind data set was collected from Cao’s et al. work [29 (link)], which includes 276 unique single point missense entries in 37 different proteins. None of them is in the training or validation set.
p53. This is the second blind dataset. We used a dataset of 42 single point missense mutations within the DNA binding domain of the tumor suppressor protein p53, which thermodynamic effects have been experimentally determined [46 (link),47 (link),48 (link)]. As in the previous case, none of them appeared in our training set.
PTEN and TPMT. For the third blind data set, we collected two independent datasets for the phosphatase and tensin homologue (PTEN) and thiopurine S-methyl transferase (TPMT) proteins from the Critical Assessment of Genome Interpretation (CAGI) challenge [30 (link)]. It can be downloaded from https://genomeinterpretation.org/content/predict-effect-missense-mutations-pten-and-tpmt-protein-stability. We removed mutations with an unknown amino acid “X” (both in wild type and mutant), and then kept a total of 7363 missense mutations for the PTEN (3736) and TPMT (3627) proteins.

Free full text: Click here

Li G., Panday S.K, & Alexov E. (2021). SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability. International Journal of Molecular Sciences, 22(2), 606.

Publication 2021

Amino acid Blind Genome Missense mutations Mutant proteins Mutations Phosphatase Point mutations Proteins Pten Tensin Tpmt Transferase Tumor suppressor protein p53

Corresponding Organization : Clemson University

Top 5 similar protocols

Protocol cited in 6 other protocols

Variable analysis

independent variables

Single point missense mutations in proteins

dependent variables

Change in Gibbs free energy (ΔΔG) of the mutant proteins

control variables

Wild type protein sequences

positive controls

Not specified

negative controls

Not specified

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!