Protein Structure and Sequence Analysis

Structural homology models of ancestral sequences were generated by MODELLER v10.2 (Webb and Sali, 2016 (link)) using PDB 1M34 as a template for all nitrogenase protein subunits and visualized by ChimeraX v1.3 (Pettersen et al., 2021 (link)).
Extant and ancestral protein sequence space was visualized by machine-learning embeddings, where each protein embedding represents protein features in a fixed-size, multidimensional vector space. The analysis was conducted on concatenated (HDK) nitrogenase protein sequences in our phylogenetic dataset. The embeddings were obtained using the pre-trained language model ESM2 (Lin et al., 2022 (link); Rives et al., 2021 (link)), a transformer architecture trained to reproduce correlations at the sequence level in a dataset containing hundreds of millions of protein sequences. Layer 33 of this transformer was used, as recommended by the authors. The resulting 1024 dimensions were reduced by UMAP (McInnes et al., 2020 ) for visualization in a two-dimensional space.
Protein site-wise conservation analysis was performed using the Consurf server (Ashkenazy et al., 2016 (link)). An input alignment containing only extant, Group I Mo-nitrogenases was submitted for analysis under default parameters. Conserved sites were defined by a Consurf conservation score >7.

Free full text: Click here

Garcia A.K., Harris D.F., Rivier A.J., Carruthers B.M., Pinochet-Barros A., Seefeldt L.C, & Kaçar B. (2023). Nitrogenase resurrection and the evolution of a singular enzymatic mechanism. eLife, 12, e85003.

Publication 2023

Homology sequences Nitrogenases Protein Protein sequences Protein subunits Space visualization Vector

Corresponding Organization :

Other organizations : University of Wisconsin–Madison, Utah State University

Top 5 similar protocols

Variable analysis

independent variables

Ancestral nitrogenase protein sequences
Extant nitrogenase protein sequences

dependent variables

Structural homology models of ancestral nitrogenase protein sequences
Protein sequence embeddings
Protein site-wise conservation scores

control variables

PDB 1M34 as a template for structural homology modeling of all nitrogenase protein subunits
Default parameters for Consurf server analysis of extant, Group I Mo-nitrogenases

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!