Metaproteomic Database Construction Workflow

For protein identification of the mock community samples, a database was created using all protein sequences from the reference genomes of the organisms used in the mock communities (Supplementary Table 8). The cRAP protein sequence database (http://www.thegpm.org/crap/) containing protein sequences of common laboratory contaminants was appended to the database. The final database contained 123,100 protein sequences and is available from the PRIDE repository (PXD006118). For protein identification of the soda lake mats we used the database described above. For protein identification of the human saliva metaproteomes we used the same public databases as described in Grassl et al.^{9 (link)} as a starting point. Namely the protein sequences from the human oral microbiome database⁵³ and the human reference protein sequences from Uniprot (UP000005640). CD-HIT was used to remove redundant sequences from the database using an identity threshold of 95%^{49 (link)}. The saliva metaproteome database contained 914,388 protein sequences and is available from the PRIDE repository (PXD006366). For peptide identification and protein inference the MS/MS spectra were searched against the databases using the Sequest HT node in Proteome Discoverer version 2.0.0.802 (Thermo Fisher Scientific) or the MaxQuant software version 1.5.5.1^{15 (link)}.

Free full text: Click here

Kleiner M., Thorson E., Sharp C.E., Dong X., Liu D., Li C, & Strous M. (2017). Assessing species biomass contributions in microbial communities via metaproteomics. Nature Communications, 8, 1558.

Publication 2017

Sequest HT node Proteome Discoverer

Crap Genomes Human Human identification Human microbiome Ms ms Peptide Protein Protein human Protein sequences Proteome Saliva Saliva protein Soda lake

Corresponding Organization : University of Calgary

Top 5 similar protocols

Protocol cited in 17 other protocols

Variable analysis

independent variables

Database used for protein identification

dependent variables

Protein identification

control variables

Organisms used in the mock communities (Supplementary Table 8)
Common laboratory contaminants from the cRAP protein sequence database
Protein sequences from the human oral microbiome database and the human reference protein sequences from Uniprot (UP000005640)
CD-HIT used to remove redundant sequences from the database using an identity threshold of 95%

controls

Positive control: Not explicitly mentioned
Negative control: Not explicitly mentioned

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!