The database was constructed using the amino acid sequences of all curated non-redundant 3248 hydrogenase catalytic subunits represented in the NCBI RefSeq database in August 20142 (link) (Dataset S1). In order to test the classification tool, additional sequences from newly-sequenced archaeal and bacterial phyla were retrieved from the Joint Genome Institute’s Integrated Microbial Genomes database43 (link).
Free full text: Click here