description of the chosen configurations and references are given in the
following. We used the default parameters unless mentioned otherwise.
with different parameter settings to tune for the desired sensitivity. In this
comparison we included the strictest parameter set (also default settings), i.e.
Blast E-value cutoff 10−20 and divergence cutoff 0.2.
2007 including 35 species.
obtained from the Compara database version 47, which is available from
are available from
2003 and Jul 2003 respectively.
built 58 from Nov 2007.
used the data from Oct 2007 including 373 species.
including 550 species. OMA infer orthology at the level of pairs of sequences
(“OMA Pairwise”), from which it also computes groups of
orthologs (“OMA Group”). Both type of predictions are
included in the comparisons.
for aligning the protein sequences. We used the more accurate algorithm from
Smith and Waterman [35] (link) for the alignment with the same scoring
threshold as used by the OMA algorithm for the all-against-all step.
using ML distance estimates from pairwise alignments having significant
alignment scores (Dayhoff score >217, the cut-off used by OMA as
well)