Genome sequences and genome assembly data were downloaded for the following eukaryotes: Anopheles gambiae, Apis melifera, A. thaliana, Bos taurus, Canis familiaris, Cavia porcellus, C. brenneri, C. briggsae, C. elegans, C. remanei, Chlamydomonas reinhartdii, Ciona intestinalis, D. melanogaster, Felis catus, Gallus gallus, Giardia lamblia, H. sapiens, Loxodonta africana, Macaca mulatta, Magnoporthe grisea, Neurospora crassa, Ornithorynchus anatinus, Pan troglodytes, Plasmodium falciparum, Populus trichocarpa, S. cerevisiae, S. pombe, T. rubripes, T. gondii, T. spiralis and Xenopus tropicalis (full details of source data and download sites are listed in Supplementary Table S6).