Seventeen new species have been added since TreeFam v1 (4 (link)). TreeFam v4 contains predicted protein sequences from the fully sequenced genomes of 25 animal species: human, chimpanzee, macaque, mouse, rat, cow, dog, opossum, chicken, frog, two pufferfish (Takifugu and Tetraodon), zebrafish, medaka, stickleback, sea squirts (Ciona intestinalis and C. savignyi), two fruit-flies (Drosophila melanogaster and D. pseudoobscura), two mosquitoes (Aedes aegypti and Anopheles gambiae), the flatworm Schistosoma mansoni, and the nematodes Caenorhabditis elegans, C. briggsae and C. remanei. In addition, four outgroup genomes are included: baker's yeast, fission yeast, rice and thale cress (Arabidopsis).
The C. briggsae and C. remanei proteins were downloaded from WormBase (16 (link)), D. pseudoobscura proteins from FlyBase (17 (link)), fission yeast and flatworm proteins from GeneDB (18 (link)), thale cress proteins from TIGR (19 (link)), rice proteins from the Beijing Genomics Institute (20 (link)) and the remaining sequences from Ensembl (15 (link)). In addition to these species, TreeFam includes UniProt (21 (link)) proteins from animal species whose genomes have not been fully sequenced. For TreeFam v4, all sequences were downloaded in October 2006.