  • A phenogram (see Additional data files) was then built from this matrix, using the neighbor-joining algorithm [ 52].

  • The sequences are listed in the same order as they appear in the phenogram (see below), and are numbered in the same order as they appear in the EPK data table (see below).

  • The Newick format tree file was then imported into TreeExplorer [ 60], which was used to build the summary dendrogram (Figure 1) by manually collapsing branches that represented sequence clusters evident in the phenogram (see Additional data files).

  • In an attempt to provide greater resolution of clusters evident in the phenogram, some of the larger subfamilies were split into sets of smaller individually named branches.

  • The fragmentary sequences appear in the phenogram (see Additional data files) in parentheses, next to the more complete sequence to which they share the greatest degree of protein sequence identity in a 100-residue window.

