  • The following databases were used: Human UTR-DB (EBI) (version 13) [ 70]; Human Transcript Database (Baylor University) (version 1) [ 71]; GenBank CDS (NCBI) (only PRI mRNA sequences were used, version 119) [ 72]; HINT (Ohio State University) [ 73]; EST Assembly Project (University of Washington) [ 74]; TIGR Human Gene Index (version 4.5) [ 75]; dbEST (NCBI) (version 119) [ 76]; MINT and RINT (Ohio State University) [ 73]; EMBL Rodent (EMBL) (version 63) [ 77]; SWISS-PROT (EMBL) (version 39) [ 78]; TrEMBL (EMBL) (version 14) [ 78]; PIR (MIPS-JIPID) (version 65) [ 79]; and Pfam (Sanger Centre) (version 5.4) [ 80].

  • Islamic terms: khaksar, khankah, pir, purdah

  • PIR [ 7 ] classifies proteins into either a homeomorphic superfamily (proteins containing similar domains in the same order) or a homology domain superfamily (proteins from different homeomorphic superfamilies sharing a common ancestral domain).

  • A number of public efforts are currently focusing on the annotation and curation of gene-specific functional data, including LocusLink, Protein Information Resource (PIR), GeneCards, Proteome, Kyoto Encyclopedia of Genes and Genomes (KEGG), Ensembl, and Swiss-Prot to name but a few [ 2 3 4 5 6 7 8 ] . These resources provide exceptional depth and coverage of the functional data available for a given gene, but are not designed to effectively explore the biological knowledge associated with hundreds or thousands of genes in parallel.

  • 03202001 "as putative cinnamyl-alcohol dehydrogenase", based on sequence similarity (its top 10 BLAST matches are all cinnamyl-alcohol dehydrogenases with E-values in the range of 10 -94if analyzed against all non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF on Jan 2, 2002).

