  • We first identified EST libraries that were directionally cloned and sequenced (that is, ESTs were cloned and sequenced in a defined orientation with respect to the mRNA transcript); then, focusing exclusively on ESTs from such libraries, we searched for UniGene clusters containing a statistically significant number of misoriented ESTs.

  • Another caveat arises in that 3' sequencing reads of directionally cloned ESTs are generally not reoriented before deposit of sequences in GenBank.

  • 7% for differential gene expression was observed, i.e. the fold change for a given gene seen by microarray was directionally consistent with that seen by RT-PCR, regardless whether the results were significant by either the 5% LFC model (for microarray data) or a student's T-test (for RT-PCR data).

  • Concordance: state of agreement between two complementary measurement techniques which is directionally consistent, e.g. two techniques determine that values are statistically significant and that they are both either positive or negative.

  • We applied the BLASTX tool [ 29, 30] to blast each of the 45,588 relevant mRNA and EST sequences that belonged to both high-quality directionally cloned libraries and candidate UniGene clusters against the NCBI nr database (non-redundant database of protein sequences deposited in GenBank), with a threshold expectation value of 1e-10.

