  • Otherwise the original state is retained for the next iteration of the Markov chain.

  • Then in the ( k + 1)th iteration, the estimates are updated by

  • This search (expect value (e) threshold for inclusion in profile = 0.01) recovered, in addition to the orthologs of the PRC-H proteins from other purple proteobacteria, several uncharacterized proteins from the cyanobacterium Anabaena (for example, all5315 and alr5332, iteration 2, e = 10 -6-10 -4), non-photosynthetic α-proteobacteria such as Mesorhizobium, Sinorhizobium, Brucella and Caulobacter (for example, SMc00885, iteration 4, e = 10 -4or CAC1676, iteration 5, e = 10 -6), several other assorted bacteria like Deinococcus, Bacillus and Streptomyces (for example, YlmC iteration 4, e = 10 -5) and several archaea with completely sequenced genomes.

  • A PSI-BLAST search started with the N-terminal SBHM from the Aquifex aeolicus DDRP β' subunit detected several SBHMs from proteins other than the β' subunit, such as acetylornithine deacetylase (E = 2 × 10 -7, iteration 2), the C-terminal region of bacterial cytochrome F (E = 10 -4, iteration 2), biotin carboxyl carrier domain of biotin transcarboxylases (E = 6 × 10 -4, iteration 2) and the phosphotransferase system enzyme II (E = 10 -3, iteration 2).

  • As an example, a profile seeded with the MBP1p APSES domain and including all APSES domains in the NR database detects the N. meningitidis protein NMA1544 (gi:11290039) with an E-value of 0.1 in iteration 3. Given the availability of the three-dimensional structure of the APSES domain of the MBP1p protein [ 16, 17], we investigated this potential relationship using the KilA-N domain sequences for sequence-structure threading of the PDB database with the 3DPSSM, PSIPRED and combined-fold prediction algorithms.

