  • Using PSI-BLAST [ 27 ] with different inteins as seeds, divergent inteins with less than ten percent sequence identity as compared to the query sequence are recovered (data not shown); however, using this approach we did not discover any additional intein or homing endonuclease encoding genes in T. acidophilum.

  • The T. acidophilum intein shows significant sequence similarity to the inteins found in the A-ATPase catalytic subunits of Pyrococcus.

  • Moreover, these inteins are inserted into the same highly conserved sequence in the ATP binding site.

  • Multiple sequence alignments of diverse intein sequences identified eight motifs composed of moderately conserved residues [ 18 28 ] . The A-ATPase A subunit of the Thermoplasma and Pyrococcus intein multiple sequence alignment (with manual modification) is shown in figure 1. The T. acidophilum intein (173 amino acids long) is among the shortest inteins known, and the alignment with other inteins reveals the absence of sequences homologous to the typical endonuclease motifs.

  • The significance of the match between the T. acidophilum and the three pyrococcal ATPase inteins was assessed using PRSS at http://fasta.bioch.virginia.edu/fasta/prss.htm [ 29 ] . The P-value for this match, i.e. the probability of obtaining a match of this quality by chance alone, was calculated to be below 10 -10.

