PSI-BLAST searches using hidden markov models of structural repeats: prediction of an unusual sliding DNA clamp and of β-propellers in UV-damaged DNA-binding …

AF Neuwald, A Poleksic - Nucleic acids research, 2000 - academic.oup.com
AF Neuwald, A Poleksic
Nucleic acids research, 2000academic.oup.com
We have designed hidden Markov models (HMMs) of structurally conserved repeats that,
based on pairwise comparisons, are unconserved at the sequence level. To model
secondary structure features these HMMs assign higher probabilities of transition to insert or
delete states within sequence regions predicted to form loops. HMMs were optimized using
a sampling procedure based on the degree of statistical uncertainty associated with
parameter estimates. A PSI-BLAST search initialized using a checkpoint-recovered profile …
Abstract
We have designed hidden Markov models (HMMs) of structurally conserved repeats that, based on pairwise comparisons, are unconserved at the sequence level. To model secondary structure features these HMMs assign higher probabilities of transition to insert or delete states within sequence regions predicted to form loops. HMMs were optimized using a sampling procedure based on the degree of statistical uncertainty associated with parameter estimates. A PSI-BLAST search initialized using a checkpoint-recovered profile derived from simulated sequences emitted by such a HMM can reveal distant structural relationships with, in certain instances, substantially greater sensitivity than a normal PSI-BLAST search. This is illustrated using two examples involving DNA- and RNA-associated proteins with structurally conserved repeats. In the first example a putative sliding DNA clamp protein was detected in the thermophilic bacterium Thermotoga maritima. This protein appears to have arisen by way of a duplicated β-clamp gene that then acquired features of a PCNA-like clamp, perhaps to perform a PCNA-related function in association with one or more of the many archaeal-like proteins present in this organism. In the second example, β-propeller domains were predicted in the large subunit of UV-damaged DNA-binding protein and in related proteins, including the large subunit of cleavage-polyadenylation specificity factor, the yeast Rse1p and human SAP130 pre-mRNA splicing factors and the fission yeast Rik1p gene silencing protein.
Oxford University Press