CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix …

JD Thompson, DG Higgins, TJ Gibson - Nucleic acids research, 1994 - academic.oup.com
JD Thompson, DG Higgins, TJ Gibson
Nucleic acids research, 1994academic.oup.com
The sensitivity of the commonly used progressive multiple sequence alignment method has
been greatly improved for the alignment of divergent protein sequences. Firstly, individual
weights are assigned to each sequence in a partial alignment in order to downweight near-
duplicate sequences and up-weight the most divergent ones. Secondly, amino acid
substitution matrices are varied at different alignment stages according to the divergence of
the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap …
Abstract
The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. Firstly, individual weights are assigned to each sequence in a partial alignment in order to downweight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. These modifications are incorporated into a new program, CLUSTAL W which is freely available.
Oxford University Press