Gapped sequence alignment using artificial neural networks: application to the MHC class I system

M Andreatta, M Nielsen - Bioinformatics, 2016 - academic.oup.com
Bioinformatics, 2016academic.oup.com
Motivation: Many biological processes are guided by receptor interactions with linear
ligands of variable length. One such receptor is the MHC class I molecule. The length
preferences vary depending on the MHC allele, but are generally limited to peptides of
length 8–11 amino acids. On this relatively simple system, we developed a sequence
alignment method based on artificial neural networks that allows insertions and deletions in
the alignment. Results: We show that prediction methods based on alignments that include …
Abstract
Motivation: Many biological processes are guided by receptor interactions with linear ligands of variable length. One such receptor is the MHC class I molecule. The length preferences vary depending on the MHC allele, but are generally limited to peptides of length 8–11 amino acids. On this relatively simple system, we developed a sequence alignment method based on artificial neural networks that allows insertions and deletions in the alignment.
Results: We show that prediction methods based on alignments that include insertions and deletions have significantly higher performance than methods trained on peptides of single lengths. Also, we illustrate how the location of deletions can aid the interpretation of the modes of binding of the peptide-MHC, as in the case of long peptides bulging out of the MHC groove or protruding at either terminus. Finally, we demonstrate that the method can learn the length profile of different MHC molecules, and quantified the reduction of the experimental effort required to identify potential epitopes using our prediction algorithm.
Availability and implementation: The NetMHC-4.0 method for the prediction of peptide-MHC class I binding affinity using gapped sequence alignment is publicly available at: http://www.cbs.dtu.dk/services/NetMHC-4.0.
Contact:  mniel@cbs.dtu.dk
Supplementary information:  Supplementary data are available at Bioinformatics online.
Oxford University Press