Go to JCI Insight
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Advertising
  • Job board
  • Contact
  • Clinical Research and Public Health
  • Current issue
  • Past issues
  • By specialty
    • COVID-19
    • Cardiology
    • Gastroenterology
    • Immunology
    • Metabolism
    • Nephrology
    • Neuroscience
    • Oncology
    • Pulmonology
    • Vascular biology
    • All ...
  • Videos
    • Conversations with Giants in Medicine
    • Video Abstracts
  • Reviews
    • View all reviews ...
    • Complement Biology and Therapeutics (May 2025)
    • Evolving insights into MASLD and MASH pathogenesis and treatment (Apr 2025)
    • Microbiome in Health and Disease (Feb 2025)
    • Substance Use Disorders (Oct 2024)
    • Clonal Hematopoiesis (Oct 2024)
    • Sex Differences in Medicine (Sep 2024)
    • Vascular Malformations (Apr 2024)
    • View all review series ...
  • Viewpoint
  • Collections
    • In-Press Preview
    • Clinical Research and Public Health
    • Research Letters
    • Letters to the Editor
    • Editorials
    • Commentaries
    • Editor's notes
    • Reviews
    • Viewpoints
    • 100th anniversary
    • Top read articles

  • Current issue
  • Past issues
  • Specialties
  • Reviews
  • Review series
  • Conversations with Giants in Medicine
  • Video Abstracts
  • In-Press Preview
  • Clinical Research and Public Health
  • Research Letters
  • Letters to the Editor
  • Editorials
  • Commentaries
  • Editor's notes
  • Reviews
  • Viewpoints
  • 100th anniversary
  • Top read articles
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Advertising
  • Job board
  • Contact
Top
  • View PDF
  • Download citation information
  • Send a comment
  • Terms of use
  • Standard abbreviations
  • Need help? Email the journal
  • Top
  • Solving a fundamental challenge
  • DeepMind, AI, and protein folding
  • Success involved multiple factors
  • AlphaFold’s impact
  • Concluding remarks
  • Footnotes
  • References
  • Version history
  • Article usage
  • Citations to this article

Advertisement

Viewpoint Open Access | 10.1172/JCI174915

AlphaFold developers Demis Hassabis and John Jumper share the 2023 Albert Lasker Basic Medical Research Award

Tobin R. Sosnick

Department of Biochemistry and Molecular Biology, Pritzker School of Molecular Engineering, Institute for Biophysical Dynamics, University of Chicago, Chicago, Illinois, USA.

Address correspondence to: Tobin R. Sosnick, Department of Biochemistry and Molecular Biology, Pritzker School of Molecular Engineering, Institute for Biophysical Dynamics, University of Chicago, 929 E. 57th St., GCIS Room W101C, Chicago, Illinois, 60637 USA. Phone: 773.218.5950; Email: trsosnic@uchicago.edu.

Find articles by Sosnick, T. in: PubMed | Google Scholar

Published September 21, 2023 - More info

Published in Volume 133, Issue 19 on October 2, 2023
J Clin Invest. 2023;133(19):e174915. https://doi.org/10.1172/JCI174915.
© 2023 Sosnick This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Published September 21, 2023 - Version history
View PDF

The 2023 Albert Lasker Basic Medical Research Award has been given to Demis Hassabis and John Jumper of DeepMind for the invention of AlphaFold, the artificial intelligence (AI) system that solved the long-standing challenge of predicting the three-dimensional (3D) structure of proteins from the one-dimensional (1D) sequence of their amino acids.

The expression “I alphafolded it” is now something I hear almost every day, whether in the lab, during a thesis defense, or at a scientific conference. This transformation of a noun into a verb, akin to the phrase, “I googled it,” mirrors the revolution that has occurred in biological sciences over the past two and a half years. This change began with the announcement on November 30, 2020 (1), of the success of DeepMind’s AI program AlphaFold, which was developed by a team run by Drs. Hassabis and Jumper (2). They essentially solved Nobel Prize winner Christian Anfinsen’s six-decade-old grand challenge in biophysics — the prediction of the structure of a protein based solely on its primary amino acid sequence (3). The full impact of this development began when AlphaFold and similar programs (4) were made widely accessible in July 2021, fundamentally altering the landscape of biological research.

Knowledge of a protein’s atomic level structure is essential for a meaningful understanding of how it functions. Since virtually every biological process involves proteins, this knowledge is critical. These biomolecules can be enzymes, antibodies, or motors found in our muscles, participating in signaling, sensing, motility, immune response, translation, compartmentalization, and regulation. However, the determination of even a single protein structure can be highly laborious. The AI-based breakthrough of providing millions of structures with an accuracy likely comparable to that of experimental methods (5) greatly advances scientific and medical studies on numerous fronts, including areas ranging from drug discovery to protein design and engineering. The paradigm-changing shift is the expectation that an AI-generated structure is available and likely to be as accurate as an experimental structure or at least accurate enough that the prediction can be applied to interpret data or influence further research.

Solving a fundamental challenge

Not surprisingly, considerable effort has been put into developing prediction methods for protein structure. To this end, John Moult and Krzysztof Fidelis founded the Critical Assessment of Structure Prediction (CASP) in 1994 (6). CASP’s approach, which balanced cooperativity with competition, provided a rigorous, blind format where the field could test different methods. More than 100 research groups would participate in each biannual round of CASP, resulting in considerable advances. However, obtaining a reliable atomic level structure for all but the smallest proteins typically required the knowledge of a structure of a structurally similar (homologous) protein acting as a template.

Results (once corrected for a larger database) largely plateaued by the 11th round of CASP in 2014 (referred to as “CASP11”) (7). Earlier, it had been recognized that by mining the multiple sequence alignments (MSAs) for correlations between residues that are far apart in the 1D sequence, one could identify residues that are likely to be near one another spatially (8). In CASP11, David Baker’s lab employed this strategy to accurately predict a structure without using a template (7). In CASP12, this strategy (along with other improvements) resulted in a noticeable increase in accuracy (9). While impressive, the methods still did not have the accuracy to replace experimental structure determination. However, this progress highlighted that tremendous information content exists in the sequence correlations, and millions of sequences were becoming available through large-scale sequencing projects that allowed for the creation of accurate MSAs (10). As a result, high-accuracy structure prediction appeared achievable if augmented with sequencing data.

DeepMind, AI, and protein folding

Meanwhile, Dr. Hassabis, a renowned polymath with expertise in computer science and neuroscience, as well as game design (being an expert game player himself), cofounded DeepMind in 2011. DeepMind developed AlphaGo, an AI program that beat the world Go champion in 2016, an achievement that is considered one of the major successes of AI (11). Part of AlphaGo’s success can be attributed to the fact that it had access to a large data set to train on including thousands of human expert games and then millions of self-play games.

Protein structure prediction similarly incorporated Dr. Hassabis’s three major elements that made it an attractive problem for AI (12). First, protein folding is characterized as a massive search problem. Second, the Protein Data Bank (PDB) provides over 100,000 structures for training (12). Third, success in prediction is easily quantifiable by the similarity between the prediction and the known structure, using specific measures such as backbone root-mean-square deviation (RMSD).

Dr. Jumper joined DeepMind in 2017 to work on structure prediction, utilizing his expertise in protein folding. Previously, Dr. Jumper had been a graduate student at the University of Chicago from 2011 to 2017, where he was coadvised by Karl Freed and me. During his studies, Dr. Jumper developed both an extremely fast and clever molecular dynamics (MD) method and a new machine learning approach for simultaneously training all the energy function parameters (13). In the three years prior to graduate school, he worked for D. E. Shaw Research, a research company recognized for its work in advancing MD hardware for longtime protein simulations.

Dr. Jumper was part of the DeepMind team that participated in CASP13 in 2018. The team, along with other groups, applied AI methods that were becoming increasingly powerful (14). DeepMind’s original version of AlphaFold outperformed the other methods in the template-free category (14), but it did not yet achieve the 1 Å accuracy needed to transform people’s view of protein structure prediction. The DeepMind structure prediction group, now being led by Dr. Jumper, accomplished that feat in CASP14 in 2020 (1).

Success involved multiple factors

To achieve this success, they developed a whole new AI architecture, recognizing that they were predicting protein structures going from a 1D sequence to a 3D physical structure (2, 15). Their approach, along with the inclusion of physics and evolutionary information, was designed into AlphaFold at many stages. Rather than operating a one-directional pipeline, the various algorithmic steps communicate back and forth with each other; for example, an MSA is used to help generate an initial model, which then is used to improve the MSA, which then improves the model, in an iterative manner.

Other factors contributed to AlphaFold’s success. A one-by-one deletion of the various elements in the prediction pipeline produced only a slight decrease in the performance (2). Surprisingly, even the lack of a template had a minimal effect on performance. However, accuracy decreased notably when the MSA was composed of less than approximately 30 sequences. Overall, it has been suggested that AlphaFold was approximately 5–6 years ahead of existing methods (16).

While the physical principles of protein structure are included throughout AlphaFold, there are instances where biophysics was sacrificed for expediency (2, 15). Fundamentally, proteins are polymers, which governs many of their biophysical properties. Nevertheless, this property was not required in the early search stages of the pipeline. Rather, the protein starts as a “residue gas” where each amino acid monomer is free to move as an independent unit, with the backbone connectivity being realized only after the units have found their native positions. This procedure enables the residues to quickly obtain their correct spatial location without needing to deal with the chain’s connectivity, a property that can create substantial barriers when the chain tries to pass through itself. This connectivity-free strategy is nonphysical but is highly efficient from the perspective of global optimization.

Very commendably, DeepMind made the AlphaFold code publicly available at the time of the original 2021 publication (2) and created a database that provides open access to more than 200 million structures that essentially covers the space of all folds (5). This has been tremendously beneficial, as every research biologist has instant access to a prediction, structural biologists can run the program themselves, and experts can study the algorithm as well as improve it or add capabilities. This generosity has further increased AlphaFold’s impact and rapid adoption by the research community.

AlphaFold’s impact

AlphaFold’s influence goes well beyond the prediction of the structure of individual proteins. A potentially more important area is the prediction of protein-protein interactions. Although not designed for this task, AlphaFold was adapted to docking (17) by simply creating a hypothetical single-chain protein with a chain break between the two partners, a technique employed by another highly successful AI method that was developed shortly afterward (4). Not surprisingly, AlphaFold’s performance can be improved when explicitly trained for docking, as seen in “AlphaFold-Multimer” (18). Success in docking prediction is likely due partially to the simultaneous prediction of folding and docking, which helps resolve the induced fit problem (19).

AlphaFold also works on membrane proteins (20), protein design (21), and protein-ligand interactions, which can facilitate drug development (22). AlphaFold provides a very useful residue-level confidence score to assist the user. Fortuitously, regions that are predicted with low confidence can be used to predict intrinsically disordered regions or regions that fold upon binding, both of which often have biological significance (23). AlphaFold can improve experimental structure determination by fitting its atomic level models into low-resolution data, such as those obtained from the nuclear pore complex (24).

While the structure prediction challenge is effectively solved, many aspects of protein folding, such as dynamics and thermodynamics, remain active areas of research (25). It will be fascinating to see how AI methods impact these fields. Researchers may increasingly be faced with the intellectual dilemma of deciding whether they would rather use AI methods and obtain a more accurate solution for a particular problem, or use methods based on physical principles but that may provide (so far) a less accurate answer.

Concluding remarks

Following the July 2021 online publication of AlphaFold, I sent my colleagues an email with the subject line “Revolution in structural biology” and text “…I believe the impact of this work may rival that of genomics and sequencing as the tools start to be applied to protein-protein interactions, membrane protein structure prediction, protein design, and drug discovery… We shall see.” Two years later, I can confidently say that we have seen, and it is clear: AlphaFold has transformed biological and biomedical research for the better.

Footnotes

Conflict of interest: The author has declared that no conflict of interest exists.

Copyright: © 2023, Sosnick et al. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.

Reference information: J Clin Invest. 2023;133(19):e174915. https://doi.org/10.1172/JCI174915.

References
  1. Center C. Artificial intelligence solution to a 50-year-old science challenge could ‘revolutionise’ medical research. https://predictioncenter.org/casp14/doc/CASP14_press_release.html Accessed August 18, 2023.
  2. Jumper J, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–589.
    View this article via: CrossRef PubMed Google Scholar
  3. Anfinsen CB, et al. The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. Proc Natl Acad Sci U S A. 1961;47(9):1309–1314.
    View this article via: CrossRef PubMed Google Scholar
  4. Baek M, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373(6557):871–876.
    View this article via: CrossRef PubMed Google Scholar
  5. Varadi M, et al. AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022;50(d1):D439–D444.
    View this article via: CrossRef PubMed Google Scholar
  6. Moult J, et al. A large-scale experiment to assess protein structure prediction methods. Proteins. 1995;23(3):ii–v.
    View this article via: CrossRef PubMed Google Scholar
  7. Moult J, et al. Critical assessment of methods of protein structure prediction: progress and new directions in round XI. Proteins. 2016;84(suppl 1):4–14.
    View this article via: CrossRef PubMed Google Scholar
  8. Morcos F, et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proc Natl Acad Sci U S A. 2011;108(49):E1293–E1301.
    View this article via: CrossRef PubMed Google Scholar
  9. Abriata LA, et al. Assessment of hard target modeling in CASP12 reveals an emerging role of alignment-based contact prediction methods. Proteins. 2018;86(suppl 1):97–112.
    View this article via: CrossRef PubMed Google Scholar
  10. Ovchinnikov S, et al. Protein structure determination using metagenome sequence data. Science. 2017;355(6322):294–298.
    View this article via: CrossRef PubMed Google Scholar
  11. Silver D, et al. Mastering the game of Go with deep neural networks and tree search. Nature. 2016;529(7587):484–489.
    View this article via: CrossRef PubMed Google Scholar
  12. Hassabis D. Using AI to Accelerate Scientific Discovery. https://cbmm.mit.edu/video/using-ai-accelerate-scientific-discovery Accessed August 18, 2023.
  13. Jumper JM, et al. Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours. PLoS Comput Biol. 2018;14(12):e1006578.
    View this article via: CrossRef PubMed Google Scholar
  14. Xu J, Wang S. Analysis of distance-based protein structure prediction by deep learning in CASP13. Proteins. 2019;87(12):1069–1081.
    View this article via: CrossRef PubMed Google Scholar
  15. Jumper J, Hassabis D. Protein structure predictions to atomic accuracy with AlphaFold. Nat Methods. 2022;19(1):11–12.
    View this article via: CrossRef PubMed Google Scholar
  16. Al Quraishi M. Some Thoughts on a Mysterious Universe. https://moalquraishi.wordpress.com/ Accessed August 18, 2023.
  17. Bryant P, et al. Improved prediction of protein-protein interactions using AlphaFold2. Nat Commun. 2022;13(1):1265.
    View this article via: CrossRef PubMed Google Scholar
  18. Evans R, et al. Protein complex prediction with AlphaFold-Multimer [preprint]. https://doi.org/10.1101/2021.10.04.463034 Posted on bioRxiv March 10, 2022.
  19. Faruk NF, et al. Challenges and advantages of accounting for backbone flexibility in prediction of protein-protein complexes. J Chem Theory Comput. 2022;18(3):2016–2032.
    View this article via: CrossRef PubMed Google Scholar
  20. Hegedus T, et al. Ins and outs of AlphaFold2 transmembrane protein structure predictions. Cell Mol Life Sci. 2022;79(1):73.
    View this article via: CrossRef PubMed Google Scholar
  21. Goverde CA, et al. De novo protein design by inversion of the AlphaFold structure prediction network. Protein Sci. 2023;32(6):e4653.
    View this article via: CrossRef PubMed Google Scholar
  22. Wong F, et al. Benchmarking AlphaFold-enabled molecular docking predictions for antibiotic discovery. Mol Syst Biol. 2022;18(9):e11081.
    View this article via: CrossRef PubMed Google Scholar
  23. Piersimoni L, et al. Lighting up Nobel Prize-winning studies with protein intrinsic disorder. Cell Mol Life Sci. 2022;79(8):449.
    View this article via: CrossRef PubMed Google Scholar
  24. Fontana P, et al. Structure of cytoplasmic ring of nuclear pore complex by integrative cryo-EM and AlphaFold. Science. 2022;376(6598):eabm9326.
    View this article via: CrossRef PubMed Google Scholar
  25. Chen SJ, et al. Opinion: protein folds vs. protein folding: differing questions, different challenges. Proc Natl Acad Sci U S A. 2023;120(1):e2214423119.
    View this article via: CrossRef PubMed Google Scholar
Version history
  • Version 1 (September 21, 2023): Electronic publication
  • Version 2 (September 28, 2023): Electronic publication

Article tools

  • View PDF
  • Download citation information
  • Send a comment
  • Terms of use
  • Standard abbreviations
  • Need help? Email the journal

Related Collection:

2023 Lasker Awards
  • Optical coherence tomography: when a picture is worth a million words
    Simone Tzaridis et al.

Related posts

  • The 2023 Lasker Awards

Metrics

  • Article usage
  • Citations to this article

Go to

  • Top
  • Solving a fundamental challenge
  • DeepMind, AI, and protein folding
  • Success involved multiple factors
  • AlphaFold’s impact
  • Concluding remarks
  • Footnotes
  • References
  • Version history
Advertisement
Advertisement

Copyright © 2025 American Society for Clinical Investigation
ISSN: 0021-9738 (print), 1558-8238 (online)

Sign up for email alerts