SHARE
Facebook X Pinterest WhatsApp

DeepMind Massively Expands Protein Structure Database

thumbnail
DeepMind Massively Expands Protein Structure Database

A colorful 3D protein structure.

Researchers have said that the publication of this database, which is free to use, will help in combating malaria, antibiotic resistance, and plastic waste, as well as potentially speeding up the discovery of new drugs.

Written By
thumbnail
David Curry
David Curry
Aug 22, 2022

DeepMind has announced a huge expansion of its AlphaFold protein structure database, increasing the amount of 3D structures available from one million to over 200 million, which comprises almost all cataloged proteins known to science. 

The project was started 18 months ago, although it was first conceptualized by DeepMind in 2016. It was considered to be one of the best ways for scientists to solve the 50 year old problem of protein folding, which is the identification of each protein’s unique structure. 

SEE ALSO: Intel Launches Second-Generation Deep Learning Processors

“As someone who’s been in genomics and computational biology since the 1990s, I’ve seen many of these moments come where you can sense the landscape shifting under you and the provision of new resources, and this has been one of the fastest,” said joint director of EMBL’s European Bioinformatics Institute, Ewan Birney, to New Scientist. “I mean, two years ago, we just simply did not realize that this was feasible.”

Researchers have said that the publication of this database, which is free to use, will help in combating malaria, antibiotic resistance and plastic waste, as well as potentially speeding up the discovery of new drugs. DeepMind says it is proof that AI can have a positive impact on science. 

DeepMind’s AlphaFold team worked with CASP (Critical Assessment of protein Structure Prediction), a community forum established in 1994 with the goal of accurately predicting protein structures. AlphaFold used their and other teams research to influence the model for the database, alongside participating in the annual protein structure prediction challenge in 2018, in which the AI came first. 

What’s remarkable, for those involved, has been the speed at which AlphaFold has become superior to those with decades of experience in the field.

“AlphaFold is the singular and momentous advance in life science that demonstrates the power of AI,” said Eric Topol, Founder and Director of the Scripps Research Translational Institute. “Determining the 3D structure of a protein used to take many months or years, it now takes seconds. AlphaFold has already accelerated and enabled massive discoveries, including cracking the structure of the nuclear pore complex. And with this new addition of structures illuminating nearly the entire protein universe, we can expect more biological mysteries to be solved each day.”

This is not the first or the last time AI will be involved in the pursuit of cataloging and predicting scientific nodes and phenomenon. When the AI DeepMind and other organizations deal in, deep learning, was first being presented as a potential solution for scientific problems, it was as a tool to accelerate the massive computation needs for large projects such as protein folding. 

“It’s the most complex thing we’ve ever done. This is the reason I started DeepMind,” said DeepMind co-founder and CEO, Demis Hassabis, to MIT Technology Review. “In fact, it’s why I’ve worked my whole career in AI.”

thumbnail
David Curry

David is a technology writer with several years experience covering all aspects of IoT, from technology to networks to security.

Recommended for you...

The Rise of Autonomous BI: How AI Agents Are Transforming Data Discovery and Analysis
Why the Next Evolution in the C-Suite Is a Chief Data, Analytics, and AI Officer
Digital Twins in 2026: From Digital Replicas to Intelligent, AI-Driven Systems
Real-time Analytics News for the Week Ending December 27

Featured Resources from Cloud Data Insights

The Difficult Reality of Implementing Zero Trust Networking
Misbah Rehman
Jan 6, 2026
Cloud Evolution 2026: Strategic Imperatives for Chief Data Officers
Why Network Services Need Automation
The Shared Responsibility Model and Its Impact on Your Security Posture
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.