New Dataset for AI-Enabled Sign Language Translation

thumbnail
New Dataset for AI-Enabled Sign Language Translation

Arrangement of outlines of human head, technological and fractal elements on the subject of artificial intelligence, computer science and future technologies

The dataset will allow more automatic sign language understanding and translation. These technologies could be applied to applications such as virtual assistants and robotics.

Apr 8, 2021

Artificial intelligence (AI) is helping humans save, parse, and learn language. With a new dataset, researchers and developers could get a massive boost developing technologies for the deaf community.

The How2Sign Dataset

The dataset includes over 80 hours of videos showing sign language interpreters translating a variety of tutorials. Amanda Duarte, a researcher in the Emerging Technologies for Artificial Intelligence group at the Barcelona Supercomputing Center (BSC), spent two years recording these videos and preparing the data.

Duarte also made use of Carnegie’s Carnegie Mellon University’s Panoptic Studio, a state-of-the-art dome-shaped studio that allowed researchers to video translators and reconstruct their movements in 3D.

(Source: Barcelona Supercomputing Center)

Thanks to Duarte, How2Sign provides a public resource for researchers in natural language processing and computer vision, helping usher in a new era of deaf and hard of hearing enabled products and services. Making the internet more accessible is a huge goal, and one of the first applications is software that transfers signs from one user to another.

The dataset provides a valuable resource for researchers and developers to design quality technology that considers the needs of the deaf community. Artificial intelligence requires computation and algorithms capability, but it also requires data.

Advertisement

Future accessibility projects

Duarte, INPhiNIT doctoral student of the “la Caixa” Foundation, has received funding from several sources — Facebook AI, the “la Caixa” Foundation, as well as the collaboration of the Image Processing Group of the Universitat Politècnica de Catalunya (UPC), Carnegie Mellon University and Gallaudet University — to make this dataset happen.

The dataset will allow more automatic sign language understanding and translation. These technologies could expand to application areas such as virtual assistants, robotics, and other emerging technologies.

Duarte will present the new resource at the CVPR 2021 conference later this summer. Her work and the dataset are currently ongoing, expanding and improving the data repository. The more the dataset expands, the more accessible technology will become.

thumbnail
Elizabeth Wallace

Elizabeth Wallace is a Nashville-based freelance writer with a soft spot for data science and AI and a background in linguistics. She spent 13 years teaching language in higher ed and now helps startups and other organizations explain - clearly - what it is they do.

Recommended for you...

Domain-Specific LLMs: How to Make AI Useful for Your Business
Hardik Parikh
Mar 11, 2026
The State of the Neoclouds Market
Why Agentic AI Projects Are Getting Canceled (And How You Can Save Yours)
Akhil Verghese
Mar 2, 2026
Will Your Organization Take the Quantum Leap in 2026? Read This First.
David McNeely
Feb 26, 2026

Featured Resources from Cloud Data Insights

Domain-Specific LLMs: How to Make AI Useful for Your Business
Hardik Parikh
Mar 11, 2026
Engineering the Agentic Enterprise: Building Smarter, Adaptive, Autonomous Systems
Varun Goswami
Mar 10, 2026
The AI That Actually Scales Is Boring. That’s the Point.
Jared Coyle
Mar 9, 2026
Real-time Analytics News for the Week Ending March 7
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.