SHARE
Facebook X Pinterest WhatsApp

New Dataset for AI-Enabled Sign Language Translation

thumbnail
New Dataset for AI-Enabled Sign Language Translation

Arrangement of outlines of human head, technological and fractal elements on the subject of artificial intelligence, computer science and future technologies

The dataset will allow more automatic sign language understanding and translation. These technologies could be applied to applications such as virtual assistants and robotics.

Apr 8, 2021

Artificial intelligence (AI) is helping humans save, parse, and learn language. With a new dataset, researchers and developers could get a massive boost developing technologies for the deaf community.

The How2Sign Dataset

The dataset includes over 80 hours of videos showing sign language interpreters translating a variety of tutorials. Amanda Duarte, a researcher in the Emerging Technologies for Artificial Intelligence group at the Barcelona Supercomputing Center (BSC), spent two years recording these videos and preparing the data.

Duarte also made use of Carnegie’s Carnegie Mellon University’s Panoptic Studio, a state-of-the-art dome-shaped studio that allowed researchers to video translators and reconstruct their movements in 3D.

(Source: Barcelona Supercomputing Center)

Thanks to Duarte, How2Sign provides a public resource for researchers in natural language processing and computer vision, helping usher in a new era of deaf and hard of hearing enabled products and services. Making the internet more accessible is a huge goal, and one of the first applications is software that transfers signs from one user to another.

The dataset provides a valuable resource for researchers and developers to design quality technology that considers the needs of the deaf community. Artificial intelligence requires computation and algorithms capability, but it also requires data.

Advertisement

Future accessibility projects

Duarte, INPhiNIT doctoral student of the “la Caixa” Foundation, has received funding from several sources — Facebook AI, the “la Caixa” Foundation, as well as the collaboration of the Image Processing Group of the Universitat Politècnica de Catalunya (UPC), Carnegie Mellon University and Gallaudet University — to make this dataset happen.

The dataset will allow more automatic sign language understanding and translation. These technologies could expand to application areas such as virtual assistants, robotics, and other emerging technologies.

Duarte will present the new resource at the CVPR 2021 conference later this summer. Her work and the dataset are currently ongoing, expanding and improving the data repository. The more the dataset expands, the more accessible technology will become.

thumbnail
Elizabeth Wallace

Elizabeth Wallace is a Nashville-based freelance writer with a soft spot for data science and AI and a background in linguistics. She spent 13 years teaching language in higher ed and now helps startups and other organizations explain - clearly - what it is they do.

Recommended for you...

Smart Governance in the Age of Self-Service BI: Striking the Right Balance
Why the Next Evolution in the C-Suite Is a Chief Data, Analytics, and AI Officer
Top 5 Smart Manufacturing Articles of 2025
Digital Twins in 2026: From Digital Replicas to Intelligent, AI-Driven Systems

Featured Resources from Cloud Data Insights

Cloud Evolution 2026: Strategic Imperatives for Chief Data Officers
Why Network Services Need Automation
The Shared Responsibility Model and Its Impact on Your Security Posture
The Role of Data Governance in ERP Systems
Sandip Roy
Nov 28, 2025
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.