SHARE
Facebook X Pinterest WhatsApp

Local AI: Lower-Power Analytics for the Smartphone Era

thumbnail
Local AI: Lower-Power Analytics for the Smartphone Era

Analytics technology illustration concept with blue text

Untethering compute from the cloud allows the broadening of AI’s reach. And it speeds up response time by reducing the lag caused by communicating with distant servers.

Written By
thumbnail
Joe McKendrick
Joe McKendrick
Nov 7, 2021

Artificial intelligence has a bottleneck problem. It’s based on deep neural networks that can require hundreds of millions to billions of calculations — a processing and energy-intensive undertaking. Then there’s the cost and latency of shuffling data to and from memory to perform these and other analytics computations. Enter Vivienne Sze, associate professor at MIT, known for her role in developing video compression standards still in use today. Now, she is focusing on designing more-efficient deep neural networks to process video, and more-efficient hardware to run AI applications on smartphones, embedded devices, tiny robots, smart homes, and medical devices.

In a recent interview at MIT, she explains why we need low-power AI now. “AI applications are moving to smartphones, tiny robots, and internet-connected appliances and other devices with limited power and processing capabilities. The challenge is that AI has high computing requirements. Applying analytics to sensor and camera data from a self-driving car can consume about 2,500 watts, but the computing budget of a smartphone is just about a single watt.”

Localizing AI on small devices such as smartphones “means that the data processing no longer has to take place in the cloud, on racks of warehouse servers,” Sze says. “Untethering compute from the cloud allows us to broaden AI’s reach. It speeds up response time by reducing the lag caused by communicating with distant servers. This is crucial for interactive applications like autonomous navigation and augmented reality, which need to respond instantaneously to changing conditions. Processing data on the device can also protect medical and other sensitive records. Data can be processed right where they’re collected.”

From a hardware perspective, Sze seeks to “reuse data locally rather than send them off-chip. Storing reused data on-chip makes the process extremely energy-efficient.” On the software side, Sze is designing “pruning” into algorithmic code to remove energy-intensive “weights” from a deep network, along with other adaptations. A potential application she is looking at is eye-movement tracking to help diagnose neurodegenerative disorders that can be done with an ordinary smartphone from patients’ homes, versus the expensive in-office equipment typically required up until now.

thumbnail
Joe McKendrick

Joe McKendrick is RTInsights Industry Editor and industry analyst focusing on artificial intelligence, digital, cloud and Big Data topics. His work also appears in Forbes an Harvard Business Review. Over the last three years, he served as co-chair for the AI Summit in New York, as well as on the organizing committee for IEEE's International Conferences on Edge Computing. (full bio). Follow him on Twitter @joemckendrick.

Recommended for you...

Real-time Analytics News for the Week Ending February 21
Why Intelligence Without Authority Cannot Deliver Enterprise Value
Harsha Kumar
Feb 17, 2026
Real-time Analytics News for the Week Ending February 14
The Foundation Before the Speed: Preparing for Real-Time Analytics

Featured Resources from Cloud Data Insights

Is AI Coming for Small Business? How Small Businesses are Approaching AI Integration
Brian Aagaard
Feb 23, 2026
Real-time Analytics News for the Week Ending February 21
Submarine Fiber-Optic Cables: The Hidden Infrastructure Powering Global Digital Economies
Sharat Sinha
Feb 21, 2026
The Secret to Managing Cloud Provider Risk While Remaining Innovative
John Bruce
Feb 20, 2026
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.