- Here’s why you shouldn’t buy a Nintendo Switch until mid-August Monday 5:11 PM
- Man blasted for making his coworkers babysit his child Monday 5:07 PM
- Pete Buttigieg’s country radio interview was blocked from the air Monday 4:35 PM
- 15-year-old Smash Bros. prodigy caught using racist slur in private Discord server Monday 3:47 PM
- Instagram users who post pet pictures more likely to get hacked Monday 3:45 PM
- Post-Prime Day recap: Shipping delays, more sales, and a scam Monday 3:08 PM
- Jacob Wohl returns to Twitter … for now Monday 1:56 PM
- How to stream WWE Raw Reunion Monday 1:35 PM
- ‘I hope Trump deports you’: Woman goes on racist rant to Spanish speakers at a store Monday 1:24 PM
- Emoji Mashup Bot gives life to unidentifiable emotions Monday 1:15 PM
- Notorious grifter Anna Sorokin reportedly blocked from profiting off Netflix series Monday 12:45 PM
- Charlottesville attacker’s Twitter account included praise for Hitler Monday 12:10 PM
- ‘Short Treks’ trailer: Spock, Pike, and Number One return Monday 11:57 AM
- Everything we know about ‘Star Trek: Lower Decks,’ the new animated show Monday 11:55 AM
- Cole Carrigan says he left Team 10 after being called homophobic slur Monday 11:32 AM
For the visually impaired, an app could see the world and describe it aloud
We spoke to co-creator Alberto Rizzoli.
Restoring independence to the visually impaired? There’s an app for that.
Aipoly is a smartphone app that pairs machine vision with artificial intelligence to audibly describe whatever the smartphone’s camera “sees.”
The app was developed by Marita Cheng and Alberto Rizzoli, technologists who collaborated at Singularity University to create something that would be useful to the 285 million vision-impaired people around the world. According to Cheng and Rizzoli, two-thirds of this population will become smartphone users in the next five years.
Inspiration struck when they attended a presentation by IBM Watson Group CTO Robert High. High demonstrated some of the celebrity supercomputer’s capabilities—show it a picture and it can provide a semantic, conversational description of what’s happening in it. “We started looking into technologies to recognize images,” Rizzoli told the Daily Dot. “We learned about neural networks and integrated this into an application. It’s the simplest possible process for a user to identify an image: press a button, receive an audio description.”
The Aipoly software works by dividing an image into sections and running reverse image searches on them. It identifies the nouns in a picture—”car,” “battery,” “dog”—as well as the adjectives, like “silver” or “shiny.” Then artificial intelligence steps up to the plate to turn the computer’s understanding of the image into something for a human to digest. Audio playback might tell a visually impaired user that he is looking at “a shiny, silver car.”
The demo video shows it in action:
This is still an experimental technology. Once perfected, a visually impaired individual might be able to use this app to recognize what’s on a plate of food or to take pictures of their children to identify how they’re dressed. Rizzoli told us about one user who was passionate about cars, so they walked around a parking lot together until they successfully identified a Tesla using the app.
For now, there is some human help taking place behind the scenes to help Aipoly accurately identify images, but Rizzoli tells us it will soon be 100 percent software-based.
He has big ambitions for the future as well, and imagines using Aipoly to create something of a Google Street View for the blind. “We can build a virtual model of the world so that users don’t have to keep scanning their environment,” he said. “The info is already there, and Aipoly would one day provide them with realtime feedback.”
Rizzoli is proud of the autonomy that the app might afford to those with vision impairments. “It makes the visually impaired more independent, and it enables them to explore the world.”
Photo via Aipoly
Dylan Love is an editorial consultant and journalist whose reporting interests include emergent technology, digital media, and Russian language and culture. He is a former staff writer for the Daily Dot, and his work has been published by Business Insider, International Business Times, Men's Journal, and the Next Web.