Dimitri Kanevsky, a research scientist at Google with an extensive background in mathematics, knows the impact technology can have when built with accessibility in mind. Having lost his hearing in early childhood, he imagines a world where technology can make it easier for people who are deaf or hard of hearing to be a part of everyday, in-person conversations with hearing people. Whether it's ordering coffee at a cafe, conversing with coworkers or checking out at the grocery store.
Dimitri has been turning that idea into a reality. He co-created Live Transcribe, our speech-to-text technology, which launched in 2019 and is now used daily by over a million people to communicate — including Dimitri. He works closely with the team to develop new and helpful features — like an offline mode that will be launching in the coming weeks to give people access to real-time captions even when Wi-Fi and data are unavailable.
For World Hearing Day, we talked with Dimitri about his work, why building for everyone matters and the future of accessible technology.
Tell us more about your background and job at Google.
When I moved to the U.S in 1984, there were no transcription services. I wanted to change that, so I focused my work on optimizing speech and language recognition to help people who are deaf or hard of hearing.
I eventually moved from academia to Google’s speech recognition team in 2014. The work my team and I accomplished allowed us to create practical applications — like Live Transcribe and Live Caption.
How has your personal experience shaped your career?
I completely lost my hearing when I was one. I learned to lipread well so I could communicate with other students and teachers. My family was also very helpful to me. When I switched to a school where my father taught, he made sure I was in a class with children I knew so it was a smoother transition.
But in eighth grade, I moved to a math school with new teachers and students and was unable to lipread what they taught in class or communicate with my new classmates. I sat, day after day, not understanding the material they were teaching and had to teach myself from textbooks. If I had a tool like Live Transcribe when I was growing up, my experience would have been very different.
In what ways has assistive technology — like Live Transcribe — changed your experience today?
Technology provides tremendous opportunities to help people with disabilities — I know this firsthand.
I use Live Transcribe every day to communicate with others. I use it to play games and share stories with my twin granddaughters — which is life-changing. And just last week, I gave a lecture at a mathematical seminar at John Hopkins University. During it, I could interact with the audience and answer questions — without Live Transcribe that would have been very difficult for me to do.
I used to rely heavily on lipreading for day-to-day tasks, but when people wear masks I can't do that — I don't even know when someone who's wearing a mask is talking to me. Because of this, Live Transcribe is even more important to me — especially when at stores, riding public transit or visiting a doctor.
What are you excited about when you think about speech recognition technology ten years from now?
My dream is to use speech recognition technology to help people communicate. As technology advances, it will unlock new possibilities — such as transcribing speech even as people switch languages, understanding people with all accents and speech motor skills, indicating more sound events with visual symbols and automatically integrating sign recognition or additional haptic feedback technologies.
Further in the future, I hope to see an experience where people are no longer dependent on a mobile phone to see transcriptions. Perhaps transcriptions will be available in convenient wearable eye technologies or appear on a wall when someone looks at it. There's a variant of prediction that there will be no mobile phones since all devices around us — like our walls — will act as mobile devices when people need them to.
What do you want others to learn from World Hearing Day?
According to WHO, one in ten people will experience hearing loss by 2050. Still, a lot of people with hearing loss don’t know about novel speech recognition technologies that could help them communicate, and hearing people aren’t aware of these tools.
World Hearing Day is an opportunity to make everybody aware of the needs of people with hearing loss and the technology that everyone can use to have a tremendous impact on their lives.
by Sagar SavlaGoogle Research via The Keyword
Comments
Post a Comment