Interviews with voice technology professionals, boosting your knowledge and technical skills. Voice interfaces, voice assistants, data science, machine learning and much more. Hosted by Carl Robinson. Subscribe now!TAGS:
Bryan Colligan is the co-founder of AlphaVoice, the easiest way to get your podcast and audio content onto Amazon Alexa and Google Assistant. In an especially fun conversation, Bryan shares his vision of how the podcasting and voice technology worlds are set to collide, and how the implications will be felt by platforms, content producers and consumers alike.
He reveals the 4 main business models of content producers, explains why voice is the tech wave that will dominate, and foretells the im...TAGS:
This is the second part of my conversation with Dogac Basaran, a post-doctoral researcher at CNRS, the French national scientific research centre. If you missed the first part, you might want to go back and listen to the previous episode on Signal Processing Basics for Audio.
Today, in part 2 of 2, we explore Dogac's research into audio fingerprinting, alignment, and melody extraction. By analysing the magnitude of frequency peaks and their relative spacing, Dogac shows us how it's possible to...TAGS:
Dogac Basaran is a post-doctoral researcher at CNRS, the French national scientific research centre. Today, in part 1 of 2, Dogac gives us a crash course in signal processing, where we learn what signal processing is and discover some of its many applications.
Leveraging his teaching experience, Dogac uses simple language and real-world examples to explain the fundamental signal processing concepts that are used in voice technology today. He defines frequency, period, and stability, and descr...TAGS:
Pablo Arias is a final-year PhD student in perception and cognitive science at the audio research lab, IRCAM, in Paris. We discuss Pablo's work on how people perceive smiling and non-smiling voices.
First Pablo explains what cognitive science, neuroscience and perception are, and why research into these areas is so important. He then takes us through the aims, methods, and results of his latest research paper into smiling in the voice, and we discuss the academic and technological implication...TAGS:
Benjamin Etienne is a data scientist at Rogervoice, a mobile app that allows deaf and hard-of-hearing people to use the telephone. Ben shares his inspirational story about how he taught himself data science and machine learning in the evenings, so he could work in a more technical role. He tells us why he's not keen on Kaggle competitions, and why getting a job in data science is the best way to master it.
Ben introduces us to the challenges faced by the deaf and hard-of-hearing community, and...TAGS:
Greg Beller is the Head of the Interfaces Research and Creation team the leading audio research laboratory IRCAM in France. He is also the founder of SYNEKINE, a live entertainment company which mixes art and science in the spirit of research.
We explore the relationship between sound and physical space, and the link between our voices and our gestures. Greg explains what prosody is and its importance in speech and communication. He then demonstrates a number of technological art installations...TAGS:
In this episode I talk with Charles Cadbury, owner of the London-based technology consultancy, Champers Advisory, about his experience building voice applications, and the fascinating future of voice technology. He was great fun to talk to, and had plenty of surprising facts and interesting stories to share. You're going to really enjoy listening to this episode!
Charles has built or consulted on over 350 digital products for his clients over the years, covering web, social, mobile, and and no...TAGS:
This episode covers 8 of the most interesting voice startups that I found at the Vivatech technology conference in Paris, France.
Included in this episode is a voice transcription and synthesis mobile app for the hard of hearing, a voice enabled smart alarm clock that can monitor your sleep quality, a robot behaviour system that delivers CMS content in person, and a comprehensive voice assistant platform that can handle multiple requests in a single query.
John Fitzpatrick is the VP of Product & Engineering at Voysis, a leading voice technology company that builds custom Voice AI solutions for businesses. They are currently focused on the ecommerce vertical, helping to voice-enable mobile apps and websites to augment the shopping experience.
In our conversation we cover a range of topics including the major components of the Voysis system, the technologies and tools John's team used to build it, and the challenges they faced. We also discuss...TAGS:
Eric Bolo is the CTO of Batvoice Technologies, a speech analytics startup based in Paris, France. Eric talks about building a custom speech-to-text system for their flagship product, Call Watch.
He introduces us to speech analytics and audio-mining, and describes some typical applications. We go into detail about speech-to-text (STT) technologies, and discuss the pros and cons of using cloud STT services such as Google speech versus building a custom STT system yourself.
Eric tells us about the...TAGS: