Interviews with voice technology professionals, boosting your knowledge and technical skills. Voice interfaces, voice assistants, data science, machine learning and much more. Hosted by Carl Robinson. Subscribe now!TAGS:
Florian Eyben is the CTO of audEERING, an audio analysis company that specialises in emotional artificial intelligence. Florian is a leading expert in voice emotion analytics, machine learning and signal processing, and our conversation is a great introduction to harnessing emotion in the voice. We discuss many of the incredible applications of voice emotion analytics, and dive into some projects carried out by audEERING for major brands in market research, call centers, social robotics, heal...TAGS:
Joshua Montgomery is the CEO of Mycroft, the open source voice assistant. Mycroft is a much needed addition to the range of big tech voice assistants on the market, offering a new level of data privacy and customisation.
Their MkII device will be launched soon, and is available for pre-order now on IndieGogo. Josh describes how the MkII improves over previous iterations, takes us through the technical stack, and explains how you can start developing voice applications for the Mycroft ecosyste...TAGS:
Syed Ahmed is a Developer Advocate at PubNub, a global Data Stream Network and real-time infrastructure-as-a-service company based in San Francisco, California. PubNub's products allow developers to build realtime web, mobile, voice and IoT applications. Low network latency is especially important for voice enabled applications that integrate with IoT devices, as any delay during a conversational interaction can affect usability and frustrate users.
In this episode, Syed explains how PubNub s...TAGS:
Sebastian Hanfland is the CEO of the audio branding consultancy, Hanfland and Friends. His team help companies define their their brand in audio form, in order to promote brand recognition, increase the perceived quality of products, and influence customers.
Sebastian explains what audio branding is, and how he selects the right voices and sounds for a brand. We explore many interesting examples of audio branding for products, customer service, workplaces and more, and hear a demo of their la...TAGS:
Bryan Colligan is the co-founder of AlphaVoice, the easiest way to get your podcast and audio content onto Amazon Alexa and Google Assistant. In an especially fun conversation, Bryan shares his vision of how the podcasting and voice technology worlds are set to collide, and how the implications will be felt by platforms, content producers and consumers alike.
He reveals the 4 main business models of content producers, explains why voice is the tech wave that will dominate, and foretells the im...TAGS:
This is the second part of my conversation with Dogac Basaran, a post-doctoral researcher at CNRS, the French national scientific research centre. If you missed the first part, you might want to go back and listen to the previous episode on Signal Processing Basics for Audio.
Today, in part 2 of 2, we explore Dogac's research into audio fingerprinting, alignment, and melody extraction. By analysing the magnitude of frequency peaks and their relative spacing, Dogac shows us how it's possible to...TAGS:
Dogac Basaran is a post-doctoral researcher at CNRS, the French national scientific research centre. Today, in part 1 of 2, Dogac gives us a crash course in signal processing, where we learn what signal processing is and discover some of its many applications.
Leveraging his teaching experience, Dogac uses simple language and real-world examples to explain the fundamental signal processing concepts that are used in voice technology today. He defines frequency, period, and stability, and descr...TAGS:
Pablo Arias is a final-year PhD student in perception and cognitive science at the audio research lab, IRCAM, in Paris. We discuss Pablo's work on how people perceive smiling and non-smiling voices.
First Pablo explains what cognitive science, neuroscience and perception are, and why research into these areas is so important. He then takes us through the aims, methods, and results of his latest research paper into smiling in the voice, and we discuss the academic and technological implication...TAGS:
Benjamin Etienne is a data scientist at Rogervoice, a mobile app that allows deaf and hard-of-hearing people to use the telephone. Ben shares his inspirational story about how he taught himself data science and machine learning in the evenings, so he could work in a more technical role. He tells us why he's not keen on Kaggle competitions, and why getting a job in data science is the best way to master it.
Ben introduces us to the challenges faced by the deaf and hard-of-hearing community, and...TAGS:
Greg Beller is the Head of the Interfaces Research and Creation team the leading audio research laboratory IRCAM in France. He is also the founder of SYNEKINE, a live entertainment company which mixes art and science in the spirit of research.
We explore the relationship between sound and physical space, and the link between our voices and our gestures. Greg explains what prosody is and its importance in speech and communication. He then demonstrates a number of technological art installation...TAGS:
In this episode I talk with Charles Cadbury, owner of the London-based technology consultancy, Champers Advisory, about his experience building voice applications, and the fascinating future of voice technology. He was great fun to talk to, and had plenty of surprising facts and interesting stories to share. You're going to really enjoy listening to this episode!
Charles has built or consulted on over 350 digital products for his clients over the years, covering web, social, mobile, and and n...TAGS:
This episode covers 8 of the most interesting voice startups that I found at the Vivatech technology conference in Paris, France.
Included in this episode is a voice transcription and synthesis mobile app for the hard of hearing, a voice enabled smart alarm clock that can monitor your sleep quality, a robot behaviour system that delivers CMS content in person, and a comprehensive voice assistant platform that can handle multiple requests in a single query.
Links from the show:
John Fitzpatrick is the VP of Product & Engineering at Voysis, a leading voice technology company that builds custom Voice AI solutions for businesses. They are currently focused on the ecommerce vertical, helping to voice-enable mobile apps and websites to augment the shopping experience.
In our conversation we cover a range of topics including the major components of the Voysis system, the technologies and tools John's team used to build it, and the challenges they faced. We also discus...TAGS:
Eric Bolo is the CTO of Batvoice Technologies, a speech analytics startup based in Paris, France. Eric talks about building a custom speech-to-text system for their flagship product, Call Watch.
He introduces us to speech analytics and audio-mining, and describes some typical applications. We go into detail about speech-to-text (STT) technologies, and discuss the pros and cons of using cloud STT services such as Google speech versus building a custom STT system yourself.
Eric tells us about t...TAGS: