Overview
Explore the complexities of open source voice technology in this 48-minute conference talk from linux.conf.au. Delve into the challenges faced by developers, including real-time speech-to-text limitations, machine learning hurdles, and multilingual support issues. Discover strategies for overcoming hardware constraints, improving on-device and cloud-based speech recognition, and debugging poor-fit models. Learn about the difficulties in sourcing extensive datasets, training new models, and handling language idiosyncrasies across multiple skills. Examine the intricacies of creating voice interactions that are both useful and human-like. Gain insights into the evolving landscape of open source voice technology and its impact on the Linux ecosystem.
Syllabus
Beach Wreck Ignition: Challenges in open source voice
Taught by
linux.conf.au