Exploring infrastructure for Dutch speech recognition

Due to developments in AI, the world of automatic speech recognition (ASR) is rapidly changing. New ASR systems seem to provide overwhelmingly accurate transcription of speech. But how do these systems perform under atypical conditions and in large scale applications?

ASR systems that have become available on the market recently such as Whisper, seem to provide overwhelmingly accurate transcription of speech. But how do these systems perform under atypical conditions?  For example, in the case of dialects, children or elderly speech or speech from non-native Dutch speakers? What happens if there are multiple speakers, cross talk and background noises? And, what to do if you want to transcribe very large amounts of speech data? What's the best way to handle this in a more (infra)structural way? 

In this seminar, we will show examples from different application areas and discuss practical, operational, and strategic aspects

For whom
researchers, teachers, support staff from various disciplines interested in the application of automatic speech recognition

Speech recogntion

Auteur

Reacties

Dit artikel heeft 0 reacties