ML4Audio - pyctcdecode: A simple and fast speech-to-text prediction decoding algorithm
This week the Kensho team will join us to talk about pyctcdecode
pyctcdecode is a fast and feature-rich CTC beam search decoder for speech recognition. Ask your questions in
Speakers
- Raymond Grossman: Raymond works as a machine learning engineer at Kensho Technologies, specializing in speech and natural language domains. Prior to coming to Kensho, he studied mathematics at Princeton and was an avid Kaggler under the moniker @ToTrainThemIsMyCause. LinkedIn:
- Jeremy Lopez: Jeremy is a machine learning engineer at Kensho Technologies and has worked on a variety of different topics including search and speech recognition. Before working at Kensho, he earned a PhD in experimental particle physics at MIT and continued doing physics research as a postdoc at the University of Colorado Boulder. LinkedIn:
1 view
23
8
3 years ago 00:57:26 1
ML4Audio - pyctcdecode: A simple and fast speech-to-text prediction decoding algorithm
3 years ago 00:59:08 7
ML for Audio Study Group - Text to Speech Deep Dive