tinyML Talks: The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset
tinyML Talks
The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset
Mark Mazumder , PhD Student
Harvard University
This talk will present the Multilingual Spoken Words Corpus (MSWC), a speech dataset of over 340,000 spoken words in 50 languages, with over 23 million audio examples. MSWC has many use cases, ranging from voice-enabled consumer devices to call center automation. The dataset is CC-BY licensed and free for academic research and commercial use. We will introduce applications of MSWC for few-shot keyword spotting and spoken term search tasks in low-resource languages, and share a brief tutorial on getting started with the dataset. We will also discuss how we automated the construction of our dataset and our self-supervised approach for detecting outlier samples.
1 view
31
10
2 years ago 00:10:23 18
How TinyML Gives us Spider-Man Powers | Emelie Eldracher | TEDxMIT
2 years ago 00:51:33 2
#100 Embedded Machine Learning on Edge Devices(with Daniel Situnayake)
3 years ago 00:59:44 9
tinyML Talks: Advanced Anomaly Detection Made Easy
3 years ago 01:01:09 3
tinyML Talks: Energy-Efficiency and Security for TinyML and EdgeAI: A Cross-Layer Approach
3 years ago 01:04:11 4
tinyML Talks Pakistan: FFConv: An FPGA-based Accelerator for Fast Convolution Layers in...
3 years ago 00:53:51 2
tinyML Talks: Oculi is putting the human eye in A.I.
3 years ago 00:28:21 2
tinyML Asia 2021 Justin Kao: A lightweight face detection method working with Himax Ultra-Low...
3 years ago 00:23:01 1
tinyML Asia 2021 Haochen Xie: An approach to dynamically integrate heterogenous AI components...
3 years ago 00:29:39 1
tinyML Asia 2021 Joshua Chang: Sensor Fusion using Machine Learning: Smart Forehead Temperature...
3 years ago 01:01:24 1
tinyML Talks: The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset
3 years ago 00:34:03 9
tinyML Talks Toronto Part 1: Evolutionary Needs of TinyML
3 years ago 00:17:35 1
tinyML Talks Toronto Part 2: tinyMLedu: widening access to tinyML education and resources
3 years ago 00:27:20 54
tinyML Talks Toronto Part 3: tinyML4STEM: using tinyML for Neuroscience in K12
3 years ago 01:06:51 1
tinyML Talks India: Single Lead ECG Classification On Wearable and Implantable Devices
3 years ago 00:26:53 4
tinyML Asia 2021 Yihong Wu: Lightweight visual localization with deep learning
3 years ago 00:56:12 6
tinyML Talks: CFU Playground: Customize Your ML Processor for Your Specific TinyML Model
3 years ago 00:49:23 5
tinyML Asia 2021 Chanwoo Kim: A review of on-device fully neural end-to-end speech recognition...
3 years ago 01:10:29 3
tinyML Talks: The Value of Edge AI for Industrial Applications: onsemi and SensiML IIoT Solutions
3 years ago 00:53:29 1
Pete Warden — Practical Applications of TinyML
3 years ago 01:00:43 2
tinyML Talks: AutoML + TinyML with Edge Impulse’s EON Tuner
3 years ago 00:56:06 10
tinyML Talks Morocco: Enabling Ultra-low Power Always-On Computer Vision at Qualcomm
3 years ago 01:01:21 3
tinyML Talks: Verification of ML-based AI systems and its applicability in Edge ML
3 years ago 01:01:20 14
tinyML Talks: A Practical Guide to Neural Network Quantization
3 years ago 00:16:49 4
EMEA 2021 tiny Talks: Building Heterogeneous TinyML Pipelines