Datasets for Speech Recognition Analysis

July 1, 2020

3 minute read905 views

Datasets for Speech Recognition Analysis

Author

July 1, 2020

In This post we share top Datasets for Speech Recognition. Speech emotion analysis is an important task which further enables several application use cases. Due to the widespread use of smartphones,

In This post we share top Datasets for Speech Recognition. Speech emotion analysis is an important task which further enables several application use cases. Due to the widespread use of smartphones, it becomes viable to analyze speech commands captured using microphones for emotion understanding by utilizing on-device machine learning models

Google Audio Dataset
Urbansound Dataset
Spoken Digit Dataset
Bird Audio Detection
TensorFlow Speech Recognition
Emotion Based Speech Recognition in the Wild

1. Google Audio Dataset

Google Audio Dataset is a large-scale dataset of manually annotated audio events. In this Daataset 2.1 million annotated videos, 5.8 thousand hours of audio with 527 classes of annotated sounds.

Some Audio Classes are given below

Speech Recognition Analysis

Get Dataset

2. Urbansound Dataset

This is Urbansound Dataset. This dataset contains 1302 labeled sound recordings. Each recording is labeled with the start and end times of sound events from 10 classes: air_conditioner, car_horn, children_playing, dog_bark, drilling, enginge_idling, gun_shot, jackhammer, siren, and street_music. Each recording may contain multiple sound events, but for each file only events from a single class are labeled.

This dataset in Both CSV and JSON File. You can Download and Grow your Machine Learning Skills.

Get Dataset

3. Spoken Digit Dataset

This is simple audio/speech dataset consisting of recordings of spoken digits in wav files at 8kHz. The recordings are trimmed so that they have near minimal silence at the beginnings and ends. FSDD is open Dataset for Everyone. In this Dataset 4 Speakers with 2000 Recordings in English pronunciations.

No. of Rows:- 2000

Get Dataset

4. Bird Audio Detection

Detecting bird sounds in audio is an important task for automatic wildlife monitoring, as well as in citizen science and audio library management.you can download dataset through { freefield1010: • [data labels] • [audio files (5.8 Gb zip)] (or [via bittorrent]) Warblr: • [data labels] • [audio files (4.3 Gb zip)] (or [via bittorrent]) }

Get Dataset

5. TensorFlow Speech Recognition

Can you build an algorithm that understands simple speech commands? If Yes than this Dataset is for you. Note: There are only 12 possible labels for the Test set: yes, no, up, down, left, right, on, off, stop, go, silence, unknown. The unknown label should be used for a command that is not one one of the first 10 labels or that is not silence.

Get Datset

6. Emotion Based Speech Recognition in the Wild

EmoSpeech Dataset India's first keyword-emotion dataset. Surveillance 3 datasets in one 8K Environment Samples, 8K Keywords Samples and 8K Emotion Samples.

Your browser does not support the audio element.

Get Dataset

Thanks for Reading, Share this Blog