AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds.AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds.
Data Type
Multivariate
Default Task
Relational-Learning
Attribute Type
Real
Published Year
2017
Area of Dataset
Human Sound
Missing Values
No
No. of Instances
2084320
No. of Attributes
527
| Data Type | Multivariate | Default Task | Relational-Learning |
|---|---|---|---|
| Attribute Type | Real | Published Year | 2017 |
| Area of Dataset | Human Sound | Missing Values | No |
| No. of Instances | 2084320 | No. of Attributes | 527 |
Sign in to join the discussion and post comments.
Sign in