그잔

설명 :

데이터 세트는 각 30초 길이의 1000개 오디오 트랙으로 구성됩니다. 여기에는 각각 100개의 트랙으로 표시되는 10개의 장르가 포함되어 있습니다. 트랙은 모두 .wav 형식의 22050Hz 모노 16비트 오디오 파일입니다.

장르는 다음과 같습니다.

블루스
고전
국가
디스코
힙합
재즈
금속
팝
레게
바위
추가 문서 : 코드가 있는 논문에서 탐색
홈페이지 : http://marsyas.info/index.html
소스 코드 : tfds.audio.gtzan.GTZAN
버전 :
- 1.0.0 (기본값): 릴리스 정보가 없습니다.
다운로드 크기 : 1.14 GiB
데이터세트 크기 : 3.71 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	1,000

기능 구조 :

FeaturesDict({
    'audio': Audio(shape=(None,), dtype=int64),
    'audio/filename': Text(shape=(), dtype=string),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=10),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
오디오	오디오	(없음,)	int64
오디오/파일 이름	텍스트		끈
상표	클래스 레이블		int64

감독 키 ( as_supervised 문서 참조): ('audio', 'label')
그림 ( tfds.show_examples ): 지원되지 않습니다.
예 ( tfds.as_dataframe ):

인용 :

@misc{tzanetakis_essl_cook_2001,
author    = "Tzanetakis, George and Essl, Georg and Cook, Perry",
title     = "Automatic Musical Genre Classification Of Audio Signals",
url       = "http://ismir2001.ismir.net/pdf/tzanetakis.pdf",
publisher = "The International Society for Music Information Retrieval",
year      = "2001"
}