xtreme_xnli

설명 :

이 데이터 세트에는 각 XNLI 언어로의 MNLI 기계 번역이 포함되어 있습니다. 번역 데이터는 XTREME에서 제공합니다. 이것은 원본 XNLI 논문에서 제공하는 기계 번역 데이터와 다릅니다.

홈페이지 : https://www.nyu.edu/projects/bowman/xnli/
소스 코드 : tfds.text.xtreme_xnli.XtremeXnli
버전 :
- 1.1.0 (기본값): 릴리스 정보가 없습니다.
다운로드 크기 : 2.31 GiB
데이터세트 크기 : 1.59 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	392,570

기능 구조 :

FeaturesDict({
    'hypothesis': TranslationVariableLanguages({
        'language': Text(shape=(), dtype=string),
        'translation': Text(shape=(), dtype=string),
    }),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=3),
    'premise': Translation({
        'ar': Text(shape=(), dtype=string),
        'bg': Text(shape=(), dtype=string),
        'de': Text(shape=(), dtype=string),
        'el': Text(shape=(), dtype=string),
        'en': Text(shape=(), dtype=string),
        'es': Text(shape=(), dtype=string),
        'fr': Text(shape=(), dtype=string),
        'hi': Text(shape=(), dtype=string),
        'ru': Text(shape=(), dtype=string),
        'sw': Text(shape=(), dtype=string),
        'th': Text(shape=(), dtype=string),
        'tr': Text(shape=(), dtype=string),
        'ur': Text(shape=(), dtype=string),
        'vi': Text(shape=(), dtype=string),
        'zh': Text(shape=(), dtype=string),
    }),
})

기능 문서 :

특징	수업	D타입
	풍모Dict
가설	TranslationVariableLanguages
가설/언어	텍스트	끈
가설/번역	텍스트	끈
상표	클래스 레이블	int64
전제	번역
전제/ar	텍스트	끈
전제/배경	텍스트	끈
전제/드	텍스트	끈
전제/엘	텍스트	끈
전제/en	텍스트	끈
전제	텍스트	끈
전제/fr	텍스트	끈
전제/안녕	텍스트	끈
전제/루	텍스트	끈
전제/sw	텍스트	끈
전제	텍스트	끈
전제/tr	텍스트	끈
전제	텍스트	끈
전제/vi	텍스트	끈
전제/zh	텍스트	끈

감독된 키 ( as_supervised 문서 참조): None
그림 ( tfds.show_examples ): 지원되지 않습니다.
예 ( tfds.as_dataframe ):

인용 :

@article{hu2020xtreme,
      author    = {Junjie Hu and Sebastian Ruder and Aditya Siddhant and Graham Neubig and Orhan Firat and Melvin Johnson},
      title     = {XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization},
      journal   = {CoRR},
      volume    = {abs/2003.11080},
      year      = {2020},
      archivePrefix = {arXiv},
      eprint    = {2003.11080}
}