카스크

설명 :

QASC는 문장 구성에 중점을 둔 질의 응답 데이터 세트입니다. 그것은 초등학교 과학에 관한 9,980개의 8방향 객관식 질문(8,134 훈련, 926 개발, 920 시험)으로 구성되어 있으며 17M 문장의 말뭉치와 함께 제공됩니다.

추가 문서 : 코드가 있는 논문에서 탐색
홈페이지 : https://allenai.org/data/qasc
소스 코드 : tfds.datasets.qasc.Builder
버전 :
- 0.1.0 (기본값): 릴리스 정보가 없습니다.
다운로드 크기 : 1.54 MiB
데이터 세트 크기 : 6.61 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'test'`	920
`'train'`	8,134
`'validation'`	926

기능 구조 :

FeaturesDict({
    'answerKey': Text(shape=(), dtype=string),
    'choices': Sequence({
        'label': Text(shape=(), dtype=string),
        'text': Text(shape=(), dtype=string),
    }),
    'combinedfact': Text(shape=(), dtype=string),
    'fact1': Text(shape=(), dtype=string),
    'fact2': Text(shape=(), dtype=string),
    'formatted_question': Text(shape=(), dtype=string),
    'id': Text(shape=(), dtype=string),
    'question': Text(shape=(), dtype=string),
})

기능 문서 :

특징	수업	D타입
	풍모Dict
답변키	텍스트	끈
선택	순서
선택/라벨	텍스트	끈
선택/텍스트	텍스트	끈
결합 사실	텍스트	끈
사실1	텍스트	끈
사실2	텍스트	끈
formatted_question	텍스트	끈
ID	텍스트	끈
문제	텍스트	끈

감독된 키 ( as_supervised 문서 참조): None
그림 ( tfds.show_examples ): 지원되지 않습니다.
예 ( tfds.as_dataframe ):

인용 :

@article{allenai:qasc,
      author    = {Tushar Khot and Peter Clark and Michal Guerquin and Peter Jansen and Ashish Sabharwal},
      title     = {QASC: A Dataset for Question Answering via Sentence Composition},
      journal   = {arXiv:1910.11473v2},
      year      = {2020},
}