quac

説明:

Question Answering in Context は、モデリング、理解、情報探索ダイアログへの参加のためのデータセットです。データインスタンスは、2 人のクラウドワーカー間のインタラクティブな対話で構成されます。(1) 隠されたウィキペディアのテキストについて可能な限り学ぶために一連の自由形式の質問を投げかける学生と、(2) 短い抜粋を提供して質問に答える教師。（スパン）テキストから。 QuAC は、既存の機械理解データセットには見られない課題をもたらします。QuAC の質問は、多くの場合、より自由で、答えられないか、対話コンテキスト内でのみ意味があります。

追加のドキュメント:コードを使用したペーパーの探索
ホームページ： https ://quac.ai/
ソースコード: tfds.datasets.quac.Builder
バージョン:
- 1.0.0 (デフォルト): 初期リリース。
ダウンロードサイズ: 73.47 MiB
データセットサイズ: 298.04 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	83,568
`'validation'`	7,354

機能構造:

FeaturesDict({
    'answers': Sequence({
        'answer_start': int32,
        'text': Text(shape=(), dtype=string),
    }),
    'background': Text(shape=(), dtype=string),
    'context': Text(shape=(), dtype=string),
    'followup': Text(shape=(), dtype=string),
    'orig_answer': FeaturesDict({
        'answer_start': int32,
        'text': Text(shape=(), dtype=string),
    }),
    'question': Text(shape=(), dtype=string),
    'section_title': Text(shape=(), dtype=string),
    'title': Text(shape=(), dtype=string),
    'yesno': Text(shape=(), dtype=string),
})

機能のドキュメント:

特徴	クラス	Dtype
	特徴辞書
答え	順序
回答/answer_start	テンソル	int32
回答/テキスト	文章	ストリング
バックグラウンド	文章	ストリング
コンテクスト	文章	ストリング
ファローアップ	文章	ストリング
orig_answer	特徴辞書
orig_answer/answer_start	テンソル	int32
orig_answer/text	文章	ストリング
質問	文章	ストリング
section_title	文章	ストリング
題名	文章	ストリング
はい・いいえ	文章	ストリング

監視されたキー( as_supervised docを参照): ('context', 'answers')
図( tfds.show_examples ): サポートされていません。
例( tfds.as_dataframe ):

引用：

@article{choi2018quac,
  title={Quac: Question answering in context},
  author={Choi, Eunsol and He, He and Iyyer, Mohit and Yatskar, Mark and Yih, Wen-tau and Choi, Yejin and Liang, Percy and Zettlemoyer, Luke},
  journal={arXiv preprint arXiv:1808.07036},
  year={2018}
}