q_re_cc

説明：

81,000 の質問と回答のペアを含む 14,000 の会話を含むデータセット。 QReCC は、TREC CAsT、QuAC、Google Natural question からの質問に基づいて構築されています。

ホームページ: https://github.com/apple/ml-qrecc
ソースコード: tfds.text.qrecc.QReCC
バージョン:
- 1.0.0 (デフォルト): 初期リリース。
ダウンロードサイズ: 7.60 MiB
データセットのサイズ: 69.29 MiB
自動キャッシュ(ドキュメント): はい
分割:

スプリット	例
`'test'`	16,451
`'train'`	63,501

機能の構造:

FeaturesDict({
    'answer': Text(shape=(), dtype=string),
    'answer_url': Text(shape=(), dtype=string),
    'context': Sequence(Text(shape=(), dtype=string)),
    'conversation_id': Scalar(shape=(), dtype=int32, description=The id of the conversation.),
    'question': Text(shape=(), dtype=string),
    'question_rewrite': Text(shape=(), dtype=string),
    'source': Text(shape=(), dtype=string),
    'turn_id': Scalar(shape=(), dtype=int32, description=The id of the conversation turn, within a conversation.),
})

機能ドキュメント:

特徴	クラス	形	Dタイプ	説明
	特徴辞書
答え	文章		弦
答え_url	文章		弦
コンテクスト	シーケンス(テキスト)	（なし、）	弦
会話ID	スカラー		int32	会話の ID。
質問	文章		弦
質問_リライト	文章		弦
ソース	文章		弦	データの元のソース -- QuAC、CAsT、または Natural question のいずれか
ターンID	スカラー		int32	会話内の会話ターンの ID。

監視キー( as_supervised docを参照): None
図( tfds.show_examples ): サポートされていません。
例( tfds.as_dataframe ):

引用：

@article{qrecc,
  title={Open-Domain Question Answering Goes Conversational via Question Rewriting},
  author={Anantha, Raviteja and Vakulenko, Svitlana and Tu, Zhucheng and Longpre, Shayne and Pulman, Stephen and Chappidi, Srinivas},
  journal={Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
  year={2021}
}

q_re_cc コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

q_re_cc