参考:
subtask-1
使用以下命令在 TFDS 中加载此数据集:
ds = tfds.load('huggingface:humicroedit/subtask-1')
- 说明:
This new dataset is designed to assess the funniness of edited news headlines.
- 许可:无已知许可
- 版本:0.0.0
- 拆分:
拆分 | 样本 |
---|---|
'funlines' |
8248 |
'test' |
3024 |
'train' |
9652 |
'validation' |
2419 |
- 特征:
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"original": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"edit": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"grades": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"meanGrade": {
"dtype": "float32",
"id": null,
"_type": "Value"
}
}
subtask-2
使用以下命令在 TFDS 中加载此数据集:
ds = tfds.load('huggingface:humicroedit/subtask-2')
- 说明:
This new dataset is designed to assess the funniness of edited news headlines.
- 许可:无已知许可
- 版本:0.0.0
- 拆分:
拆分 | 样本 |
---|---|
'funlines' |
1958 |
'test' |
2960 |
'train' |
9381 |
'validation' |
2355 |
- 特征:
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"original1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"edit1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"grades1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"meanGrade1": {
"dtype": "float32",
"id": null,
"_type": "Value"
},
"original2": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"edit2": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"grades2": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"meanGrade2": {
"dtype": "float32",
"id": null,
"_type": "Value"
},
"label": {
"num_classes": 3,
"names": [
"equal",
"sentence1",
"sentence2"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}