مراجع:
استخدم الأمر التالي لتحميل مجموعة البيانات هذه في TFDS:
ds = tfds.load('huggingface:wrbsc')
- وصف :
WUT Relations Between Sentences Corpus contains 2827 pairs of related sentences.
Relationships are derived from Cross-document Structure Theory (CST), which enables multi-document summarization through identification of cross-document rhetorical relationships within a cluster of related documents.
Every relation was marked by at least 3 annotators.
- الترخيص : Attribution-ShareAlike 3.0 غير مستورد (CC BY-SA 3.0)
- الإصدار : 1.1.0
- الإنشقاقات :
ينقسم | أمثلة |
---|---|
'train' | 2827 |
- سمات :
{
"sentence1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence2": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"relationship": {
"num_classes": 16,
"names": [
"Krzy\u017cowanie_si\u0119",
"T\u0142o_historyczne",
"\u0179r\u00f3d\u0142o",
"Dalsze_informacje",
"Zawieranie",
"Opis",
"Uszczeg\u00f3\u0142owienie",
"Parafraza",
"Spe\u0142nienie",
"Mowa_zale\u017cna",
"Zmiana_pogl\u0105du",
"Streszczenie",
"To\u017csamo\u015b\u0107",
"Sprzeczno\u015b\u0107",
"Modalno\u015b\u0107",
"Cytowanie"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}