참고자료:
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:wrbsc')
- 설명 :
WUT Relations Between Sentences Corpus contains 2827 pairs of related sentences.
Relationships are derived from Cross-document Structure Theory (CST), which enables multi-document summarization through identification of cross-document rhetorical relationships within a cluster of related documents.
Every relation was marked by at least 3 annotators.
- 라이센스 : Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
- 버전 : 1.1.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 2827 |
- 특징 :
{
"sentence1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence2": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"relationship": {
"num_classes": 16,
"names": [
"Krzy\u017cowanie_si\u0119",
"T\u0142o_historyczne",
"\u0179r\u00f3d\u0142o",
"Dalsze_informacje",
"Zawieranie",
"Opis",
"Uszczeg\u00f3\u0142owienie",
"Parafraza",
"Spe\u0142nienie",
"Mowa_zale\u017cna",
"Zmiana_pogl\u0105du",
"Streszczenie",
"To\u017csamo\u015b\u0107",
"Sprzeczno\u015b\u0107",
"Modalno\u015b\u0107",
"Cytowanie"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}