Riferimenti:
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:cuad')
- Descrizione :
Contract Understanding Atticus Dataset (CUAD) v1 is a corpus of more than 13,000 labels in 510
commercial legal contracts that have been manually labeled to identify 41 categories of important
clauses that lawyers look for when reviewing contracts in connection with corporate transactions.
- Licenza : CUAD è concesso in licenza in base alla licenza Creative Commons Attribution 4.0 (CC BY 4.0).
- Versione : 1.0.0
- Divide :
Diviso | Esempi |
---|---|
'test' | 4182 |
'train' | 22450 |
- Caratteristiche :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"context": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"answers": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"answer_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}