তথ্যসূত্র:
ক্যাটাল
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:catalonia_independence/catalan')
- বর্ণনা :
This dataset contains two corpora in Spanish and Catalan that consist of annotated Twitter messages for automatic stance detection. The data was collected over 12 days during February and March of 2019 from tweets posted in Barcelona, and during September of 2018 from tweets posted in the town of Terrassa, Catalonia.
Each corpus is annotated with three classes: AGAINST, FAVOR and NEUTRAL, which express the stance towards the target - independence of Catalonia.
- লাইসেন্স : CC BY-NC-SA 4.0
- সংস্করণ : 1.1.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 2010 |
'train' | 6028 |
'validation' | 2010 |
- বৈশিষ্ট্য :
{
"id_str": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"TWEET": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"LABEL": {
"num_classes": 3,
"names": [
"AGAINST",
"FAVOR",
"NEUTRAL"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
স্প্যানিশ
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:catalonia_independence/spanish')
- বর্ণনা :
This dataset contains two corpora in Spanish and Catalan that consist of annotated Twitter messages for automatic stance detection. The data was collected over 12 days during February and March of 2019 from tweets posted in Barcelona, and during September of 2018 from tweets posted in the town of Terrassa, Catalonia.
Each corpus is annotated with three classes: AGAINST, FAVOR and NEUTRAL, which express the stance towards the target - independence of Catalonia.
- লাইসেন্স : CC BY-NC-SA 4.0
- সংস্করণ : 1.1.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 2016 |
'train' | 6046 |
'validation' | 2015 |
- বৈশিষ্ট্য :
{
"id_str": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"TWEET": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"LABEL": {
"num_classes": 3,
"names": [
"AGAINST",
"FAVOR",
"NEUTRAL"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}