তথ্যসূত্র:
cross_topic_1
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_1')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 1.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 207 |
'train' | 112 |
'validation' | 62 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_genre_1
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_genre_1')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 13.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 269 |
'train' | 63 |
'validation' | 112 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_2
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_2')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 2.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 179 |
'train' | 112 |
'validation' | 90 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_3
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_3')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 3.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 152 |
'train' | 112 |
'validation' | 117 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_4
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_4')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 4.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 207 |
'train' | 62 |
'validation' | 112 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_5
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_5')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 5.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 229 |
'train' | 62 |
'validation' | 90 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_6
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_6')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 6.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 202 |
'train' | 62 |
'validation' | 117 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_7
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_7')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 7.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 179 |
'train' | 90 |
'validation' | 112 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_8
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_8')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 8.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 229 |
'train' | 90 |
'validation' | 62 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_9
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_9')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 9.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 174 |
'train' | 90 |
'validation' | 117 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_10
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_10')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 10.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 152 |
'train' | 117 |
'validation' | 112 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_11
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_11')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 11.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 202 |
'train' | 117 |
'validation' | 62 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_topic_12
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_topic_12')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 12.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 174 |
'train' | 117 |
'validation' | 90 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_genre_2
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_genre_2')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 14.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 319 |
'train' | 63 |
'validation' | 62 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_genre_3
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_genre_3')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 15.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 291 |
'train' | 63 |
'validation' | 90 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
cross_genre_4
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:guardian_authorship/cross_genre_4')
- বর্ণনা :
A dataset cross-topic authorship attribution. The dataset is provided by Stamatatos 2013.
1- The cross-topic scenarios are based on Table-4 in Stamatatos 2017 (Ex. cross_topic_1 => row 1:P S U&W ).
2- The cross-genre scenarios are based on Table-5 in the same paper. (Ex. cross_genre_1 => row 1:B P S&U&W).
3- The same-topic/genre scenario is created by grouping all the datasts as follows.
For ex., to use same_topic and split the data 60-40 use:
train_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[:60%]+validation[:60%]+test[:60%]')
tests_ds = load_dataset('guardian_authorship', name="cross_topic_<<#>>",
split='train[-40%:]+validation[-40%:]+test[-40%:]')
Important: train+validation+test[:60%] will generate the wrong splits becasue the data is imbalanced
* See https://huggingface.co/docs/datasets/splits.html for detailed/more examples
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 16.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 264 |
'train' | 63 |
'validation' | 117 |
- বৈশিষ্ট্য :
{
"author": {
"num_classes": 13,
"names": [
"catherinebennett",
"georgemonbiot",
"hugoyoung",
"jonathanfreedland",
"martinkettle",
"maryriddell",
"nickcohen",
"peterpreston",
"pollytoynbee",
"royhattersley",
"simonhoggart",
"willhutton",
"zoewilliams"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"topic": {
"num_classes": 5,
"names": [
"Politics",
"Society",
"UK",
"World",
"Books"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}