তথ্যসূত্র:
আরবি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/arabic')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 9995 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
চীনা
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/chinese')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 6541 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
চেক
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/czech')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 2520 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ডাচ
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/dutch')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 10862 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ইংরেজি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/english')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 57945 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ফরাসি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/french')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 21690 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
জার্মান
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/german')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 20103 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
হিন্দি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/hindi')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 3402 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ইন্দোনেশিয়ান
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/indonesian')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 16308 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ইতালিয়ান
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/italian')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 17673 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
জাপানি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/japanese')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 4372 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
কোরিয়ান
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/korean')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 4111 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
পর্তুগিজ
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/portuguese')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 28143 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
রাশিয়ান
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/russian')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 18143 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
স্প্যানিশ
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/spanish')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 38795 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
থাই
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/thai')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 5093 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
তুর্কি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/turkish')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 1512 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ভিয়েতনামী
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:wiki_lingua/vietnamese')
- বর্ণনা :
WikiLingua is a large-scale multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. The dataset includes ~770k
article and summary pairs in 18 languages from WikiHow. The gold-standard
article-summary alignments across languages was done by aligning the images
that are used to describe each how-to step in an article.
- লাইসেন্স : CC BY-NC-SA 3.0
- সংস্করণ : 1.1.1
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 6616 |
- বৈশিষ্ট্য :
{
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"feature": {
"section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"document": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"summary": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"english_section_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}