References:
az_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/az_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
904 |
'train' |
5947 |
'validation' |
672 |
- Features:
{
"translation": {
"languages": [
"az",
"en"
],
"id": null,
"_type": "Translation"
}
}
aztr_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/aztr_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
904 |
'train' |
188397 |
'validation' |
672 |
- Features:
{
"translation": {
"languages": [
"az_tr",
"en"
],
"id": null,
"_type": "Translation"
}
}
be_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/be_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
665 |
'train' |
4510 |
'validation' |
249 |
- Features:
{
"translation": {
"languages": [
"be",
"en"
],
"id": null,
"_type": "Translation"
}
}
beru_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/beru_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
665 |
'train' |
212615 |
'validation' |
249 |
- Features:
{
"translation": {
"languages": [
"be_ru",
"en"
],
"id": null,
"_type": "Translation"
}
}
es_to_pt
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/es_to_pt')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1764 |
'train' |
44939 |
'validation' |
1017 |
- Features:
{
"translation": {
"languages": [
"es",
"pt"
],
"id": null,
"_type": "Translation"
}
}
fr_to_pt
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/fr_to_pt')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1495 |
'train' |
43874 |
'validation' |
1132 |
- Features:
{
"translation": {
"languages": [
"fr",
"pt"
],
"id": null,
"_type": "Translation"
}
}
gl_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/gl_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1008 |
'train' |
10018 |
'validation' |
683 |
- Features:
{
"translation": {
"languages": [
"gl",
"en"
],
"id": null,
"_type": "Translation"
}
}
glpt_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/glpt_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1008 |
'train' |
61803 |
'validation' |
683 |
- Features:
{
"translation": {
"languages": [
"gl_pt",
"en"
],
"id": null,
"_type": "Translation"
}
}
he_to_pt
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/he_to_pt')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1624 |
'train' |
48512 |
'validation' |
1146 |
- Features:
{
"translation": {
"languages": [
"he",
"pt"
],
"id": null,
"_type": "Translation"
}
}
it_to_pt
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/it_to_pt')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1670 |
'train' |
46260 |
'validation' |
1163 |
- Features:
{
"translation": {
"languages": [
"it",
"pt"
],
"id": null,
"_type": "Translation"
}
}
pt_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/pt_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1804 |
'train' |
51786 |
'validation' |
1194 |
- Features:
{
"translation": {
"languages": [
"pt",
"en"
],
"id": null,
"_type": "Translation"
}
}
ru_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/ru_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
5477 |
'train' |
208107 |
'validation' |
4806 |
- Features:
{
"translation": {
"languages": [
"ru",
"en"
],
"id": null,
"_type": "Translation"
}
}
ru_to_pt
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/ru_to_pt')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1589 |
'train' |
47279 |
'validation' |
1185 |
- Features:
{
"translation": {
"languages": [
"ru",
"pt"
],
"id": null,
"_type": "Translation"
}
}
tr_to_en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:ted_hrlr/tr_to_en')
- Description:
Data sets derived from TED talk transcripts for comparing similar language pairs
where one is high resource and the other is low resource.
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
5030 |
'train' |
182451 |
'validation' |
4046 |
- Features:
{
"translation": {
"languages": [
"tr",
"en"
],
"id": null,
"_type": "Translation"
}
}