- Description:
MTNT: Machine Translation of Noisy Text
Source code:
tfds.translate.mtnt.Mtnt
Versions:
1.0.0
(default): Initial release.
Download size:
35.08 MiB
Auto-cached (documentation): Yes
Feature structure:
FeaturesDict({
'dst': Text(shape=(), dtype=string),
'src': Text(shape=(), dtype=string),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
dst | Text | string | ||
src | Text | string |
Supervised keys (See
as_supervised
doc):('src', 'dst')
Figure (tfds.show_examples): Not supported.
Citation:
@InProceedings{michel2018mtnt,
author = {Michel, Paul and Neubig, Graham},
title = {MTNT: A Testbed for Machine Translation of Noisy Text},
booktitle = {Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing}
}
mtnt/en-fr (default config)
Dataset size:
11.33 MiB
Splits:
Split | Examples |
---|---|
'test' |
1,020 |
'train' |
35,692 |
'valid' |
811 |
- Examples (tfds.as_dataframe):
mtnt/en-ja
Dataset size:
4.41 MiB
Splits:
Split | Examples |
---|---|
'test' |
946 |
'train' |
5,746 |
'valid' |
892 |
- Examples (tfds.as_dataframe):
mtnt/fr-en
Dataset size:
8.28 MiB
Splits:
Split | Examples |
---|---|
'test' |
1,022 |
'train' |
18,942 |
'valid' |
876 |
- Examples (tfds.as_dataframe):