참고자료:
루플
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:poleval2019_mt/ru-pl')
- 설명 :
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 2969 |
'train' | 20001 |
'validation' | 3001 |
- 특징 :
{
"translation": {
"languages": [
"ru",
"pl"
],
"id": null,
"_type": "Translation"
}
}
en-pl
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:poleval2019_mt/en-pl')
- 설명 :
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 9845 |
'train' | 129255 |
'validation' | 10001 |
- 특징 :
{
"translation": {
"languages": [
"en",
"pl"
],
"id": null,
"_type": "Translation"
}
}
pl-ru
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:poleval2019_mt/pl-ru')
- 설명 :
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 2967 |
'train' | 20001 |
'validation' | 3001 |
- 특징 :
{
"translation": {
"languages": [
"pl",
"ru"
],
"id": null,
"_type": "Translation"
}
}
pl-en
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:poleval2019_mt/pl-en')
- 설명 :
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 1 |
'train' | 129255 |
'validation' | 10001 |
- 특징 :
{
"translation": {
"languages": [
"pl",
"en"
],
"id": null,
"_type": "Translation"
}
}