ポールヴァル2019_mt

参考文献:

るーぷる

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:poleval2019_mt/ru-pl')
  • 説明
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).

Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
  • ライセンス: 不明なライセンス
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 2969
'train' 20001
'validation' 3001
  • 特徴
{
    "translation": {
        "languages": [
            "ru",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

英語

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:poleval2019_mt/en-pl')
  • 説明
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).

Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
  • ライセンス: 不明なライセンス
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 9845
'train' 129255
'validation' 10001
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

プルル

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:poleval2019_mt/pl-ru')
  • 説明
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).

Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
  • ライセンス: 不明なライセンス
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 2967
'train' 20001
'validation' 3001
  • 特徴
{
    "translation": {
        "languages": [
            "pl",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

プリエン

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:poleval2019_mt/pl-en')
  • 説明
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).

Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
  • ライセンス: 不明なライセンス
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 1
'train' 129255
'validation' 10001
  • 特徴
{
    "translation": {
        "languages": [
            "pl",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}