xtreme_pos

  • Description:

Universal Dependencies (UD) is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 200 contributors producing more than 100 treebanks in over 70 languages. If you’re new to UD, you should start by reading the first part of the Short Introduction and then browsing the annotation guidelines.

FeaturesDict({
    'tokens': Sequence(Text(shape=(), dtype=string)),
    'upos': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=18)),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
tokens Sequence(Text) (None,) string
upos Sequence(ClassLabel) (None,) int64
@article{nivre2018universal,
  title={Universal Dependencies 2.2},
  author={Nivre, Joakim and Abrams, Mitchell and Agi{'c}, {
{Z} }eljko
  and Ahrenberg, Lars and Antonsen, Lene and Aranzabe, Maria Jesus and
  Arutie, Gashaw and Asahara, Masayuki and Ateyah, Luma and Attia,
  Mohammed and others},
  year={2018}
}

xtreme_pos/xtreme_pos_af (default config)

  • Dataset size: 445.94 KiB

  • Splits:

Split Examples
'dev' 194
'test' 425
'train' 1,315

xtreme_pos/xtreme_pos_ar

  • Dataset size: 3.35 MiB

  • Splits:

Split Examples
'dev' 909
'test' 1,680
'train' 6,075

xtreme_pos/xtreme_pos_bg

  • Dataset size: 2.14 MiB

  • Splits:

Split Examples
'dev' 1,115
'test' 1,116
'train' 8,907

xtreme_pos/xtreme_pos_de

  • Dataset size: 37.62 MiB

  • Splits:

Split Examples
'dev' 19,233
'test' 22,458
'train' 166,849

xtreme_pos/xtreme_pos_el

  • Dataset size: 7.17 MiB

  • Splits:

Split Examples
'dev' 2,559
'test' 2,809
'train' 28,152

xtreme_pos/xtreme_pos_en

  • Dataset size: 4.67 MiB

  • Splits:

Split Examples
'dev' 4,699
'test' 6,165
'train' 26,825

xtreme_pos/xtreme_pos_es

  • Dataset size: 8.26 MiB

  • Splits:

Split Examples
'dev' 3,054
'test' 3,147
'train' 28,492

xtreme_pos/xtreme_pos_et

  • Dataset size: 4.84 MiB

  • Splits:

Split Examples
'dev' 3,125
'test' 3,760
'train' 25,749

xtreme_pos/xtreme_pos_eu

  • Dataset size: 1.27 MiB

  • Splits:

Split Examples
'dev' 1,798
'test' 1,799
'train' 5,396

xtreme_pos/xtreme_pos_fa

  • Dataset size: 1.73 MiB

  • Splits:

Split Examples
'dev' 599
'test' 600
'train' 4,798

xtreme_pos/xtreme_pos_fi

  • Dataset size: 4.48 MiB

  • Splits:

Split Examples
'dev' 3,239
'test' 4,422
'train' 27,198

xtreme_pos/xtreme_pos_fr

  • Dataset size: 7.28 MiB

  • Splits:

Split Examples
'dev' 5,979
'test' 9,465
'train' 47,308

xtreme_pos/xtreme_pos_he

  • Dataset size: 1.57 MiB

  • Splits:

Split Examples
'dev' 484
'test' 491
'train' 5,241

xtreme_pos/xtreme_pos_hi

  • Dataset size: 5.78 MiB

  • Splits:

Split Examples
'dev' 1,884
'test' 2,909
'train' 14,752

xtreme_pos/xtreme_pos_hu

  • Dataset size: 438.07 KiB

  • Splits:

Split Examples
'dev' 441
'test' 449
'train' 910

xtreme_pos/xtreme_pos_id

  • Dataset size: 1.31 MiB

  • Splits:

Split Examples
'dev' 559
'test' 1,557
'train' 4,477

xtreme_pos/xtreme_pos_it

  • Dataset size: 6.85 MiB

  • Splits:

Split Examples
'dev' 2,278
'test' 3,518
'train' 29,685

xtreme_pos/xtreme_pos_ja

  • Dataset size: 3.57 MiB

  • Splits:

Split Examples
'dev' 8,938
'test' 10,253
'train' 47,926

xtreme_pos/xtreme_pos_kk

  • Dataset size: 167.15 KiB

  • Splits:

Split Examples
'test' 1,047
'train' 31

xtreme_pos/xtreme_pos_ko

  • Dataset size: 5.82 MiB

  • Splits:

Split Examples
'dev' 3,016
'test' 4,276
'train' 27,410

xtreme_pos/xtreme_pos_mr

  • Dataset size: 56.14 KiB

  • Splits:

Split Examples
'dev' 46
'test' 47
'train' 373

xtreme_pos/xtreme_pos_nl

  • Dataset size: 2.90 MiB

  • Splits:

Split Examples
'dev' 1,394
'test' 1,471
'train' 18,051

xtreme_pos/xtreme_pos_pt

  • Dataset size: 4.65 MiB

  • Splits:

Split Examples
'dev' 1,770
'test' 2,681
'train' 17,992

xtreme_pos/xtreme_pos_ru

  • Dataset size: 20.25 MiB

  • Splits:

Split Examples
'dev' 9,960
'test' 11,336
'train' 67,435

xtreme_pos/xtreme_pos_ta

  • Dataset size: 3.65 KiB

  • Splits:

Split Examples
'test' 55

xtreme_pos/xtreme_pos_te

  • Dataset size: 143.77 KiB

  • Splits:

Split Examples
'dev' 131
'test' 146
'train' 1,051

xtreme_pos/xtreme_pos_th

  • Dataset size: 377.24 KiB

  • Splits:

Split Examples
'test' 1,000

xtreme_pos/xtreme_pos_tl

  • Dataset size: 228.78 KiB

  • Splits:

Split Examples
'dev' 80
'test' 120
'train' 400

xtreme_pos/xtreme_pos_tr

  • Dataset size: 1.06 MiB

  • Splits:

Split Examples
'dev' 988
'test' 4,785
'train' 3,664

xtreme_pos/xtreme_pos_ur

  • Dataset size: 1.50 MiB

  • Splits:

Split Examples
'dev' 552
'test' 535
'train' 4,043

xtreme_pos/xtreme_pos_vi

  • Dataset size: 454.32 KiB

  • Splits:

Split Examples
'dev' 800
'test' 800
'train' 1,400

xtreme_pos/xtreme_pos_yo

  • Dataset size: 22.65 KiB

  • Splits:

Split Examples
'test' 100

xtreme_pos/xtreme_pos_zh

  • Dataset size: 3.29 MiB

  • Splits:

Split Examples
'dev' 3,038
'test' 5,528
'train' 18,998