- Description:
Universal Dependencies (UD) is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 200 contributors producing more than 100 treebanks in over 70 languages. If you’re new to UD, you should start by reading the first part of the Short Introduction and then browsing the annotation guidelines.
Homepage: https://universaldependencies.org/
Source code:
tfds.datasets.xtreme_pos.Builder
Versions:
1.0.0
(default): Initial release.
Download size:
338.76 MiB
Auto-cached (documentation): Yes
Feature structure:
FeaturesDict({
'tokens': Sequence(Text(shape=(), dtype=string)),
'upos': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=18)),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
tokens | Sequence(Text) | (None,) | string | |
upos | Sequence(ClassLabel) | (None,) | int64 |
Supervised keys (See
as_supervised
doc):None
Figure (tfds.show_examples): Not supported.
Citation:
@article{nivre2018universal,
title={Universal Dependencies 2.2},
author={Nivre, Joakim and Abrams, Mitchell and Agi{'c}, {
{Z} }eljko
and Ahrenberg, Lars and Antonsen, Lene and Aranzabe, Maria Jesus and
Arutie, Gashaw and Asahara, Masayuki and Ateyah, Luma and Attia,
Mohammed and others},
year={2018}
}
xtreme_pos/xtreme_pos_af (default config)
Dataset size:
445.94 KiB
Splits:
Split | Examples |
---|---|
'dev' |
194 |
'test' |
425 |
'train' |
1,315 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ar
Dataset size:
3.35 MiB
Splits:
Split | Examples |
---|---|
'dev' |
909 |
'test' |
1,680 |
'train' |
6,075 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_bg
Dataset size:
2.14 MiB
Splits:
Split | Examples |
---|---|
'dev' |
1,115 |
'test' |
1,116 |
'train' |
8,907 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_de
Dataset size:
37.62 MiB
Splits:
Split | Examples |
---|---|
'dev' |
19,233 |
'test' |
22,458 |
'train' |
166,849 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_el
Dataset size:
7.17 MiB
Splits:
Split | Examples |
---|---|
'dev' |
2,559 |
'test' |
2,809 |
'train' |
28,152 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_en
Dataset size:
4.67 MiB
Splits:
Split | Examples |
---|---|
'dev' |
4,699 |
'test' |
6,165 |
'train' |
26,825 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_es
Dataset size:
8.26 MiB
Splits:
Split | Examples |
---|---|
'dev' |
3,054 |
'test' |
3,147 |
'train' |
28,492 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_et
Dataset size:
4.84 MiB
Splits:
Split | Examples |
---|---|
'dev' |
3,125 |
'test' |
3,760 |
'train' |
25,749 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_eu
Dataset size:
1.27 MiB
Splits:
Split | Examples |
---|---|
'dev' |
1,798 |
'test' |
1,799 |
'train' |
5,396 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_fa
Dataset size:
1.73 MiB
Splits:
Split | Examples |
---|---|
'dev' |
599 |
'test' |
600 |
'train' |
4,798 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_fi
Dataset size:
4.48 MiB
Splits:
Split | Examples |
---|---|
'dev' |
3,239 |
'test' |
4,422 |
'train' |
27,198 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_fr
Dataset size:
7.28 MiB
Splits:
Split | Examples |
---|---|
'dev' |
5,979 |
'test' |
9,465 |
'train' |
47,308 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_he
Dataset size:
1.57 MiB
Splits:
Split | Examples |
---|---|
'dev' |
484 |
'test' |
491 |
'train' |
5,241 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_hi
Dataset size:
5.78 MiB
Splits:
Split | Examples |
---|---|
'dev' |
1,884 |
'test' |
2,909 |
'train' |
14,752 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_hu
Dataset size:
438.07 KiB
Splits:
Split | Examples |
---|---|
'dev' |
441 |
'test' |
449 |
'train' |
910 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_id
Dataset size:
1.31 MiB
Splits:
Split | Examples |
---|---|
'dev' |
559 |
'test' |
1,557 |
'train' |
4,477 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_it
Dataset size:
6.85 MiB
Splits:
Split | Examples |
---|---|
'dev' |
2,278 |
'test' |
3,518 |
'train' |
29,685 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ja
Dataset size:
3.57 MiB
Splits:
Split | Examples |
---|---|
'dev' |
8,938 |
'test' |
10,253 |
'train' |
47,926 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_kk
Dataset size:
167.15 KiB
Splits:
Split | Examples |
---|---|
'test' |
1,047 |
'train' |
31 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ko
Dataset size:
5.82 MiB
Splits:
Split | Examples |
---|---|
'dev' |
3,016 |
'test' |
4,276 |
'train' |
27,410 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_mr
Dataset size:
56.14 KiB
Splits:
Split | Examples |
---|---|
'dev' |
46 |
'test' |
47 |
'train' |
373 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_nl
Dataset size:
2.90 MiB
Splits:
Split | Examples |
---|---|
'dev' |
1,394 |
'test' |
1,471 |
'train' |
18,051 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_pt
Dataset size:
4.65 MiB
Splits:
Split | Examples |
---|---|
'dev' |
1,770 |
'test' |
2,681 |
'train' |
17,992 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ru
Dataset size:
20.25 MiB
Splits:
Split | Examples |
---|---|
'dev' |
9,960 |
'test' |
11,336 |
'train' |
67,435 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ta
Dataset size:
3.65 KiB
Splits:
Split | Examples |
---|---|
'test' |
55 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_te
Dataset size:
143.77 KiB
Splits:
Split | Examples |
---|---|
'dev' |
131 |
'test' |
146 |
'train' |
1,051 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_th
Dataset size:
377.24 KiB
Splits:
Split | Examples |
---|---|
'test' |
1,000 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_tl
Dataset size:
228.78 KiB
Splits:
Split | Examples |
---|---|
'dev' |
80 |
'test' |
120 |
'train' |
400 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_tr
Dataset size:
1.06 MiB
Splits:
Split | Examples |
---|---|
'dev' |
988 |
'test' |
4,785 |
'train' |
3,664 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ur
Dataset size:
1.50 MiB
Splits:
Split | Examples |
---|---|
'dev' |
552 |
'test' |
535 |
'train' |
4,043 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_vi
Dataset size:
454.32 KiB
Splits:
Split | Examples |
---|---|
'dev' |
800 |
'test' |
800 |
'train' |
1,400 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_yo
Dataset size:
22.65 KiB
Splits:
Split | Examples |
---|---|
'test' |
100 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_zh
Dataset size:
3.29 MiB
Splits:
Split | Examples |
---|---|
'dev' |
3,038 |
'test' |
5,528 |
'train' |
18,998 |
- Examples (tfds.as_dataframe):