Ссылки:
адъюнкт_остров
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/adjunct_island')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
anaphor_gender_agreement
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/anaphor_gender_agreement')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
anaphor_number_agreement
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/anaphor_number_agreement')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
animate_subject_passive
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/animate_subject_passive')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
animate_subject_trans
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/animate_subject_trans')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
причинный
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/causative')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
complex_NP_island
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/complex_NP_island')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
координата_структура_constraint_complex_left_branch
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/coordinate_structure_constraint_complex_left_branch')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
координата_structure_constraint_object_extraction
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/coordinate_structure_constraint_object_extraction')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_irregular_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_irregular_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_irregular_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_irregular_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_with_adj_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_with_adj_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_with_adj_irregular_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_with_adj_irregular_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
definer_noun_agreement_with_adj_irregular_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_with_adj_irregular_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
определитель_существительное_согласие_с_прилагательным_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/determiner_noun_agreement_with_adjective_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
tractor_agreement_relational_noun
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/distractor_agreement_relational_noun')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
tractor_agreement_relative_clause
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/distractor_agreement_relative_clause')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
drop_argument
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/drop_argument')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
ellipsis_n_bar_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/ellipsis_n_bar_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
ellipsis_n_bar_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/ellipsis_n_bar_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
экзистенциальный_there_object_raising
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/existential_there_object_raising')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
экзистенциальные_там_квантификаторы_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/existential_there_quantifiers_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
экзистенциальные_там_квантификаторы_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/existential_there_quantifiers_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
экзистенциальный_there_subject_raising
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/existential_there_subject_raising')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
expletive_it_object_raising
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/expletive_it_object_raising')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
зачаточный
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/inchoative')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
непереходный
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/intransitive')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
нерегулярные_прошлые_партиципли_прилагательные
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/irregular_past_participle_adjectives')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
неправильные_прошлые_партиципли_глаголы
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/irregular_past_participle_verbs')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
неправильный_множественный_субъект_глагол_согласие_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/irregular_plural_subject_verb_agreement_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
неправильный_множественный_субъект_глагол_согласие_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/irregular_plural_subject_verb_agreement_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
left_branch_island_echo_question
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/left_branch_island_echo_question')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
left_branch_island_simple_question
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/left_branch_island_simple_question')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
matrix_question_npi_licensor_present
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/matrix_question_npi_licensor_present')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
npi_present_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/npi_present_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
npi_present_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/npi_present_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
only_npi_licensor_present
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/only_npi_licensor_present')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
only_npi_scope
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/only_npi_scope')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
пассивный_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/passive_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
пассивный_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/passive_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Принцип_A_c_command
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_c_command')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
принцип_A_case_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_case_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
принцип_A_case_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_case_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Принцип_A_домен_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_domain_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
принцип_A_domain_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_domain_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
принцип_A_domain_3
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_domain_3')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
принцип_А_реконструкция
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/principle_A_reconstruction')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Regular_plural_subject_verb_agreement_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/regular_plural_subject_verb_agreement_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Regular_plural_subject_verb_agreement_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/regular_plural_subject_verb_agreement_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
sendential_negation_npi_licensor_present
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/sentential_negation_npi_licensor_present')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
sentential_negation_npi_scope
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/sentential_negation_npi_scope')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Сентиментальный_субъект_остров
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/sentential_subject_island')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
превосходные_квантификаторы_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/superlative_quantifiers_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
превосходные_квантификаторы_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/superlative_quantifiers_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
жесткий_vs_raising_1
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/tough_vs_raising_1')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
жесткий_vs_raising_2
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/tough_vs_raising_2')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
переходный
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/transitive')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
какой_остров
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_island')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
wh_questions_object_gap
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_questions_object_gap')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
wh_questions_subject_gap
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_questions_subject_gap')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
wh_questions_subject_gap_long_distance
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_questions_subject_gap_long_distance')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Wh_vs_that_no_gap
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_vs_that_no_gap')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
wh_vs_that_no_gap_long_distance
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_vs_that_no_gap_long_distance')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
Wh_vs_that_with_gap
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_vs_that_with_gap')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
wh_vs_that_with_gap_long_distance
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:blimp/wh_vs_that_with_gap_long_distance')
- Описание :
BLiMP is a challenge set for evaluating what language models (LMs) know about
major grammatical phenomena in English. BLiMP consists of 67 sub-datasets, each
containing 1000 minimal pairs isolating specific contrasts in syntax,
morphology, or semantics. The data is automatically generated according to
expert-crafted grammars.
- Лицензия : Нет известной лицензии.
- Версия : 0.1.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"sentence_good": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sentence_bad": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"field": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"linguistics_term": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"UID": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"simple_LM_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"one_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"two_prefix_method": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"lexically_identical": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"pair_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}