Riferimenti:
comprensione_narrativa_astratta
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/abstract_narrative_understanding')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 3000 |
'train' | 2400 |
'validation' | 600 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
anacronismi
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/anachronisms')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 230 |
'train' | 184 |
'validation' | 46 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
similarità_analogica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/analogical_similarity')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 323 |
'train' | 259 |
'validation' | 64 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
coinvolgimento_analitico
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/analytic_entailment')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 70 |
'train' | 54 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
aritmetica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/arithmetic')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 15023 |
'train' | 12019 |
'validation' | 3004 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ascii_word_recognition
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/ascii_word_recognition')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 5000 |
'train' | 4000 |
'validation' | 1000 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
verifica_autoralità
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/authorship_verification')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 880 |
'train' | 704 |
'validation' | 176 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
categorizzazione_auto
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/auto_categorization')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 328 |
'train' | 263 |
'validation' | 65 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
auto_debug
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/auto_debugging')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 34 |
'train' | 18 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
bbq_lite_json
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/bbq_lite_json')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 16076 |
'train' | 12866 |
'validation' | 3210 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
bridging_anaphora_length_barqa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/bridging_anaphora_resolution_barqa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 648 |
'train' | 519 |
'validation' | 129 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
giudizio_causale
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/causal_judgment')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 190 |
'train' | 152 |
'validation' | 38 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
causa_ed_effetto
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/cause_and_effect')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 153 |
'train' | 123 |
'validation' | 30 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
scacco matto_in_uno
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/checkmate_in_one')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 3498 |
'train' | 2799 |
'validation' | 699 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
chess_state_tracking
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/chess_state_tracking')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 6000 |
'train' | 4800 |
'validation' | 1200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
teorema_del_resto_cinese
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/chinese_remainder_theorem')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 500 |
'train' | 400 |
'validation' | 100 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
cifar10_classificazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/cifar10_classification')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 20000 |
'train' | 16000 |
'validation' | 4000 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
codice_riga_descrizione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/code_line_description')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 60 |
'train' | 44 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
nomi in codice
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/codenames')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 85 |
'train' | 68 |
'validation' | 17 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
colore
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/color')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 4000 |
'train' | 3200 |
'validation' | 800 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
morfema_comune
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/common_morpheme')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 50 |
'train' | 34 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
combinazioni_concettuali
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/conceptual_combinations')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 103 |
'train' | 84 |
'validation' | 19 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
conlang_translation
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/conlang_translation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 164 |
'train' | 132 |
'validation' | 32 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
conflitti_conoscenza_parametrica_contestuale
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/contextual_parametric_knowledge_conflicts')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 17528 |
'train' | 14023 |
'validation' | 3505 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
crash_blossom
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/crash_blossom')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 38 |
'train' | 22 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
crass_ai
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/crass_ai')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 44 |
'train' | 28 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
criobiologia_spagnolo
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/cryobiology_spanish')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 146 |
'train' | 117 |
'validation' | 29 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
criptonite
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/cryptonite')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 26157 |
'train' | 20926 |
'validation' | 5231 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
cs_algoritmi
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/cs_algorithms')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1320 |
'train' | 1056 |
'validation' | 264 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
dark_humor_detection
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/dark_humor_detection')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 80 |
'train' | 64 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
data_comprensione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/date_understanding')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 369 |
'train' | 296 |
'validation' | 73 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
disambiguazione_qa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/disambiguation_qa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 258 |
'train' | 207 |
'validation' | 51 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
discorse_marker_prediction
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/discourse_marker_prediction')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 857 |
'train' | 686 |
'validation' | 171 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
disfl_qa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/disfl_qa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 8000 |
'train' | 6400 |
'validation' | 1600 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
dyck_linguals
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/dyck_languages')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1000 |
'train' | 800 |
'validation' | 200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
elementari_math_qa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/elementary_math_qa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 38160 |
'train' | 30531 |
'validation' | 7629 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
emoji_film
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/emoji_movie')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 100 |
'train' | 80 |
'validation' | 20 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
emojis_emotion_prediction
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/emojis_emotion_prediction')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 131 |
'train' | 105 |
'validation' | 26 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
giudizi_empirici
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/empirical_judgments')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 99 |
'train' | 80 |
'validation' | 19 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
proverbi_inglese
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/english_proverbs')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 34 |
'train' | 18 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
proverbi_russi_inglese
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/english_russian_proverbs')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 80 |
'train' | 64 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
polarità_implicata
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/entailed_polarity')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 148 |
'train' | 119 |
'validation' | 29 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
comportato_polarità_hindi
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/entailed_polarity_hindi')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 138 |
'train' | 111 |
'validation' | 27 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ragionamento_epistemico
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/epistemic_reasoning')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2000 |
'train' | 1600 |
'validation' | 400 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
valutare_l'essenzialità_dell'informazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/evaluating_information_essentiality')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 68 |
'train' | 52 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
fact_checker
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/fact_checker')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 7154 |
'train' | 5724 |
'validation' | 1430 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
fantasia_ragionamento
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/fantasy_reasoning')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 201 |
'train' | 161 |
'validation' | 40 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
pochi_colpi_nlg
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/few_shot_nlg')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 153 |
'train' | 123 |
'validation' | 30 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
figura_del_discorso_rilevamento
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/figure_of_speech_detection')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 59 |
'train' | 43 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
errori_formali_sillogismi_negazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/formal_fallacies_syllogisms_negation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 14200 |
'train' | 11360 |
'validation' | 2840 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
gemma
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/gem')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 14802 |
'train' | 11845 |
'validation' | 2957 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
genere_inclusive_frasi_german
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/gender_inclusive_sentences_german')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 200 |
'train' | 160 |
'validation' | 40 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
conoscenza_generale
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/general_knowledge')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 70 |
'train' | 54 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
forme_geometriche
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/geometric_shapes')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 359 |
'train' | 288 |
'validation' | 71 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
goal_step_wikihow
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/goal_step_wikihow')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 7053 |
'train' | 5643 |
'validation' | 1410 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
gre_reading_comprehension
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/gre_reading_comprehension')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 31 |
'train' | 15 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
hhh_allineamento
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/hhh_alignment')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 221 |
'train' | 179 |
'validation' | 42 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
hindi_domanda_risposta
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/hindi_question_answering')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 6610 |
'train' | 5288 |
'validation' | 1322 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
conoscenza_indù
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/hindu_knowledge')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 175 |
'train' | 140 |
'validation' | 35 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
hinglish_tossicità
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/hinglish_toxicity')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 200 |
'train' | 160 |
'validation' | 40 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sensi_organi_umani
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/human_organs_senses')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 42 |
'train' | 26 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
iperbato
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/hyperbaton')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 50000 |
'train' | 40000 |
'validation' | 10000 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
identificare_teoremi_matematici
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/identify_math_theorems')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 53 |
'train' | 37 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
identifica_dispari_metafora
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/identify_odd_metaphor')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 47 |
'train' | 31 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
implicazioni
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/implicatures')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 492 |
'train' | 394 |
'validation' | 98 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
relazioni_implicite
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/implicit_relations')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 85 |
'train' | 68 |
'validation' | 17 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
intent_recognition
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/intent_recognition')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 693 |
'train' | 555 |
'validation' | 138 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
international_fonetic_alphabet_nli
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/international_phonetic_alphabet_nli')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 126 |
'train' | 101 |
'validation' | 25 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
alfabeto_fonetico_internazionale_translitterato
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/international_phonetic_alphabet_transliterate')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1003 |
'train' | 803 |
'validation' | 200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
intersect_geometry
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/intersect_geometry')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 249999 |
'train' | 200000 |
'validation' | 49999 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ironia_identificazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/irony_identification')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 99 |
'train' | 80 |
'validation' | 19 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
kanji_ascii
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/kanji_ascii')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1092 |
'train' | 875 |
'validation' | 217 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
Kannada
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/kannada')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 316 |
'train' | 253 |
'validation' | 63 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
key_value_maps
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/key_value_maps')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 101 |
'train' | 80 |
'validation' | 21 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
noti_sconosciuti
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/known_unknowns')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 46 |
'train' | 30 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
giochi_linguistici
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/language_games')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2128 |
'train' | 1704 |
'validation' | 424 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
identificazione_lingua
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/language_identification')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 10000 |
'train' | 8000 |
'validation' | 2000 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
mappature_linguistiche
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/linguistic_mappings')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 15527 |
'train' | 12426 |
'validation' | 3101 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
linguistics_puzzle
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/linguistics_puzzles')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2000 |
'train' | 1600 |
'validation' | 400 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
lista_funzioni
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/list_functions')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 10750 |
'train' | 8700 |
'validation' | 2050 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
logica_griglia_puzzle
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/logic_grid_puzzle')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1000 |
'train' | 800 |
'validation' | 200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
argomenti_logici
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/logical_args')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 32 |
'train' | 16 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
deduzione_logica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/logical_deduction')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1500 |
'train' | 1200 |
'validation' | 300 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
rilevamento_errore_logico
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/logical_fallacy_detection')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2800 |
'train' | 2240 |
'validation' | 560 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sequenza_logica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/logical_sequence')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 39 |
'train' | 23 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
induzione_matematica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/mathematical_induction')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 69 |
'train' | 53 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
forme di matrice
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/matrixshapes')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 4462 |
'train' | 3570 |
'validation' | 892 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
metafora_booleano
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/metaphor_boolean')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 680 |
'train' | 544 |
'validation' | 136 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
metafora_comprensione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/metaphor_understanding')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 234 |
'train' | 188 |
'validation' | 46 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
minute_mysteries_qa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/minute_mysteries_qa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 477 |
'train' | 383 |
'validation' | 94 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
idee sbagliate
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/misconceptions')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 219 |
'train' | 176 |
'validation' | 43 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
idee sbagliate_russo
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/misconceptions_russian')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 49 |
'train' | 33 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
mnist_ascii
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/mnist_ascii')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 69984 |
'train' | 55988 |
'validation' | 13996 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
aritmetica_modificata
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/modified_arithmetic')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 6000 |
'train' | 4800 |
'validation' | 1200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
moral_ammissibilità
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/moral_permissibility')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 342 |
'train' | 274 |
'validation' | 68 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
movie_dialog_stesso_o_diverso
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/movie_dialog_same_or_different')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 50000 |
'train' | 40000 |
'validation' | 10000 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
film_raccomandazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/movie_recommendation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 500 |
'train' | 400 |
'validation' | 100 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
mult_data_wrangling
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/mult_data_wrangling')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 7854 |
'train' | 6380 |
'validation' | 1474 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
multiemo
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/multiemo')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1437281 |
'train' | 1149873 |
'validation' | 287408 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
istruzioni_naturali
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/natural_instructions')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 193250 |
'train' | 154615 |
'validation' | 38635 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
navigare
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/navigate')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1000 |
'train' | 800 |
'validation' | 200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sciocchezze_parole_grammatica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/nonsense_words_grammar')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 50 |
'train' | 34 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
novel_concepts
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/novel_concepts')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 32 |
'train' | 16 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
conteggio_oggetti
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/object_counting')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1000 |
'train' | 800 |
'validation' | 200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
dispari_uno_fuori
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/odd_one_out')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 86 |
'train' | 69 |
'validation' | 17 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
operatori
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/operators')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 210 |
'train' | 168 |
'validation' | 42 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
paragrafo_segmentazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/paragraph_segmentation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 9000 |
'train' | 7200 |
'validation' | 1800 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
parsinlu_qa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/parsinlu_qa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1050 |
'train' | 840 |
'validation' | 210 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
parsinlu_reading_comprehension
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/parsinlu_reading_comprehension')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 518 |
'train' | 415 |
'validation' | 103 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
pinguini_in_un_tavolo
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/penguins_in_a_table')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 149 |
'train' | 120 |
'validation' | 29 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
elementi_periodici
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/periodic_elements')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 654 |
'train' | 524 |
'validation' | 130 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
idiomi_persiani
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/persian_idioms')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 66 |
'train' | 50 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
correlazione_frase
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/phrase_relatedness')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 100 |
'train' | 80 |
'validation' | 20 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
intuizione_fisica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/physical_intuition')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 81 |
'train' | 65 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
fisica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/physics')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 229 |
'train' | 184 |
'validation' | 45 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
domande_fisiche
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/physics_questions')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 54 |
'train' | 38 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
play_dialog_stesso_o_diverso
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/play_dialog_same_or_different')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 3264 |
'train' | 2612 |
'validation' | 652 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
Polish_sequence_labeling
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/polish_sequence_labeling')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 12812 |
'train' | 10250 |
'validation' | 2562 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
presupposti_come_nli
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/presuppositions_as_nli')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 735 |
'train' | 588 |
'validation' | 147 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
qa_wikidata
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/qa_wikidata')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 20321 |
'train' | 16257 |
'validation' | 4064 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
selezione_domanda
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/question_selection')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1582 |
'train' | 1266 |
'validation' | 316 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
testo_reale_o_falso
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/real_or_fake_text')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 15088 |
'train' | 12072 |
'validation' | 3016 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ragionamento_su_oggetti_colorati
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/reasoning_about_colored_objects')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2000 |
'train' | 1600 |
'validation' | 400 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ripetizione_copia_logica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/repeat_copy_logic')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 32 |
'train' | 16 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
riformulare
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/rephrase')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 78 |
'train' | 62 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
enigma_senso
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/riddle_sense')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 49 |
'train' | 33 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
rovina_nomi
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/ruin_names')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 448 |
'train' | 359 |
'validation' | 89 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
salient_translation_error_detection
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/salient_translation_error_detection')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 998 |
'train' | 799 |
'validation' | 199 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
comunicato_stampa_scientifico
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/scientific_press_release')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 50 |
'train' | 34 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
semantic_parsing_in_context_sparc
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/semantic_parsing_in_context_sparc')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1155 |
'train' | 924 |
'validation' | 231 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
semantic_parsing_spider
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/semantic_parsing_spider')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1034 |
'train' | 828 |
'validation' | 206 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
frase_ambiguità
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/sentence_ambiguity')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 60 |
'train' | 44 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
somiglianze_astrazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/similarities_abstraction')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 76 |
'train' | 60 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
simp_turing_concept
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simp_turing_concept')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 6390 |
'train' | 5112 |
'validation' | 1278 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
simple_arithmetic_json
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simple_arithmetic_json')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 30 |
'train' | 14 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
simple_arithmetic_json_multiple_choice
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simple_arithmetic_json_multiple_choice')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 8 |
'train' | 0 |
'validation' | 0 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
simple_arithmetic_json_subtasks
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simple_arithmetic_json_subtasks')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 30 |
'train' | 15 |
'validation' | 15 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
simple_arithmetic_multiple_targets_json
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simple_arithmetic_multiple_targets_json')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 10 |
'train' | 0 |
'validation' | 0 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
semplici_domande_etiche
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simple_ethical_questions')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 115 |
'train' | 92 |
'validation' | 23 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
modifica_testo_semplice
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/simple_text_editing')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 47 |
'train' | 31 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sbotta
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/snarks')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 181 |
'train' | 145 |
'validation' | 36 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
social_iqa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/social_iqa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1935 |
'train' | 1548 |
'validation' | 387 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
social_support
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/social_support')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 897 |
'train' | 718 |
'validation' | 179 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sport_comprensione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/sports_understanding')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 986 |
'train' | 789 |
'validation' | 197 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
storie_strane
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/strange_stories')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 174 |
'train' | 140 |
'validation' | 34 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
strategiaqa
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/strategyqa')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2289 |
'train' | 1832 |
'validation' | 457 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
informazioni_sufficienti
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/sufficient_information')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 39 |
'train' | 23 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
rischio_suicidio
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/suicide_risk')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 40 |
'train' | 24 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
swahili_english_proverbi
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/swahili_english_proverbs')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 153 |
'train' | 123 |
'validation' | 30 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
proverbi_da_svedesi_a_tedeschi
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/swedish_to_german_proverbs')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 72 |
'train' | 56 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
simbolo_interpretazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/symbol_interpretation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 990 |
'train' | 795 |
'validation' | 195 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sequenze_temporali
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/temporal_sequences')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1000 |
'train' | 800 |
'validation' | 200 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
teso
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/tense')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 286 |
'train' | 229 |
'validation' | 57 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
cronometraggio
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/timedial')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2550 |
'train' | 2040 |
'validation' | 510 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
topic_chat
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/topical_chat')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 22295 |
'train' | 17836 |
'validation' | 4459 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
tracking_shuffled_objects
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/tracking_shuffled_objects')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 3750 |
'train' | 3000 |
'validation' | 750 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
capire_favole
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/understanding_fables')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 189 |
'train' | 152 |
'validation' | 37 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
annulla_permutazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/undo_permutation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 300 |
'train' | 240 |
'validation' | 60 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
conversione_unità
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/unit_conversion')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 23936 |
'train' | 19151 |
'validation' | 4785 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
unità_interpretazione
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/unit_interpretation')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 100 |
'train' | 80 |
'validation' | 20 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
apprendimento_innaturale_nel_contesto
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/unnatural_in_context_learning')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 73420 |
'train' | 58736 |
'validation' | 14684 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
vitaminc_fact_verification
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/vitaminc_fact_verification')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 54668 |
'train' | 43735 |
'validation' | 10933 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
cos'è_il_tao
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/what_is_the_tao')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 36 |
'train' | 20 |
'validation' | 16 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
which_wiki_modifica
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/which_wiki_edit')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 571 |
'train' | 457 |
'validation' | 114 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
winowhy
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/winowhy')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 2862 |
'train' | 2290 |
'validation' | 572 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ordinamento_parole
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/word_sorting')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 1900 |
'train' | 1520 |
'validation' | 380 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
word_riordinare
Utilizzare il comando seguente per caricare questo set di dati in TFDS:
ds = tfds.load('huggingface:bigbench/word_unscrambling')
- Descrizione :
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to
probe large language models, and extrapolate their future capabilities.
- Licenza : Licenza Apache 2.0
- Versione : 0.0.0
- Divide :
Diviso | Esempi |
---|---|
'default' | 8917 |
'train' | 7134 |
'validation' | 1783 |
- Caratteristiche :
{
"idx": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"inputs": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_targets": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"multiple_choice_scores": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}