tab_fact

مراجع:

tab_fact

برای بارگذاری این مجموعه داده در TFDS از دستور زیر استفاده کنید:

ds = tfds.load('huggingface:tab_fact/tab_fact')
  • توضیحات :
The problem of verifying whether a textual hypothesis holds the truth based on the given evidence, also known as fact verification, plays an important role in the study of natural language understanding and semantic representation. However, existing studies are restricted to dealing with unstructured textual evidence (e.g., sentences and passages, a pool of passages), while verification using structured forms of evidence, such as tables, graphs, and databases, remains unexplored. TABFACT is large scale dataset with 16k Wikipedia tables as evidence for 118k human annotated statements designed for fact verification with semi-structured evidence. The statements are labeled as either ENTAILED or REFUTED. TABFACT is challenging since it involves both soft linguistic reasoning and hard symbolic reasoning.
  • مجوز : مجوز شناخته شده ای وجود ندارد
  • نسخه : 1.0.0
  • تقسیم ها :
تقسیم کنید نمونه ها
'test' 12779
'train' 92283
'validation' 12792
  • ویژگی ها :
{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "table_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "table_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "table_caption": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "statement": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "refuted",
            "entailed"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

blind_test

برای بارگذاری این مجموعه داده در TFDS از دستور زیر استفاده کنید:

ds = tfds.load('huggingface:tab_fact/blind_test')
  • توضیحات :
The problem of verifying whether a textual hypothesis holds the truth based on the given evidence, also known as fact verification, plays an important role in the study of natural language understanding and semantic representation. However, existing studies are restricted to dealing with unstructured textual evidence (e.g., sentences and passages, a pool of passages), while verification using structured forms of evidence, such as tables, graphs, and databases, remains unexplored. TABFACT is large scale dataset with 16k Wikipedia tables as evidence for 118k human annotated statements designed for fact verification with semi-structured evidence. The statements are labeled as either ENTAILED or REFUTED. TABFACT is challenging since it involves both soft linguistic reasoning and hard symbolic reasoning.
  • مجوز : مجوز شناخته شده ای وجود ندارد
  • نسخه : 1.0.0
  • تقسیم ها :
تقسیم کنید نمونه ها
'test' 9750
  • ویژگی ها :
{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "table_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "table_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "table_caption": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "statement": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "test_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}