fake_news_english

참고자료:

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:fake_news_english')
  • 설명 :
Fake news has become a major societal issue and a technical challenge for social media companies to identify. This content is difficult to identify because the term "fake news" covers intentionally false, deceptive stories as well as factual errors, satire, and sometimes, stories that a person just does not like. Addressing the problem requires clear definitions and examples. In this work, we present a dataset of fake news and satire stories that are hand coded, verified, and, in the case of fake news, include rebutting stories. We also include a thematic content analysis of the articles, identifying major themes that include hyperbolic support or condemnation of a gure, conspiracy theories, racist themes, and discrediting of reliable sources. In addition to releasing this dataset for research use, we analyze it and show results based on language that are promising for classification purposes. Overall, our contribution of a dataset and initial analysis are designed to support future work by fake news researchers.
  • 라이센스 : 알려진 라이센스 없음
  • 버전 : 1.1.0
  • 분할 :
나뉘다
'train' 492
  • 특징 :
{
    "article_number": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "url_of_article": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "fake_or_satire": {
        "num_classes": 2,
        "names": [
            "Satire",
            "Fake"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "url_of_rebutting_article": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}