báo chí

Tài liệu tham khảo:

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:newspop')
  • Sự miêu tả :
This is a large data set of news items and their respective social feedback on multiple platforms: Facebook, Google+ and LinkedIn.
The collected data relates to a period of 8 months, between November 2015 and July 2016, accounting for about 100,000 news items on four different topics: economy, microsoft, obama and palestine.
This data set is tailored for evaluative comparisons in predictive analytics tasks, although allowing for tasks in other research areas such as topic detection and tracking, sentiment analysis in short text, first story detection or news recommendation.
  • Giấy phép : Giấy phép Quốc tế Creative Commons Ghi công 4.0 (CC-BY)
  • Phiên bản : 0.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 93239
  • Đặc trưng :
{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "topic": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "publish_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "facebook": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "google_plus": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "linked_in": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}