TFDS hiện hỗ trợ định dạng Croissant 🥐 ! Đọc tài liệu để biết thêm.

Trang này được dịch bởi Cloud Translation API.

tweet_eval

Tài liệu tham khảo:

biểu tượng cảm xúc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/emoji')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	50000
`'train'`	45000
`'validation'`	5000

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 20,
        "names": [
            "\u2764",
            "\ud83d\ude0d",
            "\ud83d\ude02",
            "\ud83d\udc95",
            "\ud83d\udd25",
            "\ud83d\ude0a",
            "\ud83d\ude0e",
            "\u2728",
            "\ud83d\udc99",
            "\ud83d\ude18",
            "\ud83d\udcf7",
            "\ud83c\uddfa\ud83c\uddf8",
            "\u2600",
            "\ud83d\udc9c",
            "\ud83d\ude09",
            "\ud83d\udcaf",
            "\ud83d\ude01",
            "\ud83c\udf84",
            "\ud83d\udcf8",
            "\ud83d\ude1c"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

cảm xúc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/emotion')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	1421
`'train'`	3257
`'validation'`	374

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 4,
        "names": [
            "anger",
            "joy",
            "optimism",
            "sadness"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

ghét

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/hate')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	2970
`'train'`	9000
`'validation'`	1000

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "non-hate",
            "hate"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

sự mỉa mai

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/irony')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	784
`'train'`	2862
`'validation'`	955

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "non_irony",
            "irony"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

xúc phạm

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/offensive')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	860
`'train'`	11916
`'validation'`	1324

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "non-offensive",
            "offensive"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

tình cảm

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/sentiment')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	12284
`'train'`	45615
`'validation'`	2000

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "negative",
            "neutral",
            "positive"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

lập trường_phá thai

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/stance_abortion')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	280
`'train'`	587
`'validation'`	66

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "none",
            "against",
            "favor"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

lập trường_vô thần

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/stance_atheism')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	220
`'train'`	461
`'validation'`	52

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "none",
            "against",
            "favor"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

lập trường_khí hậu

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/stance_climate')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	169
`'train'`	355
`'validation'`	40

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "none",
            "against",
            "favor"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

lập trường_nữ quyền

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/stance_feminist')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	285
`'train'`	597
`'validation'`	67

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "none",
            "against",
            "favor"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

lập trường_hillary

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:tweet_eval/stance_hillary')

Sự miêu tả :

TweetEval consists of seven heterogenous tasks in Twitter, all framed as multi-class tweet classification. All tasks have been unified into the same benchmark, with each dataset presented in the same format and with fixed training, validation and test splits.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.1.0
Chia tách :

Tách ra	Ví dụ
`'test'`	295
`'train'`	620
`'validation'`	69

Đặc trưng :

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "none",
            "against",
            "favor"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}