hebrew_sentiment

참고자료:

토큰

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:hebrew_sentiment/token')
  • 설명 :
HebrewSentiment is a data set consists of 12,804 user comments to posts on the official Facebook page of Israels
president, Mr. Reuven Rivlin. In October 2015, we used the open software application Netvizz (Rieder,
2013) to scrape all the comments to all of the presidents posts in the period of June  August 2014,
the first three months of Rivlins presidency.2 While the presidents posts aimed at reconciling tensions
and called for tolerance and empathy, the sentiment expressed in the comments to the presidents posts
was polarized between citizens who warmly thanked the president, and citizens that fiercely critiqued his
policy. Of the 12,804 comments, 370 are neutral; 8,512 are positive, 3,922 negative.

Data Annotation: A trained researcher examined each comment and determined its sentiment value,
where comments with an overall positive sentiment were assigned the value 1, comments with an overall
negative sentiment were assigned the value -1, and comments that are off-topic to the posts content
were assigned the value 0. We validated the coding scheme by asking a second trained researcher to
code the same data. There was substantial agreement between raters (N of agreements: 10623, N of
disagreements: 2105, Coehns Kappa = 0.697, p = 0).
  • 라이센스 : 알려진 라이센스 없음
  • 버전 : 1.0.0
  • 분할 :
나뉘다
'test' 2560
'train' 10244
  • 특징 :
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "pos",
            "neg",
            "off-topic"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

변형

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:hebrew_sentiment/morph')
  • 설명 :
HebrewSentiment is a data set consists of 12,804 user comments to posts on the official Facebook page of Israels
president, Mr. Reuven Rivlin. In October 2015, we used the open software application Netvizz (Rieder,
2013) to scrape all the comments to all of the presidents posts in the period of June  August 2014,
the first three months of Rivlins presidency.2 While the presidents posts aimed at reconciling tensions
and called for tolerance and empathy, the sentiment expressed in the comments to the presidents posts
was polarized between citizens who warmly thanked the president, and citizens that fiercely critiqued his
policy. Of the 12,804 comments, 370 are neutral; 8,512 are positive, 3,922 negative.

Data Annotation: A trained researcher examined each comment and determined its sentiment value,
where comments with an overall positive sentiment were assigned the value 1, comments with an overall
negative sentiment were assigned the value -1, and comments that are off-topic to the posts content
were assigned the value 0. We validated the coding scheme by asking a second trained researcher to
code the same data. There was substantial agreement between raters (N of agreements: 10623, N of
disagreements: 2105, Coehns Kappa = 0.697, p = 0).
  • 라이센스 : 알려진 라이센스 없음
  • 버전 : 1.0.0
  • 분할 :
나뉘다
'test' 2555
'train' 10221
  • 특징 :
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 3,
        "names": [
            "pos",
            "neg",
            "off-topic"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}