더 미세한

참고자료:

더 미세한

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:finer/finer')
  • 설명 :
The directory data contains a corpus of Finnish technology related news articles with a manually prepared
named entity annotation (digitoday.2014.csv). The text material was extracted from the archives of Digitoday,
a Finnish online technology news source (www.digitoday.fi). The corpus consists of 953 articles
(193,742 word tokens) with six named entity classes (organization, location, person, product, event, and date).
The corpus is available for research purposes and can be readily used for development of NER systems for Finnish.
  • 라이센스 : 알려진 라이센스 없음
  • 버전 : 1.0.0
  • 분할 :
나뉘다
'test' 3512
'test_wikipedia' 3360
'train' 13497
'validation' 986
  • 특징 :
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "ner_tags": {
        "feature": {
            "num_classes": 13,
            "names": [
                "O",
                "B-DATE",
                "B-EVENT",
                "B-LOC",
                "B-ORG",
                "B-PER",
                "B-PRO",
                "I-DATE",
                "I-EVENT",
                "I-LOC",
                "I-ORG",
                "I-PER",
                "I-PRO"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "nested_ner_tags": {
        "feature": {
            "num_classes": 13,
            "names": [
                "O",
                "B-DATE",
                "B-EVENT",
                "B-LOC",
                "B-ORG",
                "B-PER",
                "B-PRO",
                "I-DATE",
                "I-EVENT",
                "I-LOC",
                "I-ORG",
                "I-PER",
                "I-PRO"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}