msr_genomics_kbcomp

참고자료:

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:msr_genomics_kbcomp')
  • 설명 :
The database is derived from the NCI PID Pathway Interaction Database, and the textual mentions are extracted from cooccurring pairs of genes in PubMed abstracts, processed and annotated by Literome (Poon et al. 2014). This dataset was used in the paper “Compositional Learning of Embeddings for Relation Paths in Knowledge Bases and Text” (Toutanova, Lin, Yih, Poon, and Quirk, 2016).
  • 라이센스 : 알려진 라이센스 없음
  • 버전 : 1.1.0
  • 분할 :
나뉘다
'test' 2784
'train' 12160
'validation' 1315
  • 특징 :
{
    "GENE1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "relation": {
        "num_classes": 3,
        "names": [
            "Positive_regulation",
            "Negative_regulation",
            "Family"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "GENE2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}