so_stacksample

مراجع:

پاسخ ها

برای بارگذاری این مجموعه داده در TFDS از دستور زیر استفاده کنید:

ds = tfds.load('huggingface:so_stacksample/Answers')
  • توضیحات :
Dataset with the text of 10% of questions and answers from the Stack Overflow programming Q&A website.

This is organized as three tables:

Questions contains the title, body, creation date, closed date (if applicable), score, and owner ID for all non-deleted Stack Overflow questions whose Id is a multiple of 10.
Answers contains the body, creation date, score, and owner ID for each of the answers to these questions. The ParentId column links back to the Questions table.
Tags contains the tags on each of these questions
  • مجوز : همه مشارکت‌های کاربر Stack Overflow تحت مجوز CC-BY-SA 3.0 با ذکر منبع مورد نیاز هستند.
  • نسخه : 1.1.0
  • تقسیم ها :
تقسیم کنید نمونه ها
'Answers' 2014516
  • ویژگی ها :
{
    "Id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "OwnerUserId": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "CreationDate": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "ParentId": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "Score": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "Body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

سوالات

برای بارگذاری این مجموعه داده در TFDS از دستور زیر استفاده کنید:

ds = tfds.load('huggingface:so_stacksample/Questions')
  • توضیحات :
Dataset with the text of 10% of questions and answers from the Stack Overflow programming Q&A website.

This is organized as three tables:

Questions contains the title, body, creation date, closed date (if applicable), score, and owner ID for all non-deleted Stack Overflow questions whose Id is a multiple of 10.
Answers contains the body, creation date, score, and owner ID for each of the answers to these questions. The ParentId column links back to the Questions table.
Tags contains the tags on each of these questions
  • مجوز : همه مشارکت‌های کاربر Stack Overflow تحت مجوز CC-BY-SA 3.0 با ذکر منبع مورد نیاز هستند.
  • نسخه : 1.1.0
  • تقسیم ها :
تقسیم کنید نمونه ها
'Questions' 1264216
  • ویژگی ها :
{
    "Id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "OwnerUserId": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "CreationDate": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "ClosedDate": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Score": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "Title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

برچسب ها

برای بارگذاری این مجموعه داده در TFDS از دستور زیر استفاده کنید:

ds = tfds.load('huggingface:so_stacksample/Tags')
  • توضیحات :
Dataset with the text of 10% of questions and answers from the Stack Overflow programming Q&A website.

This is organized as three tables:

Questions contains the title, body, creation date, closed date (if applicable), score, and owner ID for all non-deleted Stack Overflow questions whose Id is a multiple of 10.
Answers contains the body, creation date, score, and owner ID for each of the answers to these questions. The ParentId column links back to the Questions table.
Tags contains the tags on each of these questions
  • مجوز : همه مشارکت‌های کاربر Stack Overflow تحت مجوز CC-BY-SA 3.0 با ذکر منبع مورد نیاز هستند.
  • نسخه : 1.1.0
  • تقسیم ها :
تقسیم کنید نمونه ها
'Tags' 3750994
  • ویژگی ها :
{
    "Id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "Tag": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}