参考文献:
書籍_v1_01
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_01')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 6106719 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ウォッチ_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Watches_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 960872 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Personal_Care_Appliances_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Personal_Care_Appliances_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 85981 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
モバイル_エレクトロニクス_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Mobile_Electronics_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 104975 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
デジタルビデオゲーム_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Video_Games_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 145431 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
デジタル_ソフトウェア_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Software_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 102084 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
メジャー_アプライアンス_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Major_Appliances_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 96901 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ギフト_カード_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Gift_Card_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 149086 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ビデオ_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Video_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 380604 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
荷物_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Luggage_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 348657 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ソフトウェア_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Software_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 341931 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ビデオ_ゲーム_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Video_Games_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1785997 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
家具_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Furniture_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 792113 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ミュージカル_楽器_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Musical_Instruments_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 904765 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
デジタル_ミュージック_購入_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Music_Purchase_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1688884 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
書籍_v1_02
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_02')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3105520 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ホーム_エンターテイメント_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Home_Entertainment_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 705889 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
食料品_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Grocery_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2402458 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
アウトドア_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Outdoors_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2302401 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ペット_製品_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Pet_Products_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2643619 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ビデオ_DVD_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Video_DVD_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 5069140 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
アパレル_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Apparel_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 5906333 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
PC_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/PC_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 6908554 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ツール_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Tools_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1741100 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ジュエリー_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Jewelry_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1767753 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ベイビー_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Baby_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1752932 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ホーム_改善_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Home_Improvement_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2634781 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
カメラ_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Camera_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1801974 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
芝生と庭_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Lawn_and_Garden_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2557288 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Office_製品_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Office_Products_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2642434 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
エレクトロニクス_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Electronics_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3093869 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
自動車_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Automotive_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3514942 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
デジタルビデオダウンロード_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Video_Download_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4057147 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
モバイル_アプリ_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Mobile_Apps_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 5033376 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
靴_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Shoes_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4366916 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
おもちゃ_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Toys_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4864249 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
スポーツ_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Sports_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4850360 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
キッチン_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Kitchen_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4880466 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ビューティー_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Beauty_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 5115666 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ミュージック_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Music_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4751577 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Health_Personal_Care_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Health_Personal_Care_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 5331449 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
デジタル_電子ブック_購入_v1_01
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Ebook_Purchase_v1_01')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 5101693 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ホーム_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Home_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 6221559 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ワイヤレス_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Wireless_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 不明なライセンス
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 9002021 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
書籍_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 10319090 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
デジタル_電子ブック_購入_v1_00
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Ebook_Purchase_v1_00')
- 説明:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- ライセンス: 既知のライセンスはありません
- バージョン: 0.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 12520722 |
- 特徴:
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}