Références :
Livres_v1_01
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_01')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 6106719 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Montres_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Watches_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 960872 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Personal_Care_Appliances_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Personal_Care_Appliances_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 85981 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Mobile_Electronics_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Mobile_Electronics_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 104975 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Jeux_vidéo_numériques_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Video_Games_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 145431 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Logiciel_numérique_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Software_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 102084 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Major_Appliances_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Major_Appliances_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 96901 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Gift_Card_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Gift_Card_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 149086 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Vidéo_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Video_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 380604 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Bagages_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Luggage_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 348657 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Logiciel_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Software_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 341931 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Jeux_vidéo_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Video_Games_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 1785997 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Meubles_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Furniture_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 792113 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Instruments_musicaux_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Musical_Instruments_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 904765 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Digital_Music_Purchase_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Music_Purchase_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 1688884 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Livres_v1_02
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_02')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 3105520 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Accueil_Divertissement_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Home_Entertainment_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 705889 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Épicerie_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Grocery_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 2402458 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Extérieur_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Outdoors_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 2302401 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Produits_pouranimaux_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Pet_Products_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 2643619 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Vidéo_DVD_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Video_DVD_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 5069140 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Vêtements_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Apparel_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 5906333 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
PC_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/PC_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 6908554 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Outils_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Tools_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 1741100 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Bijoux_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Jewelry_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 1767753 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Bébé_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Baby_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 1752932 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Accueil_Improvement_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Home_Improvement_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 2634781 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Caméra_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Camera_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 1801974 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Pelouse_et_Garden_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Lawn_and_Garden_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 2557288 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Office_Products_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Office_Products_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 2642434 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Électronique_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Electronics_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 3093869 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Automobile_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Automotive_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 3514942 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Téléchargement_vidéo_numérique_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Video_Download_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 4057147 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Applications_mobiles_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Mobile_Apps_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 5033376 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Chaussures_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Shoes_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 4366916 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Jouets_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Toys_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 4864249 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Sports_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Sports_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 4850360 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Cuisine_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Kitchen_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 4880466 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Beauté_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Beauty_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 5115666 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Musique_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Music_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 4751577 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Santé_Personal_Care_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Health_Personal_Care_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 5331449 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Digital_Ebook_Purchase_v1_01
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Ebook_Purchase_v1_01')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 5101693 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Accueil_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Home_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 6221559 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Sans fil_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Wireless_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 9002021 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Livres_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 10319090 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
Digital_Ebook_Purchase_v1_00
Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :
ds = tfds.load('huggingface:amazon_us_reviews/Digital_Ebook_Purchase_v1_00')
- Description :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns:
- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
- Licence : Aucune licence connue
- Version : 0.1.0
- Divisions :
Diviser | Exemples |
---|---|
'train' | 12520722 |
- Caractéristiques :
{
"marketplace": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"customer_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_parent": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"product_category": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"star_rating": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"helpful_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"total_votes": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"vine": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"verified_purchase": {
"num_classes": 2,
"names": [
"N",
"Y"
],
"id": null,
"_type": "ClassLabel"
},
"review_headline": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"review_date": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}