code_search_net

Références :

tous

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/all')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 100529
'train' 1880853
'validation' 89154
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Java

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/java')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 26909
'train' 454451
'validation' 15328
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

aller

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/go')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 14291
'train' 317832
'validation' 14242
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

python

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/python')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 22176
'train' 412178
'validation' 23107
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

javascript

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/javascript')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 6483
'train' 123889
'validation' 8253
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

rubis

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/ruby')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 2279
'train' 48791
'validation' 2209
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

php

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:code_search_net/php')
  • Description :
CodeSearchNet corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation.
  • Licence : Divers
  • Version : 1.0.0
  • Divisions :
Diviser Exemples
'test' 28391
'train' 523712
'validation' 26015
  • Caractéristiques :
{
    "repository_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_path_in_repository": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "whole_func_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "language": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "func_documentation_string": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_documentation_tokens": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "split_name": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "func_code_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}