tf.keras.preprocessing.text.text_to_word_sequence

Converts a text to a sequence of words (or tokens).

Compat aliases for migration

See Migration guide for more details.

tf.keras.preprocessing.text.text_to_word_sequence(
    input_text,
    filters='!"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n',
    lower=True, split=' '
)

This function transforms a string of text into a list of words while ignoring filters which include punctuations by default.

sample_text = 'This is a sample sentence.'
tf.keras.preprocessing.text.text_to_word_sequence(sample_text)
['this', 'is', 'a', 'sample', 'sentence']

Args
`input_text`	Input text (string).
`filters`	list (or concatenation) of characters to filter out, such as punctuation. Default: '!"#$%&()*+,-./:;<=>?@[\]^_`{\|}~\t\n', includes basic punctuation, tabs, and newlines.
`lower`	boolean. Whether to convert the input to lowercase.
`split`	str. Separator for word splitting.

Returns
A list of words (or tokens).

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.

Last updated 2021-08-16 UTC.

tf.keras.preprocessing.text.text_to_word_sequence Stay organized with collections Save and categorize content based on your preferences.