tf.raw_ops.UnicodeEncode
Stay organized with collections
Save and categorize content based on your preferences.
Encode a tensor of ints into unicode strings.
tf.raw_ops.UnicodeEncode(
input_values,
input_splits,
output_encoding,
errors='replace',
replacement_char=65533,
name=None
)
Returns a vector of strings, where output[i]
is constructed by encoding the
Unicode codepoints in input_values[input_splits[i]:input_splits[i+1]]
using output_encoding
.
Example:
input_values = [72, 101, 108, 108, 111, 87, 111, 114, 108, 100]
input_splits = [0, 5, 10]
output_encoding = 'UTF-8'
output = ['Hello', 'World']
Args |
input_values
|
A Tensor of type int32 .
A 1D tensor containing the unicode codepoints that should be encoded.
|
input_splits
|
A Tensor . Must be one of the following types: int32 , int64 .
A 1D tensor specifying how the unicode codepoints should be split into strings.
In particular, output[i] is constructed by encoding the codepoints in the
slice input_values[input_splits[i]:input_splits[i+1]] .
|
output_encoding
|
A string from: "UTF-8", "UTF-16-BE", "UTF-32-BE" .
Unicode encoding of the output strings. Valid encodings are: "UTF-8",
"UTF-16-BE", and "UTF-32-BE" .
|
errors
|
An optional string from: "ignore", "replace", "strict" . Defaults to "replace" .
Error handling policy when there is invalid formatting found in the input.
The value of 'strict' will cause the operation to produce a InvalidArgument
error on any invalid input formatting. A value of 'replace' (the default) will
cause the operation to replace any invalid formatting in the input with the
replacement_char codepoint. A value of 'ignore' will cause the operation to
skip any invalid formatting in the input and produce no corresponding output
character.
|
replacement_char
|
An optional int . Defaults to 65533 .
The replacement character codepoint to be used in place of any invalid
formatting in the input when errors='replace' . Any valid unicode codepoint may
be used. The default value is the default unicode replacement character is
0xFFFD (U+65533).
|
name
|
A name for the operation (optional).
|
Returns |
A Tensor of type string .
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
Last updated 2024-04-26 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-04-26 UTC."],[],[]]