A RNN backed by cuDNN.
Computes the RNN from the input and initial states, with respect to the params buffer. Accepts one extra input "sequence_lengths" than CudnnRNN.
rnn_mode: Indicates the type of the RNN model. input_mode: Indicates whether there is a linear projection between the input and the actual computation before the first layer. 'skip_input' is only allowed when input_size == num_units; 'auto_select' implies 'skip_input' when input_size == num_units; otherwise, it implies 'linear_input'. direction: Indicates whether a bidirectional model will be used. Should be "unidirectional" or "bidirectional". dropout: Dropout probability. When set to 0., dropout is disabled. seed: The 1st part of a seed to initialize dropout. seed2: The 2nd part of a seed to initialize dropout. input: If time_major is true, this is a 3-D tensor with the shape of [seq_length, batch_size, input_size]. If time_major is false, the shape is [batch_size, seq_length, input_size]. input_h: If time_major is true, this is a 3-D tensor with the shape of [num_layer * dir, batch_size, num_units]. If time_major is false, the shape is [batch_size, num_layer * dir, num_units]. input_c: For LSTM, a 3-D tensor with the shape of [num_layer * dir, batch, num_units]. For other models, it is ignored. params: A 1-D tensor that contains the weights and biases in an opaque layout. The size must be created through CudnnRNNParamsSize, and initialized separately. Note that they might not be compatible across different generations. So it is a good idea to save and restore sequence_lengths: a vector of lengths of each input sequence. output: If time_major is true, this is a 3-D tensor with the shape of [seq_length, batch_size, dir * num_units]. If time_major is false, the shape is [batch_size, seq_length, dir * num_units]. output_h: The same shape has input_h. output_c: The same shape as input_c for LSTM. An empty tensor for other models. is_training: Indicates whether this operation is used for inference or training. time_major: Indicates whether the input/output format is time major or batch major. reserve_space: An opaque tensor that can be used in backprop calculation. It is only produced if is_training is true.
Nested Classes
class | CudnnRNN.Options | Optional attributes for CudnnRNN
|
Constants
String | OP_NAME | The name of this op, as known by TensorFlow core engine |
Public Methods
static <T extends TNumber> CudnnRNN<T> | |
static CudnnRNN.Options |
direction(String direction)
|
static CudnnRNN.Options |
dropout(Float dropout)
|
Output<?> | |
static CudnnRNN.Options |
inputMode(String inputMode)
|
static CudnnRNN.Options |
isTraining(Boolean isTraining)
|
static CudnnRNN.Options |
numProj(Long numProj)
|
Output<T> |
output()
|
Output<T> |
outputC()
|
Output<T> |
outputH()
|
Output<T> | |
static CudnnRNN.Options |
rnnMode(String rnnMode)
|
static CudnnRNN.Options |
seed(Long seed)
|
static CudnnRNN.Options |
seed2(Long seed2)
|
static CudnnRNN.Options |
timeMajor(Boolean timeMajor)
|
Inherited Methods
Constants
public static final String OP_NAME
The name of this op, as known by TensorFlow core engine
Public Methods
public static CudnnRNN<T> create (Scope scope, Operand<T> input, Operand<T> inputH, Operand<T> inputC, Operand<T> params, Operand<TInt32> sequenceLengths, Options... options)
Factory method to create a class wrapping a new CudnnRNN operation.
Parameters
scope | current scope |
---|---|
options | carries optional attributes values |
Returns
- a new instance of CudnnRNN