tfm.nlp.tasks.ElectraPretrainTask

ELECTRA Pretrain Task (Masked LM + Replaced Token Detection).

Inherits From: Task

tfm.nlp.tasks.ElectraPretrainTask(
    params, logging_dir: Optional[str] = None, name: Optional[str] = None
)

Args
`params`	the task configuration instance, which can be any of dataclass, ConfigDict, namedtuple, etc.
`logging_dir`	a string pointing to where the model, summaries etc. will be saved. You can also write additional stuff in this directory.
`name`	the task name.

Attributes
`logging_dir`
`task_config`

Attributes

logging_dir

task_config

Methods

`aggregate_logs`

View source

aggregate_logs(
    state, step_logs
)

Optional aggregation over logs returned from a validation step.

Given step_logs from a validation step, this function aggregates the logs after each eval_step() (see eval_reduce() function in official/core/base_trainer.py). It runs on CPU and can be used to aggregate metrics during validation, when there are too many metrics that cannot fit into TPU memory. Note that this may increase latency due to data transfer between TPU and CPU. Also, the step output from a validation step may be a tuple with elements from replicas, and a concatenation of the elements is needed in such case.

Args
`state`	The current state of training, for example, it can be a sequence of metrics.
`step_logs`	Logs from a validation step. Can be a dictionary.

`build_inputs`

View source

build_inputs(
    params, input_context=None
)

Returns tf.data.Dataset for pretraining.

`build_losses`

View source

build_losses(
    labels, model_outputs, metrics, aux_losses=None
) -> tf.Tensor

Standard interface to compute losses.

Args
`labels`	optional label tensors.
`model_outputs`	a nested structure of output tensors.
`aux_losses`	auxiliary loss tensors, i.e. `losses` in keras.Model.

Returns
The total loss tensor.

`build_metrics`

View source

build_metrics(
    training=None
)

Gets streaming metrics for training/validation.

`build_model`

View source

build_model()

[Optional] Creates model architecture.

Returns
A model instance.

`create_optimizer`

View source

@classmethod
create_optimizer(
    optimizer_config: tfm.optimization.OptimizationConfig,
    runtime_config: Optional[tfm.core.base_task.RuntimeConfig] = None,
    dp_config: Optional[tfm.core.base_task.DifferentialPrivacyConfig] = None
)

Creates an TF optimizer from configurations.

Args
`optimizer_config`	the parameters of the Optimization settings.
`runtime_config`	the parameters of the runtime.
`dp_config`	the parameter of differential privacy.

Returns
A tf.optimizers.Optimizer object.

`inference_step`

View source

inference_step(
    inputs, model: tf.keras.Model
)

Performs the forward step.

With distribution strategies, this method runs on devices.

Args
`inputs`	a dictionary of input tensors.
`model`	the keras.Model.

Returns
Model outputs.

`initialize`

View source

initialize(
    model: tf.keras.Model
)

[Optional] A callback function used as CheckpointManager's init_fn.

This function will be called when no checkpoint is found for the model. If there is a checkpoint, the checkpoint will be loaded and this function will not be called. You can use this callback function to load a pretrained checkpoint, saved under a directory other than the model_dir.

Args
`model`	The keras.Model built or used by this task.

`process_compiled_metrics`

View source

process_compiled_metrics(
    compiled_metrics, labels, model_outputs
)

Process and update compiled_metrics.

call when using compile/fit API.

Args
`compiled_metrics`	the compiled metrics (model.compiled_metrics).
`labels`	a tensor or a nested structure of tensors.
`model_outputs`	a tensor or a nested structure of tensors. For example, output of the keras model built by self.build_model.

`process_metrics`

View source

process_metrics(
    metrics, labels, model_outputs
)

Process and update metrics.

Called when using custom training loop API.

Args
`metrics`	a nested structure of metrics objects. The return of function self.build_metrics.
`labels`	a tensor or a nested structure of tensors.
`model_outputs`	a tensor or a nested structure of tensors. For example, output of the keras model built by self.build_model.
`**kwargs`	other args.

`reduce_aggregated_logs`

View source

reduce_aggregated_logs(
    aggregated_logs, global_step: Optional[tf.Tensor] = None
)

Optional reduce of aggregated logs over validation steps.

This function reduces aggregated logs at the end of validation, and can be used to compute the final metrics. It runs on CPU and in each eval_end() in base trainer (see eval_end() function in official/core/base_trainer.py).

Args
`aggregated_logs`	Aggregated logs over multiple validation steps.
`global_step`	An optional variable of global step.

Returns
A dictionary of reduced results.

`train_step`

View source

train_step(
    inputs,
    model: tf.keras.Model,
    optimizer: tf.keras.optimizers.Optimizer,
    metrics
)

Does forward and backward.

Args
`inputs`	a dictionary of input tensors.
`model`	the model, forward pass definition.
`optimizer`	the optimizer for this training step.
`metrics`	a nested structure of metrics objects.

Returns
A dictionary of logs.

`validation_step`

View source

validation_step(
    inputs, model: tf.keras.Model, metrics
)

Validatation step.

Args
`inputs`	a dictionary of input tensors.
`model`	the keras.Model.
`metrics`	a nested structure of metrics objects.

Returns
A dictionary of logs.

Class Variables
loss	`'loss'`

tfm.nlp.tasks.ElectraPretrainTask Stay organized with collections Save and categorize content based on your preferences.

Args

Attributes

Methods

aggregate_logs

build_inputs

build_losses

build_metrics

build_model

create_optimizer

inference_step

initialize

process_compiled_metrics

process_metrics

reduce_aggregated_logs

train_step

validation_step

Class Variables

tfm.nlp.tasks.ElectraPretrainTask

`aggregate_logs`

`build_inputs`

`build_losses`

`build_metrics`

`build_model`

`create_optimizer`

`inference_step`

`initialize`

`process_compiled_metrics`

`process_metrics`

`reduce_aggregated_logs`

`train_step`

`validation_step`