tf.AggregationMethod
Stay organized with collections
Save and categorize content based on your preferences.
A class listing aggregation methods used to combine gradients.
Computing partial derivatives can require aggregating gradient
contributions. This class lists the various methods that can
be used to combine gradients in the graph.
The following aggregation methods are part of the stable API for
aggregating gradients:
ADD_N
: All of the gradient terms are summed as part of one
operation using the "AddN" op (see tf.add_n
). This
method has the property that all gradients must be ready and
buffered separately in memory before any aggregation is performed.
DEFAULT
: The system-chosen default aggregation method.
The following aggregation methods are experimental and may not
be supported in future releases:
EXPERIMENTAL_TREE
: Gradient terms are summed in pairs using
using the "AddN" op. This method of summing gradients may reduce
performance, but it can improve memory utilization because the
gradients can be released earlier.
Class Variables
ADD_N = 0
DEFAULT = 0
EXPERIMENTAL_ACCUMULATE_N = 2
EXPERIMENTAL_TREE = 1
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2020-10-01 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2020-10-01 UTC."],[],[]]