PR #62817: [onednn] Enable auto_mixed_precision for fp16 on cpu
Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/62817 This PR enables AMP FP16 on supporting Intel Xeon CPUs. If there is no GPU available to run FP16, it checks if FP16 can be executed on CPU. Use config `auto_mixed_precision` to enable FP16 on CPUs. Copybara import of the project: -- fc1ee3bc by Gauri1 Deshpande <gauri1.deshpande@intel.com>: [onednn] Enable auto_mixed_precision for fp16 on cpu -- 9a3c8d59 by Gauri1 Deshpande <gauri1.deshpande@intel.com>: Address review comments - update comments as per guidelines. -- 27b1a962 by Gauri1 Deshpande <gauri1.deshpande@intel.com>: Address review comments -- 9a3ef902 by Gauri1 Deshpande <gauri1.deshpande@intel.com>: Address review comments -- 0ab5b80f by Gauri1 Deshpande <gauri1.deshpande@intel.com>: minor change -- 182a1738 by Gauri1 Deshpande <gauri1.deshpande@intel.com>: Address review comments -- 9780d75a by Gauri1 Deshpande <gauri1.deshpande@intel.com>: Address re-review comments Merging this change closes #62817 FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/tensorflow/pull/62817 from Intel-tensorflow:gaurides/amp_fp16 9780d75a PiperOrigin-RevId: 615915563
Showing
- tensorflow/BUILD 7 additions, 6 deletionstensorflow/BUILD
- tensorflow/compiler/mlir/tf2xla/api/v1/BUILD 0 additions, 7 deletionstensorflow/compiler/mlir/tf2xla/api/v1/BUILD
- tensorflow/compiler/mlir/tf2xla/api/v1/compile_mlir_util.cc 0 additions, 4 deletionstensorflow/compiler/mlir/tf2xla/api/v1/compile_mlir_util.cc
- tensorflow/compiler/mlir/tf2xla/api/v1/compile_mlir_util_test.cc 1 addition, 74 deletions...low/compiler/mlir/tf2xla/api/v1/compile_mlir_util_test.cc
- tensorflow/core/BUILD 4 additions, 4 deletionstensorflow/core/BUILD
- tensorflow/core/api_def/BUILD 2 additions, 2 deletionstensorflow/core/api_def/BUILD
- tensorflow/core/common_runtime/BUILD 6 additions, 6 deletionstensorflow/core/common_runtime/BUILD
- tensorflow/core/common_runtime/eager/BUILD 2 additions, 2 deletionstensorflow/core/common_runtime/eager/BUILD
- tensorflow/core/graph/BUILD 2 additions, 2 deletionstensorflow/core/graph/BUILD
- tensorflow/core/grappler/optimizers/BUILD 1 addition, 1 deletiontensorflow/core/grappler/optimizers/BUILD
- tensorflow/core/grappler/optimizers/auto_mixed_precision.cc 26 additions, 14 deletionstensorflow/core/grappler/optimizers/auto_mixed_precision.cc
- tensorflow/core/grappler/optimizers/auto_mixed_precision.h 8 additions, 4 deletionstensorflow/core/grappler/optimizers/auto_mixed_precision.h
- tensorflow/core/grappler/optimizers/auto_mixed_precision_lists.h 55 additions, 38 deletions...low/core/grappler/optimizers/auto_mixed_precision_lists.h
- tensorflow/core/grappler/optimizers/auto_mixed_precision_test.cc 116 additions, 74 deletions...low/core/grappler/optimizers/auto_mixed_precision_test.cc
- tensorflow/core/grappler/optimizers/meta_optimizer.cc 4 additions, 0 deletionstensorflow/core/grappler/optimizers/meta_optimizer.cc
- tensorflow/core/kernels/BUILD 5 additions, 5 deletionstensorflow/core/kernels/BUILD
- tensorflow/core/kernels/mkl/BUILD 1 addition, 1 deletiontensorflow/core/kernels/mkl/BUILD
- tensorflow/core/ops/BUILD 4 additions, 4 deletionstensorflow/core/ops/BUILD
- tensorflow/core/platform/build_config.default.bzl 5 additions, 5 deletionstensorflow/core/platform/build_config.default.bzl
- tensorflow/core/protobuf/rewriter_config.proto 2 additions, 2 deletionstensorflow/core/protobuf/rewriter_config.proto
Please register or sign in to comment