- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 16 for XlaLaunch (0.15 sec)
-
tensorflow/compiler/jit/encapsulate_xla_computations_pass.h
// functions contain the computations to be passed to XlaLaunch. During // encapsulation, we sort the arguments into the order expected by // XlaLaunch. static Status Encapsulate(std::unique_ptr<Graph>* graph, FunctionLibraryDefinition* flib_def); // b) we rewrite the function calls generated in phase (a) into XlaLaunch // operators. We also convert the XlaClusterOutput output nodes of the
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Thu Feb 22 06:59:07 UTC 2024 - 3.6K bytes - Viewed (0) -
tensorflow/compiler/jit/encapsulate_xla_computations_pass.cc
// the arguments into the order expected by XlaLaunch computations: // 1) arguments // 2) resource variable arguments // See the documentation of EncapsulateSubgraphsInFunctions for the meaning // of the arguments. // // TODO(b/113166435): Ordering constraints on XlaLaunch op can be relaxed. Status RewriteSubgraph(const std::vector<OutputTensor>& arg_source_tensors,
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Tue Mar 12 06:33:33 UTC 2024 - 15.1K bytes - Viewed (0) -
tensorflow/compiler/jit/xla_platform_info.h
// xla_device_metadata_ lives in the tensorflow::DeviceBase in which the // XlaLaunch/_XlaCompile/_XlaRun op is placed and thus does not die before the // XlaLaunch/_XlaCompile/_XlaRun OpKernel. const XlaDevice::Metadata* xla_device_metadata_; // pjrt_device_metadata_ lives in tensorflow::PjRtBaseDevice in which the // XlaLaunch/XlaCompileOnDemand op is placed and thus does not die before the // op kernel.
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Wed Feb 21 09:53:30 UTC 2024 - 7.2K bytes - Viewed (0) -
tensorflow/compiler/mlir/tensorflow/transforms/tf_device_passes.td
This pass rewrites `tf.PartitionedCall` and `tf.StatefulPartitionedCall` operations with `_xla_compile_device_type` attribute in a `tf_device.cluster` into `tf.XlaLaunch` operations. This makes the attached function execute with XLA. `tf.XlaLaunch` requires resource-type arguments come at the end, so this pass rewrites the called function if necessary. This pass assumes there are no nested `tf_device.cluster`s so we don't end
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Wed Apr 17 18:52:57 UTC 2024 - 12.5K bytes - Viewed (0) -
tensorflow/compiler/jit/ops/xla_ops.cc
#include "absl/status/status.h" #include "tensorflow/core/framework/op.h" #include "tensorflow/core/framework/shape_inference.h" namespace tensorflow { using shape_inference::InferenceContext; REGISTER_OP("XlaLaunch") .Input("constants: Tconstants") .Attr("Tconstants: list(type) >= 0") .Input("args: Targs") .Attr("Targs: list(type) >= 0") .Input("resources: Nresources * resource")
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Sat Apr 06 09:08:06 UTC 2024 - 4.5K bytes - Viewed (0) -
tensorflow/compiler/jit/xla_compile_util.h
const NodeDef& node_def, absl::Span<const XlaArgument> args, absl::Span<const DataType> result_types); // Checks if single device compilation and execution with PJRT is enabled for // `device_type` in either the XlaLaunch op or the XlaCompileOnDemand op. bool UsePjRtForSingleDeviceCompilation(const DeviceType& device_type); // Gets the resource name of the PjRt DeviceCompiler for `device_type`.
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Wed Feb 21 09:53:30 UTC 2024 - 2.4K bytes - Viewed (0) -
tensorflow/compiler/jit/flags.h
public: // Allow using Device API (PjRt) for `device_type` in the XlaLaunch op. // Please note that `enabled_for_xla_launch_` needs to be true in addition // to the `device_type` being allowed in order to use the Device API for // single device compilation and execution in the XlaLaunch op. void AllowForDeviceInXlaLaunch(const DeviceType& device_type) {
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Wed Apr 17 18:52:57 UTC 2024 - 14.5K bytes - Viewed (0) -
tensorflow/compiler/jit/kernels/xla_ops.cc
if (ctx->has_input(i) || ctx->has_input(++i)) { ctx->set_output(0, ctx->input(i)); } } REGISTER_KERNEL_BUILDER(Name("XlaLaunch").Device(DEVICE_CPU), XlaLocalLaunchOp); REGISTER_KERNEL_BUILDER(Name("XlaLaunchV2").Device(DEVICE_CPU), XlaLaunchV2Op); REGISTER_KERNEL_BUILDER(Name("XlaLaunch") .Device(DEVICE_GPU) .HostMemory("constants")
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Fri May 17 22:46:36 UTC 2024 - 41.4K bytes - Viewed (0) -
tensorflow/compiler/mlir/tensorflow/transforms/host_runtime/lower_cluster_to_runtime_ops.cc
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Wed Apr 17 18:52:57 UTC 2024 - 9.4K bytes - Viewed (0) -
tensorflow/compiler/mlir/tfrt/tests/mlrt/tf_to_mlrt.mlir
%unused = "tf.TestAsyncIdentity"(%x) {__op_key = 0: i32, T = i32} : (tensor<i32>) -> tensor<i32> // CHECK: mlrt.await_all_control [[unused]] return %x : tensor<i32> } // ----- // Test for XlaLaunch func.func private @xla_func_0(%arg0: tensor<1x3xf32>, %arg1: tensor<1x3xf32>) -> tensor<1x3xf32> attributes {tf._XlaMustCompile = true, tf._noinline = true, tf._original_func_name = "should_not_be_used"} {
Registered: Sun Jun 16 05:45:23 UTC 2024 - Last Modified: Fri May 31 20:44:15 UTC 2024 - 24.7K bytes - Viewed (0)