- Sort Score
- Result 10 results
- Languages All
Results 1 - 2 of 2 for peers (0.13 sec)
-
tensorflow/c/c_api_experimental.h
// // This is intended to be used when a peer failure is detected. TF_CAPI_EXPORT extern void TFE_AbortCollectiveOps(TFE_Context* ctx, TF_Status* status); // Checks the health of collective ops peers. Explicit health check is needed in // multi worker collective ops to detect failures in the cluster. If a peer is // down, collective ops may hang.
C - Registered: Tue Apr 30 12:39:09 GMT 2024 - Last Modified: Thu Apr 27 21:07:00 GMT 2023 - 15.1K bytes - Viewed (0) -
RELEASE.md
is now a stable API and is no longer considered experimental. Some of the major improvements involve handling peer failure and many bug fixes. Please check out the detailed tutorial on [Multi-worker training with Keras](https://www.tensorflow.org/tutorials/distribute/multi_worker_with_keras).
Plain Text - Registered: Tue May 07 12:40:20 GMT 2024 - Last Modified: Mon Apr 29 19:17:57 GMT 2024 - 727.7K bytes - Viewed (8)