About 13,200 results
Open links in new tab
  1. Should we set non_blocking to True? - PyTorch Forums

    Feb 26, 2019 · I’ve read the document saying that if we have pinned memory, we could set non_blocking to true. Will this result in anything bad in our code? Like in my code, after doing data …

  2. gpu_tensor.to("cpu", non_blocking=True) is blocking #39694

    Jun 9, 2020 · 🐛 Bug >>> a = torch.tensor(100000, device="cuda") >>> b = a.to("cpu", non_blocking=True) >>> b.is_pinned() False The cpu dst memory is created as pageable, so the ...

  3. python - Proper Usage of PyTorch's non_blocking=True for Data ...

    Aug 18, 2020 · I am looking into prefetching data into the GPU from the CPU when the model is being trained on the GPU. Overlapping CPU-to-GPU data transfer with GPU model training appears to …

  4. Unveiling the Magic of PyTorch Non-Blocking — codegenes.net

    Nov 14, 2025 · In the realm of deep learning, optimizing the training and inference processes is crucial for achieving high-performance results. PyTorch, a popular deep learning framework, offers a feature …

  5. Does non_blocking=True use my stream or a new one?

    Jan 4, 2023 · I’m using stream context manager (with torch.cuda.stream (s)) to write my custom autograd function. If I do some data transfer from host to device using to (device=d, …

  6. CUDA Non-Blocking in PyTorch: A Comprehensive Guide

    Nov 13, 2025 · In the realm of deep learning, PyTorch has emerged as one of the most popular frameworks due to its dynamic computational graph and ease of use. When dealing with large-scale …

  7. Should `pin_memory` always used with `non_blocking=True`?

    Oct 12, 2022 · Simplest case: import torch import torch.cuda.nvtx as nvtx from contextlib import contextmanager @contextmanager def stream_wrapper(stream): yield …

  8. Is synchronization necessary before calling run if non ... - GitHub

    The question In the FlashInfer documentation, there is always a warning that if non_blocking=True is passed, a synchronization operation must be called before run. However, if there is only host-to...

  9. Should we set non_blocking to True? - Page 2 - PyTorch Forums

    Nov 10, 2021 · Sorry for picking up on old thread but I found your statement interesting. Without any in-depth knowledge of how GPUs work, I assume that this means: The data transfer to the GPU can …

  10. I have a question about tensor.to (non_blocking=True)

    Jan 19, 2024 · I saw topics on [Should we set non_blocking to True? - PyTorch Forums] . I read answers and i saw post [[How to Overlap Data Transfers in CUDA C/C++ | NVIDIA Technical Blog]] How to …