Intro Torch Torch Distributed

TryHard-LL/--pytorch-distributed

与 DataParallel 的单进程控制多 GPU 不同，在 distributed 的帮助下，我们只需要编写一份代码，torch 就会自动将其分配给个进程，分别在个 GPU 上运行。在 API 层面，pytorch 为我们提供了 torch.distributed.launch 启动器，用于在命令行分布式地执行 python 文件。

GitHub

Bug: torch.distributed.pipelining produces incorrect graph when model contains nn.Embedding

When splitting a simple model that contains an nn.Embedding layer into pipeline stages with the torch.distributed.pipelining.pipeline API, the pipeline representation incorrectly calls the embedding ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

TryHard-LL/--pytorch-distributed

Bug: torch.distributed.pipelining produces incorrect graph when model contains nn.Embedding

Trending now