Examples ================ - :doc:`ping_pong.py `: Demonstrates the basics of Monarch's Actor/endpoint API with a ping-pong communication example - :doc:`crawler.py `: Demonstrates Monarch's actor API and many-to-one communications with a web crawler example - :doc:`spmd_ddp.py `: Shows how to run PyTorch's Distributed Data Parallel (DDP) using SPMDActor - :doc:`Interactive SPMD Job `: Shows how to use ``serve()`` and ``run_spmd()`` for interactive SPMD training with job caching and debugging - :doc:`kubernetes_ddp.py `: Extends the DDP example to run on Kubernetes using MonarchMesh CRD and operator - :doc:`grpo_actor.py `: Implements a distributed PPO-like reinforcement learning algorithm using the Monarch actor framework - :doc:`distributed_tensors.py `: Shows how to dispatch tensors and tensor level operations to a distributed mesh of workers and GPUs - :doc:`debugging.py `: Shows how to use the Monarch debugger to debug a distributed program - `Multinode Slurm Tutorial `_: Multinode distributed training tutorial using Monarch and Slurm to run an SPMD training job. - `Running on Kubernetes using Skypilot `_: Run Monarch on Kubernetes and cloud VMs via SkyPilot. .. toctree:: :hidden: /generated/examples/ping_pong /generated/examples/crawler /generated/examples/ddp/spmd_ddp /generated/examples/ddp/spmd_job /generated/examples/ddp/kubernetes_ddp /generated/examples/grpo_actor /generated/examples/distributed_tensors /generated/examples/debugging Multinode Slurm Tutorial