Loading…
Tuesday August 5, 2025 9:25am - 9:40am GMT+07
As we continue to push the model sizes for LLMs, distributed training has become an essential component of our AI infrastructure. However, as we build clusters that consume power beyond the capability of a single datacenter region, new networking challenges emerge. This talk will explore the complexities of networking in a multi-region distributed training environment, where data is transmitted across long distances between datacenters. We will discuss the current state of distributed training, the limitations of traditional networking approaches, and the innovative solutions being developed to address these challenges.
Speakers
OB

Omar Baldonado

Director, DC Networking, Meta
Tuesday August 5, 2025 9:25am - 9:40am GMT+07
TaiNEX 2 - 701 CD

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link