signal insight

OpenAI opens its MRC supercomputer networking protocol after deploying it on frontier training clusters

OpenAI described Multipath Reliable Connection, a network protocol it says is already deployed on its largest NVIDIA GB200 training supercomputers and has been used to train multiple OpenAI models. The company is also contributing the MRC specification to the Open Compute Project as part of a broader push to make large-scale Ethernet training fabrics more resilient.

Published May 5, 2026 Updated May 8, 2026 2 sources

OpenAIMRCai infrastructureinfrastructure releasehigh impact

ai-infrastructuretrainingnetworkingperformanceinfrastructure release

Impact: high
Confidence: 96%
Change type: infrastructure release
First seen: May 5, 2026
Last updated: May 8, 2026
Audience: ai infrastructure engineershyperscale platform teamsfrontier model operators
Status: Published

Summary

What changed

OpenAI publicly detailed MRC, said it is already in production on major training clusters, and released the specification as an Open Compute Project contribution.

Why it matters

This is a real infrastructure signal, not a paper-only research note. OpenAI is effectively saying networking reliability is now a first-order constraint on frontier training, and it is trying to shape the shared protocol stack for clusters that run well beyond 100,000 GPUs.

Evidence excerpt

OpenAI says MRC is already deployed across its largest NVIDIA GB200 supercomputers, has been used to train multiple OpenAI models, and is now available as an Open Compute Project contribution.