Name: Tensor Contraction as a First-Class Primitive: Learnings from Building Efficient and Programmable AI Compute
Start: 2025-08-05T16:30:00-0700
End: 2025-08-05T16:55:00-0700

Tuesday August 5, 2025 4:30pm - 4:55pm PDT

TaiNEX2 - 703

Recent advances in large-scale AI models have placed increasing pressure on the underlying compute architecture to deliver not only raw performance but also programmability and efficiency at scale. This talk introduces the Tensor Contraction Processor (TCP), a novel architecture that reconceptualizes tensor contraction as the central computational primitive, enabling a broader class of operations beyond traditional matrix multiplication. We will present the motivation behind this architectural shift, its implications for compiler design and runtime scheduling, and findings related to performance and energy efficiency. The discussion will also explore how exposing tensor contraction at the hardware level opens opportunities for more expressive and seamless execution strategies, potentially reducing data movement and improving utilization. We will share key learnings from scaling the chip across servers and racks, highlight intersections with relevant OCP Project areas, and discuss how these insights are informing our product roadmap.

Speakers

Woo Kim

FuriosaAI

Tuesday August 5, 2025 4:30pm - 4:55pm PDT
TaiNEX2 - 703

Future Technologies Symposium

Need help? View Support Guides
Event questions? Contact Event Planner

2025 OCP APAC Summit

Woo Kim

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!