Untether 推出 2-PFLOPS AI 芯片并发布边缘路线图

By Sally Ward-Foxton, EETimes (August 23, 2022)

At Hot Chips this week, Untether unveiled its second-gen architecture for AI inference, the first chip using this architecture, as well as plans to expand to edge and endpoint accelerators.

Untether’s new architecture, internally codenamed Boqueria, addresses trends for very large neural networks, including transformer networks in natural language processing and beyond, endpoint applications that require power efficiency, and applications that require performance and power efficiency combined with prediction accuracy.

The first chip to use the Boqueria architecture, SpeedAI, is a data center inference accelerator capable of 2 PFLOPS of FP8 performance running at peak power consumption (66 W), or 30 TFLOPS/W based on a more usual 30-35 W power envelope. (Untether’s first generation chip, RunAI, could handle 500 TOPS of INT8.)

Search Silicon IP

16,000 IP Cores from 450 Vendors

EETimes

本网站的任何部分未经Design&Reuse许可，不得复制，重发，转载或以其他方式使用。

业务合作

访问我们的合作伙伴页面了解更多信息

成为合作伙伴

添加产品

供应商免费录入产品信息

公布产品信息

Untether 推出 2-PFLOPS AI 芯片并发布边缘路线图

Contact Untether

Search Silicon IP

业务合作

添加产品

Design-Reuse.com