Part #/ Keyword

All Products

Capacitors/Resistors
Embedded Processors & Controllers
Diodes/Transistors
Power Management ICs/Switches/Relays/Circuit Protection
Interface ICs/Clock and Timing
Optoelectronics
Crystals/Oscillators/Resonators
RF&Radio/Amplifiers/Filters/EMI Optimization
Connectors
Memory/Sensors
Inductors/Coils/Transformers
Display Modules / LED Drivers / Display Drivers
Data Converters/Logic ICs
Audio Products/Micromotors
Functional Modules/IoT/Communication Modules

NVIDIA Unveils Three AI Systems

2026-04-01 10:36:23Mr.Ming

According to announcements at NVIDIA GTC 2026, NVIDIA unveiled three new systems—the Groq LPX inference rack, Vera ETL256 CPU rack, and STX storage reference architecture—signaling a strategic shift toward becoming a full-stack AI infrastructure platform provider.

Analysis from SemiAnalysis indicates that these launches reflect NVIDIA’s expansion beyond GPUs into broader domains, including inference optimization, high-density CPU deployment, and storage orchestration, positioning the company to address end-to-end AI infrastructure demands.

The Groq LPX inference rack represents a rapid commercialization effort following NVIDIA’s reported $20 billion investment in Groq-related intellectual property and talent acquisition. Built around the LP30 chip using Samsung’s 4nm process, the system avoids reliance on TSMC’s constrained N3 capacity and does not require high-bandwidth memory (HBM), creating a differentiated supply and cost advantage.

From an architectural perspective, the LPX rack integrates LP30 chips with NVIDIA GPUs and introduces an “Attention and Feed-Forward Decoupling” (AFD) approach. This design separates attention workloads and feed-forward network (FFN) processing across GPUs and LPUs, significantly reducing inference latency and optimizing performance for interactive large language model (LLM) applications. SemiAnalysis notes that LPUs are particularly effective in latency-sensitive scenarios, reinforcing their role within decoupled architectures.

The Vera ETL256 CPU rack integrates 256 CPUs within a single liquid-cooled rack and utilizes copper-based interconnect topology to enable full intra-rack connectivity, helping to alleviate CPU supply constraints as AI workloads scale. Meanwhile, the STX storage reference architecture extends NVIDIA’s influence into the storage layer, complementing its capabilities in compute and networking to complete a full-stack AI infrastructure framework.

NVIDIA also announced ecosystem support for the STX standard from major technology companies, including Dell Technologies, Hewlett Packard Enterprise, IBM, NetApp, Supermicro, and VAST Data, reinforcing its strategy of expanding industry influence through partnerships.

On the technical front, the LP30 chip adopts a monolithic die design with 500MB of on-chip SRAM and delivers up to 1.2 PFLOPS of performance at FP8 precision, representing a significant improvement over Groq’s first-generation LPU. This advancement is driven in part by the transition from GlobalFoundries’ GF16 process to Samsung’s SF4 node.

Within the AFD framework, GPUs handle attention computations requiring dynamic access to key-value cache and leverage HBM resources, while LPUs execute statically scheduled FFN workloads to maximize low-latency performance. The two components communicate via an All-to-All mechanism for token distribution and aggregation, supported by a ping-pong pipeline design that further reduces communication latency and enhances overall inference efficiency.

* Solemnly declare: The copyright of this article belongs to the original author. The reprinted article is only for the purpose of disseminating more information. If the author's information is marked incorrectly, please contact us to modify or delete it as soon as possible. Thank you for your attention!

Previous: Memory Chip Price Rally May Ease in Q3 Next: How to Build a Tube Amplifier?

Payment Methods

Products/Services

Delivery Services

Customer Support

Verified by

About

Link Us on

Electronic components inquiry: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 0 1 2 3 4 5 6 7 8 9

Search History

NVIDIA Unveils Three AI Systems