REBEL™ and Beyond — Towards a Unified Chiplet Ecosystem
November 13, 2024 -- As Moore’s Law approaches its practical limits, chiplet architecture is gaining attention for its unique advantages: bypassing reticle constraints, nearly doubling time-to-market speed, and achieving higher yields with fewer defects per unit area compared to monolithic designs. This shift has spurred industry interest in building a robust ecosystem for standardized chiplets, such as high-speed I/O or analog-to-digital converters. A thriving chiplet ecosystem would bring transformative benefits to chip manufacturers, end-users, and the entire semiconductor industry by fostering innovation, ensuring interoperability, and enhancing cost efficiency.
As an active proponent of this ecosystem, Rebellions has mapped out a roadmap for future REBEL™ products, designed to embrace unified standards and interoperable IPs within the chiplet framework.
REBEL™
REBEL™ is Rebellions’ chiplet-based AI accelerator engineered specifically for data center-scale workloads. Designed from the ground up for exceptional efficiency, high hardware utilization, low latency, and seamless scalability, REBEL™ sets a new standard in AI performance.
Equipped with 144GB of HBM3e memory, it delivers an impressive 1 PFLOPS of FP16 compute power within a 350W power envelope. REBEL™ also leverages the UCIe-Advanced specification for ultra-high bandwidth density, near-zero latency, and remarkable energy efficiency. Following REBEL™, Rebellions is introducing advanced derivatives that leverage its chiplet-based architecture for seamless heterogeneous integration. These solutions add I/O dies, enhanced memory capacity, and integrated CPU functionality, further extending the REBEL™ architecture with specialized options for diverse AI infrastructure needs.
REBEL™-I/O
REBEL™-I/O introduces dedicated interface dies that handle data communication between multiple processing chips. By offloading peripheral functions to the I/O Ethernet interface, the main chiplet dies can focus exclusively on computation and significantly enhance its power.
This modular approach mitigates manufacturing challenges: unlike monolithic chips, REBEL™-I/O allows components with distinct process node requirements to be fabricated independently, promoting reuse and cutting cost. Additionally, the Ethernet interface offers future-proof scalability for both chip-to-chip (scale-up) and server-to-server (scale-out) connectivity, with far less communication overhead, enabling seamless expansion as demands grow.
REBEL™-CPU
While a single chiplet in REBEL™ contains a CPU, REBEL™-CPU goes further by heterogeneously integrating Arm’s 64-core CPU cluster Compute Subsystem (CSS). This improves resource utilization by allowing the main chiplet to focus exclusively on workload acceleration.
The synergy between CPU and accelerators enables each component to operate at peak efficiency, maximizing compute power and minimizing latency as systems scale across servers and racks. This integration delivers up to 10x higher efficiency in server and rack-level solutions, and also lays the optimal foundation for AI training.With planned support for reliability, availability, and serviceability (RAS) features, REBEL™-CPU is positioned as a robust solution for demanding AI applications.
REBEL™-MEM
Recognizing the increasing memory demands driven by growing model sizes, REBEL™-MEM is designed to enhance REBEL™’s memory capacity through additional external memory dies. This added memory, combined with HBM3e’s industry-leading bandwidth, ensures REBEL™ can meet the challenges for any large-scale AI workloads.
With an external memory expander, REBEL™-MEM gains up to 300GB/s bandwidth and 200GB additional capacity via LPDDR or GDDR memory. This enables the chiplet to efficiently handle memory-centric AI models, namely LLMs, Video Language Models (VLMs), and Deep Learning Recommendation Models (DLRMs). These applications have substantial memory requirements due to large model sizes, extended sequence lengths, or significant intermediate data storage.
Stay tuned!
Rebellions is developing innovative chiplets–including REBEL™, REBEL™-I/O, REBEL™-CPU, and REBEL™-MEM–that are inline with our roadmap, pushing the next frontier in chiplet architecture. We will continue to pave the way to simplify integration complexity and drive cost-efficient, scalable AI acceleration, with highly interoperable, industry-leading solutions that set new standards in AI performance and affordability.
Related News
- ADTechnology Partners with Arm, Samsung Foundry, and Rebellions on AI CPU Chiplet Platform
- Rebellions Partners with Arm, Samsung Foundry, and ADTechnology on Next-Gen AI Computing Chiplet Technology
- Alchip Technologies Demonstrates IC Ecosystem at Chiplet Summit
- Arm CSA and AMBA release provide boost for chiplet ecosystem
Latest News
- Eliyan Ports Industry’s Highest Performing PHY to Samsung Foundry SF4X Process Node, Achieving up to 40 Gbps Bandwidth at Unprecedented Power Levels with UCIe-Compliant Chiplet Interconnect Technology
- GlobalFoundries and U.S. Department of Commerce Announce Award Agreement on CHIPS Act Funding for Essential Chip Manufacturing
- One Chip Vs. Many Chiplets
- Cadence Unveils Arm-Based System Chiplet
- Chiplets: Revolutionizing Semiconductor Design and Manufacturing