REBEL™ and Beyond — Towards a Unified Chiplet Ecosystem

November 13, 2024 -- As Moore’s Law approaches its practical limits, chiplet architecture is gaining attention for its unique advantages: bypassing reticle constraints, nearly doubling time-to-market speed, and achieving higher yields with fewer defects per unit area compared to monolithic designs. This shift has spurred industry interest in building a robust ecosystem for standardized chiplets, such as high-speed I/O or analog-to-digital converters. A thriving chiplet ecosystem would bring transformative benefits to chip manufacturers, end-users, and the entire semiconductor industry by fostering innovation, ensuring interoperability, and enhancing cost efficiency.

As an active proponent of this ecosystem, Rebellions has mapped out a roadmap for future REBEL™ products, designed to embrace unified standards and interoperable IPs within the chiplet framework.


REBEL™

REBEL™ is Rebellions’ chiplet-based AI accelerator engineered specifically for data center-scale workloads. Designed from the ground up for exceptional efficiency, high hardware utilization, low latency, and seamless scalability, REBEL™ sets a new standard in AI performance.

Equipped with 144GB of HBM3e memory, it delivers an impressive 1 PFLOPS of FP16 compute power within a 350W power envelope. REBEL™ also leverages the UCIe-Advanced specification for ultra-high bandwidth density, near-zero latency, and remarkable energy efficiency. Following REBEL™, Rebellions is introducing advanced derivatives that leverage its chiplet-based architecture for seamless heterogeneous integration. These solutions add I/O dies, enhanced memory capacity, and integrated CPU functionality, further extending the REBEL™ architecture with specialized options for diverse AI infrastructure needs.


REBEL™-I/O

REBEL™-I/O introduces dedicated interface dies that handle data communication between multiple processing chips. By offloading peripheral functions to the I/O Ethernet interface, the main chiplet dies can focus exclusively on computation and significantly enhance its power. 

This modular approach mitigates manufacturing challenges: unlike monolithic chips, REBEL™-I/O allows components with distinct process node requirements to be fabricated independently, promoting reuse and cutting cost. Additionally, the Ethernet interface offers future-proof scalability for both chip-to-chip (scale-up) and server-to-server (scale-out) connectivity, with far less communication overhead, enabling seamless expansion as demands grow.


REBEL™-CPU

While a single chiplet in REBEL™ contains a CPU, REBEL™-CPU goes further by heterogeneously integrating Arm’s 64-core CPU cluster Compute Subsystem (CSS). This improves resource utilization by allowing the main chiplet to focus exclusively on workload acceleration.

The synergy between CPU and accelerators enables each component to operate at peak efficiency, maximizing compute power and minimizing latency as systems scale across servers and racks. This integration delivers up to 10x higher efficiency in server and rack-level solutions, and also lays the optimal foundation for AI training.With planned support for reliability, availability, and serviceability (RAS) features, REBEL™-CPU is positioned as a robust solution for demanding AI applications.


REBEL™-MEM

Recognizing the increasing memory demands driven by growing model sizes, REBEL™-MEM is designed to enhance REBEL™’s memory capacity through additional external memory dies. This added memory, combined with HBM3e’s industry-leading bandwidth, ensures REBEL™ can meet the challenges for any large-scale AI workloads.

With an external memory expander, REBEL™-MEM gains up to 300GB/s bandwidth and 200GB additional capacity via LPDDR or GDDR memory. This enables the chiplet to efficiently handle memory-centric AI models, namely LLMs, Video Language Models (VLMs), and Deep Learning Recommendation Models (DLRMs). These applications have substantial memory requirements due to large model sizes, extended sequence lengths, or significant intermediate data storage.


Stay tuned!

Rebellions is developing innovative chiplets–including REBEL™, REBEL™-I/O, REBEL™-CPU, and REBEL™-MEM–that are inline with our roadmap, pushing the next frontier in chiplet architecture. We will continue to pave the way to simplify integration complexity and drive cost-efficient, scalable AI acceleration, with highly interoperable, industry-leading solutions that set new standards in AI performance and affordability.