Pyramid: Accelerating LLM Inference with Cross-Level Processing-in-Memory
Authors: L. Yan, X. Lu, X. Chen, Y. Han, X.-H. Sun
Date: April, 2025
Venue: IEEE Computer Architecture Letters (CAL), April 2025
Type: Journal
Tags
LLM InferenceProcessing-in-MemoryHardware Acceleration