Publications

You can also find my articles on my Google Scholar profile.

Conference Papers

H$^2$-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference

AIM: Software and Hardware Co-design for Architecture-level IR-drop Mitigation in High-performance PIM

CtXnL: A Software-Hardware Co-Designed Solution for Efficient CXL-Based Transaction Processing

UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures

SpecPIM: Accelerating Speculative Inference on PIM-Enabled System via Architecture-Dataflow Co-Exploration

PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization

NMExplorer: An Efficient Exploration Framework for DIMM-based Near-Memory Tensor Reduction

DIMM-Link: Enabling Efficient Inter-DIMM Communication for Near-Memory Processing

GNNear: Accelerating Full-Batch Training of Graph Neural Networks with Near-Memory Processing