About me

Center for Energy-efficient Computing and Applications (CECA)
School of Integrated Circuits
Peking University
Room 512, Science Building #5, 5 Yiheyuan Road, Beijing, China, 100871

I am a third-year Ph.D. candidate at Peking University, supervised by Prof. Guangyu Sun. My research mainly focus on alleviating the memory wall problem via architecture and system innovations, especialy in deep learning scenarios (e.g., LLM inference & serving, large embedding model inference, emerging algorithms, etc.). I have published as the (co-)first author in top-tier computer architecture/system conferences including ISCA, ASPLOS, HPCA (won best paper award), DAC, PACT.

Education

  • Ph.D. candidate in Integrated Science and Engineering
    • School of Integrated Circuits, Peking University, 2022-now
  • B.Sc. in computer science
    • School of EECS, Peking University, 2018-2022
    • Double Major: Economics

Research Interests

  • Near-Data Proccessing: Architecting domain-specific accelerators to alleviate memory wall issues in LLM inference, large embedding model inference, generic computation, etc.
  • LLM Serving System Optimization: Enhancing LLM serving quality via system-level (scheduling, operator) optimizations.

Industrial Experience

  • ByteDance Seed Team:
    • Research Intern (Jul. 2024 - Now)
    • Mentor: Shufan Liu
    • Topic: LLM Serving System Optimization

Awards and Honors

  • HPCA Best Paper Award: 2023 (2 positions)
  • HPCA Best Paper Honorable Mention: 2025 (2 positions)
  • China National Scholarship: 2024 (top 2%)
  • President Award of Peking University: 2023, 2024 (top 2%)
  • Huawei Scholarship: 2023
  • Excellent Graduate, Peking University: 2022
  • Yang Xin Lotus Virtue Awards, Peking University: 2021
  • Shenzhen Stock Exchange Scholarship, Peking University: 2020
  • Founder Scholarship, Peking University: 2019
  • Merit Student, Peking University: 2019, 2020, 2021 (Undergraduate), 2023, 2024 (Graduate)