Raja Gond

Hi! I'm Raja...

I'm a Research Assistant in the AI-Infrastructure team at Microsoft Research India (MSR-I). In 2023, I graduated from the Undergraduate Programmes at the IIT Bombay, where I earned my B.Tech (with Honors) in Computer Science. I have broad interests in making computer systems more efficient and am currently working on Systems for Machine Learning.

At MSR-I, I work on optimizing systems for efficient inference of Large Language Models (LLMs). During my undergrad, I worked with Prof. Purushottam (Puru) Kulkarni on CXL and persistent memory.

When the deadlines are quiet, and time slows, you may find me taking an infinite walk or reading a book (mostly Hindi or Urdu poetry). Sometimes, as the sun sets in the western sky, I steal moments from the treasure of time to weave my thoughts into words (check the poetry below!).

Research

Preprints
- TokenWeave: Efficient Compute-Communication Overlap for Distributed LLM Inference |
  Raja Gond, Nipun Kwatra, Ramchandran Ramjee | undersubmission
- emucxl: an emulation framework for CXL-based disaggregated memory applications |
  Raja Gond, Puru Kulkarni

Service

Artifact Evaluation Committee
- ATC'25 / OSDI'25
- SOSP'25

Last updated July 2025. Thanks Jon Barron for the template!