Publications

Dissertations & Theses

PhD Thesis: Compression-Driven Memory-Efficient and High-Throughput GPU Systems for LLM Inference

Published in The University of Sydney (USYD), 2026

My doctoral dissertation focuses on alleviating the memory wall and communication bottlenecks in Large Language Model (LLM) inference through algorithm-system co-design, including low-bit quantization and unstructured sparsity acceleration.

Recommended citation: Haojun Xia. "Compression-Driven Memory-Efficient and High-Throughput GPU Systems for LLM Inference." PhD Thesis, The University of Sydney, 2026.

Master’s Thesis: The design and implementation of a lightweight automata processor

Published in University of Science and Technology of China (USTC), 2021

This thesis focuses on the architecture design, Verilog HDL prototyping, and hardware-software co-design of a high-performance, memory-efficient lightweight automata processing engine for large-scale pattern matching.

Recommended citation: Haojun Xia. "The design and implementation of a lightweight automata processor." Master's Thesis, University of Science and Technology of China, 2021.

Dr. Haojun Xia

Publications

Dissertations & Theses

PhD Thesis: Compression-Driven Memory-Efficient and High-Throughput GPU Systems for LLM Inference

Master’s Thesis: The design and implementation of a lightweight automata processor