News
See all →- [May 2026] Vec-LUT selected as featured paper for the On-Device AI session of ACM MobiSys 2026. Paper / Code / Model
- [May 2026] OxyGen updated: released ArXiv v2, and added PyTorch support (previously JAX-only) for on-board deployment (e.g., on Jetson AGX Thor). Paper / Code
- [May 2026] EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents released. Paper
Selected Publications
See all →
Latest Posts
See all →Enhancing GPTQv2 Format Support in vLLM: Analysis and Implementation
Deep technical analysis of GPTQv2 format limitations in vLLM, and implementation of CUDA kernel adaptations to enable efficient low-bit/asymmetric quantization inference.
Vision-Language-Action (VLA) Models: A Review of Recent Progress
Recent VLAs evolve from discrete to continuous, and from single-system (system 1 only) to dual-system.
Reading Notes of Dario Amodei's Blog
Reading Notes of Dario Amodei's Blog.