Posts

12th October 2025

Enhancing GPTQv2 Format Support in vLLM: Analysis and Implementation

Deep technical analysis of GPTQv2 format limitations in vLLM, and implementation of CUDA kernel adaptations to enable efficient low-bit/asymmetric quantization inference.

16th September 2025

Vision-Language-Action (VLA) Models: A Review of Recent Progress

Review VLA Embodied AI

Recent VLAs evolve from discrete to continuous, and from single-system (system 1 only) to dual-system.

2nd August 2025

Reading Notes of Dario Amodei's Blog

Essay Reading

Reading Notes of Dario Amodei’s Blog.

9th January 2025

Cheatsheet for Setting up Android Smartphones

Development Android Edge Device

Quickly setting up Android smartphones for development.

9th January 2025

Cheatsheet for Setting up Termux on Android Smartphones

Development Android Edge Device

Quickly setting up Termux on Android smartphones for development.

← Prev
1 of 2
Next →