Posts by Year

2025

Huhugui

September 19 2025

Read a middle school math problem, which is called 胡不归。The purpose is to find shortest travel time between points with different velocity on different segmen...

Deterministic in LLM

September 12 2025

Deterministic in LLM is a much harder problem than I initially thought. I was working with a major customer SN for a couple of months for this issues and eve...

Something about IRA

September 01 2025

I’m starting a new series, of Finance. Looked into IRA, backdoor, and mega backdoor. These CPA domain knowledge shouldn’t be too hard for STEM background ppl...

Ray continue on Two H200x8 nodes

August 28 2025

Issues and fixes GPU limitations Setting export CUDA_VISIBLE_DEVICES=0,1,... at the top of _start_ray.sh does NOT work Instead, add -e CU...

Quantum Computing

August 17 2025

I knew that Quantum Computing is NOT about parallelism but never truely understand how it works. Thanks for 3Blue1Brown’s great explanation video here, I fin...

VAE and MLE

August 10 2025

Now Im sure that all my VAE and ELBO related learning were NOT recorded in this blog. So it’s time to review some courses from Hung-yi.

DDIM and Classifer-free Guidance

August 09 2025

I looked up in my blog, and there is no record of VAE, ELBO, or flow models. These are the contents I spent quite some time on, but it’s right before I start...

Multi-node vLLM on Ray cluster

August 06 2025

It’s kind of funny that I barely started a Ray cluster without Anyscale. It actually a comlicated process but most current Enterprise are not aware of. Mainl...

Reward Model

July 30 2025

Read some latest publish from Cameron’s blog and got some knowledge refresh for Reward model, DPO (will be my next blog) and review on PPO (like always)

AniSora

July 25 2025

Got back from a short vacation and jump back into animation generation

Kijia and HiDream, VACE

July 18 2025

I am really into ComfyUI these days and try out couple of more models and pluggins

ComfyUI

July 17 2025

It’s time for a bit of fun after learning and coding with VLM for a while. ComfyUI and SB WebUI both has been very matured and quite some new models in the G...

Video support in Nemontron Nano VL

July 16 2025

The PR for Nemotron Nano VL is still on going. There are more modifications as below (Finally merged after I just finished this blog) 1 InternVL’s video exte...

Mixin and super()

July 13 2025

When working with Huggingface, there are lots of classes with suffix Mixin. I never thought about it’s actually meaning, but turn out to be an important feat...

Pytest Debug and vLLM Processing

July 12 2025

Debug Pytest for vLLM and get a closer look at the processing workflow. So working from the test case is a much clearly way to understand the workflow. 1 Pyt...

VLM Nemotron-Nano-VL-8B support

July 02 2025

I started working on this a month ago since Nemotron-Nano-VL-8B-V1 released. I thought it would be a good choice for me to add a model from scratch, and get ...

Back to Top ↑