In-depth articles, capability tours and field notes — written from actually running the models, tools and pipelines they're about.
A capability tour of HiDream-O1-Image, the open 8B Pixel-DiT model. Covers text-to-image, instruction editing, subject-driven personalization (IP), bbox-layout-conditioned and skeleton/openpose-conditioned generation, plus nine diagram and infographic styles — end-to-end on an RTX 4090.
Read the article →More articles
I'm actively writing about open-weights image & video models, agentic LLM pipelines, and local-GPU AI workflows. Check back, or follow on Medium.