pageshift.ai
Join us
Blog
Mission
Contact
Research
Articles
Research
February 2024
ATAC-Qwen-VL
Most image captioning models optimize for readability, not accuracy. This post shows how I built a GPT-4V-level captioning model for synthetic data generation on a single GPU.