pageshift.ai
  • Join us
  • Blog
  • Mission
  • Contact

Research

Articles

Research
February 2024
ATAC-Qwen-VL
Most image captioning models optimize for readability, not accuracy. This post shows how I built a GPT-4V-level captioning model for synthetic data generation on a single GPU.
Create crazy stories, connect with us: