Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Jiao, Qirui, Daoyuan Chen, Yilun Huang, Yaliang Li, and Ying Shen. "Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models." arXiv preprint arXiv:2408.04594 (2024).