Skip to main navigation Skip to search Skip to main content

RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

  • Tianyu Yu
  • , Haoye Zhang
  • , Qiming Li
  • , Qixin Xu
  • , Yuan Yao*
  • , Da Chen
  • , Xiaoman Lu
  • , Ganqu Cui
  • , Yunkai Dang
  • , Taiwen He
  • , Xiaocheng Feng
  • , Jun Song
  • , Bo Zheng
  • , Zhiyuan Liu*
  • , Tat Seng Chua
  • , Maosong Sun
  • *Corresponding author for this work
  • Tsinghua University
  • Harbin Institute of Technology
  • National University of Singapore
  • Peng Cheng Laboratory
  • Taobao & Tmall Group of Alibaba

Research output: Contribution to journalConference articlepeer-review

Abstract

Traditional feedback learning for hallucination reduction relies on labor-intensive manual labeling or expensive proprietary models. This leaves the community without foundational knowledge about how to build high-quality feedback with open-source MLLMs. In this work, we introduce RLAIF-V, a novel framework that aligns MLLMs in a fully open-source paradigm. RLAIF-V maximally explores open-source MLLMs from two perspectives, including high-quality feedback data generation for preference learning and self-feedback guidance for inference-time scaling. Extensive experiments on six benchmarks in both automatic and human evaluation show that RLAIF-V substantially enhances the trustworthiness of models at both preference learning and inference time. RLAIF-V 7B reduces object hallucination by 80.7% and overall hallucination by 33.7%. Remarkably, RLAIF-V 12B further reveals the self-alignment potential of open-source MLLMs, where the model can learn from feedback of itself to achieve super GPT-4V trustworthiness.

Original languageEnglish
Pages (from-to)19985-19995
Number of pages11
JournalProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOIs
StatePublished - 2025
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 - Nashville, United States
Duration: 11 Jun 202515 Jun 2025

Fingerprint

Dive into the research topics of 'RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness'. Together they form a unique fingerprint.

Cite this