How POPEN Boosts LVLM Reasoning Segmentation with Preference Optimization and Ensemble
The paper introduces POPEN, a new framework that uses preference‑based optimization and ensemble methods to reduce hallucinations and improve segmentation accuracy in large visual language models, achieving state‑of‑the‑art results on multiple benchmarks.
