Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token — Quantapedia
Recent segmentation methods leveraging Multi-modal Large Language Models (MLLMs) have shown reliable object-level segmentation and enhanced spatial perception. However, almost all previous methods pre