We won third place in ICCV EVQA-SnapUGC Challenge, with our model achieving the best single-modality performance.