New capability unifies image and language understanding, including text and voice-enabled queries, enabling retailers to interpret intent even when spoken…