Xai’s Grok Chatbot can answer questions about what’s available, taking into account your smartphone’s camera, as well as the real-time vision features available on Google’s Gemini and ChatGpt.
On Tuesday, Xai announced the launch of Grok Vision. This allows users to point their phones at objects such as products, indicators, documents, and other questions about them. Grok Vision is accessible from the Grok app for iOS, but it is not a Grok Android app yet.
Grok lets you see what you see – literally
Grok’s audio mode comes with camera access, so you’ll point your phone at something and ask “What are you looking at?”
With iOS’s vision feature, chatbots can analyze real objects, text and environments through https://t.co/cmtinp8yp6 pic.twitter.com/n1b6pcyzoi.
– Mario Nawfal (@Marionawfal) April 20, 2025
Other new features that launch for GROK today include multilingual audio and real-time search in Grok’s audio mode. Android Grok users can tap on them, but only if they subscribe to Xai’s $30/month SuperGrok plan.
Introducing real-time search in Grok Vision, multilingual audio, and voice modes. Available now.
Grok HablaEspañol
Grok Parle Français
Grok Türkçe Konuşuyor
Groku speaks Japanese
गगहिंदी pic.twitter.com/lcasyty2n5– Ebby Amir (@ebbyamir) April 22, 2025
Grok has gained new features with stable clips. Earlier this month, Xai added a “memory” component to GROK, allowing bots to extract details from past conversations. Grok also got tools like canvas for creating documents and apps.