xAI’s Grok chatbot can now ‘see’ the world around it

featured-image

xAI’s Grok chatbot can now answer questions about what’s in view of your smartphone’s camera, similar to real-time vision features available for Google’s Gemini and ChatGPT. On Tuesday, xAI announced the launch of Grok Vision, which lets users point their phone at objects like products, signs, and documents and ask questions about them. Grok Vision [...]

xAI’s Grok chatbot can now answer questions about what’s in view of your smartphone’s camera, similar to real-time vision features available for Google’s Gemini and ChatGPT. On Tuesday, xAI announced the launch of Grok Vision, which lets users point their phone at objects like products, signs, and documents and ask questions about them. Grok Vision is accessible from the Grok app for iOS, but not the Grok Android app just yet.

GROK CAN SEE WHAT YOU SEE—LITERALLY Grok’s voice mode comes with camera access, letting users point their phone at something and ask, “What am I looking at?” The Vision feature on iOS allows the chatbot to analyze real-world objects, text, and environments through your...



https://t.co/cmtINP8yp6 pic.twitter.

com/N1b6pcYZOi — Mario Nawfal (@MarioNawfal) April 20, 2025 Other new capabilities launching for Grok today include multilingual audio and real-time search in Grok’s voice mode. Grok users on Android can tap those, but only if they’re subscribed to xAI’s $30-per-month SuperGrok plan. Introducing Grok Vision, multilingual audio, and realtime search in Voice Mode.

Available now. Grok habla español Grok parle français Grok Türkçe konuşuyor グロクは日本語を話す ग्रोक हिंदी बोलता है pic.twitter.

com/lcaSyty2n5 — Ebby Amir (@ebbyamir) April 22, 2025 Grok has been gaining new features at a steady clip. Earlier this month, xAI added a “memory” component to Grok that lets the bot pull on details from past conversations. Grok also got a canvas-like tool for creating docs and apps.

.