Ticker

6/recent/ticker-posts

Elon Musk's AI sees, speaks and understands... but not yet on all smartphones

Elon Musk's AI sees, speaks and understands... but not yet on all smartphones

Grok, the artificial intelligence developed by xAI, has reached a new milestone. The assistant can now analyze in real time what your phone's camera sees. And that's not all: it's also becoming multilingual and more interactive.

Elon Musk's AI sees, speaks and understands... but not yet on all smartphones

For several months, large technology companies have been accelerating in the field of artificial intelligence. OpenAI with ChatGPT, Google with Gemini, and now xAI with Grok are making more and more announcements. These assistants are no longer content to just answer questions. They are gaining new abilities: seeing, speaking, listening, remembering. The goal is clear: to create assistants capable of interacting in real time with the real world.

The latest development, Grok can now "see" what your smartphone's camera is filming. Called Grok Vision, this feature allows you to analyze an object, document, or scene live, to instantly answer your questions. The option is available on the iOS app, but not yet on Android. It works on a variety of elements: street signs, business cards, products, packaging, or printed text. For example, it can explain the meaning of a symbol, help you translate a poster, or identify an object in a store.

Grok becomes visual, vocal, and multilingual with new interactive features

In addition to vision, xAI is rolling out new voice features. Grok now understands multiple languages and can respond to oral, a bit like Gemini Live or ChatGPT's voice mode. This voice interaction also allows for real-time searches, simply by speaking. These new features are accessible via the SuperGrok plan, billed at 45.60 euros per month, with the exception of Grok Vision, which remains free for all iOS users.

Grok also integrates a memory, capable of retaining past exchanges with the user to offer more personalized responses. A “studio” function also allows documents or applications to be generated by voice or visual command. With these additions, xAI seeks to position itself as a concrete alternative to dominant AI. Real-time vision, combined with seamless interaction, brings it closer to a true intelligent assistant, capable of understanding context, objects, and language.

Post a Comment

0 Comments