Software & AppsTechnology

Google I/O Unveils AI: HD Video Creation via Text Commands, Fraud Prevention, Advanced Math-Physics Problem Solving

Google’s annual developer conference, Google I/O 2024, took place on Tuesday, May 14th. Unlike previous years, the event did not witness the launch of any new devices. Instead, the company’s CEO, Sundar Pichai, inaugurated the event, emphasizing the primary focus on AI features. 

He stated that the newly introduced Pixel 1.5 Pro is now available globally for developers and consumers alike. Alongside this announcement, Google unveiled a plethora of features including AI-powered search, on-device AI, real-time scam protection, AI video models – VEO, and Imagen 3. These advancements signify Google’s commitment to integrating AI technology to enhance user experience and security on a global scale.

Six Key Features Launched at the Event

Let’s delve into the six special features launched at this event…

AI-Powered Search: Solving Math and Physics Questions

Google’s Circle to Search feature for Android has been continuously evolving. With this feature, you can now circle any object on your Android phone’s screen and search for it on Google. Now, through this feature, not only can you search for objects, but you can also receive suggestions on how to solve math and physics questions.

AI-Powered Search: Solving Math and Physics Questions

Real-Time Call Scam Protection: Assisting in Avoiding Fraudulent Calls

Google is currently testing a feature that provides users with warnings about potential scams. If you receive a suspicious call, the system will help you avoid potential fraud by providing real-time warnings.

Real-Time Call Scam Protection: Assisting in Avoiding Fraudulent Calls

Generative AI Video Model: Creating Videos from Text Prompts

Google has unveiled the generative AI video model, Veo. It is Google’s latest text-to-video generation model capable of generating cinematic quality videos in HD. The company invites filmmakers and creators to experiment with this model.

Previously, OpenAI released a model called ‘Sora’ capable of generating 60-second-long videos. Google claims that its model can generate videos longer than 60 seconds. Veo understands terms like aerial shots and timelapses.

Generative AI Video Model: Creating Videos from Text Prompts

Ask Photos: Finding Your Photos with AI Assistance

Google’s new feature, Ask Photos, will be rolled out shortly. It will subsequently incorporate additional capabilities. This feature enables Google Photos to answer questions such as “Show me my daughter’s swimming progress,” using AI. For this purpose, it searches photos using similar queries and creates a collection.

Ask Photos: Finding Your Photos with AI Assistance

Imagen 3: Updated Version of Text-to-Image Generator

The company has also launched Imagen 3, an updated version of the text-to-image generator. Unlike the previous version, Imagen 3 is capable of creating photorealistic images with much fewer artifacts. In a short time, it will be available on Google’s Vertex AI platform.

Imagen 3 better understands the nuances of natural language, improving its ability to create lifelike images. Sign-ups for the ImageFX service on Imagen 3 have commenced today, with availability for developers and enterprise customers expected shortly.

Imagen 3: Updated Version of Text-to-Image Generator

Project Astra Announcement: AI Assistant to Aid Daily Life

Google calls Project Astra the “future of AI assistants.” It’s a universal AI agent designed to assist in daily life. After activating your phone camera, you can point it towards any object and receive detailed information about it. This information will be provided to you promptly. Google’s CEO, Demis Hassabis, mentioned that conversational speed and natural quality are crucial for Project Astra.

Project Astra Announcement: AI Assistant to Aid Daily Life

Gemini AI 1.5 Pro: Enhanced Multilingual Support

Google has integrated its latest AI, Gemini 1.5 Pro, into the sidebar of productivity apps such as Docs, Sheets, Slides, Drive, and Gmail. This virtual assistant will provide access to detailed information about all your saved data. In addition to this, Gemini AI will support 68 languages in Google Meet.

A new learning coach, Gem Learn, will be introduced in Google Gemini in the coming months. It provides a step by step study guide to help you understand rather than just providing answers

Google Gemini: Unveiling New Learning Coach and Multimodal Capabilities

In the upcoming months, Google Gemini will introduce new features such as a Learning Coach, which provides step-by-step guidance instead of just answers, enhancing your understanding through structured practice. This year’s version of Gemini, to be available on Pixel devices by the end of the year, will come with multimodal capabilities.

Google has also announced the launch of a new open model language, Jemex 2, scheduled for release in June. Gemini-driven features will soon be available in Google Workspace. Alphabet Google Workspace is promoting Gemini-powered sidebar announcements.

Furthermore, Google’s AI research will provide AI-over-AI-driven research results, offering detailed insights into specific studies. To support expanded functionalities with minimal delays, Gemini 1.5 Flash has been launched with 1 million tokens. Sundar Pichai, CEO, stated that Google has been investing more time in AI lately and is making significant strides in AI advancements.

Gemini 1.5 Pro has now been made available to developers in the global market. Advanced Gemini users will receive 2 million tokens instead of the previous 1 million tokens.

Google Event at Shoreline Amphitheater

Google’s recent event was held at the Shoreline Amphitheater in Mountain View, California. This event commenced at 10:30 PM IST. The live streaming of this event was available on the company’s YouTube channel and social media handles.

In previous announcements, it was speculated that Google might launch the Google Pixel Fold 2, Wear OS-5, and Google TV at this event, but Google did not unveil these products.

Google’s annual event takes place every year. The company’s Gemini board was previously identified. Google gradually expands its Gemini AI into every application.

Google’s First I/O Event in 2008

Google hosted its first I/O event in 2008, marking the beginning of an annual tradition. Through this event, the company introduces many new gadgets and showcases cutting-edge technology to the public.

Over the years, Google has unveiled various innovative products at this event. In the recent ‘Google I/O 2023’ event, Google introduced its first foldable phone, the ‘Pixel Fold,’ along with Gemini AI tools, the Pixel 7A smartphone, and the Pixel tablet.

Arvind Amble

My name is Arvind Amble. As a tech enthusiast and writer, I'm fascinated by the ever-evolving world of technology, AI, IOS, Android, Software & Apps, and Digital Marketing. With a keen eye for emerging trends and a passion for innovation, I bring a fresh perspective to my writing, blending technical expertise with a creative flair.