New feature allows users to create, animate, and edit videos from text prompts without the need for separate tools.
Google has turned AI-powered video generation into a native feature of Gemini. Dubbed Gemini Omni, the functionality eliminates the need for external apps or platforms, allowing users to create and manipulate audiovisual content directly within the assistant's interface.
According to released information, the system can perform three main operations from a single text prompt. The tool can generate videos from scratch, animate static images, and edit existing footage.
This integration marks a shift in how generative AI video tools are accessed. By centralizing these capabilities within its core model, Google removes the need to switch between different specialized software for audiovisual production.
The feature was tested across all three operating modes, with results detailed in technical analyses published online. The functionality demonstrates the evolution of multimodal models in understanding and synthesizing visual data in motion.
The arrival of Gemini Omni represents a broader consolidation trend in the artificial intelligence market. By embedding advanced video creation and editing capabilities into its standard assistant, Google signals a transition for these technologies—from standalone tools to utilities integrated directly into productivity ecosystems.
Gemini Omni is a new native feature of Google's Gemini assistant that allows users to generate, animate, and edit videos directly within the platform using text prompts, eliminating the need for external apps.
The system can perform three main operations from a single text prompt: generating videos from scratch, animating static images, and editing existing footage.
Google integrated these tools to centralize audiovisual production within its core model, removing the need to switch between specialized software and signaling a shift toward consolidating AI utilities into productivity ecosystems.