AI Against Humanity
← Back to articles
Safety 📅 May 19, 2026

Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

Google's Gemini Omni enables the generation of videos from multiple inputs, raising concerns about deepfakes and misinformation. Safeguards are being implemented to counter these risks.

Google has launched Gemini Omni, a multimodal AI model that enables users to create high-quality videos from images, audio, and text, targeting those without video editing experience. This user-friendly tool allows for content generation and editing through simple text commands, making it accessible for creating engaging educational videos and other formats. However, its advanced generative capabilities raise significant concerns about misuse, such as the creation of deepfakes and misinformation. To address these issues, Google has implemented a verification system with digital watermarks and a user onboarding process to promote responsible use. Despite these measures, the potential for unintended consequences and ethical dilemmas remains, particularly affecting the creative industries and broader societal trust in digital content. As the technology evolves and becomes more integrated into daily life, careful consideration of its implications is crucial to mitigate risks associated with accessibility and the potential for abuse.

Why This Matters

This article highlights the potential risks associated with the deployment of advanced AI tools like Gemini Omni. As AI becomes more capable of generating realistic content, the possibility of misuse increases, impacting trust and authenticity in digital media. Understanding these risks is vital for developing frameworks that ensure responsible AI use and protect society from potential harms.

Original Source

Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

Read the original source at techcrunch.com ↗

Topic