Google Gemini Omni | Video creation and AI agents expand workflows

Google has published nine demos showing Gemini Omni and Gemini 3.5 Flash in action, highlighting how its latest models connect creative generation with agentic execution. Published on May 29, 2026, the article shows Gemini Omni creating and editing video from mixed inputs, while Gemini 3.5 Flash powers coding, agents, generative UI, Search experiences, and personal AI workflows.

Google Gemini Omni and Gemini 3.5 Flash AI video and agent workflow demos

{getToc} $title={Table of Contents}

Google shows how Gemini Omni connects video creation with model reasoning

Gemini Omni is presented as a model that combines Gemini's reasoning with multimodal creation. Google says it can use images, audio, video, and text as input, then generate high-quality videos grounded in Gemini's real-world knowledge.

For designers, video creators, and product teams, the most important shift is conversational editing. Instead of treating generated video as a single static output, Gemini Omni lets users keep refining the same scene through natural language while preserving character consistency, physical behavior, and previous scene context.

{getCard} $type={post} $title={More Google news} $info={AI tools, video generation, and design workflow updates} $icon={}

How Gemini Omni changes AI video editing

Google's demos show Gemini Omni editing videos through conversation. Users can ask the model to change materials, reimagine action, add objects, transform environments, shift camera angles, or continue refining a scene across multiple turns without losing the original creative thread.

This matters for visual production because many AI video workflows break when a creator asks for a second or third revision. Google's examples emphasize multi-turn control, where the video becomes a starting point that can be transformed step by step instead of regenerated from scratch.

New workflow options with Gemini 3.5 Flash

Gemini 3.5 Flash is positioned around frontier intelligence with action. Google says the model is built for complex, long-horizon agentic tasks, including coding workflows, asset organization, and multi-step execution through tools such as Antigravity.

The demos also show Gemini 3.5 Flash generating richer web UIs and graphics in AI Studio, supporting information agents in Search, and building custom generative UI such as dashboards, trackers, mini apps, simulations, and visual tools tailored to a user's question.

For designers and developers, the practical implication is that AI tools are moving beyond isolated answers. Gemini 3.5 Flash points toward systems that can plan, build, organize, and maintain interactive experiences while still requiring human supervision for quality, usability, accessibility, and final product decisions.

Availability and creative use

Google says Gemini Omni Flash is rolling out globally to Google AI Plus, Pro, and Ultra subscribers through the Gemini app and Google Flow. It is also rolling out at no cost to users on YouTube Shorts and the YouTube Create app, with developer and enterprise API access planned in the coming weeks.

Gemini 3.5 Flash is generally available through Google Antigravity, the Gemini API in Google AI Studio, Android Studio, Gemini Enterprise Agent Platform, Gemini Enterprise, AI Mode in Search, and the Gemini app globally. For production teams, the best use case is controlled experimentation across video editing, UI generation, agent workflows, and creative prototyping before committing to client-facing output.

Daisuki's Take: What This Means for Designers

We see Gemini Omni as important because it pushes AI video closer to an editable creative workflow instead of a one-shot generation process. For designers and video creators, the strongest value is multi-turn control: being able to change materials, adjust scenes, add elements, or refine camera direction without starting from zero every time.

The Gemini 3.5 Flash demos also matter because they connect creative work with agentic execution. Generative UI, coding assistance, Search experiences, dashboards, simulations, and asset organization all point toward tools that can help teams move from idea to working interface or prototype faster. That can be useful for design exploration, product mockups, campaign planning, and interactive visual concepts.

The limitation is that more capable models also make review more important. Designers still need to check continuity, visual accuracy, usability, accessibility, brand fit, licensing, and final output quality before treating generated video or UI as production-ready. Used carefully, these Gemini workflows can speed up experimentation while keeping final creative decisions in human hands.

{getCard} $type={custom} $title={What's New?} $info={More new design releases, industry news and creative updates} $icon={}

Sources and Recommended Links

9 demos of Gemini Omni and Gemini 3.5 in action | Google Blog (Official)
Gemini | Google (Official)
Google AI Studio | Google (Official)

Google Gemini Omni | Video creation and AI agents expand workflows

Google shows how Gemini Omni connects video creation with model reasoning

How Gemini Omni changes AI video editing

New workflow options with Gemini 3.5 Flash

Availability and creative use

Daisuki's Take: What This Means for Designers

Sources and Recommended Links

ElevenLabs Flows Agent | Conversational Creative Pipeline Builder

Categories

Stay Informed

Google Gemini Omni | Video creation and AI agents expand workflows

Google shows how Gemini Omni connects video creation with model reasoning

How Gemini Omni changes AI video editing

New workflow options with Gemini 3.5 Flash

Availability and creative use

Daisuki's Take: What This Means for Designers

Sources and Recommended Links

You might like