Google Flow and Veo 3.1: audio arrives on all functions, editing gets sharper
Google just extended Veo 3.1 and Flow with native audio on Ingredients to Video, Frames to Video and Extend, plus new precise editing tools. What it changes for AI video creators.

For a long time, audio in Flow stayed confined to a few specific modes. The recent update of Veo 3.1 in Google Flow changes that: native audio is now available across all the main functions, including Ingredients to Video, Frames to Video and Extend.
It is a discreet change in name, but a significant one in practice.
What changed in Flow with Veo 3.1
Veo 3.1 is not a new version of the model in the sense of an overhaul. It is an evolution of Veo 3 with three main axes according to Google: better prompt adherence, reinforced audio, and increased realism on textures and movements.
What it concretely gives in Flow:
Audio on all functions. Before this update, native audio (sound effects, ambience, synchronized dialogue) worked mainly in Text-to-Video. From now on, Ingredients to Video, Frames to Video and Extend also generate audio in a synchronized way. A shot that starts from a reference image can now include its sound ambience right from generation.
Precise clip editing. Flow integrates tools to insert new elements into a scene or remove unwanted objects. The model handles the details like shadows and lighting during these modifications, which avoids the usual visual inconsistencies in this type of editing.
Gemini Omni Flash in Flow. The Gemini Omni Flash model is integrated into Flow and available to Gemini AI Plus, Pro and Ultra subscribers. It lets you mix real references with generated content and iterate conversationally on the result.
💡 Frank's Cut: Audio on Ingredients to Video is the real news here. Building a scene from several reference elements and directly getting a coherent sound ambience is a step that used to take post-production time. Now it is in the first export.
Why native audio in Flow changes the workflow
Until now, AI video generation and sound lived in separate steps. You generated the clips, then added the audio in post-production: sound effects in a DAW, ambience in Premiere, synchronized dialogue by hand.
Veo 3.1 integrates these two dimensions right from creation. It is not perfect for every case (the precise art direction of sound stays in the specialized tools), but it speeds up prototyping and the delivery of drafts.
For creators who make short films, ads or branded content, having a clip with its coherent sound ambience right from the first export lets you present something convincing to the client without going through a preliminary audio editing session.
It is particularly useful on the Frames to Video and Extend functions, where you start from existing content. A real-shot drone clip, extended by Veo 3.1, that recovers its sound ambience in coherence with the context: that is a hybrid workflow that used to get complicated.
Veo 3.1 in the market context
Google launched Veo 3 at Google I/O 2025 with native audio as the big promise. Veo 3.1 (January 2026) extended the capabilities with 4K, vertical videos and multi-shot consistency. This Flow update completes the picture by making audio accessible from every entry point of the workflow.
In the AI video market in June 2026, Veo 3.1's positioning stays solid: better prompt adherence and very high visual quality, but less direct access than tools like Kling or Seedance. Flow is available for personal Google accounts at 10 free generations per month, and for Gemini Pro and Ultra subscribers with larger quotas.
The Veo 3.1 API is accessible via Google AI Studio and Vertex AI for developers who want to integrate video generation into their own pipelines.
Compared to the Kling/Seedance ecosystem, which is progressing fast on volume and cost, Flow stays stronger on raw quality and integration into the Google workspace. Not the same market, not the same main use.
To go further on how to organize an AI video production pipeline, the A-to-Z guide from script to shot list lays the foundations in a practical way.
What these updates mean for independent creators
A few points to keep in mind to decide whether this changes your workflow:
- If you already use Flow on a Gemini subscription, audio on all functions is a direct improvement at no extra cost.
- If you make hybrid content (real footage + AI), Frames to Video with audio is now much more useful.
- If your pipeline is already dialed in around Kling or Seedance, this update does not justify a switch. But if you are in the Google ecosystem, it is worth testing.
- The precise editing tools (insertion/removal of elements) are still young and deserve testing before being included in a client production flow.
Google's direction is coherent: making Flow a complete video creation environment, not just a clip generator. They are getting there progressively.
Use the cinematic prompt generator to prepare prompts suited to the new audio and reference capabilities in Veo 3.1.
FAQ
Foire aux questions
Réponses rapides aux questions les plus fréquentes sur cet article.
Is Veo 3.1 with audio accessible for free?
Yes, partially. Personal Google accounts have access to 10 generations per month via Google Vids and Flow. The quotas increase with the Gemini AI Plus, Pro and Ultra subscriptions. The Veo 3.1 API is paid via AI Studio and Vertex AI.
What is the difference between Veo 3 and Veo 3.1?
Veo 3 came out at Google I/O 2025 with native audio as the major new feature. Veo 3.1 (January 2026) added 4K, vertical video, multi-shot consistency and better prompt adherence. The mid-2026 Flow update extends native audio to all the workflow functions.
Can Flow replace a classic editing tool like Premiere?
No, not at this stage. Flow is a tool for creating and prototyping AI clips, not a full NLE. For the final edit, color grading and professional audio mixing, the classic tools remain necessary. Flow and Premiere are more complementary.
Is the audio generated by Veo 3.1 royalty-free?
The audio generated via Google tools is covered by Google's terms of use for AI-generated creations. For commercial use, read the Gemini and Flow terms. Google has clarified that creations from its generative tools remain the property of the user for standard uses.
Can you use Veo 3.1 from France?
Yes. Veo 3 and 3.1 are available in most European countries via Gemini and Google Vids since the early 2026 expansion. Access may vary slightly depending on the app versions and the regional settings, but web access works.
Frames to Video, what is it exactly?
It is a Flow feature that generates a video from several key images you provide. You define the start image and the end image (or several intermediate frames), and Veo 3.1 creates the movement and the transitions between them. With the recent update, this generation now includes synchronized audio.