The Prohuman
Posts
Google brings Gemini into video creation

Google brings Gemini into video creation

Plus: Figma puts agents on the canvas

Rimsha Bhardwaj, Hasan Toor & Ihtesham Haider
May 21, 2026

In partnership with

Hello, Prohuman

Today, we will talk about these stories:

Gemini Omni starts with AI video
Figma moves AI inside the workflow
Stability AI’s music bet gets serious

Accio Work: your agentic team for real business

Meet Accio Work—the agentic workspace for business owners and solopreneurs. Our specialized agent team manages operations for you—sourcing products, negotiating with suppliers, managing stores, and launching marketing—all on autopilot.

With verified skills and business tool APIs, Accio Work takes action while you stay in control. Powered by Alibaba.com’s 1B+ products and global supplier data, it handles product selection and execution seamlessly. No setup required—just results.

Try Accio Work Now!

Google wants video editing to feel conversational

Image Credits: Google

Google says Gemini Omni can take images, audio, video, and text, then generate or edit a video from them.

The first model, Gemini Omni Flash, is rolling out to Gemini app, Google Flow, and YouTube Shorts. It starts with video, supports multi-turn edits, and includes SynthID watermarking for videos made with the model.

The most important part is the editing flow. A prompt box is less interesting than a system that remembers the scene, keeps characters consistent, and lets people revise without starting over.

This is Google putting AI video closer to everyday creation, especially through Shorts, where casual users already have a camera roll full of clips and background noise.

The avatar feature will draw attention because it lets users create videos with their own voice and likeness. Google is moving carefully on speech editing, which makes sense given how quickly trust can break around synthetic video.

The real test is whether people use Omni to finish videos, or just make impressive demos.

Figma wants AI inside the design room

Image Credits: Figma

Figma’s new AI agent can sit inside the same collaborative canvas where teams already argue over buttons, flows, and edge cases.

The assistant lets users generate designs, edit existing work, and run multiple agents at once through text prompts. It follows Figma’s partnerships with Anthropic and OpenAI, and arrives as the company reports $333.4 million in first-quarter revenue, up 46% from a year earlier.

This feels like the right place for Figma to push. Designers do not need another blank prompt box as much as they need help inside the messy file where work already happens.

The real test is control. If the agent understands layout, components, and product context, it could reduce grunt work without making every screen feel machine-made.

Competition from Canva, Adobe, Flora, Krea, and Dessn gives Figma a clear reason to move fast. The cursor, the side panel, and the canvas are becoming the new AI battleground.

The question is whether teams will trust an agent enough to let it touch the source file.

Stability AI wants full songs, not clips

Image Credits: Stability AI

Stability AI now says its top audio model can generate music that runs 6 minutes and 20 seconds.

The company released Stable Audio 3.0, a four-model family for sound effects, short music, and full compositions. Three models come with open weights, while the 2.7B-parameter large model sits behind paid API and self-hosting access.

That split matters. Stability is giving developers enough to experiment, while keeping the most commercially useful model under tighter control.

The timing is also practical, because music AI is moving from demo clips toward products that labels, publishers, and working musicians might actually have to judge.

The licensing claim is the key detail here, especially while Suno and Udio are still fighting lawsuits. A studio screen, a pair of headphones, and a six-minute generated track now feel less like a toy test.

The open question is whether musicians see this as useful software or another licensing fight waiting nearby.

Prohuman team

Hasan Toor

Covers emerging technology, AI models, and the people building the next layer of the internet.

Founder

Ihtesham Haider

Writes about how new interfaces, reasoning models, and automation are reshaping human work.

Founder

Free Guides

Explore our free guides and products to get into AI and master it.

Ultimate Advanced ChatGPT Mastery Course

Ultimate Advanced ChatGPT Mastery CourseGPT-4 is the most advanced language model yet, and it has the potential to revolutionize the way we communicate and interact with computers.With the help of this course, you'll gain a deep understanding of GPT-4 and its capabilities, and you'll be well-equipped to start using it for your own projects and applicationsThis Guide includes :80+ New Chapters2000+ New GPT-4 Prompts1000+ AI ToolsSave 700+ hours on researchPS: If you like 💚 my content, follow me on Twitter for more such resources & tools .Drop a 👋 on https://twitter.com/hasantoxr

hasantoxr.gumroad.com/l/adc

The Ultimate Midjourney Guide

The Ultimate Midjourney GuideThis Ultimate Midjourney guide is created to teach you all you need to know from beginner to advanced levels so you can use Midjourney to level up your life.This Guide includes :How to get started with Midjourney?Some Basics For Beginners.Advance Level Commands.How to Create Structured Prompts.MidJourney Cheat Sheet For The Best Results.List of Midjourney Commands.How to use your own image as a source for Midjourney?How to I merge a few existing images into one? How to use a consistent character across multiple images? How to ask Midjourney to slightly modify an existing image.How to create an image that mimics a certain artist's style? Best Prompt Generators.GitHub Repository for Midjourney.Best Resources for Midjourney.PS: If you like 💚 my content, follow me on Twitter for more such resources & tools . Drop a 👋 on https://twitter.com/hasantoxr

hasantoxr.gumroad.com/l/mid

Ultimate Prompt Engineering Guide

Ultimate Advanced ChatGPT Mastery CourseGet ready to dive into the world of prompts with the 'Ultimate Prompt Engineering Guide for Beginners'. This easy-to-understand guide is perfect for those who are just starting out or want to learn more. By the end of it, you'll be amazed at what you've learned and what you can do. So whether you're wanting to start a new hobby, or aiming to be a pro, this guide is the perfect starting point.This Guide includes :20+ New Chapters2000+ New GPT-4 Prompts1000+ AI Tools10+ Curated Cheat SheetsSave 1000+ hours on researchPS: If you like 💚 my content, follow me on Twitter for more such resources & tools .Drop a 👋 on https://twitter.com/hasantoxr

hasantoxr.gumroad.com/l/pb

All of them are free to access and would stay free for you.

Feeling generous?

You know someone who loves breakthroughs as much as you do.

Share The Prohuman it’s how smart people stay one update ahead.