Mochi AI : A new SOTA in open-source video generation models
Mochi 1 preview is an open state-of-the-art video generation model with high-fidelity motion and strong prompt adherence. Mochi 1 new model dramatically closes the gap between closed and open video generation systems.
Mochi 1 DiT Introduction
Mochi 1 demonstrates dramatic improvements in quality of motion as well as extremely strong prompt adherence.
Mochi AI: Try Mochi 1 for free at Mochi-AI.com
Mochi 1 is the first step toward building world simulators that can imagine anything, whether possible or impossible.
Mochi AI - Building frontier models for video generation
Video is the ultimate medium for human-AI interaction, seamlessly integrating text, audio, images, and 3D into one unified experience.
What Users Say About Mochi AI
Sarah Mitchell
Fashion Blogger
Embracing the Mochi AI tool has transformed my content approach. It allows me to swiftly produce top-tier videos for my campaigns, captivating my audience in unprecedented ways. The outcomes have far surpassed my initial hopes!
Emma Hill
Frequent Shopper
Mochi AI has revolutionized my social media content creation process. The user-friendly tool enables me to craft captivating videos within minutes, leading to a substantial increase in our engagement metrics.
Lisa Green
Style Consultant
Mochi AI has been a transformative addition to my projects. Its simplicity and ability to produce impressive videos have saved me countless hours in the editing suite. I wholeheartedly recommend it to anyone seeking to elevate their video content game.
John Doe
Retail Manager
I've found Mochi AI to be an invaluable asset in elevating our marketing strategy. The tool's ease of use and quick video creation have not only saved us time but also significantly increased our customer engagement.
Michael Brown
Tech Enthusiast
I'm always on the lookout for innovative tools that can streamline my content creation. Mochi AI has exceeded my expectations. It's intuitive, efficient, and the quality of the videos it produces is top-notch.
Tom White
Entrepreneur
Mochi AI has been a phenomenal tool for my entrepreneurial ventures. It's intuitive, efficient, and the quality of videos it produces has taken my social media presence to new heights.
FAQ About Mochi AI
What is Mochi 1 DiT (Mochi AI)?
Mochi 1 preview is an open state-of-the-art video generation model with high-fidelity motion and strong prompt adherence. Our new model dramatically closes the gap between closed and open video generation systems.
What specifically about Mochi AI allows it to perform so competitively with the leading closed models?
1. Prompt Adherence: Demonstrates exceptional alignment with textual prompts, ensuring that generated videos accurately reflect the given instructions. This allows users detailed control over characters, settings and actions. We benchmark prompt adherence with an automated metric using a vision language model as a judge following the protocol in OpenAI DALL-E 3. We evaluate generated videos using Gemini-1.5-Pro-002.
2. Motion Quality: Mochi 1 generates smooth videos at 30 frames per second for durations up to 5.4 seconds, with high temporal coherence and realistic motion dynamics. Mochi simulates physics like fluid dynamics, fur and hair simulation, and expresses consistent, fluid human action that is beginning to cross the uncanny valley. Raters were instructed to focus on motion rather than frame-level aesthetics (criteria include interestingness of the motion, physical plausibility, and fluidity). Elo scores are computed following the LMSYS Chatbot Arena protocol.
Is Mochi AI accessible to the general public?
We are thrilled to announce a research preview of Mochi 1, our latest open-source video generation model. Mochi 1 demonstrates dramatic improvements in quality of motion as well as extremely strong prompt adherence. Licensed under the Apache 2.0 license, a preview of Mochi 1 is freely available for personal and commercial use.
What are the current limitations of Mochi AI?
Under the research preview, Mochi 1 is a living and evolving checkpoint. There are a few known limitations. The initial release generates videos at 480p today. In some edge cases with extreme motion, minor warping and distortions can also occur. Mochi 1 is also optimized for photorealistic styles so does not perform well with animated content. We also anticipate that the community will fine-tune the model to suit various aesthetic preferences. Additionally, we have implemented robust safety moderation protocols in the playground to ensure that all video generations remain safe and aligned with ethical guidelines.
What is coming next?
Today, we are releasing the Mochi 1 preview, showcasing the capabilities of our 480p base model. But this is just the beginning. Before the end of the year, we will release the full version of Mochi 1, which includes Mochi 1 HD. Mochi 1 HD will support 720p video generation with enhanced fidelity and even smoother motion, addressing edge cases such as warping in complex scenes. Looking beyond this release, we are working on image-to-video capabilities. Additionally, we are focused on improving the controllability and steerability of the models to give our users even more precise control over their outputs.
Future vision?
The Mochi 1 preview has limitations including a 480p resolution for computational efficiency on end-user devices. Looking forward, we will continue to advance the SOTA in video generation with support for high-resolution, long video generation as well as image-to-video synthesis.
Can't wait to try Mochi AI?
Let's give it a try now!