Meta develops AI system MoCha for generating animated characters from text

Researchers from Meta and the University of Waterloo have introduced a new artificial intelligence system called MoCha, which generates fully-animated characters with synchronized speech and natural movements. The system is based on a diffusion transformer model with 30 billion parameters and creates HD video clips lasting about five seconds at a frame rate of 24 frames per second. MoCha uses a 'Speech and Video Window Attention' mechanism for precise lip synchronization, trained on 300 hours of carefully filtered video content. The system can also create multi-character scenarios using a simplified prompt system and focuses on close-ups and medium shots. Independent experts have recognized the generated videos as realistic, highlighting the high quality of natural movements and lip synchronization. MoCha allows users to reference characters through simple tags. The system can be used for creating digital assistants, virtual avatars, advertising, and educational content.

2025-03-09 UTC

North Korean IT specialists expand activities in Europe by posing as citizens of other countries

2025-04-02 UTC

Google presents Gemma 3: new AI model with multimodality and 128K token context for smartphones and basic GPUs

2025-03-12 UTC

IBM Leader Doubts Quick Replacement of Programmers by AI

2025-03-12 UTC

Meta introduced Aria Gen 2 smart glasses with advanced sensors and up to 8 hours of autonomy

2025-03-03 UTC

Microsoft introduced the AI system Magma for managing robots and interfaces AIMagma

2025-02-23 UTC

Language models get stuck in thoughts instead of actions showed new US and Switzerland research