Zhang Peng, CEO of Zhipu AI, announced at Zhipu Open Day that the AI video generation model Ying was launched on Zhipu Qingyan , which only takes 30 seconds to generate a 6-second video. From now on, all C-end users can experience the AI video generation capabilities of text and image through Ying.
According to reports, after entering a piece of text (Prompt), users can choose the style they want to generate, including cartoon 3D, black and white, oil painting, movie feeling, etc., and add the music that comes to finally generate a video clip full of AI imagination. In addition, Ying also brings more new ways to play, including emojis, advertising production, plot creation, short video creation, etc.
Zhang Peng, CEO of Zhipu AI , said: "With the continuous iteration of algorithms and data, I believe that Scaling Law will continue to play a powerful role." It is reported that the video generation model of the Ying base is CogVideoX , which can integrate the three dimensions of text, time and space, and refers to the algorithm design of Sora . At the same time, it is also a DiT architecture. By strengthening its complex instruction compliance ability, content coherence, and large-scale screen scheduling optimization, CogVideoX has increased its reasoning speed by 6 times compared with the previous generation ( CogVideo ).