zeroscope_v2_XL: a new open source 1024x576 video model designed to take on Gen-2 (24.06.2023 yt vid)

Martineski@lemmy.fmhy.ml · edit-2 1 year ago

zeroscope_v2_XL: a new open source 1024x576 video model designed to take on Gen-2 (24.06.2023 yt vid)

lynx@sh.itjust.works · edit-2 1 year ago

This is currently the best text2video model, that i have seen. But it has some problems with objects morphing.

You can try it out here: https://replicate.com/anotherjesse/zeroscope-v2-xl

Martineski@lemmy.fmhy.ml · edit-2 1 year ago

Just tried it and the result was goofy but the frame consistency and smoothness of movement is insane! I can’t wait to see how things develop.

Martineski@lemmy.fmhy.ml · 1 year ago

The speed of progress that we are making with ai is crazyyy. I will be able to watch quality movies generated by ai sooner than I was expecting. I thought it will take 5 years or more at best case scenario but now I think that it will be less than 5 years from now.

lynx@sh.itjust.works · 1 year ago

At the current rate of progress, completely generated films are probably possible next year. The audio and video part is currently not good enough, but the quality will probably get to Midjourney5 level in the next half year. Scripts for a full movie can be written by GPT-4, currently it still needs a lot of help for a good result, but with better fine tuning that shouldn’t be a problem. Then the audio and video parts can be combined by using ChatGPT code interpreter, which already works quite well.

zeroscope_v2_XL: a new open source 1024x576 video model designed to take on Gen-2 (24.06.2023 yt vid)

zeroscope_v2_XL: a new open source 1024x576 video model designed to take on Gen-2 (24.06.2023 yt vid)

Text2video Looks - Zeroscope XL