

Outside of computing costs, there is no limit to the length of video generated, but keeping the output coherent and contextual for more than a few seconds is a completely different puzzle to solve.
Same reason ChatGPT 3.0 could make realistic Reddit comments half a decade ago but the latest models still can’t generate more than a paragraph or two before losing the thread.
I don’t really get the concern. Anyone who cares about understanding reality isn’t looking to social media for it, and the people who don’t care about reality don’t need AI to believe whatever nonsense is convenient for them.
Are we doomed? Yes, but AI isn’t changing that.