Viewing a single comment thread. View all comments

Yuli-Ban t1_ir7950h wrote

We've already seen it.

https://plai.cs.ubc.ca/2022/05/20/flexible-diffusion-modeling-of-long-videos/

> Dr. Wood says “This is simply the most impressive AI result I have personally seen in my career. Long range coherence is a challenge even for modern language models with massive parameter counts. Will, Saeid, Vaden, and Christian have taken a huge step forward by being able to stably generate coherent, photo-realistic 1hour+ long videos; 70x’s longer than their longest training video, and more than 2000x’s longer than the maximum of 20 frames they ever look at at once during training. There is something very special in the training procedure they have developed and the architecture they employ. Never have we been closer to being able to formulate AI agents that plan visually in domains with life-like complexity.”

15

Wassux t1_ir7ry2s wrote

Did they just invent short term memory?

7