Viewing a single comment thread. View all comments

Talkat t1_itag4t9 wrote

Really do wonder where we will be at in five years.

We will certainly have very accurate video clips (vs still inaccurate images now).

We will be able to feed language models into image generators, but I assume we will have a single integrated model for that.

And we are getting close to voice and music, so certainly in five years we will have mastered that.

Meaning we will have a model that will be able to generate videos with voices, sound tracks and sfx, shots, dialogue, and a story. Potentially we will be able to make longer form content (perhaps episodes or movies by then)

Tesla bot will be able to take commands and do activities. Plugged the net above will have voice.

Deepmind will continue their work with a general purpose model which will be able to take in problems and solve them. The question is will they be able to recursively improve an AI model. That is the biggest unknown. If so that will outshine everyone else's work.

Right now we have a few companies making images (Facebook, openai, midjourney, openai). I expect there to many companies doing it and very good models you can run on your own GPU at home that makes decent output.

Expect search to have largely changed so instead of googling for websites you ask an AI a question and it generates an answer.

Expect a lot more voice input/discussion with AI. Instead of giving it commands you give guidance (eg play some good music vs play red hot chilli peppers)

Talking of music I think a majority of music in five years will be AI generated and will be fantastic (mentioned above)

Self driving will be solved by Tesla. Don't think anyone else will have it. They may license it out to other companies.

With self driving solved Tesla will focus on other areas, potentially general purpose AI.

Good set of predictions here. Interested to see how accurate they are :)

3