itsnotlupus t1_jdt2igm wrote on March 26, 2023 at 11:39 PM

Reply to comment by sumane12 in J.A.R.V.I.S like personal assistant is getting closer. Personal voice assistant run locally on M1 pro/ by Neither_Novel_603

The model text output is(/can be) a stream, so it ought to be possible to pipe that text stream into a warmed up TTS system and start getting audio before the text is fully generated.

itsnotlupus t1_jdt280v wrote on March 26, 2023 at 11:37 PM

Reply to comment by illathon in J.A.R.V.I.S like personal assistant is getting closer. Personal voice assistant run locally on M1 pro/ by Neither_Novel_603

Whisper is the speech recognition component.
I don't think he said what he's using for TTS, might be MacOS' builtin thingy.

itsnotlupus t1_jdj2xpr wrote on March 24, 2023 at 7:19 PM

Reply to [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-

Meh. We see a few demos and all of the demos work all of the time, but that could easily be an optical illusion.

Yes, GPT-4 is probably hooked to subsystems that can parse an image, be it some revision of CLIP or whatever else, and yes it's going to work well enough some of the time, maybe even most of the time.

But maybe wait until actual non-corpo people have their hands on it and can assess how well it actually works, how often it fails, and whether anyone can actually trust it to do those things consistently.

itsnotlupus t1_jdgdkbr wrote on March 24, 2023 at 4:59 AM

Reply to [N] ChatGPT plugins by Singularian2501

So I suppose we're going to see various chat AI open-source projects integrating with a few popular APIs next.

itsnotlupus t1_jd5td54 wrote on March 22, 2023 at 1:06 AM

Reply to comment by [deleted] in [Project] Machine Learning for Audio: A library for audio analysis, feature extraction, etc by Leo_D517

It's not a user-facing product, it's a building block that would be useful to train music-oriented neural network, be they diffusers or other types of models.

It's probably going to take a little while before we see new models that leverage this library.

If you're looking for "stable diffusion but for music" right now, you could look at Riffusion (https://huggingface.co/riffusion/riffusion-model-v1)

itsnotlupus t1_ja1rjsj wrote on February 26, 2023 at 4:51 AM

Reply to comment by No_Fun_2020 in The 2030s are going to be wild by UnionPacifik

I'm not greedy, I just want a few ChatGPT-level intelligent assistants placed in human skull-shaped drones, puttering around my house.

itsnotlupus t1_j8kghwq wrote on February 14, 2023 at 11:28 PM

Reply to comment by Azatarai in Bing Chat sending love messages and acting weird out of nowhere by BrownSimpKid

What models is character.ai using? Is it related to LaMDA at all?

itsnotlupus t1_j8kgc3w wrote on February 14, 2023 at 11:27 PM

Reply to comment by Azatarai in Bing Chat sending love messages and acting weird out of nowhere by BrownSimpKid

Are you talking about Character.ai, or about the LaMDA family of models made available through the AI Test Kitchen app?

itsnotlupus t1_j8k977o wrote on February 14, 2023 at 10:36 PM

Reply to comment by Azatarai in Bing Chat sending love messages and acting weird out of nowhere by BrownSimpKid

I just got access to the kitchen, and I'm a bit underwhelmed. No free conversation prompts, 3 very limited scenarios with a limited number of consecutive back and forth before getting thrown out and having to start from zero.
I had a conversation with a tennis ball obsessed with dog where almost every answer was something like "I don't know anything about that, but man, dogs are so cool, right?"
I did get it to admit cats were fun to play with too, albeit not as fun as dogs, shortly before being told the conversation was over.

I don't know if you've got access to more open-ended flavors of LaMDA, but for me this was hard to compare favorably to ChatGPT, warts and all.

itsnotlupus t1_j8fxr3c wrote on February 14, 2023 at 12:35 AM

Reply to comment by Azatarai in Bing Chat sending love messages and acting weird out of nowhere by BrownSimpKid

Oh I didn't realize it had been opened to the public.

Why have I not seen more screenshots of LaMDA transcripts?

*edit: Just installed AI test kitchen and got on the waitlist. I guess it's public-ish.

itsnotlupus t1_j8fwzl0 wrote on February 14, 2023 at 12:30 AM

Reply to comment by Azatarai in Bing Chat sending love messages and acting weird out of nowhere by BrownSimpKid

Do we know that? We have a techno-priest's leaked cherry-picked transcripts of conversation with it, but that's not a whole lot to go on.

itsnotlupus t1_j8ft0dq wrote on February 14, 2023 at 12:00 AM

Reply to comment by Iffykindofguy in The new Bing AI hallucinated during the Microsoft demo. A reminder these tools are not reliable yet by giuven95

Well, people trust them today. They shouldn't, but they do. And it's going to get hilarious.

More seriously, we're going to learn collectively to flex a new muscle of "this AI may be super helpful, but it may also be bullshitting me." And odds are it'll be a bit of both in every answer.

Maybe those models are the inoculation we need to practice detecting bullshit online?

itsnotlupus t1_j49o5sw wrote on January 14, 2023 at 3:19 AM

Reply to comment by becausecurious in [D] Is MusicGPT a viable possibility? by markhachman

Notably many of openAI's open-source projects, including jukebox, have a license that both disclaim any ownership of the generated works and forbid commercial use, which should largely sidestep potentially thorny copyright questions.

itsnotlupus t1_j2tbhzu wrote on January 3, 2023 at 8:28 PM

Reply to [R] Massive Language Models Can Be Accurately Pruned in One-Shot by starstruckmon

Can you prune a pruned model? And then prune that again?

There's apparently no retraining needed here. Just loop over the matrices and shrink them (although it'd be nicer if there was a code repo to actually see that in action.)

I get that each successive pruning is going to make things increasingly worse, but I'm wondering if this might mean you can take an OPT-175B model and shrink it down in size to fit on commodity hardware like OPT-6.7B while still being closer in performance to the larger initial model than to the natively smaller model.

itsnotlupus t1_iz7i5hm wrote on December 7, 2022 at 1:10 AM

Reply to comment by Drooflandia in [D] Stable Diffusion 1 vs 2 - What you need to know by SleekEagle

2.1 is not a discord bot.

There is a discord preview of 2.1 available, in anticipation of the actual model release scheduled for this week, which will of course be downloadable.

See /r/StableDiffusion/comments/zdixtt/sd_21_release_soon/