Akimbo333 t1_jck73ph wrote on March 17, 2023 at 1:02 PM

Reply to comment by Hands0L0 in Those who know... by Destiny_Knight

Well you could always ask it to continue the sentence

Hands0L0 t1_jck7ifi wrote on March 17, 2023 at 1:06 PM

Not if there is a token limit.

I'm sorry, I don't think I was being clear. The token limit is tied to VRAM. You can load the 30b on a 3090 but it shallows up 20/24 gb of VRAM for the model and prompt alone. That gives you 4gb for returns

Akimbo333 t1_jcka9ef wrote on March 17, 2023 at 1:28 PM

Oh ok. So you can't make it keep talking?

Hands0L0 t1_jckbm7h wrote on March 17, 2023 at 1:39 PM

No, because the predictive text needs the entire conversation history context to predict what to say next, and the only way to store the conversation history is in RAM. If you run out of RAM you run out of room for returns.

Akimbo333 t1_jckc9iu wrote on March 17, 2023 at 1:44 PM

Damn! There's gotta be a better way to store conversations!!! Maybe one day

Hands0L0 t1_jcknz03 wrote on March 17, 2023 at 3:06 PM

Study CS and come up with a solution and you can be very rich

Akimbo333 t1_jckt4js wrote on March 17, 2023 at 3:40 PM

Oh yeah, I bet, lol!!!