Submitted by besabestin t3_10lp3g4 in MachineLearning
suntehnik t1_j5ykwei wrote
Reply to comment by besabestin in Few questions about scalability of chatGPT [D] by besabestin
Just speculation here: maybe they store generated text in a buffer and when they run out of memory buffer can be flushed to get allocation back for other tasks.
Viewing a single comment thread. View all comments