Timdegreat t1_jaj3gpr wrote on March 1, 2023 at 8:15 PM

Will we be able to generate embeddings using the ChatGPT API?

visarga t1_jaj4lxx wrote on March 1, 2023 at 8:22 PM

Not this time. Still text-embedding-ada-002

NoLifeGamer2 t1_jaj9i1b wrote on March 1, 2023 at 8:52 PM

Gotta love getting those "Model currently busy" errors for only a single request

sebzim4500 t1_jan01xr wrote on March 2, 2023 at 4:26 PM

Would you even want to? Sounds like overkill to me, but maybe I am missing some use case of the embeddings.

Timdegreat t1_jan7sel wrote on March 2, 2023 at 5:16 PM

You can use the embeddings to search through documents. First, create embeddings of your documents. Then create an embedding of your search query. Do a similarity measurement between the document embeddings and the search embedding. Surface the top N documents.

sebzim4500 t1_jan85s7 wrote on March 2, 2023 at 5:18 PM

Yeah, I get that's that embeddings are used for semantic search but would you really want to use a model as big as ChatGPT to compute the embeddings? (Given how cheap and effective Ada is)

Timdegreat t1_jangbi7 wrote on March 2, 2023 at 6:10 PM

You got a point there! I haven't given it too much thought really -- I def need to check out ada.

But wouldn't the ChatGPT embeddings still be better? Given that they're cheap, why not use the better option?

farmingvillein t1_japqcq1 wrote on March 3, 2023 at 3:58 AM

> But wouldn't the ChatGPT embeddings still be better? Given that they're cheap, why not use the better option?

Usually, to get the best embeddings, you need to train them somewhat differently than you do a "normal" LLM. So ChatGPT may not(?) be "best" right now, for that application.