_Arsenie_Boca_ t1_jdbsl4b wrote on March 23, 2023 at 7:24 AM

#2,314,580

What are the costs for all the services? I assume GPT-4 is billed per request and Pinecone per hour?

Icko_ t1_jdc09e5 wrote on March 23, 2023 at 9:20 AM

#2,314,986

Replying to _Arsenie_Boca_ (#2,314,580)

You could use faiss instead of pinecone and alpaca instead of gpt-4

_Arsenie_Boca_ t1_jdc0ko2 wrote on March 23, 2023 at 9:24 AM

#2,314,997

Replying to Icko_ (#2,314,986)

True, but im not sure how much cheaper that would really be.

Individual-Road-5784 t1_jdc0z0j wrote on March 23, 2023 at 9:30 AM

#2,315,022

Replying to _Arsenie_Boca_ (#2,314,997)

Instead of FAISS, you can also use a truly vector search database like Qdrant. It's open-source and also offers a generous free tier offering in the cloud https://qdrant.tech

Different_Prune_3529 t1_jdccrmr wrote on March 23, 2023 at 11:52 AM

#2,315,895

Can it have good performance as openai’s GPT?

dancingnightly t1_jdcnhuh wrote on March 23, 2023 at 1:24 PM

#2,317,019

Will you add semantic chunking?

localhost80 t1_jdcrfd0 wrote on March 23, 2023 at 1:53 PM

#2,317,504

Replying to _Arsenie_Boca_ (#2,314,580)

GPT charges per token so it depends on the length of the document

localhost80 t1_jdct42q wrote on March 23, 2023 at 2:05 PM

#2,317,736

Replying to Different_Prune_3529 (#2,315,895)

It will have better performance relative to the knowledge in the documents. It's the comparison of GPT-4 with global knowledge vs GPT-4 with local knowledge.

Smallpaul t1_jdd8q6m wrote on March 23, 2023 at 3:48 PM

#2,319,401

Replying to Different_Prune_3529 (#2,315,895)

It *is* OpenAI's GPT. Through an API.

edthewellendowed t1_jddoq57 wrote on March 23, 2023 at 5:29 PM

#2,320,950

Replying to Icko_ (#2,314,986)

Can you give me a little bit more info on this ? I'm interested but also very slow

Icko_ t1_jdecnjx wrote on March 23, 2023 at 8:00 PM

#2,323,344

Replying to edthewellendowed (#2,320,950)

Sure:

Suppose you had 1 million embeddings of sentences, and one vector you want the closest sentence to. If the vectors were a single number, you could just do a binary search, and you'd be done. If they are higher dimensionality, it's a lot more involved. Pinecone is a paid product doing this. Faiss is a library by facebook, which is very good too, but is free.
Recently, Facebook released the LLama models. They are large language models. ChatGPT is also a LLM, but after pretraining on a text corpus, you train it with human instructions, which is costly and time-consuming. Stanford took the LLama models, and trained them with ChatGPT. The result is pretty good not AS good, but pretty good. They called it "Alpaca".

edthewellendowed t1_jdewxml wrote on March 23, 2023 at 10:10 PM

#2,325,476

Replying to Icko_ (#2,323,344)

So If I had a pdf, I could use faiss to make am it into an embedding, and then llama / alpaca to use the pdf as a base for a chatbot ?

saintshing t1_jdgwgt7 wrote on March 24, 2023 at 9:15 AM

#2,332,219

Replying to Icko_ (#2,323,344)

I heard of people talking about using ANNOY for approximate nearest neighbor search. How is ANNOY compared to pinecone and faiss? Are pinecone and faiss self-hostable?

Icko_ t1_jdh2pja wrote on March 24, 2023 at 10:39 AM

#2,332,723

Replying to saintshing (#2,332,219)

Idk, I've never heard of it.

Always1Max t1_jdkn18v wrote on March 25, 2023 at 2:03 AM

#2,346,964

could there be something like this, but for code?

fletchertyler914 t1_je9o94n wrote on March 30, 2023 at 1:00 PM

#2,482,423

I just found this a few days ago and actually used it as a prototype base to learn the ropes, so thanks op! I ended up gutting the ingest flow in favor of an additional upload api route to make it more flexible, but overall it was a good example/guide to follow. Nice work.

[P] Open-source GPT4 & LangChain Chatbot for large PDF docs

Comments