Im_Unlucky
Im_Unlucky t1_je6hn68 wrote
Reply to comment by LetGoAndBeReal in [D] The best way to train an LLM on company data by jaxolingo
This is the way ^
- shove data into vector db
- embed incoming query
- return topk matches from vector db
- inject those into prompt
- make api call to openai
- display response
- ???
- you are now a ML engineer
Im_Unlucky t1_je6vjqd wrote
Reply to comment by visarga in [D] The best way to train an LLM on company data by jaxolingo
This is true. Good prompt engineering has definitely gone a long way for me in improving rag performance though. I don’t think shoving snippets into the prompt is the long term solution, but at this point in time it’s the best option in my experience.