DonutListen2Me t1_je7rr2p wrote on March 30, 2023 at 12:50 AM Reply to [D] The best way to train an LLM on company data by jaxolingo You don't want to input tabular data to a LLM. You should use a model specifically for tabular data, like XGBoost. What you want to do is embed your company's documents and use a vector search on the embeddings to feed as context to an LLM. Permalink 6
DonutListen2Me t1_je7rr2p wrote
Reply to [D] The best way to train an LLM on company data by jaxolingo
You don't want to input tabular data to a LLM. You should use a model specifically for tabular data, like XGBoost.
What you want to do is embed your company's documents and use a vector search on the embeddings to feed as context to an LLM.