Submitted by DreamyPen t3_yrjjql in MachineLearning
Hi all,
I will be careful not the use the term "confidence" to keep the goal clear and not confuse with confidence interval or predictions.
I have two sources of data. One very reliable (experimental), and another source less so, but still carries useful information.
Is it possible to feed the entirety of the data to an algorithm while specifying a certain "trust" or "reliability" in the data source? The goal being putting more weight on the reliable source, while still picking up some hidden patterns from the second source?
ResponsibilityNo7189 t1_ivtxak2 wrote
You can maybe use the less reliable data in pretraining, then only use your trusted data for finetuning.