SulszBachFramed
SulszBachFramed t1_j6wa7ii wrote
Reply to comment by znihilist in [R] Extracting Training Data from Diffusion Models by pm_me_your_pay_slips
You can make the same argument about lossy compression. Am I really infringing on copyright if I record an episode of House, re-encode it and redistribute it? It's not the 'original' episode, but a lossy copy of it. What if I compress it in a zip file and distribute that? In that case, I am only sharing something that can imperfectly recreate the original. The zip file itself does not resemble a video at all.
SulszBachFramed t1_j0g22xz wrote
Reply to comment by farmingvillein in [P] Medical question-answering without hallucinating by tmblweeds
> (...) a new state of the art performance of 50.3% accuracy on the MedQA biomedical question answering task.
Oof, the fact that the accuracy is only 50% does not inspire confidence.
SulszBachFramed t1_j6wp97b wrote
Reply to comment by Ronny_Jotten in [R] Extracting Training Data from Diffusion Models by pm_me_your_pay_slips
Right, hence why its relevant to large models trained on huge datasets. If the model can reconstruct data such that it is substantially similar to the original, then we have a problem. Whether from the viewpoint of copyright infringement or privacy law (gdpr).