iidealized
iidealized t1_j9kws3r wrote
Reply to [P] MIT Introduction to Data-Centric AI by anishathalye
Cool to see these topics being taught. Definitely agree these are important concepts that most ML classes skip for some reason
iidealized t1_j9kwe6h wrote
Are adversarial examples (eg minimally perturbed versions of images) considered violation of copyright? Or are they a sufficient “remix”?
iidealized t1_j655guq wrote
Paper that seems relevant:
iidealized t1_j45abq3 wrote
Reply to [D] Has ML become synonymous with AI? by Valachio
There are still many search-based advances/breakthroughs coming out that utilize ML but GOFAI as well, eg. Cicero AI for Diplomacy
iidealized t1_iyryrc5 wrote
Reply to [Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning" by fourcornerclub
I think we’ll need way more tools for quality data curation along the lines of:
https://github.com/snorkel-team/snorkel
https://github.com/cleanlab/cleanlab
iidealized t1_jabkowf wrote
Reply to comment by cthorrez in [D] Best Way to Measure LLM Uncertainty? by _atswi_
I’ve heard you can even ask the LLM: what fraction of your uncertainty is aleatoric vs epistemic, and how would the uncertainty estimates changed if you used bootstrap vs MC dropout :)