iidealized t1_jabkowf wrote on February 28, 2023 at 6:25 AM

Reply to comment by cthorrez in [D] Best Way to Measure LLM Uncertainty? by _atswi_

I’ve heard you can even ask the LLM: what fraction of your uncertainty is aleatoric vs epistemic, and how would the uncertainty estimates changed if you used bootstrap vs MC dropout :)

iidealized t1_j9kws3r wrote on February 22, 2023 at 6:53 PM

Reply to [P] MIT Introduction to Data-Centric AI by anishathalye

Cool to see these topics being taught. Definitely agree these are important concepts that most ML classes skip for some reason

iidealized t1_j9kwe6h wrote on February 22, 2023 at 6:51 PM

Reply to [R] Provable Copyright Protection for Generative Models by vyasnikhil96

Are adversarial examples (eg minimally perturbed versions of images) considered violation of copyright? Or are they a sufficient “remix”?

iidealized t1_j655guq wrote on January 27, 2023 at 7:58 PM

Reply to [D] Quantitative measure for smoothness of NLP autoencoder latent space by Blutorangensaft

Paper that seems relevant:

https://arxiv.org/abs/1905.12777

iidealized t1_j45abq3 wrote on January 13, 2023 at 6:47 AM

Reply to [D] Has ML become synonymous with AI? by Valachio

There are still many search-based advances/breakthroughs coming out that utilize ML but GOFAI as well, eg. Cicero AI for Diplomacy

iidealized t1_iyryrc5 wrote on December 3, 2022 at 6:24 PM

Reply to [Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning" by fourcornerclub

I think we’ll need way more tools for quality data curation along the lines of:

https://github.com/snorkel-team/snorkel

https://github.com/cleanlab/cleanlab

https://github.com/ydataai/pandas-profiling

https://github.com/OpenRefine/OpenRefine