xtof54 t1_jd467f3 wrote on March 21, 2023 at 6:33 PM

Reply to [D] Running an LLM on "low" compute power machines? by Qwillbehr

There are several. either collaboratively (look at together.computer hivemind petals) or on single no gpu machine with pipeline parallelism, but it requires reimplementing for every model, see e.g slowLLM on github for bloom176b

xtof54 t1_j28kaju wrote on December 30, 2022 at 12:39 PM

Reply to [D] What do you want from a PDF viewer designed for reading research papers? by highergraphic

The best for me would be an opensource mobile-compliant PDF reader that nicely reflows the text of scientific PDFs for small screens... But I'm aware this is very challenging ;-)

Thank you anyway for opensourcing!

xtof54 t1_iyq2oxi wrote on December 3, 2022 at 7:05 AM

Reply to [D] In an optimal world, how would you wish variance between runs based on different random seeds was reported in papers? by optimized-adam

good question but it depends on whether this source of randomness occurs between both models been compared at test time. or in other words what kind of generalization you want to support.

this contrasts with variability due to sampling data because we all assume data are iid, and so a confidence interval is usually computed.

one way is to fix the seed, compare the models with same seed, report significance for data sampling, and restart, and globally report proportion of significance across seeds.

but we shouldn't pay too much attention to stat significance, too many use it as a 'flag of truth', while all experiments are biased anyway, so better to always be suspicious and build confidence over time

xtof54 t1_iy7kgtw wrote on November 29, 2022 at 9:41 AM

Reply to Neural Networks are just a bunch of Decision Trees by Difficult-Race-1188

Researchers know that, but it does not help in any way to better understand DNN. A bunch of DT is not more explainable than a DNN