GraciousReformer
GraciousReformer OP t1_j9m1o15 wrote
Reply to comment by activatedgeek in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
So it is like an ability to capture "correlations" that cannot be done with random forests.
GraciousReformer OP t1_j9lwe7i wrote
Reply to comment by activatedgeek in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Still, why is it that DL has better inductive biases than others?
GraciousReformer OP t1_j9ljjqu wrote
Reply to comment by activatedgeek in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
>inductive biases
Then why does DL have inductive biases and others do not?
GraciousReformer OP t1_j9k4974 wrote
Reply to comment by yldedly in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
So it works for images but not for tabular data?
GraciousReformer OP t1_j9k1srq wrote
Reply to comment by yldedly in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
But then what is the difference from the result that NN works better for ImageNet?
GraciousReformer OP t1_j9jrhjd wrote
Reply to comment by yldedly in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
This is a great point. Thank you. So do you mean that DL work for language models only when they get a large amount of data?
GraciousReformer OP t1_j9jr4i4 wrote
Reply to comment by yldedly in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
"for example on tabular data where discontinuities are common, DL performs worse than alternatives, even if with more data it would eventually approximate a discontinuity." True. Is there references on this issue?
GraciousReformer OP t1_j9jpu94 wrote
Reply to comment by LowLook in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
?
GraciousReformer OP t1_j9jppu7 wrote
Reply to comment by terminal_object in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
"Artificial neural networks are often (demeneangly) called "glorified regressions". The main difference between ANNs and multiple / multivariate linear regression is of course, that the ANN models nonlinear relationships."
GraciousReformer OP t1_j9jib2n wrote
Reply to comment by Mefaso in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Then why DL?
GraciousReformer OP t1_j9ji7t1 wrote
Reply to comment by hpstring in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
But why DL beats the curse? Why is DL the only class?
GraciousReformer OP t1_j9jh7zm wrote
Reply to comment by BoiElroy in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Yes a very finite grid size will approximate any digital image. But this is an approximation of an image in grids. How will it lead to approximation by NN?
GraciousReformer OP t1_j9jgdmc wrote
Reply to comment by VirtualHat in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
But DL is not a linear model. Then what will be the limit of DL?
GraciousReformer OP t1_j9jg13p wrote
Reply to comment by 1bir in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Then why not use decision trees instead of DL?
GraciousReformer OP t1_j9jfxvh wrote
Reply to comment by yldedly in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Yes but is DL the unique mechanism? Why DL?
GraciousReformer OP t1_j9jfvfy wrote
Reply to comment by chief167 in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Then what will be the limitation of transformers?
GraciousReformer OP t1_j9j8bsl wrote
Reply to comment by VirtualHat in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Thank you. I understand the math. But I meant a real world example that "the solution is not in the model class."
GraciousReformer OP t1_j9j7iwm wrote
Reply to comment by VirtualHat in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
>Large linear models tend not to scale well to large datasets if the solution is not in the model class
Will you provide me a reference?
Submitted by GraciousReformer t3_118pof6 in MachineLearning
GraciousReformer OP t1_j9oeeo3 wrote
Reply to comment by currentscurrents in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
What are the situations that the bias for the hierarchy is not helpful?