GraciousReformer OP t1_j9jr4i4 wrote on February 22, 2023 at 1:53 PM

"for example on tabular data where discontinuities are common, DL performs worse than alternatives, even if with more data it would eventually approximate a discontinuity." True. Is there references on this issue?

yldedly t1_j9jr821 wrote on February 22, 2023 at 1:53 PM

This one is pretty good: https://arxiv.org/abs/2207.08815

GraciousReformer OP t1_j9jrhjd wrote on February 22, 2023 at 1:55 PM

This is a great point. Thank you. So do you mean that DL work for language models only when they get a large amount of data?

GraciousReformer OP t1_j9k1srq wrote on February 22, 2023 at 3:37 PM

But then what is the difference from the result that NN works better for ImageNet?

yldedly t1_j9k3orr wrote on February 22, 2023 at 3:52 PM

Not sure what you're asking. CNNs have inductive biases suited for images.

GraciousReformer OP t1_j9k4974 wrote on February 22, 2023 at 3:56 PM

So it works for images but not for tabular data?

yldedly t1_j9k5n8n wrote on February 22, 2023 at 4:06 PM

It depends a lot on what you mean by works. You can get a low test error with NNs on tabular data if you have enough of it. For smaller datasets, you'll get a lower test error using tree ensembles. For low out-of-distribution error neither will work.

[deleted] t1_j9jt2vp wrote on February 22, 2023 at 2:07 PM

>the bias of DL towards low-dimensional smooth manifolds

What is this? Got all the rest but that

yldedly t1_j9jtuzy wrote on February 22, 2023 at 2:13 PM

I'll link you to an old comment: https://www.reddit.com/r/MachineLearning/comments/z12zxj/comment/ix9t149/?utm_source=share&utm_medium=web2x&context=3

[D] "Deep learning is the only thing that currently works at scale"

yldedly t1_j9jpuky wrote on February 22, 2023 at 1:43 PM