Viewing a single comment thread. View all comments

ThisIsMyStonerAcount t1_iza1qzy wrote

What nonlinearity would solve the issue? The usual ones we use today certainly wouldn't. Are you thinking a 2nd order polynomial? I'm not sure that's a generally applicable function, with being non-monotonical and all?

(Or do you mean a hidden layer? If so: yeah, that's absolutely hindsight bias).

11

Blutorangensaft t1_iza9zrt wrote

I see. I meant the combination of a nonlinear activation function and another hidden layer. Was curious what people thought, thanks for your comment.

4

chaosmosis t1_izawfsz wrote

Non-monotonic activation functions can allow for single layers to solve xor, but they take forever to converge.

2