Viewing a single comment thread. View all comments

pommedeterresautee t1_iyza0g2 wrote

I know very little about CPUs, but wondering why do you think more cache would help?

Intuitively I would think it would be the case if training was for most of the time memory bandwidth limited but the issue with CPUs (vs GPUs) is that during training, model is computed bounded.

7

PresentGrapefruit451 OP t1_iyzemhl wrote

I thought so cause more l3 cache can keep more cpu instructions and for preprocessing and data loading stages fast cpu processing might be of help. Though not sure if the improvement will be significant.

2