zveroboy152
zveroboy152 OP t1_j41q48e wrote
Reply to comment by CrashTimeV in [R] AMD Instinct MI25 | Machine Learning Setup on the Cheap! by zveroboy152
I ordered one three days ago for $170. ;-) I hope to be doing some testing and write ups on it soon.
zveroboy152 OP t1_j41idsx wrote
Reply to comment by CrashTimeV in [R] AMD Instinct MI25 | Machine Learning Setup on the Cheap! by zveroboy152
I am not, but that sounds like a very cool thing to run on it. :-) (I'm a big fan of craft computing)
zveroboy152 OP t1_j2v9jj8 wrote
Reply to comment by currentscurrents in [R] AMD Instinct MI25 | Machine Learning Setup on the Cheap! by zveroboy152
Agreed! 16GB of HBM memory is impressive for the price. :-)
zveroboy152 OP t1_j2v9h5g wrote
Reply to comment by SnooHesitations8849 in [R] AMD Instinct MI25 | Machine Learning Setup on the Cheap! by zveroboy152
The driver is a bit akward to deal with, but isn't terrible. Working inside of a docker container for GPU workloads isn't terrible either if you know your way around in containerization.
But, I do agree. NVIDIA's driver's are easier to deal with (Referencing my K80 Driver install experience: https://www.zb-c.tech/2022/12/11/how-to-install-drivers-on-ubuntu-for-the-nvidia-tesla-k80/ )
zveroboy152 OP t1_j2uuiy1 wrote
Reply to comment by gradientpenalty in [R] AMD Instinct MI25 | Machine Learning Setup on the Cheap! by zveroboy152
Those are coming soon. I'm working on collecting a few sub $100 GPU's and running them through a suite of benchmarks from PyTorch's Repo:
https://github.com/pytorch/benchmark
​
I'll be sure to follow up and post some numbers. :-)
Submitted by zveroboy152 t3_102n6qp in MachineLearning
zveroboy152 OP t1_j20ult3 wrote
Reply to comment by yaosio in [R] PyTorch | Budget GPU Benchmarking by zveroboy152
You're right, I didn't include that data. I wasn't sure how to calculate it. Ill work on updating the article to reflect that data.
I appreciate the constructive criticism, it really helps. :-)
zveroboy152 OP t1_j1xrwio wrote
Reply to comment by learn-deeply in [R] PyTorch | Budget GPU Benchmarking by zveroboy152
It sounds like I have a lot to learn for PyTorch then. :-)
​
I'll work on re-working the script to reflect better numbers. If you have any pointers, or ideas, I'd love to hear it.
zveroboy152 OP t1_j1xi8km wrote
Reply to comment by Tom_Neverwinter in [R] PyTorch | Budget GPU Benchmarking by zveroboy152
Hi Tom,
I ran into that problem too. I ended up getting one of these for my Tesla's and AMD MI25's:
SUPERMICRO 1027GR-TRF
(not sponsored)
https://www.theserverstore.com/Supermicro-SuperServer-1027GR-TRFT-1U-GPU-Server
It's been great for my GPU workloads.
Submitted by zveroboy152 t3_zwtgqw in MachineLearning
zveroboy152 OP t1_j472zg8 wrote
Reply to comment by CrashTimeV in [R] AMD Instinct MI25 | Machine Learning Setup on the Cheap! by zveroboy152
That sounds like a pretty sick machine! I'll check out GPU Direct Storage and see if I can get it working. :-)