Hope you are all doing well! I needed a suggestion related to hardware to run the ML algorithms on such a huge dataset. I was trying tree based algorithms on data using 8GB and also 12GB RAM hardware and still getting memory error every time I run the code.
Could anyone please guide on the configuration of hardware required or if there is any cloud platform where we can access high configuration hardware?
Google Colab is really good, and it’s free. Only 12GB of RAM though, but if you pay $10 a month you get access to higher memory (24GB) VM’s too. I’ve been using the free tier (0.8784 on the leaderboard atm) and found 12GB of RAM is enough. I did run into issues running out of memory as you have. I’d focus on trying to work out if there are any hyperparameters you can change to reduce how much memory the algorithm requires. If not then think about using a different algorithm.