For those of you who use the R language to produce models, I have created an R script equivalent to produce the benchmark model.
All the text is copy-paste of the wonderful benchmark walkthrough provided by DrivenData, I did not add any feature engineering/insights/analysis of any kind.
The only things that’s changed is the code itself in order to match the R language.
Just go into this link to check it out
Oh… and good luck!
I am doing this in Python; is the original benchmark in Python? How can I find it? Thanks!
Original benchmark in Python is here:
And here’s the full ipynb file:
Thank you! I am currently researching time series analysis for this project, have you come across any good resources you would recommend?