bubble display of functionality by region:https://public.tableau.com/views/DataMiningtheWaterTableDrivenData_com/Bubblestatusquantity?:embed=y&:showTabs=y&:display_count=yes
map with regions based on the geographical coordinates:https://public.tableau.com/views/DataMiningtheWaterTableDrivenData_com/MAPBasin?:embed=y&:showTabs=y&:display_count=yes
click on the sheets.. this is just scratching the surface..
Very cool... I havent run my models yet, but does date recorded have a strong correlation with result? that could be a serious methodological issue if it does...
Yes, there is a slight kuznets curve relationship between date_recorded to the percentage of functional/non functional pumps. It does make intuitively sense also, older pumps break more often. What doesn't make logically sense is that most of the oldest recorded pumps are still functional, but there are probably other variables beside date_recorded at play.
x-axis wrong, should be construction_year:https://github.com/uioreanu/R-Scripts/blob/master/DrivenData%20-%20Pump%20it%20Up%20Data%20Mining%20the%20Water%20Table/construction_year_U-shaped.png
Thanks for the reply, there is certainly something interesting there! I should have time to run some models within the next two weeks, I would love some feedback once I post them.
Super!!! Thank you very much for the visualization