Kindly help -Approach for this problem

meetu · April 22, 2019, 5:13pm

Hi,
For practice, I am trying to do this problem. So as per my understanding, my approach is -
1.dummy code the phase variable
2. get unique values of different stages group by process id …so I got 7 different stages here - 11111…11011…and so on
3. now I am grouping dataset based on these stages, meaning i will combine all rows into 1 dataset, that belong to 11111 phases i.e. all those process Ids that have gone through all 5phases are collected together in 1 dataset. and this way i have 7 different datasets
4. return flow values set to 0 if its negative
5. predict return_flow and then predict return_turbidity …and then calculate the final_turbidity by using the formula
is this approach correct? kindly guide.
TIA!

Topic		Replies	Views
Preprocessing question Sustainable Industry: Rinse Over Run	3	1059	January 16, 2019
Rows with phase=='final_rinse' and target_time_period==false Sustainable Industry: Rinse Over Run	4	936	January 14, 2019
Asked to predict into the future Sustainable Industry: Rinse Over Run	4	835	January 16, 2019
Calculating Target Variable - np.maximum & return_flow Sustainable Industry: Rinse Over Run	2	817	January 22, 2019
Data not starting with the Pre-rinse Sustainable Industry: Rinse Over Run	7	1101	February 2, 2019

Kindly help -Approach for this problem

Related topics