Help required for feature engineering

mkashifiqbal · February 12, 2018, 2:38pm

Can any one help to engineer the features and select the best features instead to applying model on so many features.

RonL · February 12, 2018, 5:35pm

I think one really good way to do feature selection is to train a Random Forest or Extreme Gradient Boosting model on the data and then examining the feature importance from those models to pick out the most important features.

Here’s a link to the scikit learn’s implementation with Random Forest: http://scikit-learn.org/stable/auto_examples/ensemble/plot_forest_importances.html

mkashifiqbal · February 12, 2018, 5:55pm

Thanks RonL. It’s great suggestion. How do you think if use Two Step cluster analysis available in SPSS?

RonL · February 12, 2018, 6:53pm

I’m not really familiar with that method, but another quite popular way to reduce dimensionality would be through Principal Component Analysis.

PCA has a pretty handy parameter where you can set how much of the variance you preserve when you project the data to the principal components. This helps you to strike a balance between the number of dimensions and the amount of information they retain.

Here’s a pretty good write-up on that method: https://towardsdatascience.com/pca-using-python-scikit-learn-e653f8989e60

sagol · February 12, 2018, 8:17pm

https://machinelearningmastery.com/an-introduction-to-feature-selection/ - could be useful as start point

electricity · February 13, 2018, 8:57am

Possibly a useful SPSS screenshot for feature selection: https://stats.stackexchange.com/questions/66478/correlation-and-categorical-variables

Topic		Replies	Views
Feature Selection Warm Up: Machine Learning with a Heart	2	1500	October 28, 2019
Feature Engineering Techniques	1	131	April 29, 2024
Feature selection Techniques	0	447	December 30, 2018
What's your strategy? Warm Up: Predict Blood Donations	26	10799	August 23, 2020
Calling on the LB leaders: Did you use the indiv data at all? Pover-T Tests: Predicting Poverty	15	1553	February 22, 2018

Help required for feature engineering

Related topics