How should features be selected for logistic regression?
Instructor solution
Often, features for logistic regression will be set by hand, with the implementer relying on linguistic intuition to select features that could be significant in determining a class. For instance, in the task of sentiment analysis, it seems logical to include count of positive lexicons as a feature and count of negative lexicons as a feature. Features can be adjusted when constructing a model, as an implementer could see features with weights close to 0 to potentially remove as noise and add or test different ideas for features.
Unsupervised learning could also be used to cluster examples in the training data and surface features.
Think you've got it?
What does a weight of
- A.
The feature may be irrelevant for determining class.
- B.
The feature does not exist in any training examples.
- C.
The feature may or may not have a strong positive or negative impact on determining the class.
- D.
Both A and B.
You may exit out of this review and return later without penalty.

CLOSE REVIEW