Active Feature-Value Acquisition for Classifier Induction

Prem Melville
Raymond Mooney
Foster Provost
Maytal Saar-Tsechansky

Venue: Fourth IEEE International Conference on Data Mining (ICDM2004)
2004
Type: Selected Conference Paper

Many induction problems include missing data that can be acquired at a cost. For building accurate predictive models, acquiring complete information for all instances is often expensive or unnecessary, while acquiring information for a random subset of instances may not be most effective. Active feature-value acquisition tries to reduce the cost of achieving a desired model accuracy by identifying instances for which obtaining complete information is most informative. We present an approach in which instances are selected for acquisition based on the current model’s accuracy and its confidence in the prediction. Experimental results demonstrate that our approach can induce accurate models using substantially fewer feature-value acquisitions as compared to alternative policies.

Active Feature-Value Acquisition for Classifier Induction

Related Files:

Foster Provost