### Abstract

We investigate the problem of supervised feature selection within the filtering framework. In our approach, applicable to the two-class problems, the feature strength is inversely proportional to the p-value of the null hypothesis that its class-conditional densities, p(X|Y = 0) and p(X|Y = 1), are identical. To estimate the p-values, we use Fisher's permutation test combined with the four simple filtering criteria in the roles of test statistics: sample mean difference, symmetric Kullback-Leibler distance, information gain, and chi-square statistic. The experimental results of our study, performed using naive Bayes classifier and support vector machines, strongly indicate that the permutation test improves the above-mentioned filters and can be used effectively when sample size is relatively small and number of features relatively large.

Original language | English |
---|---|

Title of host publication | Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) |

Editors | J.-F. Boulicaut, F. Esposito, D. Pedreschi, F. Giannotti |

Pages | 334-346 |

Number of pages | 13 |

Volume | 3201 |

State | Published - 2004 |

Event | 15th European Conference on Machine Learning, ECML 2004 - Pisa, Italy Duration: Sep 20 2004 → Sep 24 2004 |

### Other

Other | 15th European Conference on Machine Learning, ECML 2004 |
---|---|

Country | Italy |

City | Pisa |

Period | 9/20/04 → 9/24/04 |

### Fingerprint

### ASJC Scopus subject areas

- Hardware and Architecture
- Computer Science(all)
- Theoretical Computer Science

### Cite this

*Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)*(Vol. 3201, pp. 334-346)

**Feature selection filters based on the permutation test.** / Radivojac, Predrag; Obradovic, Zoran; Dunker, A.; Vucetic, Slobodan.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

*Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science).*vol. 3201, pp. 334-346, 15th European Conference on Machine Learning, ECML 2004, Pisa, Italy, 9/20/04.

}

TY - GEN

T1 - Feature selection filters based on the permutation test

AU - Radivojac, Predrag

AU - Obradovic, Zoran

AU - Dunker, A.

AU - Vucetic, Slobodan

PY - 2004

Y1 - 2004

N2 - We investigate the problem of supervised feature selection within the filtering framework. In our approach, applicable to the two-class problems, the feature strength is inversely proportional to the p-value of the null hypothesis that its class-conditional densities, p(X|Y = 0) and p(X|Y = 1), are identical. To estimate the p-values, we use Fisher's permutation test combined with the four simple filtering criteria in the roles of test statistics: sample mean difference, symmetric Kullback-Leibler distance, information gain, and chi-square statistic. The experimental results of our study, performed using naive Bayes classifier and support vector machines, strongly indicate that the permutation test improves the above-mentioned filters and can be used effectively when sample size is relatively small and number of features relatively large.

AB - We investigate the problem of supervised feature selection within the filtering framework. In our approach, applicable to the two-class problems, the feature strength is inversely proportional to the p-value of the null hypothesis that its class-conditional densities, p(X|Y = 0) and p(X|Y = 1), are identical. To estimate the p-values, we use Fisher's permutation test combined with the four simple filtering criteria in the roles of test statistics: sample mean difference, symmetric Kullback-Leibler distance, information gain, and chi-square statistic. The experimental results of our study, performed using naive Bayes classifier and support vector machines, strongly indicate that the permutation test improves the above-mentioned filters and can be used effectively when sample size is relatively small and number of features relatively large.

UR - http://www.scopus.com/inward/record.url?scp=22944454584&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=22944454584&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:22944454584

VL - 3201

SP - 334

EP - 346

BT - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)

A2 - Boulicaut, J.-F.

A2 - Esposito, F.

A2 - Pedreschi, D.

A2 - Giannotti, F.

ER -