Home

Tackling the Poor Assumptions of Naive Bayes Text Classifiers


Author(s) : David Karger Jason D. M. Rennie, 
Publisher : N/A
Publication Date : 2003
ISSN : N/A
Abstract : Naive Bayes is often used as a baseline in text classication because it is fast and easy to implement. Its severe assumptions make such eciency possible but also adversely affect the quality of its results. In this paper we propose simple, heuristic solutions to some of the problems with Naive Bayes classiers, addressing both systemic issues as well as problems that arise because text is not actually generated according to a multinomial model. We nd that our simple corrections result in a fast algorithm that is competitive with stateof-the-art text classication algorithms such as the Support Vector Machine.,