Automated document preprocessing for text categorization

This study deals with the document preprocessing step of Text Categorization System (TCS) done by previous researcher. Previously, the document used for TCS was entered manually by key-in the abstract of the document into database.So it will burden if it involves a large volume of articles.In this s...

全面介绍

Saved in:
书目详细资料
Main Authors: Abd Rahman, Suraya, Sainin, Mohd Shamrie
格式: Conference or Workshop Item
语言:English
出版: 2006
主题:
在线阅读:http://repo.uum.edu.my/9590/1/Sur.pdf
http://repo.uum.edu.my/9590/
标签: 添加标签
没有标签, 成为第一个标记此记录!
实物特征
总结:This study deals with the document preprocessing step of Text Categorization System (TCS) done by previous researcher. Previously, the document used for TCS was entered manually by key-in the abstract of the document into database.So it will burden if it involves a large volume of articles.In this study, the extraction based approach was applied in order to automate the document preprocessing. One module was added into the prototype of text categorization system that is used to add document into database.