Automated document preprocessing for text categorization

This study deals with the document preprocessing step of Text Categorization System (TCS) done by previous researcher. Previously, the document used for TCS was entered manually by key-in the abstract of the document into database.So it will burden if it involves a large volume of articles.In this s...

Full description

Saved in:
Bibliographic Details
Main Authors: Abd Rahman, Suraya, Sainin, Mohd Shamrie
Format: Conference or Workshop Item
Language:English
Published: 2006
Subjects:
Online Access:http://repo.uum.edu.my/9590/1/Sur.pdf
http://repo.uum.edu.my/9590/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study deals with the document preprocessing step of Text Categorization System (TCS) done by previous researcher. Previously, the document used for TCS was entered manually by key-in the abstract of the document into database.So it will burden if it involves a large volume of articles.In this study, the extraction based approach was applied in order to automate the document preprocessing. One module was added into the prototype of text categorization system that is used to add document into database.