Cover Image for System.Linq.Enumerable+EnumerablePartition`1[System.Char]

English Article Style Recognition and Matching by Using Web Semantics

OAI: oai:igi-global.com:293751 DOI: 10.4018/IJMCMC.293751
Published by: IGI Global

Abstract

With the explosion of internet information, people feel helpless and difficult to choose in the face of massive information. However, the traditional method to organize a huge set of original documents is not only time-consuming and laborious, but also not ideal. The automatic text classification can liberate users from the tedious document processing work, recognize and distinguish different document contents more conveniently, make a large number of complicated documents institutionalized and systematized, and greatly improve the utilization rate of information. This paper adopts termed-based model to extract the features in web semantics to represent document. The extracted web semantics features are used to learn a reduced support vector machine. The experimental results show that the proposed method can correctly identify most of the writing styles.