Cover Image for System.Linq.Enumerable+EnumerablePartition`1[System.Char]

Multi-Class Classification of Agricultural Data Based on Random Forest and Feature Selection

OAI: oai:igi-global.com:298618 DOI: 10.4018/JITR.298618
Published by: IGI Global

Abstract

Agricultural production and operation produce a large amount of data, which hides valuable knowledge. Data mining technology can effectively explore the connection between various factors from the massive agricultural data. Classification prediction is one of the most valuable agricultural data mining techniques. This paper presents a new algorithm consisting of machine learning algorithms, feature ranking method and instance filter, which aims to enhance the capability of the random forest algorithm and better solve the problem of agricultural multi-class classification. The performance of the new algorithm was tested by using four standard agricultural multi-class datasets, and the experimental results showed that the newly proposed method performed well on all datasets. Among them, substantial rise in classification accuracy is observed for Eucalyptus dataset. Applying random forest algorithm on Eucalyptus dataset results in classification accuracy as 53.4% and after applying the new algorithm (rough set) the classification accuracy significantly increases to 83.7%.