Using SVMs for Classification of Cross-Document Relationships

Jaya Kumar, Yogan (2013) Using SVMs for Classification of Cross-Document Relationships. Pertanika Journal of Science & Technology, 21 (1). pp. 239-246. ISSN 0128-7680

[img] PDF
JST-0000-2012_YOGAN_JAYA_KUMAR_(ORG_MS)_22_August_2012_.pdf - Accepted Version
Restricted to Repository staff only

Download (537kB) | Request a copy

Abstract

Cross-document Structure Theory (CST) has recently been proposed to facilitate tasks related to multi-document analysis. Classifying and identifying the CST relationships between sentences across topically related documents have since been proven as necessary. However, there have not been sufficient studies presented in literature to automatically identify these CST relationships. In this study, a supervised machine learning technique, i.e. Support Vector Machines (SVMs), was applied to identify four types of CST relationships, namely “Identity”, “Overlap”, “Subsumption”, and “Description” on the datasets obtained from CSTBank corpus. The performance of the SVMs classification was measured using Precision, Recall and F-measure. In addition, the results obtained using SVMs were also compared with those from the previous literature using boosting classification algorithm. It was found that SVMs yielded better results in classifying the four CST relationships.

Item Type: Article
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Information and Communication Technology > Department of Industrial Computing
Depositing User: YOGAN JAYA KUMAR
Date Deposited: 19 Apr 2013 16:49
Last Modified: 28 May 2015 03:44
URI: http://eprints.utem.edu.my/id/eprint/6707
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item