Information Extraction from Heterogeneous WWW Resources

Sulong, Muhammad Suhaizan and Meziane, Farid (2004) Information Extraction from Heterogeneous WWW Resources. In: 7th International Conference on Work with Computing Systems (WWCS), 29 Jun - 2 Jul 2004, Kuala Lumpur, Malaysia.

wwcs2004.pdf - Published Version

Download (265kB)


The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem many tools and techniques are being developed and used for locating the web pages of interest and extracting the desired information from these pages. In this paper we present the first prototype of an Information Extraction (IE) system that attempts to extract information on different Computer Science related courses offered by British Universities.

Item Type: Conference or Workshop Item (Paper)
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Information and Communication Technology > Department of Software Engineeering
Depositing User: Hj. Muhammad Suhaizan Sulong
Date Deposited: 23 Mar 2012 14:39
Last Modified: 28 May 2015 02:25
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item