Login

Login
Welcome:
Guest

Search for:


Browse:

Bannner: Aslib individual membership.
 
Journal search
Journal cover: International Journal of Web Information Systems

International Journal of Web Information Systems

ISSN: 1744-0084

Online from: 2005

Subject Area: Information and Knowledge Management

Content: Latest Issue | icon: RSS Latest Issue RSS | Previous Issues

Options: To add Favourites and Table of Contents Alerts please take a Emerald profile

Icon: .Table of Contents.Next article.Icon: .

Mining and visualising information from RSS feeds: a case study


Document Information:
Title:Mining and visualising information from RSS feeds: a case study
Author(s):Martin O'Shea, (Department of Computer Science and Information Systems, Birkbeck, University of London, London, UK), Mark Levene, (Department of Computer Science and Information Systems, Birkbeck, University of London, London, UK)
Citation:Martin O'Shea, Mark Levene, (2011) "Mining and visualising information from RSS feeds: a case study", International Journal of Web Information Systems, Vol. 7 Iss: 2, pp.105 - 129
Keywords:Cluster analysis, Data analysis, Data mining, Data visualisation, Experiment, Extensible markup language, RSS feeds, Social data
Article type:Research paper
DOI:10.1108/17440081111141763 (Permanent URL)
Publisher:Emerald Group Publishing Limited
Abstract:

Purpose – Recent years have seen “really simple syndication” or “rich site summary”(RSS) syndication of frequently updated content become ubiquitous across the internet. RSS's XML-based format allows these data to be stored in a semi-structured format but, despite the presence of online aggregators and readers, and the related work in clustering feeds and mining subjects by keywords, much potentially useful information present in RSS may remain undiscovered. This paper aims to address this issue in an experimental setting.

Design/methodology/approach – This paper presents two distinct technologies which employ the semi-structured nature of RSS content to allow users to mine information directly from raw RSS feeds: occurrence mining counts occurrences of text strings in feeds, whilst value mining mines structured ticker tape numeric data. It describes both technologies and their implementation in an experiment, where 35 students mined small numbers of RSS feeds and visualised the data mined.

Findings – This paper analyses the results of the experiment and cites examples of data mined and visualisations produced. The subject matter of data mined is also explored and potential applications of the technologies are considered.

Research limitations/implications – The mining technologies proposed in this paper have been developed to mine textual and numeric data directly from feeds, but can be extended to mine other data types present in RSS and to include other variants like Atom.

Originality/value – These technologies are seen to be applicable to data mining, the role of data and visualisations in social data analysis, issue tracking in news mining and time series analysis.



Fulltext Options:

Login

Login

Existing customers: login
to access this document

Login


- Forgot password?

- Athens/Institutional login

Purchase

Purchase

Downloadable; Printable; Owned
HTML, PDF (334kb)Purchase

To purchase this item please login or register.

Login


- Forgot password?

Recommend to your librarian

Complete and print this form to request this document from your librarian


Marked list

Bookmark & share

Reprints & permissions

© Emerald Group Publishing Limited  |  Copyright information  |  Site policies  |  Cookie information
.