Login

Login
Welcome:
Guest
Bannner:Try our mobile site beta
 
Journal search
Journal cover: International Journal of Web Information Systems

International Journal of Web Information Systems

ISSN: 1744-0084

Online from: 2005

Subject Area: Information and Knowledge Management

Content: Latest Issue | icon: RSS Latest Issue RSS | Previous Issues

Options: To add Favourites and Table of Contents Alerts please take a Emerald profile

Previous article.Icon: Print.Table of Contents.Next article.Icon: .

Using web search logs to identify query classification terms


Document Information:
Title:Using web search logs to identify query classification terms
Author(s):Isak Taksa, (Baruch College, City University of New York, New York, USA), Sarah Zelikovitz, (College of Staten Island, City University of New York, Staten Island, New York, USA), Amanda Spink, (Faculty of Information Technology, Queensland University of Technology, Brisbane, Australia)
Citation:Isak Taksa, Sarah Zelikovitz, Amanda Spink, (2007) "Using web search logs to identify query classification terms", International Journal of Web Information Systems, Vol. 3 Iss: 4, pp.315 - 327
Keywords:Classification schemes, Computer networks, Information retrieval, Man-machine systems, User interfaces
Article type:Research paper
DOI:10.1108/17440080710848107 (Permanent URL)
Publisher:Emerald Group Publishing Limited
Abstract:

Purpose – The work presented in this paper aims to provide an approach to classifying web logs by personal properties of users.

Design/methodology/approach – The authors describe an iterative system that begins with a small set of manually labeled terms, which are used to label queries from the log. A set of background knowledge related to these labeled queries is acquired by combining web search results on these queries. This background set is used to obtain many terms that are related to the classification task. The system then ranks each of the related terms, choosing those that most fit the personal properties of the users. These terms are then used to begin the next iteration.

Findings – The authors identify the difficulties of classifying web logs, by approaching this problem from a machine learning perspective. By applying the approach developed, the authors are able to show that many queries in a large query log can be classified.

Research limitations/implications – Testing results in this type of classification work is difficult, as the true personal properties of web users are unknown. Evaluation of the classification results in terms of the comparison of classified queries to well known age-related sites is a direction that is currently being exploring.

Practical implications – This research is background work that can be incorporated in search engines or other web-based applications, to help marketing companies and advertisers.

Originality/value – This research enhances the current state of knowledge in short-text classification and query log learning.



Fulltext Options:

Login

Login

Existing customers: login
to access this document

Login


- Forgot password?

- Athens/Institutional login

Purchase

Purchase

Downloadable; Printable; Owned
HTML, PDF (97kb)Purchase

To purchase this item please login or register.

Login


- Forgot password?

Order

Fill in an Order form to request this document from your librarian


Marked list

Bookmark & share

Reprints & permissions

© Emerald Group Publishing Limited  |  Copyright info  |  Site Policies
.