Yayın:
Automatic new topic identification using multiple linear regression

Placeholder

Tarih

Akademik Birimler

Kurum Yazarları

Özmutlu, Seda

Yazarlar

Danışman

Dil

Türü

Yayıncı:

Elsevier Science

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Özet

The purpose of this study is to provide automatic new topic identification of search engine query logs, and estimate the effect of statistical characteristics of search engine queries on new topic identification. By applying multiple linear regression and multi-factor ANOVA on a sample data log from the Excite search engine, we demonstrated that the statistical characteristics of Web search queries, such as time interval, search pattern and position of a query in a user session, are effective on shifting to a new topic. Multiple linear regression is also a successful tool for estimating topic shifts and continuations. The findings of this study provide statistical proof for the relationship between the non-semantic characteristics of Web search queries and the occurrence of topic shifts and continuations.

Açıklama

Kaynak:

Anahtar Kelimeler:

Konusu

Information science & library science, Information analysis, Topic identification, Information retrievals, Search engine, Regression analysis, Regression, Search engines, Information retrieval, Semantic, ANOVA, Multiple linear regression, FMSS, Topic identification, Minimizing mean flowtime, Web search queries, Life, Identification (control systems), Users, ReaL-time methodology, Information-seeking, Trends, Users, Automatic programming, Data reduction

Alıntı

Özmutlu, S. (2006). ''Automatic new topic identification using multiple linear regression''. Automatic new topic identification using multiple linear regression, 42(4), 934-950.

Endorsement

Review

Supplemented By

Referenced By

1

Views

0

Downloads

View PlumX Details