Analysis of large data logs: an application of Poisson sampling on excite web queries

dc.contributor.buuauthorÖzmutlu, H. Cenk
dc.contributor.buuauthorSpink, A.
dc.contributor.buuauthorÖzmutlu, Seda
dc.contributor.departmentUludağ Üniversitesi/Mühendislik Fakültesi.tr_TR
dc.contributor.researcheridAAH-4480-2021tr_TR
dc.contributor.researcheridABH-5209-2020tr_TR
dc.date.accessioned2021-07-06T08:57:29Z
dc.date.available2021-07-06T08:57:29Z
dc.date.issued2002-07
dc.description.abstractSearch engines are the gateway for users to retrieve information from the Web. There is a crucial need for tools that allow effective analysis of search engine queries to provide a greater understanding of Web users' information seeking behavior. The objective of the study is to develop an effective strategy for the selection of samples from large-scale data sets. Millions of queries are submitted to Web search engines daily and new sampling techniques are required to bring these databases to a manageable size, while preserving the statistically representative characteristics or the entire data set. This paper reports results from a study using data logs from the Excite Web search engine, We use Poisson sampling to develop a sampling strategy. and show how sample sets selected by Poisson sampling statistically effectively represent the characteristics of the entire dataset. In addition, this paper discusses the use of Poisson sampling in continuous monitoring of stochastic processes, such as Web site dynamics.en_US
dc.identifier.citationÖzmutlu, H. C. vd. (2002). "Analysis of large data logs: an application of Poisson sampling on excite web queries". Information Processing & Management, 38(4), 473-490.tr_TR
dc.identifier.endpage490tr_TR
dc.identifier.issn0306-4573
dc.identifier.issue4tr_TR
dc.identifier.scopus2-s2.0-0036643012tr_TR
dc.identifier.startpage473tr_TR
dc.identifier.urihttps://doi.org/10.1016/S0306-4573(01)00043-7
dc.identifier.urihttps://www.sciencedirect.com/science/article/pii/S0306457301000437
dc.identifier.urihttp://hdl.handle.net/11452/21118
dc.identifier.volume38tr_TR
dc.identifier.wos000175479100002tr_TR
dc.indexed.scopusScopusen_US
dc.indexed.wosSCIEen_US
dc.indexed.wosSSCIen_US
dc.language.isoenen_US
dc.publisherPergamon-Elsevier Scienceen_US
dc.relation.journalInformation Processing &Managementen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergitr_TR
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectComputer scienceen_US
dc.subjectInformation science & library scienceen_US
dc.subjectPoisson samplingen_US
dc.subjectUsersen_US
dc.subjectLarge-scale in depth data analysisen_US
dc.subjectWeb user modelingen_US
dc.subjectSearch engine queriesen_US
dc.subjectData miningen_US
dc.subject.wosComputer scienceen_US
dc.subject.wosInformation systemsen_US
dc.subject.wosInformation science & library scienceen_US
dc.titleAnalysis of large data logs: an application of Poisson sampling on excite web queriesen_US
dc.typeArticle
dc.wos.quartileQ1tr_TRen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections