Votre recherche
Résultats 10 ressources
-
The Probabilistic Relevance Framework (PRF) is a formal framework for document retrieval, grounded in work done in the 1970–1980s, which led to the development of one of the most successful text-retrieval algorithms, BM25. In recent years, research in the PRF has yielded new retrieval models capable of taking into account document meta-data (especially structure and link-graph information). Again, this has led to one of the most successful Web-search and corporate-search algorithms, BM25F....
-
In this paper, we report ongoing efforts in a large scale research project to develop methods for profiling individual Web search engine users by leveraging data recorded in the transaction logs of search engines. Our research aim is to investigate how completely one can profile a Web searcher using log data. Taking a broad brush approach, we present an array of profiling attributes to illustrate the spectrum of user characteristics possible from log data. Specifically, we present ongoing...
-
This report summarises a workshop organised as a part of the EU-funded TrebleCLEF project entitled "Query Log Analysis: From Research to Best Practice" held on 27-28th May 2009 at the British Computer Science Offices in London, UK. The event involved 12 invited speakers from various academic and commercial institutions from around the world who are all involved, in some way, with query log analysis. A number of other people attended the event including local businesses and academic...
-
Query reformulation is a key user behavior during Web search. Our research goal is to develop predictive models of query reformulation during Web searching. This article reports results from a study in which we automatically classified the query-reformulation patterns for 964,780 Web searching sessions, composed of 1,523,072 queries, to predict the next query reformulation. We employed an n-gram modeling approach to describe the probability of users transitioning from one query-reformulation...
-
With the emergence of Internet various institutions started making available important legal materials in centralized online databases. Depending on the previous classification of data, available resources, degree of disclosure, each organization adopts its own way to present materials online. Oftentimes institutions providing similar data organize it in different ways (different titles, categories, search criteria, search engines, websites etc.) Additionally, some may do it differently due...
Explorer
Revue de littérature
Méthodologie
- Analyse de logs (3)
- Méthodes de recherche (2)