The Probabilistic Relevance Framework: BM25 and Beyond
Type de ressource
Auteurs/contributeurs
- Robertson, Stephen (Auteur)
- Zaragoza, Hugo (Auteur)
Titre
The Probabilistic Relevance Framework: BM25 and Beyond
Résumé
The Probabilistic Relevance Framework (PRF) is a formal framework for document retrieval, grounded in work done in the 1970–1980s, which led to the development of one of the most successful text-retrieval algorithms, BM25. In recent years, research in the PRF has yielded new retrieval models capable of taking into account document meta-data (especially structure and link-graph information). Again, this has led to one of the most successful Web-search and corporate-search algorithms, BM25F. This work presents the PRF from a conceptual point of view, describing the probabilistic modelling assumptions behind the framework and the different ranking algorithms that result from its application: the binary independence model, relevance feedback models, BM25 and BM25F. It also discusses the relation between the PRF and other statistical models for IR, and covers some related topics, such as the use of non-textual features, and parameter optimisation for models with free parameters.
Publication
Foundations and Trends® in Information Retrieval
Volume
3
Numéro
4
Pages
333-389
Date
2009/12/17
Abrév. de revue
INR
Langue
English
ISSN
1554-0669, 1554-0677
Titre abrégé
The Probabilistic Relevance Framework
Consulté le
2019-01-18 15 h 09
Catalogue de bibl.
Référence
Robertson, S. et Zaragoza, H. (2009). The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends® in Information Retrieval, 3(4), 333‑389. https://doi.org/10.1561/1500000019
Recherches connexes
Lien vers cette notice