Publications

Neural Embedding-based Metrics for Pre-Retrieval Query Performance Prediction

Negar Arabzadeh and Fattane Zarrinkalam and Jelena Jovanovic and Ebrahim Bagheri
Reference:
Negar Arabzadeh; Fattane Zarrinkalam; Jelena Jovanovic and Ebrahim Bagheri Neural Embedding-based Metrics for Pre-Retrieval Query Performance Prediction. In 42nd European Conference on IR Research (ECIR 2020), 2020.
Links to Publication:
Abstract:
Query Performance Prediction (QPP) is concerned with estimating the effectiveness of a query within the context of a retrieval model. It allows for operations such as query routing and segmentation, leading to improved retrieval performance. Pre-retrieval QPP methods are oblivious to the performance of the retrieval model as they predict query difficulty prior to observing the set of documents retrieved for the query. Since neural embedding-based models are showing wider adoption in the information retrieval community, in this paper, we propose a set of pre-retrieval QPP metrics based on the properties of pre-trained neural embeddings and show that such metrics are more effective for performance prediction compared to the widely known QPP metrics such as SCQ, PMI and SCS. We report our findings based on Robust04, ClueWeb09 and Gov2 corpora and their associated TREC topics.
Bibtex Entry:
@inproceedings{ecir2020c, author = {Negar Arabzadeh and Fattane Zarrinkalam and Jelena Jovanovic and Ebrahim Bagheri}, title = {Neural Embedding-based Metrics for Pre-Retrieval Query Performance Prediction}, booktitle = {42nd European Conference on IR Research (ECIR 2020)}, year = {2020}, abstract = {Query Performance Prediction (QPP) is concerned with estimating the effectiveness of a query within the context of a retrieval model. It allows for operations such as query routing and segmentation, leading to improved retrieval performance. Pre-retrieval QPP methods are oblivious to the performance of the retrieval model as they predict query difficulty prior to observing the set of documents retrieved for the query. Since neural embedding-based models are showing wider adoption in the information retrieval community, in this paper, we propose a set of pre-retrieval QPP metrics based on the properties of pre-trained neural embeddings and show that such metrics are more effective for performance prediction compared to the widely known QPP metrics such as SCQ, PMI and SCS. We report our findings based on Robust04, ClueWeb09 and Gov2 corpora and their associated TREC topics. } }




Powered by WordPress