LexiSemIR: A Two-Stage Re-ranking Framework with BM25 and Zero-Shot Bi-Encoder

Gupta, Swati, Nath, Tanusree, Gupta, Vedika ORCID: https://orcid.org/0000-0002-8109-498X and Gupta, Manjari (2025) LexiSemIR: A Two-Stage Re-ranking Framework with BM25 and Zero-Shot Bi-Encoder. In: 2025 - Forum for Information Retrieval Evaluation, 17 December 2025 - 20 December 2025, Varanasi.

[thumbnail of LexiSemIR.pdf]
Preview
Text
LexiSemIR.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

Most standard Information Retrieval (IR) models primarily rely on keyword matching, which can be inadequate when a deeper contextual understanding is required. In such cases, it becomes essential to capture both lexical and semantic relationships between query-document pairs. To address this limitation, our team CodeWeavers proposes LexiSemIR, a two-stage re-ranking-based model developed for the CMIR-2025 (Code-Mixed Information Retrieval) shared task on Bengali-English code-mixed text. In the first stage, the top k documents are retrieved using a lexical bag-of-words model (BM25). These are then re-ranked in the second stage using a zero-shot bi-encoder, which computes semantic similarity between query and document embeddings. The proposed approach balances simplicity and performance, while minimizing trainable parameters due to its zero-shot design. LexiSemIR secured 3rd place in the CMIR-2025 shared task, achieving MAP = 0.1546 and P@5 = 0.38, thereby outperforming the BM25 baseline in early precision. The results highlight the model’s ability to effectively combine lexical and semantic retrieval strategies for robust performance in code-mixed IR settings.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Bi-Encoder | BM25 | Code-mixed language | Information retrieval
Subjects: Physical, Life and Health Sciences > Computer Science
Divisions: Jindal Global Business School
Depositing User: Mr. Syed Anas
Date Deposited: 08 Jun 2026 06:43
Last Modified: 08 Jun 2026 06:43
Official URL: https://ceur-ws.org/Vol-4173/T3-4.pdf
URI: https://pure.jgu.edu.in/id/eprint/11536

Downloads

Downloads per month over past year

Actions (login required)

View Item
View Item