Gupta, Swati, Nath, Tanusree, Gupta, Vedika
ORCID: https://orcid.org/0000-0002-8109-498X and Gupta, Manjari
(2025)
LexiSemIR: A Two-Stage Re-ranking Framework with
BM25 and Zero-Shot Bi-Encoder.
In: 2025 - Forum for Information Retrieval Evaluation, 17 December 2025 - 20 December 2025, Varanasi.
LexiSemIR.pdf - Published Version
Available under License Creative Commons Attribution.
Download (1MB) | Preview
Abstract
Most standard Information Retrieval (IR) models primarily rely on keyword matching, which can be inadequate when a deeper contextual understanding is required. In such cases, it becomes essential to capture both lexical and semantic relationships between query-document pairs. To address this limitation, our team CodeWeavers proposes LexiSemIR, a two-stage re-ranking-based model developed for the CMIR-2025 (Code-Mixed Information Retrieval) shared task on Bengali-English code-mixed text. In the first stage, the top k documents are retrieved using a lexical bag-of-words model (BM25). These are then re-ranked in the second stage using a zero-shot bi-encoder, which computes semantic similarity between query and document embeddings. The proposed approach balances simplicity and performance, while minimizing trainable parameters due to its zero-shot design. LexiSemIR secured 3rd place in the CMIR-2025 shared task, achieving MAP = 0.1546 and P@5 = 0.38, thereby outperforming the BM25 baseline in early precision. The results highlight the model’s ability to effectively combine lexical and semantic retrieval strategies for robust performance in code-mixed IR settings.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Uncontrolled Keywords: | Bi-Encoder | BM25 | Code-mixed language | Information retrieval |
| Subjects: | Physical, Life and Health Sciences > Computer Science |
| Divisions: | Jindal Global Business School |
| Depositing User: | Mr. Syed Anas |
| Date Deposited: | 08 Jun 2026 06:43 |
| Last Modified: | 08 Jun 2026 06:43 |
| Official URL: | https://ceur-ws.org/Vol-4173/T3-4.pdf |
| URI: | https://pure.jgu.edu.in/id/eprint/11536 |
Downloads
Downloads per month over past year
