Biswas, Meghna (2025) Semi-Automated class number prediction of bibliographical resources : a framework deploying ANNIF. In: Advancing Library and Information Science: innovations, practices, and future directions. Vyom Hans, Punjab, India, pp. 83-104. ISBN 9788198181466
![[thumbnail of 9_Chapter-Meghna_Biswas.pdf]](https://pure.jgu.edu.in/style/images/fileicons/text.png)
9_Chapter-Meghna_Biswas.pdf - Published Version
Download (1MB)
Abstract
This study investigates an AI/ML-based semi-automated indexing system for libraries to efficiently process large document collections. Using supervised learning within Python's Annif framework, we trained models on manually classified MARC bibliographic records organized by Dewey Decimal Classification (DDC) standards. The implementation involved collecting and processing records containing titles, summaries, DDC numbers and subject descriptors, then dividing them into training and test datasets. We evaluated four algorithms (TF-IDF, Omikuji, FastText and NN Ensemble) using standard retrieval metrics (F1@5 and NDCG), finding that Omikuji and NN Ensemble significantly outperformed the others in indexing accuracy. The complete open-source framework demonstrates the viability of machine learning for library classification tasks, offering an efficient alternative to manual indexing while maintaining accuracy. These results suggest promising applications for AI in knowledge organization systems, with potential for expansion to other classification schemes and larger datasets to further enhance performance.
Item Type: | Book Section |
---|---|
Keywords: | Supervised Machine Learning | Semi-Automated Classification | Automated Subject Indexing | DDC | Annif | Ensemble approach |
Subjects: | Social Sciences and humanities > Decision Sciences > Information Systems and Management Physical, Life and Health Sciences > Computer Science Social Sciences and humanities > Social Sciences > Library and Information Science |
JGU School/Centre: | Global Library |
Depositing User: | Mr. Gautam Kumar |
Date Deposited: | 24 Jul 2025 06:05 |
Last Modified: | 24 Jul 2025 06:05 |
Official URL: | https://doi.org/10.34256/vadlibs.25.9.83 |
URI: | https://pure.jgu.edu.in/id/eprint/9881 |
Downloads
Downloads per month over past year