Semi-Automated class number prediction of bibliographical resources : a framework deploying ANNIF

Biswas, Meghna (2025) Semi-Automated class number prediction of bibliographical resources : a framework deploying ANNIF. In: Advancing Library and Information Science: innovations, practices, and future directions. Vyom Hans, Punjab, India, pp. 83-104. ISBN 9788198181466

[thumbnail of 9_Chapter-Meghna_Biswas.pdf] Text
9_Chapter-Meghna_Biswas.pdf - Published Version

Download (1MB)

Abstract

This study investigates an AI/ML-based semi-automated indexing system for libraries to efficiently process large document collections. Using supervised learning within Python's Annif framework, we trained models on manually classified MARC bibliographic records organized by Dewey Decimal Classification (DDC) standards. The implementation involved collecting and processing records containing titles, summaries, DDC numbers and subject descriptors, then dividing them into training and test datasets. We evaluated four algorithms (TF-IDF, Omikuji, FastText and NN Ensemble) using standard retrieval metrics (F1@5 and NDCG), finding that Omikuji and NN Ensemble significantly outperformed the others in indexing accuracy. The complete open-source framework demonstrates the viability of machine learning for library classification tasks, offering an efficient alternative to manual indexing while maintaining accuracy. These results suggest promising applications for AI in knowledge organization systems, with potential for expansion to other classification schemes and larger datasets to further enhance performance.

Item Type: Book Section
Keywords: Supervised Machine Learning | Semi-Automated Classification | Automated Subject Indexing | DDC | Annif | Ensemble approach
Subjects: Social Sciences and humanities > Decision Sciences > Information Systems and Management
Physical, Life and Health Sciences > Computer Science
Social Sciences and humanities > Social Sciences > Library and Information Science
JGU School/Centre: Global Library
Depositing User: Mr. Gautam Kumar
Date Deposited: 24 Jul 2025 06:05
Last Modified: 24 Jul 2025 06:05
Official URL: https://doi.org/10.34256/vadlibs.25.9.83
URI: https://pure.jgu.edu.in/id/eprint/9881

Downloads

Downloads per month over past year

Actions (login required)

View Item
View Item