AutoMSC: automatic assignment of Mathematics Subject Classification labels
From MaRDI portal
Abstract: Authors of research papers in the fields of mathematics, and other math-heavy disciplines commonly employ the Mathematics Subject Classification (MSC) scheme to search for relevant literature. The MSC is a hierarchical alphanumerical classification scheme that allows librarians to specify one or multiple codes for publications. Digital Libraries in Mathematics, as well as reviewing services, such as zbMATH and Mathematical Reviews (MR) rely on these MSC labels in their workflows to organize the abstracting and reviewing process. Especially, the coarse-grained classification determines the subject editor who is responsible for the actual reviewing process. In this paper, we investigate the feasibility of automatically assigning a coarse-grained primary classification using the MSC scheme, by regarding the problem as a multi-class classification machine learning task. We find that our method achieves an (F_1)-score of over 77%, which is remarkably close to the agreement of zbMATH and MR ((F_1)-score of 81%). Moreover, we find that the method's confidence score allows for reducing the effort by 86% compared to the manual coarse-grained classification effort while maintaining a precision of 81% for automatically classified articles.
Recommendations
- Automated Classification and Categorization of Mathematical Knowledge
- NLP-based detection of Mathematics Subject Classification
- Reimplementing the mathematics subject classification (MSC) as a linked open dataset
- scientific article; zbMATH DE number 1338276
- Mathematical document classification via symbol frequency analysis
Cited in
(7)- 10 years later: the Mathematics Subject Classification and Linked Open Data
- Automated Classification and Categorization of Mathematical Knowledge
- AutoMSC
- NLP-based detection of Mathematics Subject Classification
- Evaluation and domain adaptation of similarity models for short mathematical texts
- Using general large language models to classify mathematical documents
- A retrieval and ranking method of mathematical documents based on CA-YOLOv5 and HFS
This page was built for publication: AutoMSC: automatic assignment of Mathematics Subject Classification labels
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2219407)