PGRdup

From MaRDI portal
Software:30592



swMATH18761CRANPGRdupMaRDI QIDQ30592

Discover Probable Duplicates in Plant Genetic Resources Collections

J. Radhamani, Rishi Kumar Tyagi, Kalyani Srinivasan, J. Aravind, B. Ananda Subhash

Last update: 31 August 2023

Copyright license: GNU General Public License, version 3.0, GNU General Public License, version 2.0

Software version identifier: 0.2.3.7, 0.2.3.8, 0.2.1, 0.2.2.1, 0.2.2, 0.2.3.1, 0.2.3.2, 0.2.3.3, 0.2.3.4, 0.2.3.5, 0.2.3.6, 0.2.3, 0.2, 0.2.3.9

Source code repository: https://github.com/cran/PGRdup

Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.





This page was built for software: PGRdup