PsychWordVec

From MaRDI portal
Software:72395



CRANPsychWordVecMaRDI QIDQ72395

Word Embedding Research Framework for Psychological Science

Han-Wu-Shuang Bao

Last update: 27 September 2023

Software version identifier: 0.3.2, 0.1.0, 0.1.2, 0.2.0, 0.3.0, 0.3.1, 2023.8, 2023.9


Copyright license: GNU General Public License, version 3.0

An integrative toolbox of word embedding research that provides: (1) a collection of 'pre-trained' static word vectors in the '.RData' compressed format <https://psychbruce.github.io/WordVector_RData.pdf>; (2) a series of functions to process, analyze, and visualize word vectors; (3) a range of tests to examine conceptual associations, including the Word Embedding Association Test <doi:10.1126/science.aal4230> and the Relative Norm Distance <doi:10.1073/pnas.1720347115>, with permutation test of significance; (4) a set of training methods to locally train (static) word vectors from text corpora, including 'Word2Vec' <arXiv:1301.3781>, 'GloVe' <doi:10.3115/v1/D14-1162>, and 'FastText' <arXiv:1607.04606>; (5) a group of functions to download 'pre-trained' language models (e.g., 'GPT', 'BERT') and extract contextualized (dynamic) word vectors (based on the R package 'text').