Authorship attribution using principal component analysis and competitive neural networks (Q1649294)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Authorship attribution using principal component analysis and competitive neural networks
scientific article

    Statements

    Authorship attribution using principal component analysis and competitive neural networks (English)
    0 references
    0 references
    5 July 2018
    0 references
    Summary: Feature extraction is a common problem in statistical pattern recognition. It refers to a process whereby a data space is transformed into a feature space that, in theory, has exactly the same dimension as the original data space. However, the transformation is designed in such a way that the data set may be represented by a reduced number of ``effective'' features and yet retain most of the intrinsic information content of the data; in other words, the data set undergoes a dimensionality reduction. Principal component analysis is one of these processes. In this paper the data collected by counting selected syntactic characteristics in around a thousand paragraphs of each of the sample books underwent a principal component analysis. Authors of texts identified by the competitive neural networks, which use these effective features.
    0 references
    0 references
    principal components
    0 references
    authorship attribution
    0 references
    stylometry
    0 references
    text categorization
    0 references
    stylistic features
    0 references
    syntactic characteristics
    0 references
    multilayer preceptor
    0 references
    competitive learning
    0 references
    artificial neural network
    0 references
    0 references
    0 references