Manifold valued data analysis of samples of networks, with applications in corpus linguistics

From MaRDI portal
Publication:2135359

DOI10.1214/21-AOAS1480zbMATH Open1498.62351arXiv1902.08290OpenAlexW2948985192MaRDI QIDQ2135359FDOQ2135359


Authors: Katie E. Severn, Ian L. Dryden, Simon Preston Edit this on Wikidata


Publication date: 6 May 2022

Published in: The Annals of Applied Statistics (Search for Journal in Brave)

Abstract: Networks arise in many applications, such as in the analysis of text documents, social interactions and brain activity. We develop a general framework for extrinsic statistical analysis of samples of networks, motivated by networks representing text documents in corpus linguistics. We identify networks with their graph Laplacian matrices, for which we define metrics, embeddings, tangent spaces, and a projection from Euclidean space to the space of graph Laplacians. This framework provides a way of computing means, performing principal component analysis and regression, and carrying out hypothesis tests, such as for testing for equality of means between two samples of networks. We apply the methodology to the set of novels by Jane Austen and Charles Dickens.


Full work available at URL: https://arxiv.org/abs/1902.08290




Recommendations




Cites Work


Cited In (1)

Uses Software





This page was built for publication: Manifold valued data analysis of samples of networks, with applications in corpus linguistics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2135359)