Core concepts in data analysis. Summarization, correlation and visualization. (Q625112)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Core concepts in data analysis. Summarization, correlation and visualization.
scientific article

    Statements

    Core concepts in data analysis. Summarization, correlation and visualization. (English)
    0 references
    14 February 2011
    0 references
    As it is well known, data analysis is the process of exploring data with the goal of highlighting useful information in order to eventually support decision making. There are many approaches under this term, ranging from data mining, statistical data analysis, machine learning, etc., with various applications. This textbook follows an unconventional way to present the main aspects regarding data analysis. Thus, it starts with introducing the notion of the data analysis ``core'', also presenting some simple illustrating examples. Basically, two ways of studying data analysis are considered: {\parindent=7mm \begin{itemize}\item[(a)]summarization -- for developing and augmenting concepts, and \item[(b)]correlation -- for enhancing and establishing relations. \end{itemize}} Thus, the author mixes elements of statistical data analysis, data mining, and computational intelligence to accomplish his task to demonstrate that data analysis should help in enhancing and augmenting knowledge about the corresponding data. Then, the reader is led in a friendly way through different data analysis areas, such as: summarization and visualization, correlation, multivariate correlation, linear regression, linear discrimination, decision trees, naïve Bayes model, principal component analysis, clustering techniques, etc. The final appendix summarizes knowledge regarding basic linear algebra, basic optimization, basic Matlab, etc. Many concrete examples illustrate the theory exposed in the book. As an overall conclusion, this book represents an exciting text, covering the main topics of the data analysis area. It can be successfully used as a textbook for BS and MS students in computer science, on the one hand, and for researchers in data mining and related fields, on the other hand.
    0 references
    summarization
    0 references
    visualization
    0 references
    correlation
    0 references
    naive Bayes
    0 references
    linear regression
    0 references
    linear discrimination
    0 references
    decision trees
    0 references
    neural network
    0 references
    principal component analysis
    0 references
    clustering algorithms
    0 references
    0 references

    Identifiers