Core concepts in data analysis. Summarization, correlation and visualization. (Q625112)

From MaRDI portal





scientific article; zbMATH DE number 5851575
Language Label Description Also known as
default for all languages
No label defined
    English
    Core concepts in data analysis. Summarization, correlation and visualization.
    scientific article; zbMATH DE number 5851575

      Statements

      Core concepts in data analysis. Summarization, correlation and visualization. (English)
      0 references
      14 February 2011
      0 references
      As it is well known, data analysis is the process of exploring data with the goal of highlighting useful information in order to eventually support decision making. There are many approaches under this term, ranging from data mining, statistical data analysis, machine learning, etc., with various applications. This textbook follows an unconventional way to present the main aspects regarding data analysis. Thus, it starts with introducing the notion of the data analysis ``core'', also presenting some simple illustrating examples. Basically, two ways of studying data analysis are considered: {\parindent=7mm \begin{itemize}\item[(a)]summarization -- for developing and augmenting concepts, and \item[(b)]correlation -- for enhancing and establishing relations. \end{itemize}} Thus, the author mixes elements of statistical data analysis, data mining, and computational intelligence to accomplish his task to demonstrate that data analysis should help in enhancing and augmenting knowledge about the corresponding data. Then, the reader is led in a friendly way through different data analysis areas, such as: summarization and visualization, correlation, multivariate correlation, linear regression, linear discrimination, decision trees, naïve Bayes model, principal component analysis, clustering techniques, etc. The final appendix summarizes knowledge regarding basic linear algebra, basic optimization, basic Matlab, etc. Many concrete examples illustrate the theory exposed in the book. As an overall conclusion, this book represents an exciting text, covering the main topics of the data analysis area. It can be successfully used as a textbook for BS and MS students in computer science, on the one hand, and for researchers in data mining and related fields, on the other hand.
      0 references
      summarization
      0 references
      visualization
      0 references
      correlation
      0 references
      naive Bayes
      0 references
      linear regression
      0 references
      linear discrimination
      0 references
      decision trees
      0 references
      neural network
      0 references
      principal component analysis
      0 references
      clustering algorithms
      0 references
      0 references

      Identifiers