Sparse learning of the disease severity score for high-dimensional data (Q1693802)

From MaRDI portal





scientific article; zbMATH DE number 6833051
Language Label Description Also known as
default for all languages
No label defined
    English
    Sparse learning of the disease severity score for high-dimensional data
    scientific article; zbMATH DE number 6833051

      Statements

      Sparse learning of the disease severity score for high-dimensional data (English)
      0 references
      0 references
      0 references
      0 references
      31 January 2018
      0 references
      Summary: Learning disease severity scores automatically from collected measurements may aid in the quality of both healthcare and scientific understanding. Some steps in that direction have been taken and machine learning algorithms for extracting scoring functions from data have been proposed. Given the rapid increase in both quantity and diversity of data measured and stored, the large amount of information is becoming one of the challenges for learning algorithms. In this work, we investigate the direction of the problem where the dimensionality of measured variables is large. Learning the severity score in such cases brings the issue of which of measured features are relevant. We propose a novel approach by combining desirable properties of existing formulations, which compares favorably to alternatives in accuracy and especially in the robustness of the learned scoring function. The proposed formulation has a nonsmooth penalty that induces sparsity. This problem is solved by addressing a dual formulation which is smooth and allows an efficient optimization. The proposed approach might be used as an effective and reliable tool for both scoring function learning and biomarker discovery, as demonstrated by identifying a stable set of genes related to influenza symptoms' severity, which are enriched in immune-related processes.
      0 references
      machine learning algorithms
      0 references
      scoring functions
      0 references
      large dimensionality of measured variables
      0 references
      robustness
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references