On mining complex sequential data by means of FCA and pattern structures
From MaRDI portal
Publication:2817080
DOI10.1080/03081079.2015.1072925zbMATH Open1365.68412arXiv1504.02255OpenAlexW2221985475WikidataQ99455694 ScholiaQ99455694MaRDI QIDQ2817080FDOQ2817080
Authors: Aleksey Buzmakov, Elias Egho, Nicolas Jay, Sergei O. Kuznetsov, Amedeo Napoli, Chedy Raïssi
Publication date: 29 August 2016
Published in: International Journal of General Systems (Search for Journal in Brave)
Abstract: Nowadays data sets are available in very complex and heterogeneous ways. Mining of such data collections is essential to support many real-world applications ranging from healthcare to marketing. In this work, we focus on the analysis of "complex" sequential data by means of interesting sequential patterns. We approach the problem using the elegant mathematical framework of Formal Concept Analysis (FCA) and its extension based on "pattern structures". Pattern structures are used for mining complex data (such as sequences or graphs) and are based on a subsumption operation, which in our case is defined with respect to the partial order on sequences. We show how pattern structures along with projections (i.e., a data reduction of sequential structures), are able to enumerate more meaningful patterns and increase the computing efficiency of the approach. Finally, we show the applicability of the presented method for discovering and analyzing interesting patient patterns from a French healthcare data set on cancer. The quantitative and qualitative results (with annotations and analysis from a physician) are reported in this use case which is the main motivation for this work. Keywords: data mining; formal concept analysis; pattern structures; projections; sequences; sequential data.
Full work available at URL: https://arxiv.org/abs/1504.02255
Recommendations
- Revisiting pattern structure projections
- \textsc{RCA-seq}: an original approach for enhancing the analysis of sequential data based on hierarchies of multilevel closed partially-ordered patterns
- Fitting pattern structures to knowledge discovery in big data
- scientific article; zbMATH DE number 1808289
- Using pattern structures for analyzing ontology-based annotations of biomedical data
Cites Work
- SPADE: An efficient algorithm for mining frequent sequences
- Sequential pattern mining -- approaches and algorithms
- ON SUCCINCT REPRESENTATION OF KNOWLEDGE COMMUNITY TAXONOMIES WITH FORMAL CONCEPT ANALYSIS
- On stability of a formal concept
- The Efficient Computation of Complete and Concise Substring Scales with Suffix Trees
Cited In (14)
- Title not available (Why is that?)
- Formal concept analysis: from knowledge discovery to knowledge processing
- Using Formal Concept Analysis for Mining and Interpreting Patient Flows within a Healthcare Network
- Finding sequential patterns with TF-IDF metrics in health-care databases
- Revisiting pattern structure projections
- Non-homogeneous Markov models for sequential pattern mining of healthcare data
- Title not available (Why is that?)
- \textsc{RCA-seq}: an original approach for enhancing the analysis of sequential data based on hierarchies of multilevel closed partially-ordered patterns
- Using pattern structures for analyzing ontology-based annotations of biomedical data
- Hierarchies of weighted closed partially-ordered patterns for enhancing sequential data analysis
- Combining sequence and itemset mining to discover named entities in biomedical texts: a new type of pattern
- Stage division and pattern discovery of complex patient care processes
- Fitting pattern structures to knowledge discovery in big data
- Steps towards causal Formal Concept Analysis
This page was built for publication: On mining complex sequential data by means of FCA and pattern structures
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2817080)