Estimating sequence similarity from read sets for clustering next-generation sequencing data
DOI10.1007/S10618-018-0584-8zbMATH Open1458.62130arXiv1705.06125OpenAlexW2963017196WikidataQ129411252 ScholiaQ129411252MaRDI QIDQ2218320FDOQ2218320
Authors: Petr Ryšavý, Filip Železný
Publication date: 15 January 2021
Published in: Data Mining and Knowledge Discovery (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1705.06125
Recommendations
- Algorithms for indexing highly similar DNA sequences
- Better greedy sequence clustering with fast banded alignment
- A heuristic clustering method based on neighbor-seeds for 454 sequencing data
- Distance measures for biological sequences: some recent approaches
- A novel method for sequence similarity analysis based on the relative frequency of dual nucleo\-tides
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Protein sequences, DNA sequences (92D20) Computational methods for problems pertaining to biology (92-08)
Cites Work
Cited In (6)
- \textit{De novo} clustering of long-read transcriptome data using a greedy, quality-value based algorithm
- Subset Clustering of Binary Sequences, with an Application to Genomic Abnormality Data
- A heuristic clustering method based on neighbor-seeds for 454 sequencing data
- Numeric Lyndon-based feature embedding of sequencing reads for machine learning approaches
- Better greedy sequence clustering with fast banded alignment
- Can we replace reads by numeric signatures? Lyndon fingerprints as representations of sequencing reads for machine learning
Uses Software
This page was built for publication: Estimating sequence similarity from read sets for clustering next-generation sequencing data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2218320)