Non-parametric detection of meaningless distances in high dimensional data
From MaRDI portal
Publication:746216
DOI10.1007/S11222-011-9229-0zbMATH Open1322.62145DBLPjournals/sac/Kaban12aOpenAlexW2160406974WikidataQ56140630 ScholiaQ56140630MaRDI QIDQ746216FDOQ746216
Authors: Ata Kabán
Publication date: 16 October 2015
Published in: Statistics and Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s11222-011-9229-0
Recommendations
- When is `nearest neighbour' meaningful: A converse theorem and implications
- On the distance concentration awareness of certain data reduction techniques
- scientific article; zbMATH DE number 2080487
- High-dimensional \(p\)-norms
- On the behavior of intrinsically high-dimensional spaces: distances, direct and reverse nearest neighbors, and hubness
curse of dimensionalityhigh dimensional datastatistical testnearest neighbourdistance concentrationChebyshev bound
Cites Work
- Hubs in space: popular nearest neighbors in high-dimensional data
- Title not available (Why is that?)
- When is `nearest neighbour' meaningful: A converse theorem and implications
- On the distance concentration awareness of certain data reduction techniques
- New instability results for high-dimensional nearest neighbor search
- Selecting marker genes for cancer classification using supervised weighted kernel clustering and the support vector machine
Cited In (11)
- Title not available (Why is that?)
- Rigid transformations for stabilized lower dimensional space to support subsurface uncertainty quantification and interpretation
- Identifying consistent statements about numerical data with dispersion-corrected subgroup discovery
- High-dimensional \(p\)-norms
- On the behavior of intrinsically high-dimensional spaces: distances, direct and reverse nearest neighbors, and hubness
- Instability results for Euclidean distance, nearest neighbor search on high dimensional Gaussian data
- The hubness phenomenon: fact or artifact?
- Smoothed Quantiles for Measuring Discrete Risks
- Measuring Discrete Risks on Infinite Domains: Theoretical Foundations, Conditional Five Number Summaries, and Data Analyses
- Intrinsic dimension of geometric data sets
- Efficiency of the pMST and RDELA location and scatter estimators
This page was built for publication: Non-parametric detection of meaningless distances in high dimensional data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q746216)