Partitioned K-nearest neighbor local depth for scalable comparison-based learning

From MaRDI portal
Publication:6375628

arXiv2108.08864MaRDI QIDQ6375628FDOQ6375628


Authors: Jacob D. Baron, Richard W. R. Darling, J. Laylon Davis, Regina Pettit Edit this on Wikidata


Publication date: 19 August 2021

Abstract: A triplet comparison oracle on a set S takes an object xinS and for any pair y,zsubsetSsetminusx declares which of y and z is more similar to x. Partitioned Local Depth (PaLD) supplies a principled non-parametric partitioning of S under such triplet comparisons but needs O(n2logn) oracle calls and O(n3) post-processing steps. We introduce Partitioned Nearest Neighbors Local Depth (PaNNLD), a computationally tractable variant of PaLD leveraging the K-nearest neighbors digraph on S. PaNNLD needs only O(nKlogn) oracle calls, by replacing an oracle call by a coin flip when neither y nor z is adjacent to x in the undirected version of the K-nearest neighbors digraph. By averaging over randomizations, PaNNLD subsequently requires (at best) only O(nK2) post-processing steps. Concentration of measure shows that the probability of randomization-induced error delta in PaNNLD is no more than 2edelta2K2.













This page was built for publication: Partitioned K-nearest neighbor local depth for scalable comparison-based learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6375628)