Efficient Local Intrinsic Dimensionality Estimation in Evolving Deep Representations

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Local intrinsic dimensionality (LID) provides insight into the behavior of individual training points in deep neural networks, with applications including adversarial detection, prevention of dimensional collapse in self-supervised learning, and identification of untruthful responses from large language models (LLMs). In such contexts, efficient LID estimation has depended on the use of mini-batches, due to the high cost of computing neighborhoods in latent space. However, estimation with respect to small subsets of the training data usually reflects the dimensionality of the global manifold structure rather than the intended local distribution around each point. In this paper, we propose the Nearest Distance Cache (NDC), a method that improves the locality of LID estimation by reusing nearest-neighbor distances observed in past mini-batches. This strategy faces two key challenges: representations evolve over time, and limited memory prevents storing all past distances. To address these, NDC maintains a compact cache of nearest distances per example and uses window-based change detection to discard outdated samples affected by distributional drift. We also evaluate NDC on two tasks: an autoencoder trained on synthetic data with known ground-truth LID, and a ResNet trained on CIFAR-10. Results show that NDC captures local properties of deep representations not revealed by single mini-batch estimates.

Original languageEnglish (US)
Title of host publicationSimilarity Search and Applications - 18th International Conference, SISAP 2025, Proceedings
EditorsGiuseppe Amato, Vladimir Mic, Agma Traina, Nicola Messina, Laurent Amsaleg, Gylfi Þór Guðmundsson, Björn Þór Jónsson, Lucia Vadicamo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages41-55
Number of pages15
ISBN (Print)9783032060686
DOIs
StatePublished - 2026
Externally publishedYes
Event18th International Conference on Similarity Search and Applications, SISAP 2025 - Reykjavik, Iceland
Duration: Oct 1 2025Oct 3 2025

Publication series

NameLecture Notes in Computer Science
Volume16134 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th International Conference on Similarity Search and Applications, SISAP 2025
Country/TerritoryIceland
CityReykjavik
Period10/1/2510/3/25

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Keywords

  • Deep Representations
  • Distributional Drift Detection
  • Local Intrinsic Dimensionality
  • Nearest Distance Cache

Fingerprint

Dive into the research topics of 'Efficient Local Intrinsic Dimensionality Estimation in Evolving Deep Representations'. Together they form a unique fingerprint.

Cite this