Auditing data streams for correlated glitches

Ji Meng Loh, Tamraparni Dasu

Research output: Contribution to journalArticlepeer-review


Cellular networks carry massive volumes of voice, text and data traffic every second. The networks are monitored constantly to measure network performance, detect traffic congestion, identify anomalies, and to serve other customer service and network support functions. Data collected from mobility networks is used to make many critical decisions. The quality of the information plays an important role in the effectiveness of these decisions. Therefore, it is important to ensure that the data collected from cellular networks meet quality standards. In particular, identifying glitches that are correlated can help in isolating root causes and facilitate more efficient problem solving in the network, as well as quicker data repairs. In this paper, we present a methodology for automated auditing of massive, complex data streams with a focus on correlated glitches, and a case study that illustrates the application of this methodology. The methodology has two main components: a set of logical constraints that embody domain specific information, and statistical methods for identifying correlated glitches to enable automated quantitative cleaning of data. Together, the two components provide a comprehensive yet customisable set of criteria for evaluating information quality as a function of time and network topology.

Original languageEnglish (US)
Pages (from-to)85-106
Number of pages22
JournalInternational Journal of Information Quality
Issue number2
StatePublished - 2013

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications


  • Automated detection
  • Correlated glitches
  • Data quality
  • Hierarchical data
  • Spatio-temporal analysis
  • Stream mining


Dive into the research topics of 'Auditing data streams for correlated glitches'. Together they form a unique fingerprint.

Cite this