Abstract
Cellular networks carry massive volumes of voice, text and data traffic every second. The networks are monitored constantly to measure network performance, detect traffic congestion, identify anomalies, and to serve other customer service and network support functions. Data collected from mobility networks is used to make many critical decisions. The quality of the information plays an important role in the effectiveness of these decisions. Therefore, it is important to ensure that the data collected from cellular networks meet quality standards. In particular, identifying glitches that are correlated can help in isolating root causes and facilitate more efficient problem solving in the network, as well as quicker data repairs. In this paper, we present a methodology for automated auditing of massive, complex data streams with a focus on correlated glitches, and a case study that illustrates the application of this methodology. The methodology has two main components: a set of logical constraints that embody domain specific information, and statistical methods for identifying correlated glitches to enable automated quantitative cleaning of data. Together, the two components provide a comprehensive yet customisable set of criteria for evaluating information quality as a function of time and network topology.
Original language | English (US) |
---|---|
Pages (from-to) | 85-106 |
Number of pages | 22 |
Journal | International Journal of Information Quality |
Volume | 3 |
Issue number | 2 |
DOIs | |
State | Published - 2013 |
All Science Journal Classification (ASJC) codes
- Information Systems
- Computer Science Applications
Keywords
- Automated detection
- Correlated glitches
- Data quality
- Hierarchical data
- Spatio-temporal analysis
- Stream mining