Effect of data repair on mining network streams

Ji Meng Loh, Tamraparni Dasu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Data quality issues have special implications in network data. Data glitches are propagated rapidly along pathways dictated by the hierarchy and topology of the network. In this paper, we use temporal data from a vast data network to study data glitches and their effect on network monitoring tasks such as anomaly detection. We demonstrate the consequences of cleaning the data, and develop targeted and customized cleaning strategies by exploiting the network hierarchy.

Original languageEnglish (US)
Title of host publicationProceedings - 12th IEEE International Conference on Data Mining Workshops, ICDMW 2012
Pages226-233
Number of pages8
DOIs
StatePublished - Dec 1 2012
Externally publishedYes
Event12th IEEE International Conference on Data Mining Workshops, ICDMW 2012 - Brussels, Belgium
Duration: Dec 10 2012Dec 10 2012

Publication series

NameProceedings - 12th IEEE International Conference on Data Mining Workshops, ICDMW 2012

Other

Other12th IEEE International Conference on Data Mining Workshops, ICDMW 2012
CountryBelgium
CityBrussels
Period12/10/1212/10/12

All Science Journal Classification (ASJC) codes

  • Software

Keywords

  • Big data
  • Data glitches
  • Earth mover distance
  • Missing values
  • Network analysis
  • Outliers

Fingerprint Dive into the research topics of 'Effect of data repair on mining network streams'. Together they form a unique fingerprint.

Cite this