Cost-Effective and Latency-minimized Data Placement Strategy for Spatial Crowdsourcing in Multi-Cloud Environment

Pengwei Wang, Zhen Chen, Mengchu Zhou, Zhaohui Zhang, Abdullah Abusorrah, Ahmed Chiheb Ammari

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

As an increasingly mature business model, crowdsourcing, especially spatial crowdsourcing, has played an important role in data collection, disaster response, urban planning and other fields. However, the rapid growth of user scale and massive data collected inevitably brings serious challenges to computing and storage resources. The emergence of cloud computing provides an opportunity to handle such challenges. Its nearly unlimited resource provision capability can provide reliable services for different crowdsourcing applications. Nevertheless, considering the risks of privacy leakage and vendor lock-in using only a single cloud, as well as the additional restrictions caused by the wide geographical distribution of data and associations among workers, the use of multi-cloud seems to be a better choice. In this paper, we define a problem to find an effective data placement scheme for spatial crowdsourcing in multi-cloud environment to achieve the cost-effectiveness and minimal latency. We take full account of the interval pricing strategy. Then we analyze the geographical distribution characteristics of data centers through a clustering algorithm, and propose an effective data initialization strategy. Finally, we use a genetic algorithm to further optimize the results. Through experiments on real-world data from cloud providers, the efficiency and effectiveness of our proposed method is verified.

Original languageEnglish (US)
JournalIEEE Transactions on Cloud Computing
DOIs
StateAccepted/In press - 2021

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems
  • Hardware and Architecture
  • Computer Science Applications
  • Computer Networks and Communications

Keywords

  • Cloud computing
  • Costs
  • Crowdsourcing
  • Data centers
  • Memory
  • Reliability
  • Task analysis
  • data placement
  • density clustering
  • interval pricing
  • multi-cloud
  • spatial crowdsourcing

Fingerprint

Dive into the research topics of 'Cost-Effective and Latency-minimized Data Placement Strategy for Spatial Crowdsourcing in Multi-Cloud Environment'. Together they form a unique fingerprint.

Cite this