Abstract
As an increasingly mature business model, crowdsourcing, especially spatial crowdsourcing, has played an important role in data collection, disaster response, urban planning and other fields. However, the rapid growth of user scale and massive data collected inevitably brings serious challenges to computing and storage resources. The emergence of cloud computing provides an opportunity to handle such challenges. Its nearly unlimited resource provision capability can provide reliable services for different crowdsourcing applications. Nevertheless, considering the risks of privacy leakage and vendor lock-in using only a single cloud, as well as the additional restrictions caused by the wide geographical distribution of data and associations among workers, the use of multi-cloud seems to be a better choice. In this paper, we define a problem to find an effective data placement scheme for spatial crowdsourcing in multi-cloud environment to achieve the cost-effectiveness and minimal latency. We take full account of the interval pricing strategy. Then we analyze the geographical distribution characteristics of data centers through a clustering algorithm, and propose an effective data initialization strategy. Finally, we use a genetic algorithm to further optimize the results. Through experiments on real-world data from cloud providers, the efficiency and effectiveness of our proposed method is verified.
Original language | English (US) |
---|---|
Journal | IEEE Transactions on Cloud Computing |
DOIs | |
State | Accepted/In press - 2021 |
All Science Journal Classification (ASJC) codes
- Software
- Information Systems
- Hardware and Architecture
- Computer Science Applications
- Computer Networks and Communications
Keywords
- Cloud computing
- Costs
- Crowdsourcing
- Data centers
- Memory
- Reliability
- Task analysis
- data placement
- density clustering
- interval pricing
- multi-cloud
- spatial crowdsourcing