Authors
Christine Niyizamwiyitira and Lars Lundberg, Blekinge Institute of Technology, Sweden
Abstract
In this study, we evaluate the performance of trajectory queries that are handled by Cassandra, MongoDB, and PostgreSQL. The evaluation is conducted on a multiprocessor and a cluster. Telecommunication companies collect a lot of data from their mobile users. These data must be analysed in order to support business decisions, such as infrastructure planning. The optimal choice of hardware platform and database can be different from a query to another. We use data collected from Telenor Sverige, a telecommunication company that operates in Sweden. These data are collected every five minutes for an entire week in a medium sized city. The execution time results show that Cassandra performs much better than MongoDB and PostgreSQL for queries that do not have spatial features. Statio’s Cassandra Lucene index incorporates a geospatial index into Cassandra, thus making Cassandra to perform similarly as MongoDB to handle spatial queries. In four use cases, namely, distance query, k-nearest neigbhor query, range query, and region query, Cassandra performs much better than MongoDB and PostgreSQL for two cases, namely range query and region query. The scalability is also good for these two use cases.
Keywords
Databases evaluation, Trajectory queries, Multiprocessor and cluster, NoSQL database, Cassandra, MongoDB, PostgreSQL.