Comparative Analysis of Relational (Oracle) and Non-Relational (Cassandra) Databases for Business IntelligenceDownload PDF
The need for business intelligence systems (BI) cannot be overemphasised because of the huge data constantly being generated in the daily operations of business organisations and the opportunity provided to discover new insights for the improvement of organisational effectiveness and efficiency from the data. This study attempts to carry out performance related tests on Oracle and Cassandra in order to propose a suitable database for business intelligence. Firstly, the extract, transform and load (ETL) processes was used to move data into Oracle and Cassandra virtual machines. Secondly, SQL and NoSQL queries were run on the data in three iterations to test for performance in selected workloads (Create and load process, read, update, delete and join operations) both before and after query optimisation. To create a common ground for comparison, similar queries were run on similar datasets on both databases. Then the results from the tests were statistically analysed using Microsoft Excel. Experimental results show that the latency values of Oracle are observed to be lower than that of Cassandra, accuracy values of Cassandra are observed to be nearly the same with that of Oracle in the create and load process, while their accuracy values are observed to be slightly different in the remaining tested workload, and the throughput values of Cassandra are observed to be higher than that of Oracle. Also, the extent to which these performance outcomes support data analytics for BI is hereby presented.
Keywords: SQL, NoSQL, Cassandra, Oracle, Business Intelligence and CQL.