Skip to main content

Blog Archive for Raqsoft during January 2014

The columnar storage is good, especially when there are lots of tabular fields (this is quite common). In querying, the data to traverse is far less than that on the row storage. Less data to traverse brings less I/O workloads and higher query speed. However, the Hadoop application consumes most time on the hard disk I/O without columnar storage. Both Hive and Impala support columnar storage, but...