Recent Posts

Streaming Data Processing - Storm Vs Spark.

6 minute read

Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...

KVM Installation on CentOS 6.x.

17 minute read

KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...

Performance Tuning HBase and Hadoop.

24 minute read

Using HBase in production often requires that you turn many knobs to make it hum as expected. More Here http://hbase.apache.org/0.94/book/performance.html