Recent Posts

Python Getting Started - Learning Notes.

66 minute read

Python is a clear and powerful object-oriented programming language, comparable to Perl, Ruby, Scheme, or Java. This is basic documentation for getting star...

getmerge Operation not Permitted

1 minute read

getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.

Cloud IAM - GCP Learning Notes.

8 minute read

Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.

Long Running Jobs in YARN distcp.

1 minute read

Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...

Checking HDFS health using fsck.

3 minute read

When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.