Cloud Storge Bucket - GCP Learning Notes.
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
pyyaml
[Python]
PyYAML - The next generation YAML parser and emitter for Python.
What is Kitchen? Kitchen provides a test harness to execute infrastructure code on one or more platforms in isolation.
I was working on getting post headers for my post on this blog. I had couple of images from unsplash. But the header for the post need to be a little more ho...