Securely Manage Secrets with Google Cloud Secret Manager and Python
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
This post outlines how to send JSON data from a client (browser or curl) through a NodeJS server and ultimately into a Kafka topic. This architecture is usef...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
Handling secrets like API keys and database passwords is a critical aspect of any CI/CD pipeline. Storing these secrets as plaintext in your code or configur...
Managing secrets securely is crucial for any application, especially in cloud environments. Google Cloud Secret Manager provides a centralized and secure way...
In today’s cloud environments, maintaining consistent security and configuration across a large organization can be a daunting task. Human error and unintent...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
In today’s complex cloud environments, gaining a holistic view of your application’s health is crucial for ensuring performance, reliability, and cost-effect...
Infrastructure as Code (IaC) has become essential for managing cloud resources efficiently and consistently. Google Cloud Platform (GCP) offers several IaC t...
Apache HBase is a powerful NoSQL database, but out-of-the-box, it’s a bit like a house with no locks on the doors. Any user who can connect to the cluster ca...
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
Using Ansible within the initialization script of a Google Compute Engine (GCE) instance can streamline and automate the configuration and setup of your virt...
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
Apache HBase is a powerful NoSQL database, but out-of-the-box, it’s a bit like a house with no locks on the doors. Any user who can connect to the cluster ca...
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
This post details how to set up a Tomcat cluster for SpagoBI 5.1, building upon a previous guide for installing SpagoBI with MySQL (post). This configuration...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
BigQuery, Google’s fully-managed and serverless data warehouse, empowers organizations to analyze massive datasets with remarkable speed and efficiency. But ...
Google BigQuery is a powerful and cost-effective data warehouse that enables you to analyze massive datasets quickly. However, to get the most out of BigQuer...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
This is a quick reference for few of the commands which I use often.
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
RPM (Redhat Package Manager) is the most popular package utility and is used mostly on RHEL, Centos and Fedora. RPM helps user/admins to build, install, que...
Was working on a monitoring project, need to create a large file to test notifications. Here is how we can do that.
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
Checking port available on a remote machine using nc command instead of telnet. Same command can be used to check on a remote server as well, change the 127....
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
How to move an existing data directory in mysql to a new location. We were running out of space and had to move the existing data directory to a new drive. B...
We were running out of disk space on one of the databases server, we need to get information on what the current table/database usage was. Below are few comm...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
You can enter a linux system without entring a password using below steps. We will be creating a ssh jey which will be share between the servers, which will ...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
What is Kitchen? Kitchen provides a test harness to execute infrastructure code on one or more platforms in isolation.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
knife-windows plugin adds additional functionality to the Chef Knife CLI tool for configuring / interacting with nodes running Microsoft Windows.
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
The Chef Development Kit (ChefDK) brings the best-of-breed development tools built by the awesome Chef community to your workstation with just a few clicks. ...
knife-windows plugin adds additional functionality to the Chef Knife CLI tool for configuring / interacting with nodes running Microsoft Windows.
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
The Chef Development Kit (ChefDK) brings the best-of-breed development tools built by the awesome Chef community to your workstation with just a few clicks. ...
Cloud VPN securely connects your peer network to your Virtual Private Cloud (VPC) network through an IPsec VPN connection.
Stackdriver driver provides monitoring, logging, error reporting, tracing and debugging services. Currently known as Google Cloud’s operations suite.
Cloud SQL is a fully managed service which includes MySQL, PostgreSQL, Microsoft SQL Server. And for horizontal scalability, consider using Cloud Spanner.
Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
0.0.0.0
[Cloudera]
We were working on setting up a cluster, but the Hue URL was set to a private IP of the server. As we had setup all the nodes to access each other using a pr...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
Swappiness is a crucial Linux kernel parameter that dictates how aggressively the system uses swap space. Understanding and configuring swappiness can signif...
npm
Behind a Corporate Proxy - A Comprehensive Guide
npm, short for Node Package Manager, is the cornerstone of JavaScript development within the Node.js ecosystem. It simplifies the process of managing projec...
Apache HBase is a powerful NoSQL database, but out-of-the-box, it’s a bit like a house with no locks on the doors. Any user who can connect to the cluster ca...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
After a longtime I was on my windows machine and had to make it feel more like my linux machine. So install the thing what everyone else does cygwin. Surpise...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
TL;DR In this post we will be setting up a scheduled job to take backup for Bigtable table in avro format.
Kerberos authentication in Google Dataproc clusters offers enhanced security and user isolation. It’s a crucial step for multi-tenancy, encryption, and user ...
Welcome back to our series on datastream workflows! In our previous article, we explored the basics of setting up a datastream workflow.
This guide provides detailed instructions on setting up a datastream connection from Cloud SQL (MySQL) to BigQuery. We aim to simplify the process by using d...
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
Was setting up jekyll using docker, but was not able to install ruby on the docker machine as apt-add-repository was not working. Below is my Dockerfile when...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Changing the timezone on your Ubuntu server is a crucial task for ensuring accurate timekeeping, especially when dealing with scheduled tasks, logs, and appl...
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
In this blog post, we will venture into exporting data from Firestore and Datastore modes. Find the code on Github.
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
Ensuring consistency and streamlining infrastructure provisioning is crucial for effective cloud management. Creating custom virtual machine (VM) images on G...
Using Ansible within the initialization script of a Google Compute Engine (GCE) instance can streamline and automate the configuration and setup of your virt...
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Cloud VPN securely connects your peer network to your Virtual Private Cloud (VPC) network through an IPsec VPN connection.
Stackdriver driver provides monitoring, logging, error reporting, tracing and debugging services. Currently known as Google Cloud’s operations suite.
Cloud SQL is a fully managed service which includes MySQL, PostgreSQL, Microsoft SQL Server. And for horizontal scalability, consider using Cloud Spanner.
Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
0.0.0.0
[Cloudera]
We were working on setting up a cluster, but the Hue URL was set to a private IP of the server. As we had setup all the nodes to access each other using a pr...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
Apache Kafka is often described as a distributed, fault-tolerant, high-throughput publish-subscribe messaging system. Think of it as a central nervous system...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
Swappiness is a crucial Linux kernel parameter that dictates how aggressively the system uses swap space. Understanding and configuring swappiness can signif...
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
Apache HBase is a powerful NoSQL database, but out-of-the-box, it’s a bit like a house with no locks on the doors. Any user who can connect to the cluster ca...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
We will walk you through the process of setting up HashiCorp Vault to effectively manage your secrets. This guide is an gettinf started with vault with terra...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
Apache HBase is a powerful NoSQL database, but out-of-the-box, it’s a bit like a house with no locks on the doors. Any user who can connect to the cluster ca...
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
This is the second follow up post. In the earlier post
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
This post details how to set up a Tomcat cluster for SpagoBI 5.1, building upon a previous guide for installing SpagoBI with MySQL (post). This configuration...
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
I was working on getting post headers for my post on this blog. I had couple of images from unsplash. But the header for the post need to be a little more ho...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
The Java Development Kit (JDK) is an essential tool for Java developers, providing the necessary components to compile, debug, and run Java applications. Whi...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
Apache Kafka is often described as a distributed, fault-tolerant, high-throughput publish-subscribe messaging system. Think of it as a central nervous system...
This blog post provides a comprehensive, step-by-step guide to integrating NodeJS with Kafka. Whether you’re building real-time data pipelines, implementing ...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
The Java Development Kit (JDK) is an essential tool for Java developers, providing the necessary components to compile, debug, and run Java applications. Whi...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
The Java Development Kit (JDK) is an essential tool for Java developers, providing the necessary components to compile, debug, and run Java applications. Whi...
Was setting up jekyll using docker, but was not able to install ruby on the docker machine as apt-add-repository was not working. Below is my Dockerfile when...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
This blog post provides a comprehensive, step-by-step guide to integrating NodeJS with Kafka. Whether you’re building real-time data pipelines, implementing ...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
This post outlines how to send JSON data from a client (browser or curl) through a NodeJS server and ultimately into a Kafka topic. This architecture is usef...
Apache Kafka is often described as a distributed, fault-tolerant, high-throughput publish-subscribe messaging system. Think of it as a central nervous system...
This blog post provides a comprehensive, step-by-step guide to integrating NodeJS with Kafka. Whether you’re building real-time data pipelines, implementing ...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
This post outlines how to send JSON data from a client (browser or curl) through a NodeJS server and ultimately into a Kafka topic. This architecture is usef...
Kerberos authentication in Google Dataproc clusters offers enhanced security and user isolation. It’s a crucial step for multi-tenancy, encryption, and user ...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
What is Kitchen? Kitchen provides a test harness to execute infrastructure code on one or more platforms in isolation.
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
In this blog post we will be setting up a postgres database on docker using Dockerfile.
Python virtual environment creates a isoloated workspace of python work. This helps in creating project specific virtual environment without worrying about ...
In this blog post we will be setting up a postgres database on docker.
List Comprehensions provides easy and functional way to create list in python. We could make a single line of code which otherwise would take a few lines. l...
Python is a clear and powerful object-oriented programming language, comparable to Perl, Ruby, Scheme, or Java. This is basic documentation for getting star...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
0.0.0.0
[Cloudera]
We were working on setting up a cluster, but the Hue URL was set to a private IP of the server. As we had setup all the nodes to access each other using a pr...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
After a longtime I was on my windows machine and had to make it feel more like my linux machine. So install the thing what everyone else does cygwin. Surpise...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Checking port available on a remote machine using nc command instead of telnet. Same command can be used to check on a remote server as well, change the 127....
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
How to move an existing data directory in mysql to a new location. We were running out of space and had to move the existing data directory to a new drive. B...
We were running out of disk space on one of the databases server, we need to get information on what the current table/database usage was. Below are few comm...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
squid
as a sibling to an already existing Parent squid
.
Squid is a caching and forwarding web proxy. It has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS a...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
A Network File System (NFS) allows remote hosts to mount file systems over a network and interact with those file systems as though they are mounted locally....
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
You can enter a linux system without entring a password using below steps. We will be creating a ssh jey which will be share between the servers, which will ...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
npm
Behind a Corporate Proxy - A Comprehensive Guide
npm, short for Node Package Manager, is the cornerstone of JavaScript development within the Node.js ecosystem. It simplifies the process of managing projec...
This blog post provides a comprehensive, step-by-step guide to integrating NodeJS with Kafka. Whether you’re building real-time data pipelines, implementing ...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
This post outlines how to send JSON data from a client (browser or curl) through a NodeJS server and ultimately into a Kafka topic. This architecture is usef...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
Apache Kafka is often described as a distributed, fault-tolerant, high-throughput publish-subscribe messaging system. Think of it as a central nervous system...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
Changing the timezone on your Ubuntu server is a crucial task for ensuring accurate timekeeping, especially when dealing with scheduled tasks, logs, and appl...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
The Java Development Kit (JDK) is an essential tool for Java developers, providing the necessary components to compile, debug, and run Java applications. Whi...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
nginx
Nginx (pronounced “engine x”) is a powerful and versatile web server renowned for its high concurrency, exceptional performance, and efficient memory utiliza...
Swappiness is a crucial Linux kernel parameter that dictates how aggressively the system uses swap space. Understanding and configuring swappiness can signif...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
This post details how to set up a Tomcat cluster for SpagoBI 5.1, building upon a previous guide for installing SpagoBI with MySQL (post). This configuration...
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
nginx
Nginx (pronounced “engine x”) is a powerful and versatile web server renowned for its high concurrency, exceptional performance, and efficient memory utiliza...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
Tsung (formerly known as idx-Tsunami) is a powerful, open-source, multi-protocol distributed load testing tool. Written in Erlang, it’s licensed under the GP...
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
Apache Kafka is often described as a distributed, fault-tolerant, high-throughput publish-subscribe messaging system. Think of it as a central nervous system...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
Cloud VPN securely connects your peer network to your Virtual Private Cloud (VPC) network through an IPsec VPN connection.
Stackdriver driver provides monitoring, logging, error reporting, tracing and debugging services. Currently known as Google Cloud’s operations suite.
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
This is a quick reference for few of the commands which I use often.
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
A Network File System (NFS) allows remote hosts to mount file systems over a network and interact with those file systems as though they are mounted locally....
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
How to move an existing data directory in mysql to a new location. We were running out of space and had to move the existing data directory to a new drive. B...
We were running out of disk space on one of the databases server, we need to get information on what the current table/database usage was. Below are few comm...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
This is a quick reference for few of the commands which I use often.
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
Checking port available on a remote machine using nc command instead of telnet. Same command can be used to check on a remote server as well, change the 127....
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
npm
Behind a Corporate Proxy - A Comprehensive Guide
npm, short for Node Package Manager, is the cornerstone of JavaScript development within the Node.js ecosystem. It simplifies the process of managing projec...
A Network File System (NFS) allows remote hosts to mount file systems over a network and interact with those file systems as though they are mounted locally....
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
nginx
Nginx (pronounced “engine x”) is a powerful and versatile web server renowned for its high concurrency, exceptional performance, and efficient memory utiliza...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
npm
Behind a Corporate Proxy - A Comprehensive Guide
npm, short for Node Package Manager, is the cornerstone of JavaScript development within the Node.js ecosystem. It simplifies the process of managing projec...
This blog post provides a comprehensive, step-by-step guide to integrating NodeJS with Kafka. Whether you’re building real-time data pipelines, implementing ...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
This post outlines how to send JSON data from a client (browser or curl) through a NodeJS server and ultimately into a Kafka topic. This architecture is usef...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
npm
Behind a Corporate Proxy - A Comprehensive Guide
npm, short for Node Package Manager, is the cornerstone of JavaScript development within the Node.js ecosystem. It simplifies the process of managing projec...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
Application logs are a treasure trove of information. Beyond debugging and troubleshooting, they can provide valuable insights into your application’s perfor...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
This is a quick reference for few of the commands which I use often.
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
Ensuring consistency and streamlining infrastructure provisioning is crucial for effective cloud management. Creating custom virtual machine (VM) images on G...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
nginx
Nginx (pronounced “engine x”) is a powerful and versatile web server renowned for its high concurrency, exceptional performance, and efficient memory utiliza...
Tsung (formerly known as idx-Tsunami) is a powerful, open-source, multi-protocol distributed load testing tool. Written in Erlang, it’s licensed under the GP...
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
In this blog post we will be setting up a postgres database on docker using Dockerfile.
In this blog post we will be setting up a postgres database on docker.
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
Welcome back to our series on datastream workflows! In our previous article, we explored the basics of setting up a datastream workflow.
squid
as a sibling to an already existing Parent squid
.
Squid is a caching and forwarding web proxy. It has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS a...
HAProxy is a robust, high-performance TCP/HTTP load balancer, perfectly suited for ensuring high availability and efficient traffic distribution across your ...
npm
Behind a Corporate Proxy - A Comprehensive Guide
npm, short for Node Package Manager, is the cornerstone of JavaScript development within the Node.js ecosystem. It simplifies the process of managing projec...
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
In this blog post, we will venture into exporting data from Firestore and Datastore modes. Find the code on Github.
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
BigQuery, Google’s fully-managed and serverless data warehouse, empowers organizations to analyze massive datasets with remarkable speed and efficiency. But ...
Google BigQuery is a powerful and cost-effective data warehouse that enables you to analyze massive datasets quickly. However, to get the most out of BigQuer...
Managing resources in your Google Cloud Platform (GCP) projects is a complex task, especially when dealing with multiple projects. Fortunately, Google offers...
In this blog post we will be setting up a postgres database on docker using Dockerfile.
Python virtual environment creates a isoloated workspace of python work. This helps in creating project specific virtual environment without worrying about ...
In this blog post we will be setting up a postgres database on docker.
List Comprehensions provides easy and functional way to create list in python. We could make a single line of code which otherwise would take a few lines. l...
Python is a clear and powerful object-oriented programming language, comparable to Perl, Ruby, Scheme, or Java. This is basic documentation for getting star...
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
pyyaml
[Python]
PyYAML - The next generation YAML parser and emitter for Python.
I was working on getting post headers for my post on this blog. I had couple of images from unsplash. But the header for the post need to be a little more ho...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
Apache Kafka is often described as a distributed, fault-tolerant, high-throughput publish-subscribe messaging system. Think of it as a central nervous system...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
In this blog post, we will venture into exporting data from Firestore and Datastore modes. Find the code on Github.
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Nginx
with Heartbeat - A Step-by-Step Guide
In today’s digital landscape, ensuring high availability for your web applications is paramount. Downtime can lead to lost revenue, damaged reputation, and ...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
This is a quick reference for few of the commands which I use often.
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
RPM (Redhat Package Manager) is the most popular package utility and is used mostly on RHEL, Centos and Fedora. RPM helps user/admins to build, install, que...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
You can enter a linux system without entring a password using below steps. We will be creating a ssh jey which will be share between the servers, which will ...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
This guide walks you through configuring a High Availability (HA) Apache web server using Heartbeat on CentOS/RHEL 6. HA ensures your web server remains acce...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
RPM (Redhat Package Manager) is the most popular package utility and is used mostly on RHEL, Centos and Fedora. RPM helps user/admins to build, install, que...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
Was setting up jekyll using docker, but was not able to install ruby on the docker machine as apt-add-repository was not working. Below is my Dockerfile when...
TL;DR In this post we will be setting up a scheduled job to take backup for Bigtable table in avro format.
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
Apache HBase is a powerful NoSQL database, but out-of-the-box, it’s a bit like a house with no locks on the doors. Any user who can connect to the cluster ca...
Changing the timezone on your Ubuntu server is a crucial task for ensuring accurate timekeeping, especially when dealing with scheduled tasks, logs, and appl...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
Tsung (formerly known as idx-Tsunami) is a powerful, open-source, multi-protocol distributed load testing tool. Written in Erlang, it’s licensed under the GP...
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
This post details how to set up a Tomcat cluster for SpagoBI 5.1, building upon a previous guide for installing SpagoBI with MySQL (post). This configuration...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
This post details how to set up a Tomcat cluster for SpagoBI 5.1, building upon a previous guide for installing SpagoBI with MySQL (post). This configuration...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
Cloud SQL is a fully managed service which includes MySQL, PostgreSQL, Microsoft SQL Server. And for horizontal scalability, consider using Cloud Spanner.
squid
as a sibling to an already existing Parent squid
.
Squid is a caching and forwarding web proxy. It has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
Swappiness is a crucial Linux kernel parameter that dictates how aggressively the system uses swap space. Understanding and configuring swappiness can signif...
Swappiness is a crucial Linux kernel parameter that dictates how aggressively the system uses swap space. Understanding and configuring swappiness can signif...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
Swappiness is a crucial Linux kernel parameter that dictates how aggressively the system uses swap space. Understanding and configuring swappiness can signif...
Changing the timezone on your Ubuntu server is a crucial task for ensuring accurate timekeeping, especially when dealing with scheduled tasks, logs, and appl...
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
Terraform
Cloud Functions
Cloud Monitoring
.
This blog post demonstrates how to automate Firestore health checks using Terraform to deploy a Cloud Function (Gen 1) and set up a complete monitoring and a...
Welcome back to our series on datastream workflows! In our previous article, we explored the basics of setting up a datastream workflow.
This guide provides detailed instructions on setting up a datastream connection from Cloud SQL (MySQL) to BigQuery. We aim to simplify the process by using d...
We will walk you through the process of setting up HashiCorp Vault to effectively manage your secrets. This guide is an gettinf started with vault with terra...
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Ensuring consistency and streamlining infrastructure provisioning is crucial for effective cloud management. Creating custom virtual machine (VM) images on G...
Using Ansible within the initialization script of a Google Compute Engine (GCE) instance can streamline and automate the configuration and setup of your virt...
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
BigQuery, Google’s fully-managed and serverless data warehouse, empowers organizations to analyze massive datasets with remarkable speed and efficiency. But ...
Google BigQuery is a powerful and cost-effective data warehouse that enables you to analyze massive datasets quickly. However, to get the most out of BigQuer...
Managing resources in your Google Cloud Platform (GCP) projects is a complex task, especially when dealing with multiple projects. Fortunately, Google offers...
null_resource
When working with Terraform, it’s crucial to ensure that your infrastructure follows certain conditions or constraints. While Terraform provides validation m...
Validation in Terraform is an essential practice to detect and prevent errors early in the infrastructure provisioning process. By incorporating robust valid...
Kerberos authentication in Google Dataproc clusters offers enhanced security and user isolation. It’s a crucial step for multi-tenancy, encryption, and user ...
TL;DR In this post we will be setting up a scheduled job to take backup for Bigtable table in avro format.
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
Infrastructure as Code (IaC) has revolutionized how we manage and provision infrastructure. Tools like Terraform allow us to define our infrastructure in cod...
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Tsung (formerly known as idx-Tsunami) is a powerful, open-source, multi-protocol distributed load testing tool. Written in Erlang, it’s licensed under the GP...
Kernel 2.6
, x86_64
- A Comprehensive Guide
Siege is a powerful, open-source HTTP load testing and benchmarking utility. It’s designed to help web developers and system administrators measure the perfo...
Changing the timezone on your Ubuntu server is a crucial task for ensuring accurate timekeeping, especially when dealing with scheduled tasks, logs, and appl...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
mod_jk
and httpd
on CentOS - In-Memory Session Replication - A Deep Dive
In a previous post, we explored a basic session-based replication setup for SpagoBI. However, that configuration lacked true session redundancy.
This post details how to set up a Tomcat cluster for SpagoBI 5.1, building upon a previous guide for installing SpagoBI with MySQL (post). This configuration...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
sudo effective uid is not 0
Error on Linux
Encountered a frustrating issue on a Linux server while trying to use sudo? Specifically, the error message: sudo: effective uid is not 0, is sudo installed ...
Tsung (formerly known as idx-Tsunami) is a powerful, open-source, multi-protocol distributed load testing tool. Written in Erlang, it’s licensed under the GP...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
sysctl.conf
This post delves into optimizing Hadoop performance at the kernel level using sysctl. The sysctl interface provides a way to dynamically modify a running Lin...
nginx
Nginx (pronounced “engine x”) is a powerful and versatile web server renowned for its high concurrency, exceptional performance, and efficient memory utiliza...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
Unable to locate package oracle-java7-installer
on Ubuntu
I recently ran into a frustrating issue while trying to install Java 7 on an older Ubuntu system (Ubuntu 13, in this case). The seemingly simple command sud...
Changing the timezone on your Ubuntu server is a crucial task for ensuring accurate timekeeping, especially when dealing with scheduled tasks, logs, and appl...
The Java Development Kit (JDK) is an essential tool for Java developers, providing the necessary components to compile, debug, and run Java applications. Whi...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Ensuring the health and availability of your Firestore database is crucial for maintaining a reliable application. This post provides a Python script that se...
This script provides a simple way to create users on CentOS/RHEL 6.5 systems from the command line. It allows you to create both normal and sudo (administrat...
We will walk you through the process of setting up HashiCorp Vault to effectively manage your secrets. This guide is an gettinf started with vault with terra...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
This comprehensive guide explains how to assign multiple IP addresses to a single network interface in Linux. This technique, often called IP aliasing or cre...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
Kernel-based Virtual Machine (KVM) has become a cornerstone of Linux server virtualization, known for its maturity and performance. Red Hat’s decision to fav...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
NGINX is a powerful and versatile open-source software that excels as an HTTP server, reverse proxy, and IMAP/POP3 proxy server. Renowned for its high perfo...
Are you ready to dive into the world of automation? Ansible is a powerful tool that allows you to manage and configure systems with ease. This blog post will...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
This blog post provides a comprehensive, step-by-step guide to integrating NodeJS with Kafka. Whether you’re building real-time data pipelines, implementing ...
This guide demonstrates how to build a Node.js application that receives JSON data via HTTP and routes it to different Apache Kafka topics based on the URL p...
kafka-node
Ready to integrate your NodeJS application with Kafka? This post provides a comprehensive guide to setting up a Kafka producer using the kafka-node library. ...
This post outlines how to send JSON data from a client (browser or curl) through a NodeJS server and ultimately into a Kafka topic. This architecture is usef...
Node.js is an open-source, cross-platform JavaScript runtime environment that executes JavaScript code server-side. It’s incredibly popular for building scal...
Node.js
, Nginx
This post provides a detailed sysctl.conf file, optimized for Linux systems running high-performance web servers like Nginx and Node.js applications. The sy...
sysctl.conf
for Web and FTP Servers
The sysctl interface provides a powerful way to modify the behavior of a running Linux kernel. By editing the /etc/sysctl.conf file, you can persistently co...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
After a longtime I was on my windows machine and had to make it feel more like my linux machine. So install the thing what everyone else does cygwin. Surpise...
knife-windows plugin adds additional functionality to the Chef Knife CLI tool for configuring / interacting with nodes running Microsoft Windows.
Host file contains IP followed by the FQDN which can be used to reach that IP address. Host file takes precedence over your DNS servers. In Microsoft operati...
Was working on a monitoring project, need to create a large file to test notifications. Here is how we can do that.
The Chef Development Kit (ChefDK) brings the best-of-breed development tools built by the awesome Chef community to your workstation with just a few clicks. ...
Not Enough Physical Memory
Error in VMware Workstation 10 on Windows 8.1
Are you encountering the frustrating “Not enough physical memory is available to power this virtual machine” error in VMware Workstation 10 after a Windows u...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
Getting HBase to perform well in a production environment is an iterative process that involves tweaking various configuration parameters. This post dives de...
YARN
- A Detailed Guide for Ubuntu 12.04 LTS
HBase is a powerful open-source, non-relational, distributed database designed for handling massive datasets. Inspired by Google’s BigTable, HBase is written...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 introduced native support for monitoring JMX applications through the Zabbix Java gateway. This Java-based daemon acts as an intermediary, allowin...
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)