Kafka Kerberos Enable and Testing.
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Using Ansible within the initialization script of a Google Compute Engine (GCE) instance can streamline and automate the configuration and setup of your virt...
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
Enable Authorization on HBase.
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
Setting up SpagoBI 5.1 on Tomcat with MySQL post. Details on Tomcat Clustering.
BigQuery, Google’s fully-managed and serverless data warehouse, empowers organizations to analyze massive datasets with remarkable speed and efficiency. But ...
Google BigQuery is a powerful and cost-effective data warehouse that enables you to analyze massive datasets quickly. However, to get the most out of BigQuer...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
This is a quick reference for few of the commands which I use often.
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
RPM (Redhat Package Manager) is the most popular package utility and is used mostly on RHEL, Centos and Fedora. RPM helps user/admins to build, install, que...
Was working on a monitoring project, need to create a large file to test notifications. Here is how we can do that.
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
Checking port available on a remote machine using nc command instead of telnet. Same command can be used to check on a remote server as well, change the 127....
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
How to move an existing data directory in mysql to a new location. We were running out of space and had to move the existing data directory to a new drive. B...
We were running out of disk space on one of the databases server, we need to get information on what the current table/database usage was. Below are few comm...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
You can enter a linux system without entring a password using below steps. We will be creating a ssh jey which will be share between the servers, which will ...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
Node.js is an open-source, cross-platform runtime environment for developing server-side web applications. Node.js applications are written in JavaScript and...
Default passwd is set as username@123 As we are setting the chage, user needs to change the passwd on first login.
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
What is Kitchen? Kitchen provides a test harness to execute infrastructure code on one or more platforms in isolation.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
knife-windows plugin adds additional functionality to the Chef Knife CLI tool for configuring / interacting with nodes running Microsoft Windows.
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
The Chef Development Kit (ChefDK) brings the best-of-breed development tools built by the awesome Chef community to your workstation with just a few clicks. ...
knife-windows plugin adds additional functionality to the Chef Knife CLI tool for configuring / interacting with nodes running Microsoft Windows.
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
The Chef Development Kit (ChefDK) brings the best-of-breed development tools built by the awesome Chef community to your workstation with just a few clicks. ...
Cloud VPN securely connects your peer network to your Virtual Private Cloud (VPC) network through an IPsec VPN connection.
Stackdriver driver provides monitoring, logging, error reporting, tracing and debugging services. Currently known as Google Cloud’s operations suite.
Cloud SQL is a fully managed service which includes MySQL, PostgreSQL, Microsoft SQL Server. And for horizontal scalability, consider using Cloud Spanner.
Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
0.0.0.0
[Cloudera]
We were working on setting up a cluster, but the Hue URL was set to a private IP of the server. As we had setup all the nodes to access each other using a pr...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
This cookbook [currently as of now] can be used to setup a Cloudera Manager Server (Management Server) running on MySQL/Postgres database. But the intended u...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
Swappiness is a Linux kernel parameter that controls the relative weight given to swapping out runtime memory, as opposed to dropping pages from the system p...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
Default passwd is set as username@123 As we are setting the chage, user needs to change the passwd on first login.
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
After a longtime I was on my windows machine and had to make it feel more like my linux machine. So install the thing what everyone else does cygwin. Surpise...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
TL;DR In this post we will be setting up a scheduled job to take backup for Bigtable table in avro format.
Kerberos authentication in Google Dataproc clusters offers enhanced security and user isolation. It’s a crucial step for multi-tenancy, encryption, and user ...
Welcome back to our series on datastream workflows! In our previous article, we explored the basics of setting up a datastream workflow.
This guide provides detailed instructions on setting up a datastream connection from Cloud SQL (MySQL) to BigQuery. We aim to simplify the process by using d...
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
Was setting up jekyll using docker, but was not able to install ruby on the docker machine as apt-add-repository was not working. Below is my Dockerfile when...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
Not enough physical memory is available to power this virtual machine
If you are using VMWare Workstation (or VMWare player) on Windows 8.1 and have just update Windows, specifically KB2995388, you may receive this error messag...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
In this blog post, we will venture into exporting data from Firestore and Datastore modes. Find the code on Github.
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
Ensuring consistency and streamlining infrastructure provisioning is crucial for effective cloud management. Creating custom virtual machine (VM) images on G...
Using Ansible within the initialization script of a Google Compute Engine (GCE) instance can streamline and automate the configuration and setup of your virt...
Cloud VPN securely connects your peer network to your Virtual Private Cloud (VPC) network through an IPsec VPN connection.
Stackdriver driver provides monitoring, logging, error reporting, tracing and debugging services. Currently known as Google Cloud’s operations suite.
Cloud SQL is a fully managed service which includes MySQL, PostgreSQL, Microsoft SQL Server. And for horizontal scalability, consider using Cloud Spanner.
Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
0.0.0.0
[Cloudera]
We were working on setting up a cluster, but the Hue URL was set to a private IP of the server. As we had setup all the nodes to access each other using a pr...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
Using HBase in production often requires that you turn many knobs to make it hum as expected. More Here http://hbase.apache.org/0.94/book/performance.html
HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and written in Java. It is developed as part of Apache Software...
Apache Kafka is publish-subscribe messaging rethought as a distributed commit log. Kafka is a distributed, partitioned, replicated commit log service. It pro...
sysctl.conf
parameters.
Performace tuning Hadoop at kernel level. sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can con...
Swappiness is a Linux kernel parameter that controls the relative weight given to swapping out runtime memory, as opposed to dropping pages from the system p...
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
We will walk you through the process of setting up HashiCorp Vault to effectively manage your secrets. This guide is an gettinf started with vault with terra...
doBulkLoad
- LoadIncrementalHfiles
failed. This is recoverable. Hbase Encryption Zone.
We were working with a team who were uploading data into HBASE using the doBulkLoad, also we had moved the /hbase to encryption zone. The issue started after...
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
Using HBase in production often requires that you turn many knobs to make it hum as expected. More Here http://hbase.apache.org/0.94/book/performance.html
HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and written in Java. It is developed as part of Apache Software...
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
Enable Authorization on HBase.
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
getmerge command takes a source directory and a destination file as input and concatenates files in src into the destination local file.
HDFS
health using fsck
.
When we have large data sets on the cluster, there will be corruptions of blocks. This could be due to disk or any other.
HBase Error Could not seek StoreFileScanner
Invalid HFile block magic
We ran into an issue which was causing HBase to fail. When the job was rerun then it would work again. In the logs we got 2 errors.
This is the second follow up post. In the earlier post
First lets setup the Hbase Configuration Files. For pseudo-distributed replace with 'localhost'
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
httpd is the Apache HyperText Transfer Protocol (HTTP) server program. It is designed to be run as a standalone daemon process. When used like this it will c...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
NGINX
for HTTP
load balancing.
NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as an IMAP/POP3 proxy server. NGINX is known for its high performance, ...
httpd is the Apache HyperText Transfer Protocol (HTTP) server program. It is designed to be run as a standalone daemon process. When used like this it will c...
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
Setting up SpagoBI 5.1 on Tomcat with MySQL post. Details on Tomcat Clustering.
NGINX
for HTTP
load balancing.
NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as an IMAP/POP3 proxy server. NGINX is known for its high performance, ...
httpd is the Apache HyperText Transfer Protocol (HTTP) server program. It is designed to be run as a standalone daemon process. When used like this it will c...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
NGINX
for HTTP
load balancing.
NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as an IMAP/POP3 proxy server. NGINX is known for its high performance, ...
httpd is the Apache HyperText Transfer Protocol (HTTP) server program. It is designed to be run as a standalone daemon process. When used like this it will c...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
Google Cloud offers IAM, which lets you give more granular access to specific Google Cloud resources and prevents unwanted access to other resources.
I was working on getting post headers for my post on this blog. I had couple of images from unsplash. But the header for the post need to be a little more ho...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
The Java Development Kit (JDK) is an implementation of either one of the Java SE, Java EE or Java ME platforms released by Oracle Corporation in the form of ...
Below are the list required to integrate NodeJS and Kafka. This is a simple HOWTO to get started.
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
Was installing Java today, this is an easy install thanks to ppa:webupd8team/java, but when I tried it was not working, but has worked for me all this while.
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
The Java Development Kit (JDK) is an implementation of either one of the Java SE, Java EE or Java ME platforms released by Oracle Corporation in the form of ...
Was installing Java today, this is an easy install thanks to ppa:webupd8team/java, but when I tried it was not working, but has worked for me all this while.
The Java Development Kit (JDK) is an implementation of either one of the Java SE, Java EE or Java ME platforms released by Oracle Corporation in the form of ...
Was setting up jekyll using docker, but was not able to install ruby on the docker machine as apt-add-repository was not working. Below is my Dockerfile when...
Kafka-node is a Node.js client with Zookeeper integration for Apache Kafka 0.8.1 and later. The Zookeeper integration does the following jobs:
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
Below are the list required to integrate NodeJS and Kafka. This is a simple HOWTO to get started.
Kafka-node is a Node.js client with Zookeeper integration for Apache Kafka 0.8.1 and later. The Zookeeper integration does the following jobs:
kafka-node
Now that we have Kafka and NodeJS ready. Lets some data to our Kafka Cluster.
What we are trying to achieve ?
Apache Kafka is publish-subscribe messaging rethought as a distributed commit log. Kafka is a distributed, partitioned, replicated commit log service. It pro...
Below are the list required to integrate NodeJS and Kafka. This is a simple HOWTO to get started.
Kafka-node is a Node.js client with Zookeeper integration for Apache Kafka 0.8.1 and later. The Zookeeper integration does the following jobs:
kafka-node
Now that we have Kafka and NodeJS ready. Lets some data to our Kafka Cluster.
What we are trying to achieve ?
Kerberos authentication in Google Dataproc clusters offers enhanced security and user isolation. It’s a crucial step for multi-tenancy, encryption, and user ...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
Have updated and explanation in the conf file below.
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
sysctl.conf
parameters.
Performace tuning Hadoop at kernel level. sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can con...
What is Kitchen? Kitchen provides a test harness to execute infrastructure code on one or more platforms in isolation.
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
We have 2 domains forests in our environment, ABC and XYZ. We were not able to authenticate normal users from either of the domains. Most of the information ...
KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
In this blog post we will be setting up a postgres database on docker using Dockerfile.
Python virtual environment creates a isoloated workspace of python work. This helps in creating project specific virtual environment without worrying about ...
In this blog post we will be setting up a postgres database on docker.
List Comprehensions provides easy and functional way to create list in python. We could make a single line of code which otherwise would take a few lines. l...
Python is a clear and powerful object-oriented programming language, comparable to Perl, Ruby, Scheme, or Java. This is basic documentation for getting star...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
Python API for cloudera is really nice, apart from getting the cluster setup, we can also do configuration and automation. We use a lot of automation using C...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
Cloudera API is a very convenient way to setup a cluster and do more.
This is a basic steps to get connected with cloudera manager.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
0.0.0.0
[Cloudera]
We were working on setting up a cluster, but the Hue URL was set to a private IP of the server. As we had setup all the nodes to access each other using a pr...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
After a longtime I was on my windows machine and had to make it feel more like my linux machine. So install the thing what everyone else does cygwin. Surpise...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
The standalone installation of Chef server creates a working installation on a single server. This installation is also useful when you are installing Chef s...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Checking port available on a remote machine using nc command instead of telnet. Same command can be used to check on a remote server as well, change the 127....
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
How to move an existing data directory in mysql to a new location. We were running out of space and had to move the existing data directory to a new drive. B...
We were running out of disk space on one of the databases server, we need to get information on what the current table/database usage was. Below are few comm...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
squid
as a sibling to an already existing Parent squid
.
Squid is a caching and forwarding web proxy. It has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS a...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
A Network File System (NFS) allows remote hosts to mount file systems over a network and interact with those file systems as though they are mounted locally....
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
You can enter a linux system without entring a password using below steps. We will be creating a ssh jey which will be share between the servers, which will ...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...
Using HBase in production often requires that you turn many knobs to make it hum as expected. More Here http://hbase.apache.org/0.94/book/performance.html
HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and written in Java. It is developed as part of Apache Software...
Was installing Java today, this is an easy install thanks to ppa:webupd8team/java, but when I tried it was not working, but has worked for me all this while.
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
npm
behind a proxy
npm stands for Node Package Manager, and is the default package manager for the JavaScript runtime environment Node.js.
Below are the list required to integrate NodeJS and Kafka. This is a simple HOWTO to get started.
Kafka-node is a Node.js client with Zookeeper integration for Apache Kafka 0.8.1 and later. The Zookeeper integration does the following jobs:
kafka-node
Now that we have Kafka and NodeJS ready. Lets some data to our Kafka Cluster.
What we are trying to achieve ?
Node.js is an open-source, cross-platform runtime environment for developing server-side web applications. Node.js applications are written in JavaScript and...
Apache Kafka is publish-subscribe messaging rethought as a distributed commit log. Kafka is a distributed, partitioned, replicated commit log service. It pro...
Got into this issue on the server, when trying to install mysql-server.
Have updated and explanation in the conf file below.
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
Current IP of the server : 10.130.18.11. Virtual IPs being assigned to server : 10.130.18.22, 10.130.18.23, 10.130.18.24 to our server.. Go to network-scrip...
Changing Timezone in Ubuntu server.
sysctl.conf
parameters.
Performace tuning Hadoop at kernel level. sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can con...
The Java Development Kit (JDK) is an implementation of either one of the Java SE, Java EE or Java ME platforms released by Oracle Corporation in the form of ...
Default passwd is set as username@123 As we are setting the chage, user needs to change the passwd on first login.
nginx
Nginx (pronounced engine x) is a web server with a strong focus on high concurrency, performance and low memory usage. It can also act as a reverse proxy ser...
Swappiness is a Linux kernel parameter that controls the relative weight given to swapping out runtime memory, as opposed to dropping pages from the system p...
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
Setting up SpagoBI 5.1 on Tomcat with MySQL post. Details on Tomcat Clustering.
NGINX
for HTTP
load balancing.
NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as an IMAP/POP3 proxy server. NGINX is known for its high performance, ...
httpd is the Apache HyperText Transfer Protocol (HTTP) server program. It is designed to be run as a standalone daemon process. When used like this it will c...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
nginx
Nginx (pronounced engine x) is a web server with a strong focus on high concurrency, performance and low memory usage. It can also act as a reverse proxy ser...
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
NGINX
for HTTP
load balancing.
NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as an IMAP/POP3 proxy server. NGINX is known for its high performance, ...
Tsung (formerly known as idx-Tsunami) is a stress testing tool written in the Erlang language and distributed under the GPL license. It can currently stress ...
Siege is an http load testing and benchmarking utility. It was designed to let web developers measure their code under duress, to see how it will stand up to...
Linux Unified Key Setup-on-disk-format (or LUKS) allows you to encrypt partitions on your Linux computer. This is particularly important when it comes to mo...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
Cloud VPN securely connects your peer network to your Virtual Private Cloud (VPC) network through an IPsec VPN connection.
Stackdriver driver provides monitoring, logging, error reporting, tracing and debugging services. Currently known as Google Cloud’s operations suite.
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
This is a quick reference for few of the commands which I use often.
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
A Network File System (NFS) allows remote hosts to mount file systems over a network and interact with those file systems as though they are mounted locally....
Current IP of the server : 10.130.18.11. Virtual IPs being assigned to server : 10.130.18.22, 10.130.18.23, 10.130.18.24 to our server.. Go to network-scrip...
How to move an existing data directory in mysql to a new location. We were running out of space and had to move the existing data directory to a new drive. B...
We were running out of disk space on one of the databases server, we need to get information on what the current table/database usage was. Below are few comm...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
This is a quick reference for few of the commands which I use often.
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
Checking port available on a remote machine using nc command instead of telnet. Same command can be used to check on a remote server as well, change the 127....
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
eth1
to eth0
on EDGE
node.
Adding route to all the slaves which reside on a private network to communicate with External Server directly using an EDGE node using Interface Forwarding.
Current IP of the server : 10.130.18.11. Virtual IPs being assigned to server : 10.130.18.22, 10.130.18.23, 10.130.18.24 to our server.. Go to network-scrip...
A Network File System (NFS) allows remote hosts to mount file systems over a network and interact with those file systems as though they are mounted locally....
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
nginx
Nginx (pronounced engine x) is a web server with a strong focus on high concurrency, performance and low memory usage. It can also act as a reverse proxy ser...
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
NGINX
for HTTP
load balancing.
NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as an IMAP/POP3 proxy server. NGINX is known for its high performance, ...
npm
behind a proxy
npm stands for Node Package Manager, and is the default package manager for the JavaScript runtime environment Node.js.
Below are the list required to integrate NodeJS and Kafka. This is a simple HOWTO to get started.
Kafka-node is a Node.js client with Zookeeper integration for Apache Kafka 0.8.1 and later. The Zookeeper integration does the following jobs:
kafka-node
Now that we have Kafka and NodeJS ready. Lets some data to our Kafka Cluster.
What we are trying to achieve ?
Node.js is an open-source, cross-platform runtime environment for developing server-side web applications. Node.js applications are written in JavaScript and...
Have updated and explanation in the conf file below.
npm
behind a proxy
npm stands for Node Package Manager, and is the default package manager for the JavaScript runtime environment Node.js.
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
This is a quick reference for few of the commands which I use often.
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
CMAN v3 is a Corsync plugin that monitors the names and number of active cluster nodes in order to deliver membership and quorum information to clients (such...
Ensuring consistency and streamlining infrastructure provisioning is crucial for effective cloud management. Creating custom virtual machine (VM) images on G...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
Pentaho Data Integration (PDI) is a suite of open source Business Intelligence (BI) products which provide data integration, OLAP services, reporting, dashb...
Have updated and explanation in the conf file below.
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
sysctl.conf
parameters.
Performace tuning Hadoop at kernel level. sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can con...
nginx
Nginx (pronounced engine x) is a web server with a strong focus on high concurrency, performance and low memory usage. It can also act as a reverse proxy ser...
Tsung (formerly known as idx-Tsunami) is a stress testing tool written in the Erlang language and distributed under the GPL license. It can currently stress ...
Siege is an http load testing and benchmarking utility. It was designed to let web developers measure their code under duress, to see how it will stand up to...
Using HBase in production often requires that you turn many knobs to make it hum as expected. More Here http://hbase.apache.org/0.94/book/performance.html
Got into this issue on the server, when trying to install mysql-server.
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
iptables
- Port
forwarding from one interface to another.
iptables is a user-space application program that allows a system administrator to configure the tables provided by the Linux kernel firewall (implemented as...
In this blog post we will be setting up a postgres database on docker using Dockerfile.
In this blog post we will be setting up a postgres database on docker.
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
Welcome back to our series on datastream workflows! In our previous article, we explored the basics of setting up a datastream workflow.
squid
as a sibling to an already existing Parent squid
.
Squid is a caching and forwarding web proxy. It has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS a...
HAProxy is the Reliable, High Performance TCP/HTTP Load Balancer and it works nicely with Deveo Cluster setup.
npm
behind a proxy
npm stands for Node Package Manager, and is the default package manager for the JavaScript runtime environment Node.js.
In this blog post, we will venture into exporting data from Firestore and Datastore modes. Find the code on Github.
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
BigQuery, Google’s fully-managed and serverless data warehouse, empowers organizations to analyze massive datasets with remarkable speed and efficiency. But ...
Google BigQuery is a powerful and cost-effective data warehouse that enables you to analyze massive datasets quickly. However, to get the most out of BigQuer...
Managing resources in your Google Cloud Platform (GCP) projects is a complex task, especially when dealing with multiple projects. Fortunately, Google offers...
In this blog post we will be setting up a postgres database on docker using Dockerfile.
Python virtual environment creates a isoloated workspace of python work. This helps in creating project specific virtual environment without worrying about ...
In this blog post we will be setting up a postgres database on docker.
List Comprehensions provides easy and functional way to create list in python. We could make a single line of code which otherwise would take a few lines. l...
Python is a clear and powerful object-oriented programming language, comparable to Perl, Ruby, Scheme, or Java. This is basic documentation for getting star...
neo4j_doc_manager
.
We had a requirement where we wanted to have all the data which is in mongodb to be replicated on neo4j to show few graphs. Here is quick way to demonstrate ...
pyyaml
[Python]
PyYAML - The next generation YAML parser and emitter for Python.
I was working on getting post headers for my post on this blog. I had couple of images from unsplash. But the header for the post need to be a little more ho...
python 2.7.x
on Centos 6.5/6.6
By default centos comes with python 2.6. In most of the cases we might need python 2.7 or later to be installed. Below are few ways to install python 2.7 on ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
Apache Kafka is publish-subscribe messaging rethought as a distributed commit log. Kafka is a distributed, partitioned, replicated commit log service. It pro...
parted
.
GUID Partition Table (GPT) is a standard for the layout of the partition table on a physical hard disk, using globally unique identifiers (GUID). Although it...
In this blog post, we will venture into exporting data from Firestore and Datastore modes. Find the code on Github.
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Apache Kafka is a distributed streaming platform. Kafka 2.0 supports Kerberos authentication, Enabling Kerberos Authentication Using the Wizard on cloudera m...
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
We were deploying one of the cluster on our lab environment which is used by everyone. So the lab has it own share of stale information on it.
We had a cluster environment which we needed to update the /etc/hosts file. Which would help communicate between the server over a private network. Our serve...
heartbeat
for nginx
.
Heartbeat is a daemon that provides cluster infrastructure (communication and membership) services to its clients. This allows clients to know about the pres...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
This is a quick reference for few of the commands which I use often.
Migrating an old Nagios backup to a new nagiosxi server. Migration is simple in Nagios XI, you a backup and restore it. Once that is done we need to update/a...
Nagios monitors your entire IT infrastructure to ensure systems, applications, services, and business processes are functioning properly. In the event of a f...
RPM (Redhat Package Manager) is the most popular package utility and is used mostly on RHEL, Centos and Fedora. RPM helps user/admins to build, install, que...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
ansible
on RHEL 6.6.
Ansible is a radically simple IT automation engine that automates cloud provisioning, configuration management, application deployment, intra-service orchest...
You can enter a linux system without entring a password using below steps. We will be creating a ssh jey which will be share between the servers, which will ...
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and pr...
KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
Node.js is an open-source, cross-platform runtime environment for developing server-side web applications. Node.js applications are written in JavaScript and...
Default passwd is set as username@123 As we are setting the chage, user needs to change the passwd on first login.
Got into this issue on the server, when trying to install mysql-server.
RPM (Redhat Package Manager) is the most popular package utility and is used mostly on RHEL, Centos and Fedora. RPM helps user/admins to build, install, que...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
Was setting up jekyll using docker, but was not able to install ruby on the docker machine as apt-add-repository was not working. Below is my Dockerfile when...
TL;DR In this post we will be setting up a scheduled job to take backup for Bigtable table in avro format.
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar t...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
Tsung (formerly known as idx-Tsunami) is a stress testing tool written in the Erlang language and distributed under the GPL license. It can currently stress ...
Siege is an http load testing and benchmarking utility. It was designed to let web developers measure their code under duress, to see how it will stand up to...
Got into this issue on the server, when trying to install mysql-server.
Easy way to collect files recursively over a sftp server is to connect to the server over scp and do scp -r. Problem was that the device we were connecting ...
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
Setting up SpagoBI 5.1 on Tomcat with MySQL post. Details on Tomcat Clustering.
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
Setting up SpagoBI 5.1 on Tomcat with MySQL post. Details on Tomcat Clustering.
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
Cloud SQL is a fully managed service which includes MySQL, PostgreSQL, Microsoft SQL Server. And for horizontal scalability, consider using Cloud Spanner.
squid
as a sibling to an already existing Parent squid
.
Squid is a caching and forwarding web proxy. It has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS a...
HTTPS is a protocol for secure communication over a computer network which is widely used on the Internet. HTTPS consists of communication over Hypertext Tra...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
If using access_provider = ldap, this option is mandatory. It specifies an LDAP search filter criteria that must be met for the user to be granted access on ...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
There are inherent structural differences between how Windows and Linux handle system users. The user schemas used in Active Directory and standard LDAPv3 di...
The System Security Services Daemon (SSSD) provides a set of daemons to manage access to remote directories and authentication mechanisms. It provides Name S...
Cloud Storage is Google Cloud’s object storage service, and it allows worldwide storage and retrieval of any amount of data at anytime.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
Apache Spark is an in-memory distributed data analysis platform– primarily targeted at speeding up batch analysis jobs, iterative machine learning jobs, inte...
Swappiness is a Linux kernel parameter that controls the relative weight given to swapping out runtime memory, as opposed to dropping pages from the system p...
Swappiness is a Linux kernel parameter that controls the relative weight given to swapping out runtime memory, as opposed to dropping pages from the system p...
Have updated and explanation in the conf file below.
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
sysctl.conf
parameters.
Performace tuning Hadoop at kernel level. sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can con...
Swappiness is a Linux kernel parameter that controls the relative weight given to swapping out runtime memory, as opposed to dropping pages from the system p...
tarball
.
Setting up Hadoop using Ansible, we will be using cdh5 tarball for installation of the cluster.
Welcome back to our series on datastream workflows! In our previous article, we explored the basics of setting up a datastream workflow.
This guide provides detailed instructions on setting up a datastream connection from Cloud SQL (MySQL) to BigQuery. We aim to simplify the process by using d...
We will walk you through the process of setting up HashiCorp Vault to effectively manage your secrets. This guide is an gettinf started with vault with terra...
Data is invaluable, and as any developer would testify, ensuring its security and availability in the event of unintended deletion or modification is paramou...
Ensuring consistency and streamlining infrastructure provisioning is crucial for effective cloud management. Creating custom virtual machine (VM) images on G...
Using Ansible within the initialization script of a Google Compute Engine (GCE) instance can streamline and automate the configuration and setup of your virt...
Security is paramount when setting up resources in the cloud or on-premises. It encompasses various layers of protection, including network security, encrypt...
null_resource
with local-exec
and remote-exec
in Terraform
Terraform, a widely adopted Infrastructure as Code (IaC) tool, offers a rich array of resources and provisioners for efficiently managing and configuring inf...
BigQuery, Google’s fully-managed and serverless data warehouse, empowers organizations to analyze massive datasets with remarkable speed and efficiency. But ...
Google BigQuery is a powerful and cost-effective data warehouse that enables you to analyze massive datasets quickly. However, to get the most out of BigQuer...
Managing resources in your Google Cloud Platform (GCP) projects is a complex task, especially when dealing with multiple projects. Fortunately, Google offers...
null_resource
When working with Terraform, it’s crucial to ensure that your infrastructure follows certain conditions or constraints. While Terraform provides validation m...
Validation in Terraform is an essential practice to detect and prevent errors early in the infrastructure provisioning process. By incorporating robust valid...
Kerberos authentication in Google Dataproc clusters offers enhanced security and user isolation. It’s a crucial step for multi-tenancy, encryption, and user ...
TL;DR In this post we will be setting up a scheduled job to take backup for Bigtable table in avro format.
These are a set of testing which we can do on a Hadoop environment. These are basic testing to make sure the environment is setup correctly.
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Tsung (formerly known as idx-Tsunami) is a stress testing tool written in the Erlang language and distributed under the GPL license. It can currently stress ...
Siege is an http load testing and benchmarking utility. It was designed to let web developers measure their code under duress, to see how it will stand up to...
Changing Timezone in Ubuntu server.
The SpagoBI project is a free software/open source initiative by the SpagoBI Labs of Engineering Group. It aims to realize the most complete 100% open source...
mod_jk
and httpd
on Centos - In-Memory Session Replication.
In an earlier post we did a basic session based replication, but the session was not redundant.
Setting up SpagoBI 5.1 on Tomcat with MySQL post. Details on Tomcat Clustering.
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
Tsung (formerly known as idx-Tsunami) is a stress testing tool written in the Erlang language and distributed under the GPL license. It can currently stress ...
Siege is an http load testing and benchmarking utility. It was designed to let web developers measure their code under duress, to see how it will stand up to...
Have updated and explanation in the conf file below.
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
sysctl.conf
parameters.
Performace tuning Hadoop at kernel level. sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can con...
nginx
Nginx (pronounced engine x) is a web server with a strong focus on high concurrency, performance and low memory usage. It can also act as a reverse proxy ser...
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
Monitoring for mongodb using nagiosxi is straight forword but you might have some issues when we are setting up. Here are few issues which might come up usin...
CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web brows...
MongoDB is an open-source document database, and leading NoSQL database. MongoDB is written in c++. Below is a brief document about installing a mongodb on a...
Data Bags are a way to store information on the chef-server which all the cookbooks can access. Few more additional advantages are that we can encrypt the da...
GNU find searches the directory tree rooted at each given file name by evaluating the given expression from left to right, according to the rules of preceden...
When we were working on a nagios monitoring system we were migrating from a 32bit nagios to a 64bit Architecture. Most of the graphs are not working as the R...
HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and written in Java. It is developed as part of Apache Software...
Was installing Java today, this is an easy install thanks to ppa:webupd8team/java, but when I tried it was not working, but has worked for me all this while.
Changing Timezone in Ubuntu server.
The Java Development Kit (JDK) is an implementation of either one of the Java SE, Java EE or Java ME platforms released by Oracle Corporation in the form of ...
Upgrading Cloudera Manager 5 to the Latest Cloudera Manager, In most cases it is possible to complete the following upgrade without shutting down most CDH se...
Default passwd is set as username@123 As we are setting the chage, user needs to change the passwd on first login.
We will walk you through the process of setting up HashiCorp Vault to effectively manage your secrets. This guide is an gettinf started with vault with terra...
Current IP of the server : 10.130.18.11. Virtual IPs being assigned to server : 10.130.18.22, 10.130.18.23, 10.130.18.24 to our server.. Go to network-scrip...
KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...
Not enough physical memory is available to power this virtual machine
If you are using VMWare Workstation (or VMWare player) on Windows 8.1 and have just update Windows, specifically KB2995388, you may receive this error messag...
KVM is a kernel-based Virutal Machine which grows quickly in maturity and popularity in the Linux server market. Red Hat officially dropped Xen in favor of K...
Not enough physical memory is available to power this virtual machine
If you are using VMWare Workstation (or VMWare player) on Windows 8.1 and have just update Windows, specifically KB2995388, you may receive this error messag...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
Below are the list required to integrate NodeJS and Kafka. This is a simple HOWTO to get started.
Kafka-node is a Node.js client with Zookeeper integration for Apache Kafka 0.8.1 and later. The Zookeeper integration does the following jobs:
kafka-node
Now that we have Kafka and NodeJS ready. Lets some data to our Kafka Cluster.
What we are trying to achieve ?
Node.js is an open-source, cross-platform runtime environment for developing server-side web applications. Node.js applications are written in JavaScript and...
Have updated and explanation in the conf file below.
sysctl is an interface that allows you to make changes to a running Linux kernel. With /etc/sysctl.conf you can configure various Linux networking and system...
Pandoc is an opensource utility to create documents from markdown. We can create PDF, Doc, doc, html and other formats. And can be also used to convert html ...
Kitchen-Vagrant has the capability to spin up a windows instance for testing. To make it work you will need the vagrant-winrm to be installted on the worksta...
After a longtime I was on my windows machine and had to make it feel more like my linux machine. So install the thing what everyone else does cygwin. Surpise...
knife-windows plugin adds additional functionality to the Chef Knife CLI tool for configuring / interacting with nodes running Microsoft Windows.
Host file contains IP followed by the FQDN which can be used to reach that IP address. Host file takes precedence over your DNS servers. In Microsoft operati...
Was working on a monitoring project, need to create a large file to test notifications. Here is how we can do that.
The Chef Development Kit (ChefDK) brings the best-of-breed development tools built by the awesome Chef community to your workstation with just a few clicks. ...
Not enough physical memory is available to power this virtual machine
If you are using VMWare Workstation (or VMWare player) on Windows 8.1 and have just update Windows, specifically KB2995388, you may receive this error messag...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
YARN
distcp
.
Usually distcp and other batch jobs run for a very long time. This is fine if you are running a HADOOP environment without kerberos. When we kerberize a clus...
distcp
.
When you see this error default thing to do is the set the HADOOP_CLIENT_OPTS config on your env. In My case did a typo HADDOP_CLIENT_OPTS which caused us sp...
HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and written in Java. It is developed as part of Apache Software...
Setting up yum repos on RHEL using httpd. We will be setting up httpd and yum repo on top of it. So that we can access yum over http.
We were working on nagios and found that after our migration, service group summary was not working. You might get below error on the screen and the solution...
Zabbix history table gets really big, and if you are in a situation where you want to clean it up. Then we can do so, using the below steps.
postgres
and mysql
.
After the zabbix setup is done, we would like to take backup of the database. Here we are doing a complete backup. But if we are looking for only configurati...
There might be situations where you have a remote/zabbix server which does not have internet connectivity, due to security or other reasons. So we create a ...
In zabbix we dont have a better way to capture snmptraps. we have to manually create an item and corresponfding trigger to handle a trap arriving from the de...
zabbix-extra
on Zabbix Version 2.4.
SNMP Builder/Extra is an add-in for Zabbix. It provides new web interface components to browse MIB trees and values. SNMP OIDs can then be automatically conv...
Receiving SNMP traps is the opposite to querying SNMP-enabled devices. In this case the information is sent from a SNMP-enabled device and is collected or “t...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
Zabbix is the ultimate enterprise-level software designed for monitoring availability and performance of IT infrastructure components. Zabbix is open source ...
This script can be used to monitor Namenode Parameters. This script can be used to Generate Zabbix Import XML or Send monitoring data to Zabbix server.
This section describes how to get monitoring data from Zabbix server through the command line and export these to a CSV formatted file.
This script is to create a xml import file from a csv file. We use 2 csv file for this.
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Module used to generate Zabbix import xml for snmp traps, Below are steps to create a Zabbix Import for Traps. These traps are captured from the snmptrap fun...
zabbix-java-gateway
on Centos 6.5
Zabbix 2.0 adds native support for monitoring JMX applications by introducing a new Zabbix daemon called Zabbix Java gateway. Zabbix Java gateway is a daemon...
Send Zabbix metric by reading from a XML file. This needs package zbxsend from : https://github.com/ahmedzbyr/zbxsend
Deployment error during setting zookeeper on a new cluster using Cloudera Manager UI, when an earlier cluster containing zookeeper earlier was built using Cl...
This is the second follow up post. In the earlier post
This is the second follow up post. In the earlier post Setting Up Cloudera Manager Services Using Cloudera API [Part 1] we install the cloudera management se...
tarball
.
This is a simple zookeeper playbook, to quickly start zookeeper running on a single or more nodes, in a clustered mode.
This is a simple Storm Cluster Setup. We are using a dedicated Zookeeper Cluster/Node, instead of the standalone zkserver. Below is how we will deploy our c...
This is a simple Kafka setup. In this setup we are running kafka over a dedicated zookeeper service. (NOT the standalone zookeeper which comes with kafka)