Skip to content
  • Home
  • About
  • eBooks
  • Shop
  • Donate
  • Linux Online Courses
  • Subscribe to Newsletter
  • Linux Hosting
  • A-Z Linux Commands
  • Get Involved
    • Testimonials
    • We are Hiring!
    • Submit Article Request
    • Suggest an Update
Tecmint: Linux Howtos, Tutorials & Guides
  • Linux Distro’s
    • CentOS Alternatives
    • Best Linux Distros
    • RedHat Distributions
    • Linux Distros for Beginners
    • Linux Server Distros
    • Debian Distributions
    • Ubuntu Distributions
    • Arch Linux Distros
    • Rolling Linux Distros
    • KDE Linux Distros
    • Secure Linux Distros
    • Linux Distros for Old PC
    • New Linux Distros
    • Linux Distros for Kids
    • Linux Distros for Windows Users
  • FAQ’s
  • Programming
    • Shell Scripting
    • Learn Python
    • Learn Awk
  • Linux Commands
  • Linux Tricks
  • Best Linux Tools
  • Certifications
    • RHCE Exam
    • RHCSA Exam
    • LFCE Exam
    • LFCS Exam
    • LFCA Exam
    • Ansible Exam
  • Guides
    • Hadoop Series
    • Docker Series
    • Postfix Mail
    • XenServer Series
    • RHEV Series
    • Clustering Series
    • LVM Series
    • RAID Series
    • KVM Series
    • iSCSI Series
    • Zentyal Series
    • Ansible Series
    • Django Series
    • Create GUI Apps
  • Monitoring Tools
    • Nagios
    • Zabbix
    • Cacti
    • Observium
    • Monitorix
    • Collectd
    • Collectl
    • MySQL Monitoring

Hadoop

Enable Hive with High Availability

How to Install and Configure Hive with High Availability – Part 7

1 Comment

Hive is a Data Warehouse model in Hadoop Eco-System. It can perform as an ETL tool on top of Hadoop. Enabling High Availability (HA) on Hive is not similar as we do in Master

Set Up High Availability for Resource Manager

How to Set Up High Availability for Resource Manager – Part 6

Leave a comment

YARN is the Processing Layer of Hadoop, which consists of the Master (Resource Manager) and Slave (Node Manager) services to process the data. Resource Manager (RM) is the critical component that is responsible for

Set Up High Availability for Namenode

How to Set Up High Availability for Namenode – Part 5

Leave a comment

Hadoop has two core components which are HDFS and YARN. HDFS is for storing the Data, YARN is for processing the Data. HDFS is Hadoop Distributed File System, it has Namenode as Master Service

Install and Configure CDH in CentOS

How to Install CDH and Configure Service Placements on CentOS/RHEL 7 – Part 4

Leave a comment

In an earlier article, we have explained the installation of Cloudera Manager, in this article, you will learn how to install and configure CDH (Cloudera Distribution Hadoop) in RHEL/CentOS 7. While installing the CDH

Install Cloudera Manager in CentOS

How to Install and Configure Cloudera Manager on CentOS/RHEL 7 – Part 3

1 Comment

In this article, we described the step by step process to install Cloudera Manager as per industrial practices. In Part 2, we already have gone through the Cloudera Pre-requisites, make sure all the servers

Hadoop Pre-requisites and Security Hardening

Setting Up Hadoop Pre-requisites and Security Hardening – Part 2

Leave a comment

Hadoop Cluster Building is a step by step process where the process starts from purchasing the required servers, mounting into the rack, cabling, etc. and placing in Datacentre. Then we need to install the

Best Practices for Deploying Hadoop Server on CentOS

Best Practices for Deploying Hadoop Server on CentOS/RHEL 7 – Part 1

Leave a comment

In this series of articles, we are going to cover the entire Cloudera Hadoop Cluster Building building with Vendor and Industrial recommended best practices. Part 1: Best Practices for Deploying Hadoop Server on CentOS/RHEL

Install Hadoop in CentOS 7

How to Install Hadoop Single Node Cluster (Pseudonode) on CentOS 7

3 Comments

Hadoop is an open-source framework that is widely used to deal with Bigdata. Most of the Bigdata/Data Analytics projects are being built up on top of the Hadoop Eco-System. It consists of two-layer, one

Install Hadoop in CentOS 7

How to Install and Configure Apache Hadoop on a Single Node in CentOS 7

46 Comments

Apache Hadoop is an Open Source framework build for distributed Big Data storage and processing data across computer clusters. The project is based on the following components: Hadoop Common – it contains the Java

Install Oozie in Centos and RHEL

Install and Configure Apache Oozie Workflow Scheduler for CDH 4.X on RHEL/CentOS 6/5

7 Comments

Oozie is an open source scheduler for Hadoop, it simplifies workflow and coordina­tion between jobs. We can define dependency between jobs for an input data and hence can automate job dependency using ooze scheduler.

Post navigation
Older posts
1 2 Next →

Over 3,500,000+ Readers

Join TecMint on Facebook Follow TecMint on Twitter Join TecMint on LinkedIn Follow TecMint on Instagram Follow TecMint via RSS Feed Subscribe to TecMint Newsletter
A Beginners Guide To Learn Linux for Free [with Examples]
Red Hat RHCSA/RHCE 8 Certification Study Guide [eBooks]
Linux Foundation LFCS and LFCE Certification Study Guide [eBooks]

Learn Linux Commands and Tools

How to List Files Installed From a RPM or DEB Package in Linux

10 Useful “IP” Commands to Configure Network Interfaces

20 Linux YUM (Yellowdog Updater, Modified) Commands for Package Management

Discus – Show Colourised Disk Space Usage in Linux

How to Compare Local and Remote Files in Linux

Gdu – A Pretty Fast Disk Usage Analyzer for Linux

Join TecMint Weekly Newsletter

If You Appreciate What We Do Here On TecMint, You Should Consider:

Support Us

Linux Server Monitoring Tools

How to Monitor Performance Of CentOS 8/7 Server Using Netdata

nload – Monitor Linux Network Bandwidth Usage in Real Time

How to Install Nagios Core in Rocky LInux and AlmaLinux

Cockpit – A Powerful Tool to Monitor and Administer Multiple Linux Servers via Browser

MTR – A Network Diagnostic Tool for Linux

screenFetch – An Ultimate System Information Generator for Linux

Learn Linux Tricks & Tips

How to Transfer Files Between Two Computers using nc and pv Commands

vlock – A Smart Way to Lock User Virtual Console or Terminal in Linux

How to Find and Sort Files Based on Modification Date and Time in Linux

How to Restore Deleted /tmp Directory in Linux

How to Manipulate Filenames Having Spaces and Special Characters in Linux

How to Download MP3 Tracks from a YouTube Video Using YouTube-DL

Best Linux Tools

My Favorite Command Line Editors for Linux – What’s Your Editor?

23 Best Open Source Text Editors (GUI + CLI) in 2021

The 8 Best Free Anti-Virus Programs for Linux

8 Best PDF Document Viewers for Linux Systems

Best IP Address Management Tools for Linux

10 Useful Tools to Create Bootable USB from an ISO Image

  • Donate to TecMint
  • Contact Us
  • Advertise on TecMint
  • Linux Services
  • Copyright Policy
  • Privacy Policy
  • Career
  • Sponsored Post
Tecmint: Linux Howtos, Tutorials & Guides © 2022. All Rights Reserved.

The material in this site cannot be republished either online or offline, without our permission.

Hosting Sponsored by : Linode Cloud Hosting

Scroll back to top