I'm Kuldeep Kulkarni - Crazy about Linux, Hadoop etc open-source technologies!! By profession I'm Senior system engineer and hadoop administrator in well known IT industry since 2011. Always enthusiastic about sharing my knowledge via blogs :)
Oozie is an open source scheduler for Hadoop, it simplifies workflow and coordination between jobs. We can define dependency between jobs for an input data and hence can automate job dependency using ooze scheduler.
Hadoop is an open source programing framework developed by apache to process big data. It uses HDFS (Hadoop Distributed File System) to store the data across all the datanodes in the cluster in a distributive