Oozie4.3 一 簡介 1 官網 http://oozie.apache.org/ Apache Oozie Workflow Scheduler for Hadoop Hadoop生態的工作流調度器 Overview Oozie is a workflow ...
Azkaban . 一 簡介 官網 https: azkaban.github.io Azkaban was implemented at LinkedIn to solve the problem of Hadoop job dependencies. We had jobs that needed to run in order, from ETL jobs to data analytics ...
2018-11-02 11:09 0 680 推薦指數:
Oozie4.3 一 簡介 1 官網 http://oozie.apache.org/ Apache Oozie Workflow Scheduler for Hadoop Hadoop生態的工作流調度器 Overview Oozie is a workflow ...
概括 Azkaban是一個非常輕量的開源調度框架,適合二次開發,但是無法直接用於生產環境,存在致命缺陷(比如AzkabanWebServer是單點,1年多時間沒有修復),在一些情景下的行為簡單粗暴(比如重啟AzkabanExecutorServer會導致該server上正在運行的所有流程fail ...
https://drill.apache.org/ 一 簡介 Drill is an Apache open-source SQL query engine for Big Data exploration. Drill is designed from the ground ...
presto 0.217 官方:http://prestodb.github.io/ 一 簡介 Presto is an open source distributed SQL query engine for running interactive analytic ...
官方:http://ambari.apache.org/ The Apache Ambari project is aimed at making Hadoop management simpl ...
kudu 1.7 官方:https://kudu.apache.org/ 一 簡介 kudu有很多概念,有分布式文件系統(HDFS),有一致性算法(Zookeeper),有Table(Hive Table),有Tablet(Hive Table Partition),有列式存儲 ...
impala2.12 官方:http://impala.apache.org/ 一 簡介 Apache Impala is the open source, native analytic database for Apache Hadoop. Impala is shipped ...
Flink 1.7 官方:https://flink.apache.org/ 一 簡介 Apache Flink is an open source platform for distributed stream and batch data processing. ...