【原創】大叔經驗分享(99)Atlas簡介、安裝


 

官方:http://atlas.apache.org/

 

一 簡介

元數據管理:數據分類、數據血緣、數據治理

Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem.

Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team.

二 安裝

1 下載

# wget https://www.apache.org/dyn/closer.cgi/atlas/2.0.0/apache-atlas-2.0.0-sources.tar.gz

2 編譯

# tar xvf apache-atlas-2.0.0-sources.tar.gz

# cd apache-atlas-2.0.0-sources

# mvn clean -DskipTests install

注意maven和java有最低版本限制

maven3.5.0
java1.8.151

編譯完成

[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO] 
[INFO] Apache Atlas Server Build Tools 1.0 ................ SUCCESS [  0.460 s]
[INFO] apache-atlas 2.0.0 ................................. SUCCESS [  4.248 s]
[INFO] Apache Atlas Test Utility Tools 2.0.0 .............. SUCCESS [  8.609 s]
[INFO] Apache Atlas Integration 2.0.0 ..................... SUCCESS [ 16.224 s]
[INFO] Apache Atlas Common 2.0.0 .......................... SUCCESS [  6.408 s]
[INFO] Apache Atlas Client 2.0.0 .......................... SUCCESS [  0.340 s]
[INFO] atlas-client-common 2.0.0 .......................... SUCCESS [  5.321 s]
[INFO] atlas-client-v1 2.0.0 .............................. SUCCESS [  5.371 s]
[INFO] Apache Atlas Server API 2.0.0 ...................... SUCCESS [  5.095 s]
[INFO] Apache Atlas Notification 2.0.0 .................... SUCCESS [ 14.133 s]
[INFO] atlas-client-v2 2.0.0 .............................. SUCCESS [ 11.227 s]
[INFO] Apache Atlas Graph Database Projects 2.0.0 ......... SUCCESS [  0.453 s]
[INFO] Apache Atlas Graph Database API 2.0.0 .............. SUCCESS [ 10.108 s]
[INFO] Graph Database Common Code 2.0.0 ................... SUCCESS [  7.904 s]
[INFO] Apache Atlas JanusGraph-HBase2 Module 2.0.0 ........ SUCCESS [  8.784 s]
[INFO] Apache Atlas JanusGraph DB Impl 2.0.0 .............. SUCCESS [ 13.466 s]
[INFO] Apache Atlas Graph Database Implementation Dependencies 2.0.0 SUCCESS [  1.949 s]
[INFO] Apache Atlas Authorization 2.0.0 ................... SUCCESS [  5.845 s]
[INFO] Apache Atlas Repository 2.0.0 ...................... SUCCESS [ 26.568 s]
[INFO] Apache Atlas UI 2.0.0 .............................. SUCCESS [04:17 min]
[INFO] Apache Atlas Web Application 2.0.0 ................. SUCCESS [08:44 min]
[INFO] Apache Atlas Documentation 2.0.0 ................... SUCCESS [01:47 min]
[INFO] Apache Atlas FileSystem Model 2.0.0 ................ SUCCESS [  2.820 s]
[INFO] Apache Atlas Plugin Classloader 2.0.0 .............. SUCCESS [  4.165 s]
[INFO] Apache Atlas Hive Bridge Shim 2.0.0 ................ SUCCESS [01:03 min]
[INFO] Apache Atlas Hive Bridge 2.0.0 ..................... SUCCESS [01:38 min]
[INFO] Apache Atlas Falcon Bridge Shim 2.0.0 .............. SUCCESS [ 53.172 s]
[INFO] Apache Atlas Falcon Bridge 2.0.0 ................... SUCCESS [ 12.770 s]
[INFO] Apache Atlas Sqoop Bridge Shim 2.0.0 ............... SUCCESS [ 29.425 s]
[INFO] Apache Atlas Sqoop Bridge 2.0.0 .................... SUCCESS [ 14.340 s]
[INFO] Apache Atlas Storm Bridge Shim 2.0.0 ............... SUCCESS [ 31.160 s]
[INFO] Apache Atlas Storm Bridge 2.0.0 .................... SUCCESS [  6.921 s]
[INFO] Apache Atlas Hbase Bridge Shim 2.0.0 ............... SUCCESS [  5.872 s]
[INFO] Apache Atlas Hbase Bridge 2.0.0 .................... SUCCESS [ 44.294 s]
[INFO] Apache HBase - Testing Util 2.0.0 .................. SUCCESS [  5.072 s]
[INFO] Apache Atlas Kafka Bridge 2.0.0 .................... SUCCESS [ 12.537 s]
[INFO] Apache Atlas Distribution 2.0.0 .................... SUCCESS [  6.095 s]
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time:  23:53 min
[INFO] Finished at: 2019-12-15T02:37:56+08:00
[INFO] ------------------------------------------------------------------------

 

如果使用內置的hbase和solor,修改編譯參數如下

# mvn clean -DskipTests package -Pdist,embedded-hbase-solr

編譯完成

[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO] 
[INFO] Apache Atlas Server Build Tools 1.0 ................ SUCCESS [  0.809 s]
[INFO] apache-atlas 2.0.0 ................................. SUCCESS [  7.569 s]
[INFO] Apache Atlas Test Utility Tools 2.0.0 .............. SUCCESS [  5.851 s]
[INFO] Apache Atlas Integration 2.0.0 ..................... SUCCESS [  5.331 s]
[INFO] Apache Atlas Common 2.0.0 .......................... SUCCESS [  1.877 s]
[INFO] Apache Atlas Client 2.0.0 .......................... SUCCESS [  0.180 s]
[INFO] atlas-client-common 2.0.0 .......................... SUCCESS [  0.871 s]
[INFO] atlas-client-v1 2.0.0 .............................. SUCCESS [  0.841 s]
[INFO] Apache Atlas Server API 2.0.0 ...................... SUCCESS [  1.282 s]
[INFO] Apache Atlas Notification 2.0.0 .................... SUCCESS [  2.491 s]
[INFO] atlas-client-v2 2.0.0 .............................. SUCCESS [  0.777 s]
[INFO] Apache Atlas Graph Database Projects 2.0.0 ......... SUCCESS [  0.074 s]
[INFO] Apache Atlas Graph Database API 2.0.0 .............. SUCCESS [  1.157 s]
[INFO] Graph Database Common Code 2.0.0 ................... SUCCESS [  0.924 s]
[INFO] Apache Atlas JanusGraph-HBase2 Module 2.0.0 ........ SUCCESS [  1.499 s]
[INFO] Apache Atlas JanusGraph DB Impl 2.0.0 .............. SUCCESS [  4.756 s]
[INFO] Apache Atlas Graph Database Implementation Dependencies 2.0.0 SUCCESS [  1.388 s]
[INFO] Apache Atlas Authorization 2.0.0 ................... SUCCESS [  1.554 s]
[INFO] Apache Atlas Repository 2.0.0 ...................... SUCCESS [  9.838 s]
[INFO] Apache Atlas UI 2.0.0 .............................. SUCCESS [01:08 min]
[INFO] Apache Atlas Web Application 2.0.0 ................. SUCCESS [ 52.064 s]
[INFO] Apache Atlas Documentation 2.0.0 ................... SUCCESS [  4.832 s]
[INFO] Apache Atlas FileSystem Model 2.0.0 ................ SUCCESS [  2.084 s]
[INFO] Apache Atlas Plugin Classloader 2.0.0 .............. SUCCESS [  0.717 s]
[INFO] Apache Atlas Hive Bridge Shim 2.0.0 ................ SUCCESS [  3.103 s]
[INFO] Apache Atlas Hive Bridge 2.0.0 ..................... SUCCESS [ 10.010 s]
[INFO] Apache Atlas Falcon Bridge Shim 2.0.0 .............. SUCCESS [  1.386 s]
[INFO] Apache Atlas Falcon Bridge 2.0.0 ................... SUCCESS [  1.348 s]
[INFO] Apache Atlas Sqoop Bridge Shim 2.0.0 ............... SUCCESS [  0.246 s]
[INFO] Apache Atlas Sqoop Bridge 2.0.0 .................... SUCCESS [  6.169 s]
[INFO] Apache Atlas Storm Bridge Shim 2.0.0 ............... SUCCESS [  0.335 s]
[INFO] Apache Atlas Storm Bridge 2.0.0 .................... SUCCESS [  1.968 s]
[INFO] Apache Atlas Hbase Bridge Shim 2.0.0 ............... SUCCESS [  1.834 s]
[INFO] Apache Atlas Hbase Bridge 2.0.0 .................... SUCCESS [  5.859 s]
[INFO] Apache HBase - Testing Util 2.0.0 .................. SUCCESS [  3.876 s]
[INFO] Apache Atlas Kafka Bridge 2.0.0 .................... SUCCESS [  1.670 s]
[INFO] Apache Atlas Distribution 2.0.0 .................... SUCCESS [03:31 min]
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time:  07:08 min
[INFO] Finished at: 2019-12-15T19:08:07+08:00
[INFO] ------------------------------------------------------------------------

3 安裝

# ls distro/target
apache-atlas-2.0.0-bin.tar.gz

拷貝到安裝服務器解壓

# ls

bin  conf  data  DISCLAIMER.txt  docs  hbase  hook  hook-bin  LICENSE logs  models  NOTICE  server  solr  tools

默認配置:使用hbase自帶zookeeper,使用內置的hbase和solr

# vi conf/atlas-env.sh
# indicates whether or not a local instance of HBase should be started for Atlas
export MANAGE_LOCAL_HBASE=true
# indicates whether or not a local instance of Solr should be started for Atlas
export MANAGE_LOCAL_SOLR=true

# vi hbase/conf/hbase-env.sh
# Tell HBase whether it should manage it's own instance of ZooKeeper or not.
# export HBASE_MANAGES_ZK=true

如果要修改,除了以上配置,還要修改

# vi conf/atlas-application.properties
## Server port configuration
atlas.server.http.port=21000

這個配置文件包含啟動端口以及zookeeper、hbase、solr相關配置

4 啟動

# bin/atlas_start.py

確認服務是否正常

# jps

4.1 zookeeper

# echo stat|nc localhost 2181

4.2 hbase

# hbase/bin/hbase shell

4.3 solr

# solr/bin/solr status

如果有個別組件啟動失敗,可以手工啟動

1)hbase

# hbase/bin/start-hbase.sh

2)solr

# solr/bin/solr start -c -z localhost:2181 -p 8983 -force
# solr/bin/solr create -c fulltext_index -force -d conf/solr/
# solr/bin/solr create -c edge_index -force -d conf/solr/
# solr/bin/solr create -c vertex_index -force -d conf/solr/

5 訪問

http://localhost:21000,賬號密碼:admin/admin

 

 

登錄之后

 

6 修改賬號密碼

# cat conf/users-credentials.properties 

#username=group::sha256-password

admin=ADMIN::8c6976e5b5410415bde908bd4dee15dfb167a9c873fc4bb8a81f6f2ab448a918

rangertagsync=RANGER_TAG_SYNC::e3f67240f5117d1753c940dae9eea772d36ed5fe9bd9c94a300e40413f1afb9d

加密地址:https://tool.oschina.net/encrypt?type=2

修改之后重啟;

 

ps:hbase可能啟動失敗,查看日志

hbase/logs/hbase*.log

2020-01-11 12:49:43,613 ERROR [Thread-22] master.HMaster: Failed to become active master
java.lang.IllegalStateException: The procedure WAL relies on the ability to hsync for proper operation during component failures, but the underlying filesystem does not support doin
g so. Please check the config value of 'hbase.procedure.store.wal.use.hsync' to set the desired level of robustness and ensure the config value of 'hbase.wal.dir' points to a FileSy
stem mount that can provide it.
    at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.rollWriter(WALProcedureStore.java:1044)
    at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.recoverLease(WALProcedureStore.java:383)
    at org.apache.hadoop.hbase.procedure2.ProcedureExecutor.init(ProcedureExecutor.java:649)
    at org.apache.hadoop.hbase.master.HMaster.createProcedureExecutor(HMaster.java:1282)
    at org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:842)
    at org.apache.hadoop.hbase.master.HMaster.startActiveMasterManager(HMaster.java:2086)
    at org.apache.hadoop.hbase.master.HMaster.lambda$run$0(HMaster.java:553)
    at java.lang.Thread.run(Thread.java:748)
2020-01-11 12:49:43,614 ERROR [Thread-22] master.HMaster: ***** ABORTING master dataone-004,61500,1578718180228: Unhandled exception. Starting shutdown. *****
java.lang.IllegalStateException: The procedure WAL relies on the ability to hsync for proper operation during component failures, but the underlying filesystem does not support doin
g so. Please check the config value of 'hbase.procedure.store.wal.use.hsync' to set the desired level of robustness and ensure the config value of 'hbase.wal.dir' points to a FileSy
stem mount that can provide it.
    at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.rollWriter(WALProcedureStore.java:1044)
    at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.recoverLease(WALProcedureStore.java:383)
    at org.apache.hadoop.hbase.procedure2.ProcedureExecutor.init(ProcedureExecutor.java:649)
    at org.apache.hadoop.hbase.master.HMaster.createProcedureExecutor(HMaster.java:1282)
    at org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:842)
    at org.apache.hadoop.hbase.master.HMaster.startActiveMasterManager(HMaster.java:2086)
    at org.apache.hadoop.hbase.master.HMaster.lambda$run$0(HMaster.java:553)
    at java.lang.Thread.run(Thread.java:748)

需要修改hbase/conf/hbase-site.xml,添加

  <property>
    <name>hbase.unsafe.stream.capability.enforce</name>
    <value>false</value>
  </property>

然后啟動成功。


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM