DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)


安裝greenplum集群出現以下錯誤:

20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Checking configuration parameters, please wait...
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Reading Greenplum configuration file init_config
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Locale has not been set in init_config, will set to default value
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Locale set to en_US.utf8
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-No DATABASE_NAME set, will exit following template1 updates
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-MASTER_MAX_CONNECT not set, will set to default value 250
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Checking configuration parameters, Completed
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Commencing multi-home checks, please wait...
..
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Configuring build for standard array
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Commencing multi-home checks, Completed
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Building primary segment instance array, please wait...
..................
20160315:13:49:24:025696 gpinitsystem:h95:jason-[INFO]:-Checking Master host
20160315:13:49:24:025696 gpinitsystem:h95:jason-[INFO]:-Checking new segment hosts, please wait...
..................
20160315:13:49:39:025696 gpinitsystem:h95:jason-[INFO]:-Checking new segment hosts, Completed
20160315:13:49:39:025696 gpinitsystem:h95:jason-[INFO]:-Building the Master instance database, please wait...
20160315:13:49:49:025696 gpinitsystem:h95:jason-[INFO]:-Starting the Master in admin mode
20160315:13:51:35:025696 gpinitsystem:h95:jason-[INFO]:-Commencing parallel build of primary segment instances
20160315:13:51:35:025696 gpinitsystem:h95:jason-[INFO]:-Spawning parallel processes    batch [1], please wait...
..................
20160315:13:51:36:025696 gpinitsystem:h95:jason-[INFO]:-Waiting for parallel processes batch [1], please wait...
..................................................
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Parallel process exit status
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as completed           = 18
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as killed              = 0
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as failed              = 0
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Deleting distributed backout files
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Removing back out file
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-No errors generated from parallel processes
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Restarting the Greenplum instance in production mode
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Starting gpstop with args: -a -i -m -d /home/jason/gpdata/gpseg-1
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Gathering information and validating the environment...
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Obtaining Greenplum Master catalog information
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Obtaining Segment details from master...
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Greenplum Version: 'greenplum (Greenplum Database) 4.3.99.00 build dev'
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-There are 0 connections to the database
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Commencing Master instance shutdown with mode='immediate'
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Master host=h95
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Commencing Master instance shutdown with mode=immediate
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Master segment instance directory=/home/jason/gpdata/gpseg-1
20160315:13:52:28:011100 gpstop:h95:jason-[INFO]:-Attempting forceful termination of any leftover master process
20160315:13:52:28:011100 gpstop:h95:jason-[INFO]:-Terminating processes for segment /home/jason/gpdata/gpseg-1
20160315:13:52:28:011100 gpstop:h95:jason-[ERROR]:-Failed to kill processes for segment /home/jason/gpdata/gpseg-1: ([Errno 3] No such process)
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Starting gpstart with args: -a -d /home/jason/gpdata/gpseg-1
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Gathering information and validating the environment...
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Greenplum Binary Version: 'greenplum (Greenplum Database) 4.3.99.00 build dev'
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Greenplum Catalog Version: '201310150'
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Starting Master instance in admin mode
20160315:13:52:29:011187 gpstart:h95:jason-[INFO]:-Obtaining Greenplum Master catalog information
20160315:13:52:29:011187 gpstart:h95:jason-[INFO]:-Obtaining Segment details from master...
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Setting new master era
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Master Started...
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Shutting down master
20160315:13:52:31:011187 gpstart:h95:jason-[INFO]:-Commencing parallel segment instance startup, please wait...
....... 
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Process results...
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-   Successful segment starts                                            = 18
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-   Failed segment starts                                                = 0
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-   Skipped segment starts (segments are marked down in configuration)   = 0
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Successfully started 18 of 18 segment instances 
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Starting Master instance h95 directory /home/jason/gpdata/gpseg-1 
20160315:13:52:39:011187 gpstart:h95:jason-[INFO]:-Command sys_ctl reports Master h95 instance active
20160315:13:54:33:011187 gpstart:h95:jason-[WARNING]:-FATAL:  DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)

20160315:13:54:33:011187 gpstart:h95:jason-[INFO]:-No standby master configured.  skipping...
20160315:13:54:33:011187 gpstart:h95:jason-[INFO]:-Check status of database with gpstate utility
20160315:13:54:37:025696 gpinitsystem:h95:jason-[INFO]:-Completed restart of Greenplum instance in production mode
20160315:13:54:37:025696 gpinitsystem:h95:jason-[INFO]:-Loading gp_toolkit...
psql: FATAL:  DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
20160315:13:56:26:gpinitsystem:h95:jason-[FATAL]:-Failed to retrieve rolname. Script Exiting!

我的集群配置:兩台機器,32g內存16g交換分區。每台機器9個節點。集群按照完成之后,顯示segment啟動的18個,但是通過psql連接不上,報錯!

 主要錯誤信息:

DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)

去官網看了很多人遇到此類的問題,錯誤原因有很多,今天特地總結以下:

Q&A1:系統環境變量沒有設置正確,這個需要根據自己安裝版本的greenplum去設置一下環境變量,可以去官網相對應的版本install guide 那里設置一下!

Q&A2:shared_buffers設置太大,對於如何根據自己內存和segment節點個數分配shared_buffers,可以去官網找一下,通常出去2g的other,以及statement_mem * segment 個數,剩下的除以segment的個數即可。這種情況通常出現中安裝過程中就設置了shared_buffers,一般默認的125MB

Q&A3:防火牆是否關閉,這個情況最容易忽略,也是最容易出現的,通常有些人重啟機器之后就忘記了關閉,我就是這樣的,嘿嘿。你可以設置防火牆重啟后一樣生效!

。。。還有其他的原因歡迎來補充!謝謝,分享是一種美,希望能幫到你!

 


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM