Hive Metastore 連接報錯


背景

項目中需要通過一些自定義的組件來操控hive的元數據,於是使用了remote方式來存儲hive元數據,使用一個服務后台作為gateway,由它來控制hive元數據。

現象

在windows上連接hive metastore的時候,無端的會報NullPointerException,非常費解。

分析

看了代碼后發現,連接后會獲取本地用戶所在的用戶組信息(org.apache.hadoop.hive.metastore.HiveMetaStoreClient中的open方法):

          if (isConnected && !useSasl && conf.getBoolVar(ConfVars.METASTORE_EXECUTE_SET_UGI)){
            // Call set_ugi, only in unsecure mode.
            try {
              UserGroupInformation ugi = Utils.getUGI();
              client.set_ugi(ugi.getUserName(), Arrays.asList(ugi.getGroupNames()));
            } catch (LoginException e) {
              LOG.warn("Failed to do login. set_ugi() is not successful, " +
                       "Continuing without it.", e);
            } catch (IOException e) {
              LOG.warn("Failed to find ugi of client set_ugi() is not successful, " +
                  "Continuing without it.", e);
            } catch (TException e) {
              LOG.warn("set_ugi() not successful, Likely cause: new client talking to old server. "
                  + "Continuing without it.", e);
            }
          }
ugi.getGroupNames()會去調用本地命令在windows平台上會使用一個叫winutils的工具,但是作為客戶端開發的話不會在windows端安裝這些二進制文件,所以代碼流程就出錯了
  /**
   * a Unix command to get a given user's groups list.
   * If the OS is not WINDOWS, the command will get the user's primary group
   * first and finally get the groups list which includes the primary group.
   * i.e. the user's primary group will be included twice.
   */
  public static String[] getGroupsForUserCommand(final String user) {
    //'groups username' command return is non-consistent across different unixes
    return (WINDOWS)? new String[] { WINUTILS, "groups", "-F", "\"" + user + "\""}
                    : new String [] {"bash", "-c", "id -gn " + user
                                     + "&& id -Gn " + user};
WINUTILS的初始化在如下函數中,如果path中找不到的話會返回null
  /** a Windows utility to emulate Unix commands */
  public static final String WINUTILS = getWinUtilsPath();

  public static final String getWinUtilsPath() {
    String winUtilsPath = null;

    try {
      if (WINDOWS) {
        winUtilsPath = getQualifiedBinPath("winutils.exe");
      }
    } catch (IOException ioe) {
       LOG.error("Failed to locate the winutils binary in the hadoop binary path",
         ioe);
    }

    return winUtilsPath;
  }
在java.lang.ProcessBuilder.java中的start中有如下判斷:
public Process start() throws IOException {
        // Must convert to array first -- a malicious user-supplied
        // list might try to circumvent the security check.
        String[] cmdarray = command.toArray(new String[command.size()]);
        cmdarray = cmdarray.clone();

        for (String arg : cmdarray)
            if (arg == null)
                throw new NullPointerException();
        // Throws IndexOutOfBoundsException if command is empty
        String prog = cmdarray[0];

由於cmdarray中的第一個元素就是null,所以馬上甩出NullPointerException

toString() 中的null值檢測

另外在org.apache.hadoop.util.Shell中

ShellCommandExecutor

這個類中存在一個問題,就是toString方面沒有對成員為null的情況進行判斷如:

    /**
     * Returns the commands of this instance.
     * Arguments with spaces in are presented with quotes round; other
     * arguments are presented raw
     *
     * @return a string representation of the object.
     */
    @Override
    public String toString() {
      StringBuilder builder = new StringBuilder();
      String[] args = getExecString();
      for (String s : args) {
        if (s.indexOf(' ') >= 0) {
          builder.append('"').append(s).append('"');
        } else {
          builder.append(s);
        }
        builder.append(' ');
      }
      return builder.toString();
    }

即假如我們的命令args中有元素是null,那么這個toString也會拋出NullPointerException,因為在沒有判斷的情況下直接引用了對象方法(s.indexOf),記得這個問題似乎在Effective Java里看到過。一般並不會觸發這問題,可是在打開調試器的時候,它會去執行當前環境里對象的toString方法。所以每當debug到相關代碼段時,總是莫名其妙的就突然爆出個NullPointerException,着實費解了一些時間。

 


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM