深入理解JAVA集合系列三：HashMap的死循環解讀

本文轉載自查看原文 2016-06-20 22:40 16949 JAVA集合/ HashMap、死循環、CPU100%

由於在公司項目中偶爾會遇到HashMap死循環造成CPU100%，重啟后問題消失，隔一段時間又會反復出現。今天在這里來仔細剖析下多線程情況下HashMap所帶來的問題：

1、多線程put操作后，get操作導致死循環。

2、多線程put非null元素后，get操作得到null值。

3、多線程put操作，導致元素丟失。

死循環場景重現

下面我用一段簡單的DEMO模擬HashMap死循環:

 1 public class Test extends Thread
 2 {
 3     static HashMap<Integer, Integer> map = new HashMap<Integer, Integer>(2);
 4     static AtomicInteger at = new AtomicInteger();
 5     
 6     public void run()
 7     {
 8         while(at.get() < 100000)
 9         {
10             map.put(at.get(),at.get());
11             at.incrementAndGet();
12         }
13     }

其中map和at都是static的，即所有線程所共享的資源。接着5個線程並發操作該HashMap：

 1 public static void main(String[] args)
 2      {
 3          Test t0 = new Test();
 4          Test t1 = new Test();
 5          Test t2 = new Test();
 6          Test t3 = new Test();
 7          Test t4 = new Test();
 8          t0.start();
 9          t1.start();
10          t2.start();
11          t3.start();
12          t4.start();
13      }

反復執行幾次，出現這種情況則表示死循環了：

接下來我們去查看下CPU以及堆棧情況：

通過堆棧可以看到：Thread-3由於HashMap的擴容操作導致了死循環。

正常的擴容過程

我們先來看下單線程情況下，正常的rehash過程

1、假設我們的hash算法是簡單的key mod一下表的大小（即數組的長度）。

2、最上面是old hash表，其中HASH表的size=2，所以key=3,5,7在mod 2 以后都沖突在table[1]這個位置上了。

3、接下來HASH表擴容，resize=4，然后所有的<key,value>重新進行散列分布，過程如下：

在單線程情況下，一切看起來都很美妙，擴容過程也相當順利。接下來看下並發情況下的擴容。

並發情況下的擴容

1、首先假設我們有兩個線程，分別用紅色和藍色標注了。

2、擴容部分的源代碼：

 1 void transfer(Entry[] newTable) {
 2         Entry[] src = table;
 3         int newCapacity = newTable.length;
 4         for (int j = 0; j < src.length; j++) {
 5             Entry<K,V> e = src[j];
 6             if (e != null) {
 7                 src[j] = null;
 8                 do {
 9                     Entry<K,V> next = e.next;
10                     int i = indexFor(e.hash, newCapacity);
11                     e.next = newTable[i];
12                     newTable[i] = e;
13                     e = next;
14                 } while (e != null);
15             }
16         }
17     }