java統計字符串單詞的個數


在一些項目中可能需要對一段字符串中的單詞進行統計,我在這里寫了一個簡單的demo,有需要的同學可以拿去看一下。

本人沒怎么寫個播客,如果有啥說的不對的地方,歡迎大家指正。

查詢指定字符串出現次數可以直接使用:StringUtils.countMatches(str, sub)

統計所有單詞出現次數代碼:

        /** * 統計各個單詞出現的次數 * @param text */ public static void findEnglishNum(String text){ //找出所有的單詞 String[] array = {".", " ", "?", "!"}; for (int i = 0; i < array.length; i++) { text = text.replace(array[i],","); } String[] textArray = text.split(","); //遍歷 記錄 Map<String, Integer> map = new HashMap<String, Integer>(); for (int i = 0; i < textArray.length; i++) { String key = textArray[i]; //轉為小寫 String key_l = key.toLowerCase(); if(!"".equals(key_l)){ Integer num = map.get(key_l); if(num == null || num == 0){ map.put(key_l, 1); }else if(num > 0){ map.put(key_l, num+1); } } } //輸出到控制台 System.out.println("各個單詞出現的頻率為:"); Iterator<String> iter = map.keySet().iterator(); while(iter.hasNext()){ String key = iter.next(); Integer num = map.get(key); System.out.println(key + "\n\t\t" + num + "次\n-------------------"); } } 

測試代碼:

    public static void main(String[] args) { String text = "Welcome welcome to ADempiere, a commons-based peer-production of Open Source ERP Applications. This Wiki is for the global community to contribute and share know-how and domain expertise. We hope you can find as much open information and participate in making it most usable for everyone. This project has a bazaar of Citizens with a Community Council Team which work in theFunctional Team and Technical Team along the Software Development Procedure supported and funded by the foundation ADempiere"; findEnglishNum(text); } 

運行結果:

后面還有一些沒有全部截下來


本人代碼功力不夠深厚,如果代碼中有什么不對或不好的地方,歡迎各位大神指點!可憐


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM