Jsoup之解析URL、字符串、文件

本文轉載自查看原文 2020-06-13 21:27 649 爬蟲

package jsoup;

import org.apache.commons.io.FileUtils;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.junit.Test;

import java.io.File;
import java.net.URL;

public class JsoupFirstTest {
    @Test
    public void testUrl() throws Exception{
        //解析URL地址
        Document doc =Jsoup.parse(new URL("http://www.itcast.cn"),1000);
        //使用標簽選擇器，獲取title標簽中的內容
        String title=doc.getElementsByTag("title").first().text();
        //將獲取到的信息進行打印
        System.out.println(title);
    }
    @Test
    public void testString() throws Exception{
        //使用工具類讀取文件，獲得字符串
       String content= FileUtils.readFileToString(new File("C:\\Users\\Administrator\\Desktop\\bolg_add.html"),"utf8");
        //解析字符串
        Document doc=Jsoup.parse(content);
        String string=doc.getElementsByTag("title").first().text();
        System.out.println(string);
    }
    @Test
    public void testFile() throws  Exception{
        Document doc=Jsoup.parse(new File("C:\\Users\\Administrator\\Desktop\\bolg_add.html"),"utf8");
        String title=doc.getElementsByTag("title").first().text();
        System.out.println(title);
    }
}
上面代碼就是利用jsoup對URL、字符串、文件的解析，這也是Jsoup的三大常用功能。
在進行以上代碼的驗證是，必須提前進行依賴的引入。

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 querystring 解析url 查詢字符串【hive】解析url格式字符串 URLSearchParams生成和解析URL或者參數字符串 DEX文件解析--3、dex文件字符串解析 java Jsoup.clean 處理入參時，會將換行符解析成空字符串問題字符串解析 C/C++.【轉】解析URL的轉義字符百分比(%)字符串 JS解析XML文件和XML字符串 C# XELEMENT 解析xml文件(字符串) c++ 讀取文件字符串並且解析