PDFBox 解析PDF文件-解析服务器文件

本文转载自查看原文 2020-03-06 16:51 736 java

1.首先引进pom

<dependency>
<groupId>org.apache.pdfbox</groupId>
<artifactId>pdfbox</artifactId>
<version>2.0.4</version>
</dependency>

2.controller层直接代码

/**
 * PDF解析
 * @return
 */
@PostMapping("/getPdf")
public StringBuffer getPdf(@RequestBody JSONObject jsonObject) throws IOException {
    StringBuffer stringBuffer = null;

  //获取服务器地址
    ImportParams params = new ImportParams();
    params.setSaveUrl("/file");
    String filePath = jsonObject.getString("filePath");
    filePath = fileServer + "/" + filePath;
    URL url = new URL(filePath);
    URLConnection connection = url.openConnection();
    InputStream inputStream = connection.getInputStream();
    try {
        PDDocument document;
        PDFParser parser = new PDFParser(new RandomAccessBuffer(inputStream));
        parser.parse();
        document = parser.getPDDocument();
        document.getClass();
        if(!document.isEncrypted()) {
            PDFTextStripperByArea stripper = new PDFTextStripperByArea();
            stripper.setSortByPosition(true);
            PDFTextStripper textStripper = new PDFTextStripper();
            String exposeContent = textStripper.getText(document);
            String[] content = exposeContent.split("\\n");
             stringBuffer = new StringBuffer();
            for(String line:content) {
                stringBuffer.append(line);
            }
        }

    } catch (Exception e) {
        e.printStackTrace();

    }
    return stringBuffer;
}

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 pdfBox 解析 pdf文件使用PDFBox解析PDF文件使用pdf.js在线预览 PDF （本地文件，服务器文件） linux服务器本地域名解析文件【pdf在线浏览】使用psf.js在浏览器查看服务器端pdf文件 java 用PDFBox 删除 PDF文件中的某一页 Java 使用PDFBox提取PDF文件中的图片 pdfbox pdf转图片 PDFBox –如何读取PDF的内容 java从远程服务器获取PDF文件并后台打印（使用pdfFox）