Java " />

扫描二维码下载沐宇APP

沐宇

微信扫码使用沐宇小程序

沐宇

Java PDFReader濡備綍鎻愬彇鏂囨湰鍐呭

扬州沐宇科技
2024-06-27 21:48:29
Java

瑕佸湪Java涓彁鍙朠DF鏂囨。鐨勬枃鏈唴瀹癸紝鍙互浣跨敤Apache PDFBox搴撱€備互涓嬫槸涓€涓畝鍗曠殑绀轰緥浠g爜锛屾紨绀哄浣曚娇鐢≒DFBox鎻愬彇鏂囨湰鍐呭锛?/p>

import org.apache.pdfbox.pdmodel.PDDocument;
import org.apache.pdfbox.text.PDFTextStripper;

import java.io.File;
import java.io.IOException;

public class PDFReader {
    public static void main(String[] args) {
        try {
            // Load PDF document
            PDDocument document = PDDocument.load(new File("example.pdf"));

            // Create PDFTextStripper
            PDFTextStripper pdfTextStripper = new PDFTextStripper();

            // Extract text
            String text = pdfTextStripper.getText(document);

            // Print extracted text
            System.out.println(text);

            // Close the document
            document.close();
        } catch (IOException e) {
            e.printStackTrace();
        }
    }
}

鍦ㄨ繖涓ず渚嬩腑锛屾垜浠姞杞戒竴涓悕涓篹xample.pdf鐨凱DF鏂囨。锛屽苟浣跨敤PDFBox鐨凱DFTextStripper绫绘彁鍙栨枃鏈唴瀹广€傛渶鍚庯紝鎴戜滑灏嗘彁鍙栫殑鏂囨湰鍐呭鎵撳嵃鍒版帶鍒跺彴涓娿€?/p>

璇锋敞鎰忥紝瑕佽繍琛屾绀轰緥浠g爜锛屾偍闇€瑕佸皢Apache PDFBox搴撴坊鍔犲埌鎮ㄧ殑椤圭洰涓€傛偍鍙互鍦∕aven涓坊鍔犱互涓嬩緷璧栭」鏉ュ寘鍚玃DFBox搴擄細

<dependency>
    <groupId>org.apache.pdfbox</groupId>
    <artifactId>pdfbox</artifactId>
    <version>2.0.24</version>
</dependency>

鎮ㄥ彲浠ラ€氳繃浠ヤ笅閾炬帴涓嬭浇Apache PDFBox搴擄細https://pdfbox.apache.org/

扫码添加客服微信