【问题标题】:Create pdf and merge with pdfbox创建pdf并与pdfbox合并
【发布时间】:2012-11-28 06:54:27
【问题描述】:

这就是我想做的:

  1. 使用 pdfbox 制作 2 个不同的 pdf 文件

  2. 使用 pdfmerger 将这两个文件合并在一起

如果我将#1 保存到服务器端本地硬盘驱动器并加载#2 的文件,我知道该怎么做。但我想做的是使用“直接从内存中”。我已经从这个 pdfboxes 中搜索了所有方法,但仍然找不到。

这是我从本地文件获取的代码

谢谢。

import java.io.BufferedOutputStream;
import java.io.FileOutputStream;
import java.io.IOException;
import java.io.OutputStream;

import org.apache.pdfbox.exceptions.COSVisitorException;
import org.apache.pdfbox.pdmodel.PDDocument;
import org.apache.pdfbox.pdmodel.PDPage;
import org.apache.pdfbox.pdmodel.edit.PDPageContentStream;
import org.apache.pdfbox.pdmodel.font.PDFont;
import org.apache.pdfbox.pdmodel.font.PDTrueTypeFont;
import org.apache.pdfbox.pdmodel.font.PDType1Font;
import org.apache.pdfbox.util.PDFMergerUtility;

/**
* This is an example that creates a simple document
* with a ttf-font.
*
* @author <a href="mailto:m.g.n@gmx.de">Michael Niedermair</a>
* @version $Revision: 1.2 $
*/
public class Test2
{

    /**
    * create the second sample document from the PDF file format specification.
    *
    * @param file     The file to write the PDF to.
    * @param message    The message to write in the file.
    * @param fontfile  The ttf-font file.
    *
    * @throws IOException If there is an error writing the data.
    * @throws COSVisitorException If there is an error writing the PDF.
    */
    public void doIt(final String file, final String message) throws IOException, COSVisitorException
    {

        // the document
        PDDocument doc = null;
        try
        {
            doc = new PDDocument();

            PDPage page = new PDPage();
            doc.addPage(page);
            PDFont font = PDType1Font.HELVETICA_BOLD;


            PDPageContentStream contentStream = new PDPageContentStream(doc, page);
            contentStream.beginText();
            contentStream.setFont(font, 12);
            contentStream.moveTextPositionByAmount(100, 700);
            contentStream.drawString(message);
            contentStream.endText();
            contentStream.close();

            doc.save(file);

            System.out.println(file + " created!");
        }
        finally
        {
            if (doc != null)
            {
                doc.close();
            }
        }
    }

    /**
     * This will create a hello world PDF document
     * with a ttf-font.
     * <br />
     * see usage() for commandline
     *
     * @param args Command line arguments.
     */
    public static void main(String[] args)
    {

        Test2 app = new Test2();
        Test2 app2 = new Test2();
        try {
            app.doIt("C:/here.pdf", "hello");
            app2.doIt("C:/here2.pdf", "helloagain");
            PDFMergerUtility merger = new PDFMergerUtility();
            merger.addSource("C:/here.pdf");
            merger.addSource("C:/here2.pdf");
            OutputStream bout2 = new BufferedOutputStream(new FileOutputStream("C:/hereisthefinal.pdf"));

            merger.setDestinationStream(bout2);
            merger.mergeDocuments();

        } catch (COSVisitorException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        } catch (IOException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        }
    }

}

【问题讨论】:

    标签: java pdf merge pdfbox


    【解决方案1】:

    您只需要使用PdfMergeUtility.addSource(InputStream) 方法从输入流而不是物理文件添加源。

    快速浏览一下 API,您可以使用 PDDocument.save(OutputStream) 方法将文件写入内存中的字节数组,这样应该可以工作。

    static byte[] doIt(String message) {
       PDDocument doc = new PDDocument();
       // add the message
       ByteArrayOutputStream baos = new ByteArrayOutputStream();
       doc.save(baos);
       return baos.toByteArray();
    }
    
    void main(String args[]) {
       byte[] pdf1 = doIt("hello");
       byte[] pdf2 = doIt("world");
       PDFMergerUtility merger = new PDFMergerUtility();
       merger.addSource(new ByteArrayInputStream(pdf1));
       merger.addSource(new ByteArrayInputStream(pdf2));
       // do the rest with the merger
    }
    

    【讨论】:

    • 非常感谢你 :) 这个 Input OutputStream 转换的东西真的让我很累 :(。谢谢!
    【解决方案2】:

    我使用它来合并一些文档(InputStreams)并将合并后的文档写入 HttpServletResponse。

      PDFMergerUtility mergedDoc = new PDFMergerUtility();
      ByteArrayOutputStream colDocOutputstream = new ByteArrayOutputStream();
    
      for (int i = 0; i < documentCount; i++)
      {
        ByteArrayOutputStream tempZipOutstream = new ByteArrayOutputStream();
    ...
        mergedDoc.addSource(new ByteArrayInputStream(tempZipOutstream.toByteArray()));
      }
    
      mergedDoc.setDestinationStream(colDocOutputstream);
      mergedDoc.mergeDocuments();
    
      response.setContentLength(colDocOutputstream.size());
      response.setContentType("application/pdf");
      response.setHeader("Content-Disposition", "attachment; filename=mergedDocument.pdf");
      response.setHeader("Pragma", "public");
      response.setHeader("Cache-Control", "max-age=0");
      response.addDateHeader("Expires", 0);
      response.getOutputStream().write(colDocOutputstream.toByteArray());
    

    【讨论】:

      【解决方案3】:

      您也可以这样使用:-
      1) 创建 InputStream 列表
      2) 实例化 PDFMergerUtility 类
      3) 设置目标输出流
      4) 将所有 InputStreams 添加到 PDFMerger 作为需要合并的源文件。
      5) 调用“PDFmerger.mergeDocuments();”合并文档

         List<InputStream> locations=new ArrayList<InputStream>();
              locations.add(new FileInputStream("E:/Filenet Project Support/MergePDFs_Sample_Code/Attorney_new_form.pdf"));
              locations.add(new FileInputStream("E:/Filenet Project Support/MergePDFs_Sample_Code/JH.pdf"));
              locations.add(new FileInputStream("E:/Filenet Project Support/MergePDFs_Sample_Code/Interpreter_new_form.pdf"));
              //Instantiating PDFMergerUtility class
              PDFMergerUtility PDFmerger = new PDFMergerUtility();
              //Setting Destination Output Stream
              OutputStream out = new FileOutputStream("E:/Filenet Project Support/MergePDFs_Sample_Code/merged.pdf");
              //Adding all InputStreams to PDFMerger as Source files which needs to be merged.
              PDFmerger.addSources(locations);
              //Setting Destination Output Stream
              PDFmerger.setDestinationStream(out);
              //Merging the two documents
              PDFmerger.mergeDocuments();
              System.out.println("Documents merged");
      

      【讨论】:

        【解决方案4】:

        使用 REST 和 PDFBOX

        @RequestMapping(value = "/getMergePdf", method = RequestMethod.GET)
            public ResponseEntity<byte[]> getMergePdf(@RequestParam(value = "filePath", required = true) String filePath,
                    @RequestParam(value = "newFileName", required = true) String newFileName) throws IOException {
        
                    // Step 1: Loading an Existing PDF Document
                File file = new File(filePath);
                File[] listFile = file.listFiles();
        
                // Step 2: Instantiating the PDFMergerUtility class
                PDFMergerUtility mergePdf = new PDFMergerUtility();
        
                // Step 3: Setting the source files
                for (File pdfName : listFile) {
                    mergePdf.addSource(pdfName);
                }
        
                // Step 4: Setting the destination file
                ByteArrayOutputStream pdfDocOutputstream = new ByteArrayOutputStream();
                mergePdf.setDestinationFileName(newFileName + ".pdf");
                mergePdf.setDestinationStream(pdfDocOutputstream);
                mergePdf.mergeDocuments(MemoryUsageSetting.setupTempFileOnly());
        
                // Step 5: write in Response
                HttpHeaders headers = new HttpHeaders();
                headers.setContentType(MediaType.APPLICATION_PDF);
        
                // Here you have to set the actual filename of your pdf
                headers.setContentDispositionFormData(mergePdf.getDestinationFileName(), mergePdf.getDestinationFileName());
                headers.setCacheControl("must-revalidate, post-check=0, pre-check=0");
                ResponseEntity<byte[]> response = new ResponseEntity<>(pdfDocOutputstream.toByteArray(), headers, HttpStatus.OK);
                return response;
        
        
            }
        

        【讨论】:

        • 为什么要将 pdf 读入字节数组并将这些数组添加为源?有一个 addSource 接受 File 参数。很可能使用该重载意味着更小的内存占用。
        • 感谢您的建议.....,根据您的评论更新代码
        猜你喜欢
        • 1970-01-01
        • 1970-01-01
        • 1970-01-01
        • 1970-01-01
        • 2016-10-02
        • 1970-01-01
        • 1970-01-01
        • 2012-04-09
        • 2023-03-22
        相关资源
        最近更新 更多