【发布时间】:2018-01-08 19:53:42
【问题描述】:
以this pdf 为例。我可以用dumppdf.py -T 1707.09725.pdf 提取目录(TOC):
<outlines>
<outline level="1" title="1 Introduction">
<dest>
<list size="5">
<ref id="513"/>
<literal>XYZ</literal>
<number>99.213</number>
<number>742.911</number>
<null/>
</list>
</dest>
<pageno>14</pageno>
</outline>
<outline level="1" title="2 Convolutional Neural Networks">
<dest>
<list size="5">
<ref id="554"/>
<literal>XYZ</literal>
<number>99.213</number>
<number>742.911</number>
<null/>
</list>
</dest>
<pageno>16</pageno>
</outline>
...
我可以用 PyPDF2 做类似的事情吗?
【问题讨论】: