Python PdfFileReader.getOutlines方法代码示例 您所在的位置:网站首页 outlines怎么读 Python PdfFileReader.getOutlines方法代码示例

Python PdfFileReader.getOutlines方法代码示例

2024-07-11 12:25| 来源: 网络整理| 查看: 265

本文整理汇总了Python中PyPDF2.PdfFileReader.getOutlines方法的典型用法代码示例。如果您正苦于以下问题:Python PdfFileReader.getOutlines方法的具体用法?Python PdfFileReader.getOutlines怎么用?Python PdfFileReader.getOutlines使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在PyPDF2.PdfFileReader的用法示例。

在下文中一共展示了PdfFileReader.getOutlines方法的4个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Python代码示例。

示例1: page_extract # 需要导入模块: from PyPDF2 import PdfFileReader [as 别名] # 或者: from PyPDF2.PdfFileReader import getOutlines [as 别名] def page_extract(): PDF_IN = PdfFileReader(open(PDF_DIR, 'rb')) pg_id_num_map = page_id_to_num(PDF_IN) outlines = PDF_IN.getOutlines() bmrks = bookmarks(outlines, pg_id_num_map) png_list = [] for i in range(int(start) - 1, int(end)): output = PdfFileWriter() output.addPage(PDF_IN.getPage(i)) base, name_ext = os.path.split(PDF_DIR) name, ext = os.path.splitext(name_ext) PDF_OUT = '{}{}'.format(TMP_DIR, '{}-{}{}'.format(name, str(i).zfill(6), ext)) with open(PDF_OUT, 'wb') as outputStream: output.write(outputStream) png_list.append(gs_pdf_to_png(PDF_OUT)) png_list.append(closest(bmrks, i+1)) os.remove(PDF_OUT) png_list = group(png_list, 4) for tup in png_list: make_cards(tup[0], tup[2], tup[3]) print "Current Tag Processed: " + tup[3]开发者ID:DrLulz,项目名称:AnkiFlash,代码行数:33,代码来源:flashcardPDF_anki.py 示例2: bookmarks # 需要导入模块: from PyPDF2 import PdfFileReader [as 别名] # 或者: from PyPDF2.PdfFileReader import getOutlines [as 别名] def bookmarks(PDF): def page_id_to_num(pdf, pages=None, _result=None, _num_pages=None): if _result is None: _result = {} if pages is None: _num_pages = [] pages = pdf.trailer["/Root"].getObject()["/Pages"].getObject() t = pages["/Type"] if t == "/Pages": for page in pages["/Kids"]: _result[page.idnum] = len(_num_pages) page_id_to_num(pdf, page.getObject(), _result, _num_pages) elif t == "/Page": _num_pages.append(1) return _result def bookmarks(outlines, pg_id_num_map, result=None): if result is None: result = [] if type(outlines) == list: for outline in outlines: result = bookmarks(outline, pg_id_num_map, result) elif type(outlines) == PyPDF2.pdf.Destination: result.append((pg_id_num_map[outlines.page.idnum]+1, outlines['/Title'])) return result PDF_IN = PdfFileReader(open(PDF, 'rb')) pg_id_num_map = page_id_to_num(PDF_IN) outlines = PDF_IN.getOutlines() outlines = [item for item in outlines if not type(item) == list] outlines = [item for item in outlines if not item['/Title'] in exclude] bmrks = bookmarks(outlines, pg_id_num_map) it = iter(bmrks[1:]) TOC = [] for x in bmrks: try: TOC.append( (x[0], (next(it)[0] - 1), x[1]) ) except: pass return TOC开发者ID:DrLulz,项目名称:AnkiFlash,代码行数:59,代码来源:pathophys_of_dz.py 示例3: extract_bookmarks # 需要导入模块: from PyPDF2 import PdfFileReader [as 别名] # 或者: from PyPDF2.PdfFileReader import getOutlines [as 别名] def extract_bookmarks(pdf_filename): file = open(pdf_filename, 'rb') pdf = PdfFileReader(file) map_ = _construct_page_id_to_page_number_map(pdf) outlines = pdf.getOutlines() list_ = [] _recursive_extract_bookmarks(outlines, map_, list_) file.close() return list_开发者ID:nathanielove,项目名称:pdf-server,代码行数:15,代码来源:pdf.py 示例4: type # 需要导入模块: from PyPDF2 import PdfFileReader [as 别名] # 或者: from PyPDF2.PdfFileReader import getOutlines [as 别名] result = [] if type(outlines) == list: for outline in outlines: result = bookmarks(outline, pg_id_num_map, result) elif type(outlines) == PyPDF2.pdf.Destination: result.append((pg_id_num_map[outlines.page.idnum]+1, outlines['/Title'])) return result PDF_IN = PdfFileReader(open(PDF, 'rb')) pg_id_num_map = page_id_to_num(PDF_IN) outlines = PDF_IN.getOutlines() outlines = [item for item in outlines if not type(item) == list] outlines = [item for item in outlines if not item['/Title'] in exclude] bmrks = bookmarks(outlines, pg_id_num_map) it = iter(bmrks[1:]) TOC = [] for x in bmrks: try: TOC.append( (x[0], (next(it)[0] - 1), x[1]) ) except: pass print TOC开发者ID:DrLulz,项目名称:AnkiFlash,代码行数:33,代码来源:check_bookmarks.py

注:本文中的PyPDF2.PdfFileReader.getOutlines方法示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。



【本文地址】

公司简介

联系我们

今日新闻

    推荐新闻

    专题文章
      CopyRight 2018-2019 实验室设备网 版权所有