python如何使用最简单的方式将PDF转换成Word？-Toy模板网

这篇具有很好参考价值的文章主要介绍了python如何使用最简单的方式将PDF转换成Word？。希望对大家有所帮助。如果存在错误或未考虑完全的地方，请大家不吝赐教，您也可以点击"举报违法"按钮提交疑问。

由于PDF的文件大多都是只读文件，有时候为了满足可以编辑的需要通常可以将PDF文件直接转换成Word文件进行操作。

看了网络上面的python转换PDF文件为Word的相关文章感觉都比较复杂，并且关于一些图表的使用还要进行特殊的处理。

本篇文章主要讲解关于如何使用python是实现将PDF转换成Word的业务过程，这次没有使用GUI应用的操作。

由于可能存在版本冲突的问题，这里将开发过程中需要使用的python非标准库的版本列举出来。

python内核版本：3.6.8
PyMuPDF版本：1.18.17
pdf2docx版本：0.5.1

可以选择pip的方式对使用到的python非标准库进行安装。

pip install PyMuPDF==1.18.17
pip install pdf2docx==0.5.1

完成上述的python依赖库安装以后，将pdf2docx导入到我们的代码块中。

# Importing the Converter class from the pdf2docx module.
from pdf2docx import Converter

然后，编写业务函数的代码块，新建一个pdfToWord函数来处理转换逻辑，主要就几行代码可以实现比较简单。

def pdfToWord(pdf_file_path=None, word_file_path=None):    
"""    
It takes a pdf file path and a word file path as input, and converts the pdf file to a word file.    
:param pdf_file_path: The path to the PDF file you want to convert    
:param word_file_path: The path to the word file that you want to create    
"""    
# Creating a Converter object.    
converter_ = Converter(pdf_file_path)    
# The `convert` method takes the path to the word file that you want to create, and the start and end pages of the PDF    
# file that you want to convert.    
converter_.convert(word_file_path, start=0, end=None)    
converter_.close()

最后，使用main函数调用pdfToWord函数可以直接完成文档格式的转换。文章来源地址https://www.toymoban.com/news/detail-504878.html

# A special variable in Python that evaluates to `True` if the module is being run directly by the Python interpreter, and
# `False` if it has been imported by another module.
if __name__ == '__main__':    
     pdfToWord('D:/test-data-work/test_pdf.pdf', 'D:/test-data-work/test_pdf.docx')
# Parsing Page 2: 2/5...Ignore Line "∑" due to overlap
# Ignore Line "∑" due to overlap
# Ignore Line "ç" due to overlap
# Ignore Line "Ａ" due to overlap
# Ignore Line "ｉ ＝１" due to overlap
# Ignore Line "æ" due to overlap
# Parsing Page 5: 5/5...
# Creating Page 5: 5/5...
# --------------------------------------------------
# Terminated in 3.2503201s.

到了这里，关于python如何使用最简单的方式将PDF转换成Word？的文章就介绍完了。如果您还想了解更多内容，请在右上角搜索TOY模板网以前的文章或继续浏览下面的相关文章，希望大家以后多多支持TOY模板网！