반응형
Recent Posts
Recent Comments
«   2024/05   »
1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31
Archives
Today
Total
관리 메뉴

Synchronized's Database

Convert PDF to docx using python 본문

정보보안이야기/프로그래밍

Convert PDF to docx using python

커피를마시면잠이안와 2022. 8. 17. 20:55
반응형
>>> from pdf2docx import Converter
>>> pdf_file=('sample.pdf')
>>> docx_file='sample.docx'
>>> cv=Converter(pdf_file)
>>> cv.convert(docx_file)
[INFO] Start to convert sample.pdf
[INFO] [1/4] Opening document...
[INFO] [2/4] Analyzing document...
[WARNING] Ignore Line "e.g. novel attacks" due to overlap
[WARNING] Ignore Line "e.g. learning-based IDS" due to overlap
[WARNING] Ignore Line "P1" due to overlap
[WARNING] Ignore Line "P2" due to overlap
[WARNING] Ignore Line "P3" due to overlap
[WARNING] Ignore Line "P4" due to overlap
[WARNING] Ignore Line "P5" due to overlap
[WARNING] Ignore Line "P6" due to overlap
[WARNING] Ignore Line "P7" due to overlap
[WARNING] Ignore Line "P8" due to overlap
[WARNING] Ignore Line "P9" due to overlap
[WARNING] Ignore Line "P10" due to overlap
[WARNING] Ignore Line "Data collection " due to overlap
[WARNING] Ignore Line "and labeling" due to overlap
[WARNING] Ignore Line "Model design " due to overlap
[WARNING] Ignore Line "and learning" due to overlap
[WARNING] Ignore Line "Model " due to overlap
[WARNING] Ignore Line "operation" due to overlap
[INFO] [3/4] Parsing pages...
[INFO] (1/18) Page 1
[INFO] (2/18) Page 2
[INFO] (3/18) Page 3
[INFO] (4/18) Page 4
[INFO] (5/18) Page 5
[INFO] (6/18) Page 6
[INFO] (7/18) Page 7
[INFO] (8/18) Page 8
[INFO] (9/18) Page 9
[INFO] (10/18) Page 10
[INFO] (11/18) Page 11
[INFO] (12/18) Page 12
[INFO] (13/18) Page 13
[INFO] (14/18) Page 14
[INFO] (15/18) Page 15
[INFO] (16/18) Page 16
[INFO] (17/18) Page 17
[INFO] (18/18) Page 18
[INFO] [4/4] Creating pages...
[INFO] (1/18) Page 1
[INFO] (2/18) Page 2
[INFO] (3/18) Page 3
[INFO] (4/18) Page 4
[INFO] (5/18) Page 5
[INFO] (6/18) Page 6
[INFO] (7/18) Page 7
[INFO] (8/18) Page 8
[INFO] (9/18) Page 9
[INFO] (10/18) Page 10
[INFO] (11/18) Page 11
[ERROR] Ignore page 11 due to making page error: value must be in range 0 to 18446744073709551615 inclusive, got -1556
[INFO] (12/18) Page 12
[INFO] (13/18) Page 13
[INFO] (14/18) Page 14
[INFO] (15/18) Page 15
[INFO] (16/18) Page 16
[INFO] (17/18) Page 17
[INFO] (18/18) Page 18
[ERROR] Ignore page 18 due to making page error: list index out of range
[INFO] Terminated in 21.34s.
>>> cv.close()

 

Case of useage?

 

PDF Converted docx

 

반응형