bookscan_utils

eswyft/bookscan_utils

Fork 0

Go to file

rootless e5169586a0 initial

2023-10-10 02:41:47 +03:00

pagenum

initial

2023-10-10 02:41:47 +03:00

.gitignore

initial

2023-10-10 02:41:47 +03:00

gaps.py

initial

2023-10-10 02:41:47 +03:00

NOTES

initial

2023-10-10 02:41:47 +03:00

pagenum-mass.py

initial

2023-10-10 02:41:47 +03:00

pagenum-probe.py

initial

2023-10-10 02:41:47 +03:00

README

initial

2023-10-10 02:41:47 +03:00

rename16.py

initial

2023-10-10 02:41:47 +03:00

README

SUMMARY

	This is a collection of tools that helps me digitizing books.

	In particular, it helps assembling a bunch of random page scans into a book
	with correct page order, mainly by using OCR and text (number) recognition.

	I use it to prepare my book releases on torrents.


SYSTEM REQUIREMENTS

	Theoretically should work on any system that supports Python 3.9+ and has
	required dependencies, but might need some minor modifications in the code.

	Tested only on FreeBSD 13.


DEPENDENCIES

	System utilities:

	- tesseract
	- pdftoppm

	Python packages:

	- pytesseract
	- Pillow


AUTHORS

	rootless (c) 2023


LICENSE

	BSD-2-Clause