DOT-Files/Gentoo/gentooamd/home/user/.tldrc/tldr/pages/common/ocrmypdf.md

# ocrmypdf

> Generate a searchable PDF or PDF/A from a scanned PDF or an image of text.
> More information: <https://ocrmypdf.readthedocs.io/en/latest/cookbook.html>.

- Create a new searchable PDF/A file from a scanned PDF or image file:

`ocrmypdf {{path/to/input}} {{path/to/output.pdf}}`

- Skip pages of a mixed-format input PDF file that already contain text:

`ocrmypdf --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`

- Clean, de-skew, and rotate pages of a poor scan:

`ocrmypdf --clean --deskew --rotate-pages {{path/to/input.pdf}} {{path/to/output.pdf}}`

- Perform lossy optimization on a PDF without performing any OCR:

`ocrmypdf --tesseract-timeout 0 --optimize 2 --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`

- Set the metadata of a searchable PDF file:

`ocrmypdf --title "{{title}}" --author "{{author}}" --subject "{{subject}}" --keywords "{{keyword; key phrase; ...}}" {{path/to/input.pdf}} {{path/to/output.pdf}}`

- Display help:

`ocrmypdf --help`
Re-initialze to remove the 1GB binary lmstudio 2025-03-14 21:59:59 -06:00			`# ocrmypdf`

			`> Generate a searchable PDF or PDF/A from a scanned PDF or an image of text.`
			`> More information: <https://ocrmypdf.readthedocs.io/en/latest/cookbook.html>.`

			`- Create a new searchable PDF/A file from a scanned PDF or image file:`

Automated updated process of shell scripts and DOT Files (gentooamd) 2025-09-14 18:48:49 -06:00			`ocrmypdf {{path/to/input}} {{path/to/output.pdf}}`
Re-initialze to remove the 1GB binary lmstudio 2025-03-14 21:59:59 -06:00
			`- Skip pages of a mixed-format input PDF file that already contain text:`

			`ocrmypdf --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`

			`- Clean, de-skew, and rotate pages of a poor scan:`

Automated updated process of shell scripts and DOT Files (gentooamd) 2025-09-14 18:48:49 -06:00			`ocrmypdf --clean --deskew --rotate-pages {{path/to/input.pdf}} {{path/to/output.pdf}}`

			`- Perform lossy optimization on a PDF without performing any OCR:`

			`ocrmypdf --tesseract-timeout 0 --optimize 2 --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`
Re-initialze to remove the 1GB binary lmstudio 2025-03-14 21:59:59 -06:00
Automated updated process of shell scripts and DOT Files (gentooamd) 2025-09-14 18:48:49 -06:00			`- Set the metadata of a searchable PDF file:`
Re-initialze to remove the 1GB binary lmstudio 2025-03-14 21:59:59 -06:00
Automated updated process of shell scripts and DOT Files (gentooamd) 2025-09-14 18:48:49 -06:00			`ocrmypdf --title "{{title}}" --author "{{author}}" --subject "{{subject}}" --keywords "{{keyword; key phrase; ...}}" {{path/to/input.pdf}} {{path/to/output.pdf}}`
Re-initialze to remove the 1GB binary lmstudio 2025-03-14 21:59:59 -06:00
			`- Display help:`

			`ocrmypdf --help`