diff options
author | V3n3RiX <venerix@redcorelinux.org> | 2017-10-09 18:53:29 +0100 |
---|---|---|
committer | V3n3RiX <venerix@redcorelinux.org> | 2017-10-09 18:53:29 +0100 |
commit | 4f2d7949f03e1c198bc888f2d05f421d35c57e21 (patch) | |
tree | ba5f07bf3f9d22d82e54a462313f5d244036c768 /app-text/pdfsandwich/metadata.xml |
reinit the tree, so we can have metadata
Diffstat (limited to 'app-text/pdfsandwich/metadata.xml')
-rw-r--r-- | app-text/pdfsandwich/metadata.xml | 23 |
1 files changed, 23 insertions, 0 deletions
diff --git a/app-text/pdfsandwich/metadata.xml b/app-text/pdfsandwich/metadata.xml new file mode 100644 index 000000000000..0fb15c19e847 --- /dev/null +++ b/app-text/pdfsandwich/metadata.xml @@ -0,0 +1,23 @@ +<?xml version="1.0" encoding="UTF-8"?> +<!DOCTYPE pkgmetadata SYSTEM "http://www.gentoo.org/dtd/metadata.dtd"> +<pkgmetadata> + <!-- maintainer-needed --> + <longdescription> +pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which +contain only images (no text) will be processed by optical character +recognition (OCR) and the text will be added to each page invisibly +"behind" the images. + +pdfsandwich is a command line tool which is supposed to be useful to +OCR scanned books or journals. It is able to recognize the page layout +even for multicolumn text. + +Essentially, pdfsandwich is a wrapper script which calls the following +binaries: convert, cuneiform, gs, and hocr2pdf. It is known to run on +Unix systems and has been tested on Linux and MacOS X. It supports +parallel processing on multiprocessor systems. +</longdescription> + <upstream> + <remote-id type="sourceforge">pdfsandwich</remote-id> + </upstream> +</pkgmetadata> |