[Erp5-report] r40927 nicolas - /erp5/trunk/products/PortalTransforms/transforms/pdf_to_text.py

nobody at svn.erp5.org nobody at svn.erp5.org
Tue Nov 30 14:56:47 CET 2010


Author: nicolas
Date: Tue Nov 30 14:56:47 2010
New Revision: 40927

URL: http://svn.erp5.org?rev=40927&view=rev
Log:
Even if manpage of pdftotext mention output encoding is by default utf-8,
It might happen that sometimes another codec is used.
So hardcode output encoding format.


Modified:
    erp5/trunk/products/PortalTransforms/transforms/pdf_to_text.py

Modified: erp5/trunk/products/PortalTransforms/transforms/pdf_to_text.py
URL: http://svn.erp5.org/erp5/trunk/products/PortalTransforms/transforms/pdf_to_text.py?rev=40927&r1=40926&r2=40927&view=diff
==============================================================================
--- erp5/trunk/products/PortalTransforms/transforms/pdf_to_text.py [utf8] (original)
+++ erp5/trunk/products/PortalTransforms/transforms/pdf_to_text.py [utf8] Tue Nov 30 14:56:47 2010
@@ -22,7 +22,7 @@ class pdf_to_text(subprocesstransform):
     __version__ = '2004-07-02.01'
 
     binaryName = "pdftotext"
-    binaryArgs = "-layout -nopgbrk %(infile)s -"
+    binaryArgs = "-enc UTF-8 -layout -nopgbrk %(infile)s -"
     useStdin = True
 
 class old_pdf_to_text(commandtransform):



More information about the Erp5-report mailing list