<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to 20: Batch process and Viewing results</title><link>https://sourceforge.net/p/pdftohtml/support-requests/20/</link><description>Recent changes to 20: Batch process and Viewing results</description><atom:link href="https://sourceforge.net/p/pdftohtml/support-requests/20/feed.rss" rel="self"/><language>en</language><lastBuildDate>Tue, 07 Dec 2004 18:28:14 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/pdftohtml/support-requests/20/feed.rss" rel="self" type="application/rss+xml"/><item><title>Batch process and Viewing results</title><link>https://sourceforge.net/p/pdftohtml/support-requests/20/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;The Google Search Appliance uses pdftohtml version &lt;br /&gt;
0.33a. 0.33a is unable to read some OCR'ed files &lt;br /&gt;
and therefore the appliance does not index them &lt;br /&gt;
(since they are blank).  We have approximately &lt;br /&gt;
10,000 files that we want to run thru the 0.33a.&lt;br /&gt;
Those that are blank will be re-scanned with a &lt;br /&gt;
different software.&lt;/p&gt;
&lt;p&gt;Do you know of a way to use your software to &lt;br /&gt;
process multiple files?  Additionally, how can you &lt;br /&gt;
tell if there are blank HTML files, other than &lt;br /&gt;
opening and viewing each converted file?&lt;/p&gt;
&lt;p&gt;Thank you,&lt;br /&gt;
wongn@metro.net&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Tue, 07 Dec 2004 18:28:14 -0000</pubDate><guid>https://sourceforge.netdde7016b69f51d073f0dbc29352043a522e41593</guid></item></channel></rss>