<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to feature-requests</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/</link><description>Recent changes to feature-requests</description><atom:link href="https://sourceforge.net/p/pdftohtml/feature-requests/feed.rss" rel="self"/><language>en</language><lastBuildDate>Fri, 07 Oct 2011 00:15:23 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/pdftohtml/feature-requests/feed.rss" rel="self" type="application/rss+xml"/><item><title>Problems with converting some kind of PDF´s</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/22/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;Hi, I need help: &lt;br /&gt;
I have installed the tool on our Linux server, everthing is fine. Its works already with some koinds of PDF files - they will converted fine. Bute some kinds of PDF´s (seems new Versions) cannot be converted. There comes an error: Error (0): PDF file is damaged - attempting to reconstruct xref table... Error: Couldn't find trailer dictionary Error: Couldn't read xref table But i CAN open it so the error doesnt make sense I think the only requirements to modify are: 1. The software should be able to pass the filename to the pdf as an argument. 2. Should then be able to get 1.html document outputted. that is the same as what the pdftohtml tool is doin now but the new tool should be able to convert ALL pdf documents Has anybody the same problems? Could anybody help me and modify the tool for converting all kinds of PDF´s Of course I will PAY for the Help. with regards Markus&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Markus Schmitz</dc:creator><pubDate>Fri, 07 Oct 2011 00:15:23 -0000</pubDate><guid>https://sourceforge.nete699b89e86cd5bea1ba6556c6ba1618db9a01cdd</guid></item><item><title>Join words exceeding line</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/21/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;In &lt;a href="http://www.staff.amu.edu.pl/~insfil/problemy-dyskusje/tom3/7.pdf" rel="nofollow"&gt;http://www.staff.amu.edu.pl/~insfil/problemy-dyskusje/tom3/7.pdf&lt;/a&gt;&lt;br /&gt;
we have "konsek-" at end line, "wencje" at start next line, must be "konsekwencje"&lt;br /&gt;
"sprzecz-" + "ności" must be "sprzeczności"&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">borneq</dc:creator><pubDate>Thu, 04 Oct 2007 14:22:05 -0000</pubDate><guid>https://sourceforge.net3df7d7b58ba90d44fbad65a1bf9475a11a5a4541</guid></item><item><title>CHM output</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/20/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;This might be a matter of different program using&lt;br /&gt;
PdfToHtml, though I wish it could generate CHMs (using&lt;br /&gt;
LZM library from&lt;br /&gt;
&lt;a href="http://www.speakeasy.org/~russotto/chm/\" rel="nofollow"&gt;http://www.speakeasy.org/~russotto/chm/\&lt;/a&gt;). CHM is a very&lt;br /&gt;
usable format in a lot of cases.&lt;br /&gt;
Or maybe PdfToHtml could generate a set of files ready&lt;br /&gt;
for CHM packing.&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Victor Sergienko</dc:creator><pubDate>Wed, 27 Sep 2006 21:25:54 -0000</pubDate><guid>https://sourceforge.net04d7411158b5fad3ad502cd8f0d4f20bd3569105</guid></item><item><title>Charcter Spacing and Word Spacing Info..</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/19/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;The text being retrieved is pretty good .. but the Width &lt;br /&gt;
can be acompanied with the scaling Information that &lt;br /&gt;
should be applied to that text so that it fits within that....&lt;/p&gt;
&lt;p&gt;It would be a great addition!!!&lt;/p&gt;
&lt;p&gt;Also at some places i found that the function state-&lt;br /&gt;
&amp;gt;getHorizScaling() returns 1.0000 whereas in the actual &lt;br /&gt;
document even by the look of the eye u can make out &lt;br /&gt;
that there is soe scaling definitely less than 90%... so u &lt;br /&gt;
could also add this info with the text tags...&lt;/p&gt;
&lt;p&gt;Attaching the file fw4.pdf.... look at line "Add lines from &lt;br /&gt;
1 to G............" and notice how the scaling differs from &lt;br /&gt;
the lines adjacent to it..&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">soorajchirag</dc:creator><pubDate>Mon, 16 May 2005 13:07:50 -0000</pubDate><guid>https://sourceforge.net3fb1c9821cff5da39d9092a504e2af90041c2810</guid></item><item><title>text extraction, but still preserve pdf formatting</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/18/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;I use pdftohtml for text processing purposes, but the &lt;br /&gt;
&amp;lt;br&amp;gt; in the &amp;lt;div&amp;gt;s causes me discontinuity for the &lt;br /&gt;
paragraphs.  I could just edit the html and remove the &lt;br /&gt;
&amp;lt;br&amp;gt;, but then I'd lose the orginal pdf layout, which I &lt;br /&gt;
don't want to do.&lt;/p&gt;
&lt;p&gt;Would it be possible set an option to use the width &lt;br /&gt;
property in style to set the width, rather than use &amp;lt;br&amp;gt;?&lt;/p&gt;
&lt;p&gt;Have a look here: www.jumpdemo.com to get an idea of &lt;br /&gt;
what I'm trying to acheive.&lt;/p&gt;
&lt;p&gt;cheers.&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">perspicuous</dc:creator><pubDate>Fri, 18 Mar 2005 13:40:48 -0000</pubDate><guid>https://sourceforge.netdcdd72bd3f4eb87e118fb6b5ecc37ba6f7675647</guid></item><item><title>Pure text support?</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/17/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;This program is the closest I've comed to a pure pdf to&lt;br /&gt;
text converter. If I could just get the program to skip&lt;br /&gt;
the HTML formatting it would be perfect. Maybe a -text&lt;br /&gt;
command line argument can be added in the future?&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Wed, 19 Jan 2005 20:44:48 -0000</pubDate><guid>https://sourceforge.net5f4491dd7e8d1ac75c109750dacc7510346b0316</guid></item><item><title>HTML 3.2 support</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/16/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;Many places require HTML documents in 3.2 format, and&lt;br /&gt;
it's impossible to find any software out there that&lt;br /&gt;
will save to 3.2 these days.  This would be an&lt;br /&gt;
excellent feature in this package.&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Thu, 29 Jan 2004 18:12:13 -0000</pubDate><guid>https://sourceforge.nete5a346f93969897b91c7fb52771530ea33cac830</guid></item><item><title>Easier navigation of generated document</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/15/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;Some browsers, most notably Opera, can interpret link &lt;br /&gt;
tags in the header of a document to provide easiy &lt;br /&gt;
navigation to the next and previous documents.&lt;/p&gt;
&lt;p&gt;It would help users to navigate large documents &lt;br /&gt;
produced in complex mode if these tags were added.&lt;/p&gt;
&lt;p&gt;For instance:&lt;/p&gt;
&lt;p&gt;&amp;lt;link rel="home" href="http://www.opera.com/" &lt;br /&gt;
title="Opera front page"/&amp;gt;&lt;/p&gt;
&lt;p&gt;Opera will put a button above the window with the label &lt;br /&gt;
Home.&lt;/p&gt;
&lt;p&gt;It can do the same for next and previous, etc.  See: 12.&lt;br /&gt;
1.2 Other link relationships &amp;lt;http://www.w3.&lt;br /&gt;
org/TR/REC-html40/struct/links.html#h-12.1.2&lt;br /&gt;
&amp;gt;&lt;/p&gt;
&lt;p&gt;for the W3C recommendations.&lt;/p&gt;
&lt;p&gt;Anyway, apart from always wanting more, I'm really &lt;br /&gt;
pleased with pdftohtml.  It is already saving a lot of time &lt;br /&gt;
and effort.&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kevin Whitefoot</dc:creator><pubDate>Wed, 14 Jan 2004 11:04:08 -0000</pubDate><guid>https://sourceforge.net0f07df909d7a2f2a0e009bcdd37b28f501afee6f</guid></item><item><title>Vector graphics</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/14/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;Hi,&lt;br /&gt;
is it possible to enable the vector graphics processing?&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Mon, 17 Nov 2003 20:50:16 -0000</pubDate><guid>https://sourceforge.net222e82ee50542407ab3f7da4f37ad1a3ca020259</guid></item><item><title>-noframes improvement</title><link>https://sourceforge.net/p/pdftohtml/feature-requests/13/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;-noframes works very nice in 0.36 (since frames are &lt;br /&gt;
evil), but it would be even nicer to have &amp;amp;quot;Previous&amp;amp;quot; and &lt;br /&gt;
&amp;amp;quot;Next&amp;amp;quot; links at the top and/or bottom of each page.&lt;/p&gt;
&lt;p&gt;This would also help paging thru a doc even with icky &lt;br /&gt;
frames by having the links in a consistent place on &lt;br /&gt;
each page, instead of cascading down the left hand &lt;br /&gt;
frame.&lt;/p&gt;
&lt;p&gt;Thanks!&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Wed, 29 Oct 2003 17:40:10 -0000</pubDate><guid>https://sourceforge.netccdd349b854b85b3d88e1e1d5ce79c25bbf3c259</guid></item></channel></rss>