<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en" xmlns="http://www.w3.org/2005/Atom"><title>Recent changes to feature-requests</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/" rel="alternate"/><link href="https://sourceforge.net/p/pdftohtml/feature-requests/feed.atom" rel="self"/><id>https://sourceforge.net/p/pdftohtml/feature-requests/</id><updated>2011-10-07T00:15:23Z</updated><subtitle>Recent changes to feature-requests</subtitle><entry><title>Problems with converting some kind of PDF´s</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/22/" rel="alternate"/><published>2011-10-07T00:15:23Z</published><updated>2011-10-07T00:15:23Z</updated><author><name>Markus Schmitz</name><uri>https://sourceforge.net/u/maggus1/</uri></author><id>https://sourceforge.nete699b89e86cd5bea1ba6556c6ba1618db9a01cdd</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hi, I need help: &lt;br /&gt;
I have installed the tool on our Linux server, everthing is fine. Its works already with some koinds of PDF files - they will converted fine. Bute some kinds of PDF´s (seems new Versions) cannot be converted. There comes an error: Error (0): PDF file is damaged - attempting to reconstruct xref table... Error: Couldn't find trailer dictionary Error: Couldn't read xref table But i CAN open it so the error doesnt make sense I think the only requirements to modify are: 1. The software should be able to pass the filename to the pdf as an argument. 2. Should then be able to get 1.html document outputted. that is the same as what the pdftohtml tool is doin now but the new tool should be able to convert ALL pdf documents Has anybody the same problems? Could anybody help me and modify the tool for converting all kinds of PDF´s Of course I will PAY for the Help. with regards Markus&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>Join words exceeding line</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/21/" rel="alternate"/><published>2007-10-04T14:22:05Z</published><updated>2007-10-04T14:22:05Z</updated><author><name>borneq</name><uri>https://sourceforge.net/u/borneq/</uri></author><id>https://sourceforge.net3df7d7b58ba90d44fbad65a1bf9475a11a5a4541</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;In &lt;a href="http://www.staff.amu.edu.pl/~insfil/problemy-dyskusje/tom3/7.pdf" rel="nofollow"&gt;http://www.staff.amu.edu.pl/~insfil/problemy-dyskusje/tom3/7.pdf&lt;/a&gt;&lt;br /&gt;
we have "konsek-" at end line, "wencje" at start next line, must be "konsekwencje"&lt;br /&gt;
"sprzecz-" + "ności" must be "sprzeczności"&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>CHM output</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/20/" rel="alternate"/><published>2006-09-27T21:25:54Z</published><updated>2006-09-27T21:25:54Z</updated><author><name>Victor Sergienko</name><uri>https://sourceforge.net/u/singalen/</uri></author><id>https://sourceforge.net04d7411158b5fad3ad502cd8f0d4f20bd3569105</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;This might be a matter of different program using&lt;br /&gt;
PdfToHtml, though I wish it could generate CHMs (using&lt;br /&gt;
LZM library from&lt;br /&gt;
&lt;a href="http://www.speakeasy.org/~russotto/chm/\" rel="nofollow"&gt;http://www.speakeasy.org/~russotto/chm/\&lt;/a&gt;). CHM is a very&lt;br /&gt;
usable format in a lot of cases.&lt;br /&gt;
Or maybe PdfToHtml could generate a set of files ready&lt;br /&gt;
for CHM packing.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>Charcter Spacing and Word Spacing Info..</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/19/" rel="alternate"/><published>2005-05-16T13:07:50Z</published><updated>2005-05-16T13:07:50Z</updated><author><name>soorajchirag</name><uri>https://sourceforge.net/u/soorajchirag/</uri></author><id>https://sourceforge.net3fb1c9821cff5da39d9092a504e2af90041c2810</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;The text being retrieved is pretty good .. but the Width &lt;br /&gt;
can be acompanied with the scaling Information that &lt;br /&gt;
should be applied to that text so that it fits within that....&lt;/p&gt;
&lt;p&gt;It would be a great addition!!!&lt;/p&gt;
&lt;p&gt;Also at some places i found that the function state-&lt;br /&gt;
&amp;gt;getHorizScaling() returns 1.0000 whereas in the actual &lt;br /&gt;
document even by the look of the eye u can make out &lt;br /&gt;
that there is soe scaling definitely less than 90%... so u &lt;br /&gt;
could also add this info with the text tags...&lt;/p&gt;
&lt;p&gt;Attaching the file fw4.pdf.... look at line "Add lines from &lt;br /&gt;
1 to G............" and notice how the scaling differs from &lt;br /&gt;
the lines adjacent to it..&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>text extraction, but still preserve pdf formatting</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/18/" rel="alternate"/><published>2005-03-18T13:40:48Z</published><updated>2005-03-18T13:40:48Z</updated><author><name>perspicuous</name><uri>https://sourceforge.net/u/perspicuous/</uri></author><id>https://sourceforge.netdcdd72bd3f4eb87e118fb6b5ecc37ba6f7675647</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;I use pdftohtml for text processing purposes, but the &lt;br /&gt;
&amp;lt;br&amp;gt; in the &amp;lt;div&amp;gt;s causes me discontinuity for the &lt;br /&gt;
paragraphs.  I could just edit the html and remove the &lt;br /&gt;
&amp;lt;br&amp;gt;, but then I'd lose the orginal pdf layout, which I &lt;br /&gt;
don't want to do.&lt;/p&gt;
&lt;p&gt;Would it be possible set an option to use the width &lt;br /&gt;
property in style to set the width, rather than use &amp;lt;br&amp;gt;?&lt;/p&gt;
&lt;p&gt;Have a look here: www.jumpdemo.com to get an idea of &lt;br /&gt;
what I'm trying to acheive.&lt;/p&gt;
&lt;p&gt;cheers.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>Pure text support?</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/17/" rel="alternate"/><published>2005-01-19T20:44:48Z</published><updated>2005-01-19T20:44:48Z</updated><author><name>Anonymous</name><uri>https://sourceforge.net/u/userid-None/</uri></author><id>https://sourceforge.net5f4491dd7e8d1ac75c109750dacc7510346b0316</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;This program is the closest I've comed to a pure pdf to&lt;br /&gt;
text converter. If I could just get the program to skip&lt;br /&gt;
the HTML formatting it would be perfect. Maybe a -text&lt;br /&gt;
command line argument can be added in the future?&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>HTML 3.2 support</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/16/" rel="alternate"/><published>2004-01-29T18:12:13Z</published><updated>2004-01-29T18:12:13Z</updated><author><name>Anonymous</name><uri>https://sourceforge.net/u/userid-None/</uri></author><id>https://sourceforge.nete5a346f93969897b91c7fb52771530ea33cac830</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Many places require HTML documents in 3.2 format, and&lt;br /&gt;
it's impossible to find any software out there that&lt;br /&gt;
will save to 3.2 these days.  This would be an&lt;br /&gt;
excellent feature in this package.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>Easier navigation of generated document</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/15/" rel="alternate"/><published>2004-01-14T11:04:08Z</published><updated>2004-01-14T11:04:08Z</updated><author><name>Kevin Whitefoot</name><uri>https://sourceforge.net/u/kwhitefoot/</uri></author><id>https://sourceforge.net0f07df909d7a2f2a0e009bcdd37b28f501afee6f</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Some browsers, most notably Opera, can interpret link &lt;br /&gt;
tags in the header of a document to provide easiy &lt;br /&gt;
navigation to the next and previous documents.&lt;/p&gt;
&lt;p&gt;It would help users to navigate large documents &lt;br /&gt;
produced in complex mode if these tags were added.&lt;/p&gt;
&lt;p&gt;For instance:&lt;/p&gt;
&lt;p&gt;&amp;lt;link rel="home" href="http://www.opera.com/" &lt;br /&gt;
title="Opera front page"/&amp;gt;&lt;/p&gt;
&lt;p&gt;Opera will put a button above the window with the label &lt;br /&gt;
Home.&lt;/p&gt;
&lt;p&gt;It can do the same for next and previous, etc.  See: 12.&lt;br /&gt;
1.2 Other link relationships &amp;lt;http://www.w3.&lt;br /&gt;
org/TR/REC-html40/struct/links.html#h-12.1.2&lt;br /&gt;
&amp;gt;&lt;/p&gt;
&lt;p&gt;for the W3C recommendations.&lt;/p&gt;
&lt;p&gt;Anyway, apart from always wanting more, I'm really &lt;br /&gt;
pleased with pdftohtml.  It is already saving a lot of time &lt;br /&gt;
and effort.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>Vector graphics</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/14/" rel="alternate"/><published>2003-11-17T20:50:16Z</published><updated>2003-11-17T20:50:16Z</updated><author><name>Anonymous</name><uri>https://sourceforge.net/u/userid-None/</uri></author><id>https://sourceforge.net222e82ee50542407ab3f7da4f37ad1a3ca020259</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hi,&lt;br /&gt;
is it possible to enable the vector graphics processing?&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>-noframes improvement</title><link href="https://sourceforge.net/p/pdftohtml/feature-requests/13/" rel="alternate"/><published>2003-10-29T17:40:10Z</published><updated>2003-10-29T17:40:10Z</updated><author><name>Anonymous</name><uri>https://sourceforge.net/u/userid-None/</uri></author><id>https://sourceforge.netccdd349b854b85b3d88e1e1d5ce79c25bbf3c259</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;-noframes works very nice in 0.36 (since frames are &lt;br /&gt;
evil), but it would be even nicer to have &amp;amp;quot;Previous&amp;amp;quot; and &lt;br /&gt;
&amp;amp;quot;Next&amp;amp;quot; links at the top and/or bottom of each page.&lt;/p&gt;
&lt;p&gt;This would also help paging thru a doc even with icky &lt;br /&gt;
frames by having the links in a consistent place on &lt;br /&gt;
each page, instead of cascading down the left hand &lt;br /&gt;
frame.&lt;/p&gt;
&lt;p&gt;Thanks!&lt;/p&gt;&lt;/div&gt;</summary></entry></feed>