<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en" xmlns="http://www.w3.org/2005/Atom"><title>Recent changes to support-requests</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/" rel="alternate"/><link href="https://sourceforge.net/p/gscan2pdf/support-requests/feed.atom" rel="self"/><id>https://sourceforge.net/p/gscan2pdf/support-requests/</id><updated>2026-01-27T15:41:05.253000Z</updated><subtitle>Recent changes to support-requests</subtitle><entry><title>Starting scan creates error</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/71/" rel="alternate"/><published>2026-01-27T15:41:05.253000Z</published><updated>2026-01-27T15:41:05.253000Z</updated><author><name>Chris Deuchar</name><uri>https://sourceforge.net/u/chrisnd/</uri></author><id>https://sourceforge.net0179f190a16ca7093f467a995b39874cf80fac76</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hi, I had been using gscan2pdf successfully under Ubuntu linux for some time but for the last few months I have been getting an error when clicking the 'Scan' button.  Otherwise, the program starts fine and the scanner is correctly recognised (Epson Perfection 1250/photo)&lt;br/&gt;
The error is:&lt;br/&gt;
"open of device plustek:libusb:002:005 failed: Error during device I/O"&lt;/p&gt;
&lt;p&gt;From thereon nothing happens and I so have just gone away and used a different program(!)&lt;br/&gt;
But I would like to get this fixed please.&lt;/p&gt;
&lt;p&gt;Chris D&lt;br/&gt;
Ps. I have tried doing a complete removal and reinstall from the Ubuntu installer but without any change.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#d089" rel="alternate"/><published>2025-11-07T13:08:28.219000Z</published><updated>2025-11-07T13:08:28.219000Z</updated><author><name>Jeffrey Ratcliffe</name><uri>https://sourceforge.net/u/ra28145/</uri></author><id>https://sourceforge.nete1fbf9d5716d8b0854087a08d7228ab64a341ff8</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;OK. But it is Okular that is inserting the CR characters, not gscan2pdf.&lt;/p&gt;
&lt;p&gt;The difference is the formatting. When Writer created the PDF, it created a single text box. Okular can see that it is all one text box, and gives you the text you expect  This was lost when converting to JPG. OCR created a box per word, in order to get the word positions correct. OCR does not give much hint of the fonts used, so these must be guessed.&lt;/p&gt;
&lt;p&gt;It would be possible to embed the text in the PDF differently, but then the positions would be wrong.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#11b9" rel="alternate"/><published>2025-11-05T14:04:26.216000Z</published><updated>2025-11-05T14:04:26.216000Z</updated><author><name>Pascal </name><uri>https://sourceforge.net/u/pascal06/</uri></author><id>https://sourceforge.net8d70ce1674298a6468aace4f49695d90af0457ed</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hello,&lt;br/&gt;
to be clearer I think it would be better to make small test files. So here they are. I wrote a small text with Writer and then exported it to pdf: Test_gscan2pdf.pdf&lt;br/&gt;
Then, I converted it to jpg file (with The Gimp): Test_gscan2pdf.jpg&lt;br/&gt;
Then, imported into gscan2pdf, did OCR, then exported again to pdf: Pascal06_Test_gscan2pdf.odt_2025-11-05.pdf&lt;br/&gt;
Then, open the original pdf (Test_gscan2pdf.pdf ), select first paragraph and past here:&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span&gt;&lt;/span&gt;&lt;code&gt;“A Hare one day ridiculed the short feet and slow pace of the Tortoise, who replied, laughing:
“Though you be swift as the wind, I will beat you in a race.”
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;

&lt;p&gt;Finally, open the generated pdf (Pascal06_Test_gscan2pdf.odt_2025-11-05.pdf), select first paragraph and past here:&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span&gt;&lt;/span&gt;&lt;code&gt;“A Hare
 one
 day ridiculed
 the short
 feet and slow
 pace
 of the Tortoise,
 who
 replied, laughing:
“Though you be swift
 as
 the wind, I will
 beat
 you
 in
 a
 race.”
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;

&lt;p&gt;For me and IMHO, the two texts should be identical...&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#349c" rel="alternate"/><published>2025-11-02T11:48:21.278000Z</published><updated>2025-11-02T11:48:21.278000Z</updated><author><name>Pascal </name><uri>https://sourceforge.net/u/pascal06/</uri></author><id>https://sourceforge.net3ceb36eda3e55ca9d1e479a7799e6164cb947697</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hello,&lt;br/&gt;
IMHO, I think the problem doesn't come from Okular.&lt;br/&gt;
Following my given example, if you look into the gscan2pdf OCR recognition tab:&lt;br/&gt;
&lt;a href="https://2plz.fr/lutim/gallery#TggvwUqP/DKILVETY.png" rel="nofollow"&gt;https://2plz.fr/lutim/gallery#TggvwUqP/DKILVETY.png&lt;/a&gt;&lt;br/&gt;
Each word are "separated" so that gives a line feed after each word. It is definitely not the same thing as the raw text output&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#a666" rel="alternate"/><published>2025-11-02T09:26:54.282000Z</published><updated>2025-11-02T09:26:54.282000Z</updated><author><name>Jeffrey Ratcliffe</name><uri>https://sourceforge.net/u/ra28145/</uri></author><id>https://sourceforge.netaf58bc7cec7bff3a7eefd1aa845fe6e73499ec81</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;OK. I misunderstood. But it is going to be difficult for me to influence how Okular formats text it places into the clipboard.&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#2da6" rel="alternate"/><published>2025-11-01T20:01:19.660000Z</published><updated>2025-11-01T20:01:19.660000Z</updated><author><name>Pascal </name><uri>https://sourceforge.net/u/pascal06/</uri></author><id>https://sourceforge.net602cb6cc1617d8c45e04a4d50b00c183556e164c</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Thanks, but it's not a solution for me. The best would be that when I select all the text in a pdf and then past in an other document, it would keep the same number of line feed...&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#6505" rel="alternate"/><published>2025-11-01T17:44:32.308000Z</published><updated>2025-11-01T17:44:32.308000Z</updated><author><name>Jeffrey Ratcliffe</name><uri>https://sourceforge.net/u/ra28145/</uri></author><id>https://sourceforge.net57036eeb2e2f66fe7e1a412998b7f22f297eac59</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;status&lt;/strong&gt;: open --&amp;gt; closed&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#2c41/faf5" rel="alternate"/><published>2025-11-01T17:17:27.599000Z</published><updated>2025-11-01T17:17:27.599000Z</updated><author><name>Pascal </name><uri>https://sourceforge.net/u/pascal06/</uri></author><id>https://sourceforge.netaba69556ba272af3fa07e30a0d04afb9f7fd208b</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hello Jeffrey,&lt;br/&gt;
thanks a lot for your reply. No worry :)&lt;br/&gt;
Fatality I just scanned a bunch of documents now and I tested to "save as text". Indeed, the job is great compared to the same document in pdf. Very weird...&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#2c41" rel="alternate"/><published>2025-10-21T20:20:06.557000Z</published><updated>2025-10-21T20:20:06.557000Z</updated><author><name>Jeffrey Ratcliffe</name><uri>https://sourceforge.net/u/ra28145/</uri></author><id>https://sourceforge.net73bd1e539bd3fe3be6a0707f21e03fa9661285e0</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Apologies for the lack of response.&lt;/p&gt;
&lt;p&gt;gscan2pdf also offers a "Save as text" option. Does that do a better job?&lt;/p&gt;&lt;/div&gt;</summary></entry><entry><title>#70 gscan2pdf add CR after each word using Tesseract</title><link href="https://sourceforge.net/p/gscan2pdf/support-requests/70/?limit=25#120c" rel="alternate"/><published>2025-10-21T16:29:03.642000Z</published><updated>2025-10-21T16:29:03.642000Z</updated><author><name>Pascal </name><uri>https://sourceforge.net/u/pascal06/</uri></author><id>https://sourceforge.nete536e60858b6e1de6a8d38e520f05f517f88b151</id><summary type="html">&lt;div class="markdown_content"&gt;&lt;p&gt;Hello,&lt;br/&gt;
nobody here ?&lt;/p&gt;&lt;/div&gt;</summary></entry></feed>