Recent changes to support-requests

Starting scan creates error

2026-01-27T15:41:05.253000Z

Hi, I had been using gscan2pdf successfully under Ubuntu linux for some time but for the last few months I have been getting an error when clicking the 'Scan' button. Otherwise, the program starts fine and the scanner is correctly recognised (Epson Perfection 1250/photo)
The error is:
"open of device plustek:libusb:002:005 failed: Error during device I/O"

From thereon nothing happens and I so have just gone away and used a different program(!)
But I would like to get this fixed please.

Chris D
Ps. I have tried doing a complete removal and reinstall from the Ubuntu installer but without any change.

#70 gscan2pdf add CR after each word using Tesseract

2025-11-07T13:08:28.219000Z

OK. But it is Okular that is inserting the CR characters, not gscan2pdf.

The difference is the formatting. When Writer created the PDF, it created a single text box. Okular can see that it is all one text box, and gives you the text you expect This was lost when converting to JPG. OCR created a box per word, in order to get the word positions correct. OCR does not give much hint of the fonts used, so these must be guessed.

It would be possible to embed the text in the PDF differently, but then the positions would be wrong.

#70 gscan2pdf add CR after each word using Tesseract

2025-11-05T14:04:26.216000Z

Hello,
to be clearer I think it would be better to make small test files. So here they are. I wrote a small text with Writer and then exported it to pdf: Test_gscan2pdf.pdf
Then, I converted it to jpg file (with The Gimp): Test_gscan2pdf.jpg
Then, imported into gscan2pdf, did OCR, then exported again to pdf: Pascal06_Test_gscan2pdf.odt_2025-11-05.pdf
Then, open the original pdf (Test_gscan2pdf.pdf ), select first paragraph and past here:

“A Hare one day ridiculed the short feet and slow pace of the Tortoise, who replied, laughing:
“Though you be swift as the wind, I will beat you in a race.”

Finally, open the generated pdf (Pascal06_Test_gscan2pdf.odt_2025-11-05.pdf), select first paragraph and past here:

“A Hare
 one
 day ridiculed
 the short
 feet and slow
 pace
 of the Tortoise,
 who
 replied, laughing:
“Though you be swift
 as
 the wind, I will
 beat
 you
 in
 a
 race.”

For me and IMHO, the two texts should be identical...

#70 gscan2pdf add CR after each word using Tesseract

2025-11-02T11:48:21.278000Z

Hello,
IMHO, I think the problem doesn't come from Okular.
Following my given example, if you look into the gscan2pdf OCR recognition tab:
https://2plz.fr/lutim/gallery#TggvwUqP/DKILVETY.png
Each word are "separated" so that gives a line feed after each word. It is definitely not the same thing as the raw text output

#70 gscan2pdf add CR after each word using Tesseract

2025-11-02T09:26:54.282000Z

OK. I misunderstood. But it is going to be difficult for me to influence how Okular formats text it places into the clipboard.

#70 gscan2pdf add CR after each word using Tesseract

2025-11-01T20:01:19.660000Z

Thanks, but it's not a solution for me. The best would be that when I select all the text in a pdf and then past in an other document, it would keep the same number of line feed...

#70 gscan2pdf add CR after each word using Tesseract

2025-11-01T17:44:32.308000Z

status: open --> closed

#70 gscan2pdf add CR after each word using Tesseract

2025-11-01T17:17:27.599000Z

Hello Jeffrey,
thanks a lot for your reply. No worry :)
Fatality I just scanned a bunch of documents now and I tested to "save as text". Indeed, the job is great compared to the same document in pdf. Very weird...

#70 gscan2pdf add CR after each word using Tesseract

2025-10-21T20:20:06.557000Z

Apologies for the lack of response.

gscan2pdf also offers a "Save as text" option. Does that do a better job?

#70 gscan2pdf add CR after each word using Tesseract

2025-10-21T16:29:03.642000Z

Hello,
nobody here ?