Reopening #2858015 https://sourceforge.net/tracker/?func=detail&aid=2858015&group_id=200724&atid=974678 with files attached
using version 1.2.6 precompiled for linux-32-bit
Description
font-name change on subsequent glyphs within a token is not (always) encoded in xml.
evince 2.30 based on xpdf 3.x is showing the font-name change correctly.
Sample files:
IIA_p269.pdf
The @font-name on
<token sid="p1_s408" id="user-content-p1_w368" font-name="advgsi" ..="">(Philippe
is not properly detected: it should read: @font-name="advgsrsc" for "Philippe"
like in <token sid="p1_s410" id="user-content-p1_w370" font-name="advgsrsc" ...="">Kapser</token></token>
IIB51_p301
@font-name
<token sid="p1_s626" id="user-content-p1_w614" font-name="advgsi" ..="">(Zedler
should encode ... font-name="advgsrsc" ...>Zedler
as in
<TOKEN sid="p1_s487" id="p1_w475" font-name="advgsrsc" ...</token>
DuMont
The font-name "advgsrsc" stands for smallcaps, "advgsi" for italic and with my samples it seems regularly not correct on
token starting with "(" and "|:".
Anonymous
2 sample pages with xml