Sanskrit words like au+ya cause trouble because
Unicode wants a-chen, subscribed ya, vowel marker,
whereas EWTS presumably has vowel, +, y. Actually
I'm not really sure about the EWTS. Fixing this seems
like more trouble than it's worth at this point -- it would
hair up the parser considerably.
TMW has a single achen+y character, which finesses
the problem.