Posted on 2007-11-27 07:32:26-08 by gulbrain
marc8_to_utf8 appears to be producing ucs codes, not utf8

I have a MARC8 file that contains, amongst other characters, an E2.

The LoC codetables.xml has the following lines, within a <code> tag:-

<isCombining>true</isCombining> <marc>E2</marc> <ucs>0301</ucs> <utf-8>CC81</utf-8> <name>ACUTE / COMBINING ACUTE ACCENT (Oxia)</name>

marc8_to_utf8 is converting the E2 to a 301 (before writing to file).
    This is found by examining the Perl scalar value, not by examining the file that is produced, so the problem appears to arise before the writing to file.

Did I miss some configuration or setting?

Thanks,
Tim

Direct Responses: Write a response
Perl Weekly newsletter
A free weekly newsletter for people who are busy to read all the blogs. click here to check it out.