Posted on 2008-11-13 05:37:06-08 by wecacuee
To what format is unicode converted by FmtDefault
Hi,
Pardon me, if I am wrong somewhere since I am a newbie.

I was trying to parse a string which looks like 'Kaye R. O'Leary' in Excel File.
It results in:
Cell->{_Value} = "Kaye R. O\x{2019}Leary" (#I know this is unicode)
Cell->{Val} = "\c@K\c@a\c@y\c@e\c@ \c@R\c@.\c@ \c@O\cY\c@L\c@e\c@a\c@r\c@y" (#don't know abt this)

Can anyone please tell me,
what is character encoding Cell-$gt;{Val} is using?
Thanks for your time
Vikas
Direct Responses: 9291 | 9292 | Write a response
Posted on 2008-11-13 08:17:26-08 by jmcnamara in response to 9290
Re: To what format is unicode converted by FmtDefault

what is character encoding Cell-$gt;{Val} is using?

It is UTF-16LE, the Unicode encoding used internally in Excel.

John.
--
Direct Responses: Write a response
Posted on 2008-11-13 08:18:42-08 by wecacuee in response to 9290
Re: To what format is unicode converted by FmtDefault
This question do not concerns Spreadsheet::ParseExcel
Cell->{Val} in not in any kind of encoding.(it may be 'latin1' or any other )
\c is called control character.
Cell->{_Value} is not necessirily a unicode string. It just contains a wide character which is denoted in hexadecimal format.
look perlop for more details
Thanks,
vikas
Direct Responses: 9305 | Write a response
Posted on 2008-11-14 05:12:10-08 by wecacuee in response to 9292
Re: To what format is unicode converted by FmtDefault
according to perlunitut
Text strings, or character strings are made of characters. Bytes are irrelevant here, and so are encodings. Each character is just that: the character. Text strings are also called Unicode strings, because in Perl, every text string is a Unicode string.
Cell->{Val}, Cell->{_Value} both are unicode strings
Thanks
Vikas
Direct Responses: Write a response
Perl Weekly newsletter
A free weekly newsletter for people who are busy to read all the blogs. click here to check it out.