[Ocaml-pxp-users] pxp and charsets
Anastasia Gornostaeva
ermine at ermine.pp.ru
Tue Aug 19 03:52:20 PDT 2003
Hello.
>> I want to parse RSS from websites. These RSS files can be any charset encodin
>> (not only ascii or latin letters).
>> I want to put them into pxp and receive UTF-8 data at output.
>> How do it quickly and easily?
> Simply select UTF-8 as internal encoding, e.g.
> let config = { default_config with encoding = `Enc_utf8 }
> Then pass this config value to the parsing function. The effect is that
> PXP can represent all characters that are assigned in Unicode.
It seems it does not works. All non-latin letters are replaced to spaces.
It is not interesting :-(
BTW, in netstring recode_string works perfectly right.
So, can anybody help me?
ermine
More information about the Ocaml-pxp-users
mailing list