[Ocaml-pxp-users] pxp and charsets

Anastasia Gornostaeva ermine at ermine.pp.ru
Tue Aug 19 03:52:20 PDT 2003


Hello.

 >> I want to parse RSS from websites. These RSS files can be any charset encodin
 >> (not only ascii or latin letters).                                           
 >> I want to put them into pxp and receive UTF-8 data at output.                
 >> How do it quickly and easily?                                                
 > Simply select UTF-8 as internal encoding, e.g.                                 
 > let config = { default_config with encoding = `Enc_utf8 }                      
                                                                                
 > Then pass this config value to the parsing function. The effect is that        
 > PXP can represent all characters that are assigned in Unicode.                 

It seems it does not works. All non-latin letters are replaced to spaces.
It is not interesting :-(

BTW, in netstring  recode_string works perfectly right.
So, can anybody help me?

ermine



More information about the Ocaml-pxp-users mailing list