Re: the map file between gb18030 and utf8 is error

Started by Tatsuo Ishiiabout 23 years ago1 messages
#1Tatsuo Ishii
t-ishii@sra.co.jp
1 attachment(s)

hello,
recently i downloaded postgresql-7.3b5,i found the conversion between
gb18030 and utf was mistake because the map file bwteen them wasn't
obviously right.the first byte of utf8 encoding with two bytes shoule
between 0xc0 with 0xfd,the map file didn't accord with this condition
.please check it ,i wished that postgresql-7.3 can support the GB18030 and
can run in China.
best regards
jenny wang

Thanks for testing GB18030 support. Yes, the map file is completely
broken. I have attached fixed map generation perl script. Please test
it (I do not understand Chinese).

(1) save the perl script in the postgresql-7.3b5 source tree as:
src/backend/utils/mb/Unicode/UCS_to_GB18030.pl.

(2) run it.

cd src/backend/utils/mb/Unicode
./UCS_to_GB18030.pl

(3) the script will generate
src/backend/utils/mb/Unicode/utf8_to_gb18030.map
and
src/backend/utils/mb/Unicode/gb18030_to_utf8.map

(4) If they look good, rebuild PostgreSQL and test it.
--
Tatsuo Ishii

Attachments:

UCS_to_GB18030.pltext/plain; charset=us-asciiDownload