BUG #2400: '�' considered invalid UTF-8 character

Started by Yusuf Siddiquialmost 20 years ago4 messagesbugs
Jump to latest
#1Yusuf Siddiqui
ysiddiqui@i3.com

The following bug has been logged online:

Bug reference: 2400
Logged by: Yusuf Siddiqui
Email address: ysiddiqui@i3.com
PostgreSQL version: 8.1
Operating system: Linux
Description: 'Æ' considered invalid UTF-8 character
Details:

The character 'Æ' is rejected as an invalid UTF-8 character.

Here are the steps used to recreate it:

create table test (text_field text);
insert into test (text_field) values ('Æ');

Returned error:
invalid UTF-8 byte sequence detected near byte 0x92

This statement also does not work:
insert into test (text_field) values ('\Æ');

#2tomas@tuxteam.de
tomas@tuxteam.de
In reply to: Yusuf Siddiqui (#1)
Re: BUG #2400: 'Ã

On Tue, Apr 18, 2006 at 11:34:53PM +0000, Yusuf Siddiqui wrote:

The following bug has been logged online:

Bug reference: 2400
Logged by: Yusuf Siddiqui
Email address: ysiddiqui@i3.com
PostgreSQL version: 8.1
Operating system: Linux
Description: 'Æ' considered invalid UTF-8 character
Details:

The character 'Æ' is rejected as an invalid UTF-8 character.

Well, maybe it is :-)

Here are the steps used to recreate it:

create table test (text_field text);
insert into test (text_field) values ('Æ');

Returned error:
invalid UTF-8 byte sequence detected near byte 0x92

[...]

I'd need to know more. I gather from your mail that you are entering the
character into psql from a console. Several factors are relevant here:

- which character encoding does your console have?
(if it is, e.g. iso-8859-x then this will be probably the culprit)
- which client encoding is set? (in psql type SHOW CLIENT_ENCODING;)
- which encoding is the server using (I'd guess utf-8; it doesn't need
to be the same as the client's, since it will try to convert).

HTH
-- tomás

#3Peter Eisentraut
peter_e@gmx.net
In reply to: Yusuf Siddiqui (#1)
Re: BUG #2400: 'Ã' considered invalid UTF-8 character

Am Mittwoch, 19. April 2006 01:34 schrieb Yusuf Siddiqui:

The character 'Æ' is rejected as an invalid UTF-8 character.

Please show the output of

$ psql -c 'show client_encoding'
$ locale

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

#4SunWuKung
Balazs.Klein@t-online.hu
In reply to: Peter Eisentraut (#3)
Re: BUG #2400: 'Ã' considered invalid UTF-8 character

Aren't you using EMS Postgresql Manager?
I kept getting this message for any non-standard UTF8 character that I
wanted to insert or import.
I wrote to them and they said EMS PgManager does not yet support
unicode.

Balázs