How to store unicode?

Started by Michael Meskesover 25 years ago2 messages
#1Michael Meskes
meskes@postgresql.org

I've just been asked how to store unicode text in a postgresql database. The
problem as I understand it is that unicode strings may contain binary 0s
which might break string handling. Since I never tried, I think it's better
to ask here before answering that question.

The application uses ecpg so it's not only a backend question.

Michael
--
Michael Meskes
Michael@Fam-Meskes.De
Go SF 49ers! Go Rhein Fire!
Use Debian GNU/Linux! Use PostgreSQL!

#2Noname
t-ishii@sra.co.jp
In reply to: Michael Meskes (#1)
Re: How to store unicode?

I've just been asked how to store unicode text in a postgresql database. The
problem as I understand it is that unicode strings may contain binary 0s
which might break string handling. Since I never tried, I think it's better
to ask here before answering that question.

I guess you are talking about UCS encoding. There is another encoding
for Unicode, called UTF-8. It does not 0s, so you could use it with
PostgreSQL. Actually we have some unicode(utf-8) regression tests in
src/test/mb and they seem working.

The application uses ecpg so it's not only a backend question.

I guess ecpg is ok as long as using UTF-8.
--
Tatsuo Ishii