OK, lets talk portability.

Started by mlwabout 24 years ago31 messageshackers

markw@mohawksoft.com

about 24 years ago

As I set out to do the Windows semaphore thing, I notice it can get quite ugly.

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library. The reason I want
this is because the semaphore support, specifically multiple semaphores
identified by a single key, has to be implemented with shared memory and
multiple semaphores. (Under Windows)

I also have to look at "ownership" issues with Windows processes and shared
objects. They may need to be own be a more persistent/independent module, a
DLL.

By creating a library like "libpgport.a" (or on Windows pgport.dll/pgport.lib)
little has to be changed for various ports. If we don't have the library then
the discrete object filed would need to be specified. The library will give the
port writer a greater amount of flexibility.

Comments?

Tom Lane

tgl@sss.pgh.pa.us

about 24 years ago

In reply to: mlw (#1)

Re: OK, lets talk portability.

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library.

Right offhand, that seems a pointless exercise in relabeling code that's
going to be the same either way. What's the actual value?

The reason I want this is because the semaphore support, specifically
multiple semaphores identified by a single key, has to be implemented
with shared memory and multiple semaphores. (Under Windows)

I think you are confusing issues that are now private to the SysV sema
implementation with things that you really need to do for Windows.
Take a look at port/posix_sema.c for a less cluttered view of the
semantics you actually need to support. (I don't suppose there's any
chance that Gates & Co support POSIX semas, leaving you with no work?)

BTW, I have been able to test the named-semas variant of posix_sema.c
on OS X, and it works. I don't have access to any platforms that
support unnamed POSIX semas, which is too bad because that seems much
the preferable variant. Can anyone check it out?

regards, tom lane

PS: there's a trivial little test program in port/ipc_test.c; if you
want a "smoke test" that's simpler than a full Postgres build, try that.

mlw

markw@mohawksoft.com

about 24 years ago

In reply to: mlw (#1)

Re: OK, lets talk portability.

Tom Lane wrote:

BTW, I have been able to test the named-semas variant of posix_sema.c
on OS X, and it works. I don't have access to any platforms that
support unnamed POSIX semas, which is too bad because that seems much
the preferable variant. Can anyone check it out?

I did, and yes I was confused. Sorry. Your posix implementation assumes that
only a single process will have access to the semaphore list for deletion is
this correct? I guess I need to know how much access the child processes need
to have to the internal control structures, none? Some? All?

As I embark on my journey back to the dark side, here are my concerns for a
native Windows PostgreSQL. I think it is a whole lot more than originally
thought.

(The Matrix: Do not try to implement the fork() that would be impossible,
instead only try to realize the truth, there is no fork())

Cygwin does a lot of trickery to implement fork(), but it is not an OS level
thing. Without fork() we would have to have a way to start a postgres backend
and send it information about what it should be doing, and what it should be
doing it with.

With no fork(), information that would normally be copied on fork() is not
copied. Therefore, we need to know what that information is and how to
propagate it to the child process (under windows)

Files, Windows does not have a native open,close,read,write ,lseek support.
Yes, they have some notion of low I/O for compatibility, _open, _close, etc.
but the flags and permissions are not identical. The low file API for Windows
is CreateFile.

Semaphores, and shared memory are easy enough (If have written it in different
forms before), depending on the level of opaqueness to child processes.

The voice in the back of my head, says we need to define what the portability
issues are:

process control (fork()/spawn() etc.)
file operations (read, right, open, close, seek)
IPC constructs (shared memory, semaphores)
System interface (sync() etc)

Any others?

We should either use Apache's APR and augment it with semaphores, or come up
with a doc which defines how these various things are handled. Obviously, it
will grow as we find things that don't work.

Tom Lane

tgl@sss.pgh.pa.us

about 24 years ago

In reply to: mlw (#3)

Re: OK, lets talk portability.

mlw <markw@mohawksoft.com> writes:

I did, and yes I was confused. Sorry. Your posix implementation assumes that
only a single process will have access to the semaphore list for deletion is
this correct? I guess I need to know how much access the child processes need
to have to the internal control structures, none? Some? All?

None. The postmaster creates the semas, and the postmaster deletes 'em.
The children only use them via the PGSemaphore structs they find in
shared memory. (The SysV implementation assumes that too btw.)

With no fork(), information that would normally be copied on fork() is not
copied. Therefore, we need to know what that information is and how to
propagate it to the child process (under windows)

This will be a royal mess. Three or four years ago, when PG actually
did fork/exec to start a backend, we were careful to arrange to pass
everything the backend needed to know as command-line parameters (or
else keep it in shared memory). We have been busily breaking that
separation ever since we went over to fork-no-exec, however. In the
current scheme of things there is no chance whatever of a backend
working unless it inherits the global/static variables of the
postmaster.

And no, I don't want to undo those changes. Especially not if the
only reason for it is to not have to use Cygwin on Windows. Most
of these changes made the startup code substantially simpler,
faster, and more reliable.

regards, tom lane

mlw

markw@mohawksoft.com

about 24 years ago

In reply to: mlw (#1)

Re: OK, lets talk portability.

Tom Lane wrote:

And no, I don't want to undo those changes. Especially not if the
only reason for it is to not have to use Cygwin on Windows. Most
of these changes made the startup code substantially simpler,
faster, and more reliable.

Then I think the notion of a pure Windows version is dead in the water. Writing
a fork()-like API for Windows is, of course, doable as evidenced by cygwin, and
from a general theory seems like a pretty straight forward thing to do (with a
few low level tricks of course) but the details are pretty scary.

Has anyone done a profile of PostgreSQL running on a windows box and identified
cygwin bottlenecks which we could augment with native code?

The Hermit Hacker

scrappy@hub.org

about 24 years ago

In reply to: mlw (#5)

Re: OK, lets talk portability.

On Tue, 7 May 2002, mlw wrote:

Tom Lane wrote:

And no, I don't want to undo those changes. Especially not if the
only reason for it is to not have to use Cygwin on Windows. Most
of these changes made the startup code substantially simpler,
faster, and more reliable.

Then I think the notion of a pure Windows version is dead in the water.
Writing a fork()-like API for Windows is, of course, doable as evidenced
by cygwin, and from a general theory seems like a pretty straight
forward thing to do (with a few low level tricks of course) but the
details are pretty scary.

How is Apache doing this? I believe they do allow the pre-forked model to
work, so how are they getting around those limitations?

mlw

markw@mohawksoft.com

about 24 years ago

In reply to: The Hermit Hacker (#6)

Re: OK, lets talk portability.

"Marc G. Fournier" wrote:

On Tue, 7 May 2002, mlw wrote:

Tom Lane wrote:

And no, I don't want to undo those changes. Especially not if the
only reason for it is to not have to use Cygwin on Windows. Most
of these changes made the startup code substantially simpler,
faster, and more reliable.

Then I think the notion of a pure Windows version is dead in the water.
Writing a fork()-like API for Windows is, of course, doable as evidenced
by cygwin, and from a general theory seems like a pretty straight
forward thing to do (with a few low level tricks of course) but the
details are pretty scary.

How is Apache doing this? I believe they do allow the pre-forked model to
work, so how are they getting around those limitations?

Apache and PostgreSQL are quite different in their requirements of shared
memory. Apache (2.x) simply uses CreateProcess and passes duplicate file
handles.

The Hermit Hacker

scrappy@hub.org

about 24 years ago

In reply to: mlw (#7)

Re: OK, lets talk portability.

On 7 May 2002, Hannu Krosing wrote:

On Tue, 2002-05-07 at 15:31, Tom Lane wrote:

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library.

Right offhand, that seems a pointless exercise in relabeling code that's
going to be the same either way. What's the actual value?

The reason I want this is because the semaphore support, specifically
multiple semaphores identified by a single key, has to be implemented
with shared memory and multiple semaphores. (Under Windows)

I think you are confusing issues that are now private to the SysV sema
implementation with things that you really need to do for Windows.
Take a look at port/posix_sema.c for a less cluttered view of the
semantics you actually need to support. (I don't suppose there's any
chance that Gates & Co support POSIX semas, leaving you with no work?)

A quick google search came up with

http://sources.redhat.com/pthreads-win32/announcement.html

Unfortunately it seems to be the "wrong kind of free" software:

Pthreads-win32 is free software, distributed under the GNU Lesser
General Public License (LGPL).

Or can we accept dependancies on LGPL libs for some ports.

What someone installs on their Windows box is their problem ... doesn't
mean we can't make use of it :) Its not something that will be part of
the distribution itself, only something that needs to be availble :)

Import Notes

Reply to msg id not found: 1020790840.14750.95.camel@taru.tm.ee | Resolved by subject fallback

Hannu Krosing

hannu@tm.ee

about 24 years ago

In reply to: Tom Lane (#2)

Re: OK, lets talk portability.

On Tue, 2002-05-07 at 15:31, Tom Lane wrote:

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library.

Right offhand, that seems a pointless exercise in relabeling code that's
going to be the same either way. What's the actual value?

The reason I want this is because the semaphore support, specifically
multiple semaphores identified by a single key, has to be implemented
with shared memory and multiple semaphores. (Under Windows)

I think you are confusing issues that are now private to the SysV sema
implementation with things that you really need to do for Windows.
Take a look at port/posix_sema.c for a less cluttered view of the
semantics you actually need to support. (I don't suppose there's any
chance that Gates & Co support POSIX semas, leaving you with no work?)

A quick google search acme up with

http://sources.redhat.com/pthreads-win32/announcement.html

--------------
Hannu

#10

Hannu Krosing

hannu@tm.ee

about 24 years ago

In reply to: Tom Lane (#2)

Re: OK, lets talk portability.

On Tue, 2002-05-07 at 15:31, Tom Lane wrote:

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library.

Right offhand, that seems a pointless exercise in relabeling code that's
going to be the same either way. What's the actual value?

The reason I want this is because the semaphore support, specifically
multiple semaphores identified by a single key, has to be implemented
with shared memory and multiple semaphores. (Under Windows)

I think you are confusing issues that are now private to the SysV sema
implementation with things that you really need to do for Windows.
Take a look at port/posix_sema.c for a less cluttered view of the
semantics you actually need to support. (I don't suppose there's any
chance that Gates & Co support POSIX semas, leaving you with no work?)

A quick google search came up with

http://sources.redhat.com/pthreads-win32/announcement.html

Unfortunately it seems to be the "wrong kind of free" software:

Pthreads-win32 is free software, distributed under the GNU Lesser
General Public License (LGPL).

Or can we accept dependancies on LGPL libs for some ports.

--------------
Hannu

#11

Dann Corbit

DCorbit@connx.com

about 24 years ago

In reply to: Hannu Krosing (#10)

Re: OK, lets talk portability.

-----Original Message-----
From: mlw [mailto:markw@mohawksoft.com]
Sent: Tuesday, May 07, 2002 7:03 AM
To: Tom Lane
Cc: Marc G. Fournier; PostgreSQL-development
Subject: Re: [HACKERS] OK, lets talk portability.

Tom Lane wrote:

BTW, I have been able to test the named-semas variant of

posix_sema.c

on OS X, and it works. I don't have access to any platforms that
support unnamed POSIX semas, which is too bad because that

seems much

the preferable variant. Can anyone check it out?

I did, and yes I was confused. Sorry. Your posix
implementation assumes that
only a single process will have access to the semaphore list
for deletion is
this correct? I guess I need to know how much access the
child processes need
to have to the internal control structures, none? Some? All?

As I embark on my journey back to the dark side, here are my
concerns for a
native Windows PostgreSQL. I think it is a whole lot more
than originally
thought.

(The Matrix: Do not try to implement the fork() that would be
impossible,
instead only try to realize the truth, there is no fork())

Cygwin does a lot of trickery to implement fork(), but it is
not an OS level
thing. Without fork() we would have to have a way to start a
postgres backend
and send it information about what it should be doing, and
what it should be
doing it with.

With no fork(), information that would normally be copied on
fork() is not
copied. Therefore, we need to know what that information is and how to
propagate it to the child process (under windows)

Files, Windows does not have a native open,close,read,write
,lseek support.
Yes, they have some notion of low I/O for compatibility,
_open, _close, etc.
but the flags and permissions are not identical. The low file
API for Windows
is CreateFile.

Semaphores, and shared memory are easy enough (If have
written it in different
forms before), depending on the level of opaqueness to child
processes.

The voice in the back of my head, says we need to define what
the portability
issues are:

process control (fork()/spawn() etc.)
file operations (read, right, open, close, seek)
IPC constructs (shared memory, semaphores)
System interface (sync() etc)

Any others?

We should either use Apache's APR and augment it with
semaphores, or come up
with a doc which defines how these various things are
handled. Obviously, it
will grow as we find things that don't work.

A native port to Win32 has already been accomplished by several groups
(including CONNX Solutions Inc., where I work).
There is also one from Japan:
http://hp.vector.co.jp/authors/VA023283/PostgreSQLe.html
I saw some others when I looked around.

Trying to implement fork() is a bad idea, I agree. We used
CreateProcess instead to launch a new server.

My idea of how portability ought to be accomplished is to leverage what
others have done and use that.

I have done a port of Pthreads to NT, which might be useful for a
threaded version of the server, but I think a better approach would be
to use an OS compatibility layer like ACE. ACE might also be useful for
web servers and things of that nature. It may also be possible to
create a server core from ACE that would outperform the PosgreSQL
connection engine. JAWS (for instance) is a freely available web server
that grotesqely outperforms Apache and the other free ones (which makes
me puzzle about why JAWS is not more popular). Anyway, here is the ACE
link:
http://www.cs.wustl.edu/~schmidt/ACE.html

Import Notes

Resolved by subject fallback

#12

The Hermit Hacker

scrappy@hub.org

about 24 years ago

In reply to: Hannu Krosing (#9)

Re: OK, lets talk portability.

On 7 May 2002, Hannu Krosing wrote:

On Tue, 2002-05-07 at 15:31, Tom Lane wrote:

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library.

Right offhand, that seems a pointless exercise in relabeling code that's
going to be the same either way. What's the actual value?

The reason I want this is because the semaphore support, specifically
multiple semaphores identified by a single key, has to be implemented
with shared memory and multiple semaphores. (Under Windows)

I think you are confusing issues that are now private to the SysV sema
implementation with things that you really need to do for Windows.
Take a look at port/posix_sema.c for a less cluttered view of the
semantics you actually need to support. (I don't suppose there's any
chance that Gates & Co support POSIX semas, leaving you with no work?)

A quick google search acme up with

http://sources.redhat.com/pthreads-win32/announcement.html

Damn ... doesn't implement fork(), but does implement semaphores :)
Sooooo close :)

Semaphores
---------------------------
sem_init
sem_destroy
sem_post
sem_wait
sem_trywait
sem_timedwait
sem_open (returns an error ENOSYS)
sem_close (returns an error ENOSYS)
sem_unlink (returns an error ENOSYS)

#13

mlw

markw@mohawksoft.com

about 24 years ago

In reply to: The Hermit Hacker (#12)

Re: OK, lets talk portability.

"Marc G. Fournier" wrote:

On 7 May 2002, Hannu Krosing wrote:

On Tue, 2002-05-07 at 15:31, Tom Lane wrote:

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

I propose that this become a separate subproject and library.

Right offhand, that seems a pointless exercise in relabeling code that's
going to be the same either way. What's the actual value?

The reason I want this is because the semaphore support, specifically
multiple semaphores identified by a single key, has to be implemented
with shared memory and multiple semaphores. (Under Windows)

I think you are confusing issues that are now private to the SysV sema
implementation with things that you really need to do for Windows.
Take a look at port/posix_sema.c for a less cluttered view of the
semantics you actually need to support. (I don't suppose there's any
chance that Gates & Co support POSIX semas, leaving you with no work?)

A quick google search acme up with

http://sources.redhat.com/pthreads-win32/announcement.html

Damn ... doesn't implement fork(), but does implement semaphores :)
Sooooo close :)

Windows has semaphores, and looking at Tom's API, this is the least of our
problems.

If we can come up with a fork() free PostgreSQL, the rest is easy.

We need to come up with a set of macros and function API that handle:

Native file operations.
Process control
Shared memory, semaphores, and other IPC
Sockets (Some Windows specific crap will need to be written, but the berkeley
API is fine)
System interface, sync(), fdatasync() and such.

#14

Dann Corbit

DCorbit@connx.com

about 24 years ago

In reply to: mlw (#13)

Re: OK, lets talk portability.

-----Original Message-----
From: mlw [mailto:markw@mohawksoft.com]
Sent: Tuesday, May 07, 2002 7:44 AM
To: Tom Lane
Cc: Marc G. Fournier; PostgreSQL-development
Subject: Re: [HACKERS] OK, lets talk portability.

Tom Lane wrote:

And no, I don't want to undo those changes. Especially not if the
only reason for it is to not have to use Cygwin on Windows. Most
of these changes made the startup code substantially simpler,
faster, and more reliable.

Then I think the notion of a pure Windows version is dead in
the water. Writing
a fork()-like API for Windows is, of course, doable as
evidenced by cygwin, and
from a general theory seems like a pretty straight forward
thing to do (with a
few low level tricks of course) but the details are pretty scary.

Has anyone done a profile of PostgreSQL running on a windows
box and identified
cygwin bottlenecks which we could augment with native code?

PW32:
http://pw32.sourceforge.net/
has a better fork than Cygwin(), but it is still awful. Much better to
start a new server with CreateProcess() on Win32 [absurdly faster].

The idea of fork() translation is to copy the heap and auto data. But
the heap can split, so it isn't even really guaranteed to work.
Quite frankly, fork() in Win32 is a very, very bad idea. Not just for
efficiency reasons but also for reliability reasons.

Import Notes

Resolved by subject fallback

#15

Dann Corbit

DCorbit@connx.com

about 24 years ago

In reply to: Dann Corbit (#14)

Re: OK, lets talk portability.

-----Original Message-----
From: mlw [mailto:markw@mohawksoft.com]
Sent: Tuesday, May 07, 2002 7:58 AM
To: Marc G. Fournier
Cc: Tom Lane; PostgreSQL-development
Subject: Re: [HACKERS] OK, lets talk portability.

"Marc G. Fournier" wrote:

On Tue, 7 May 2002, mlw wrote:

Tom Lane wrote:

And no, I don't want to undo those changes. Especially

not if the

only reason for it is to not have to use Cygwin on

Windows. Most

of these changes made the startup code substantially simpler,
faster, and more reliable.

Then I think the notion of a pure Windows version is dead

in the water.

Writing a fork()-like API for Windows is, of course,

doable as evidenced

by cygwin, and from a general theory seems like a pretty straight
forward thing to do (with a few low level tricks of

course) but the

details are pretty scary.

How is Apache doing this? I believe they do allow the

pre-forked model to

work, so how are they getting around those limitations?

Apache and PostgreSQL are quite different in their
requirements of shared
memory. Apache (2.x) simply uses CreateProcess and passes
duplicate file
handles.

The way to make CreateProcess() work for PostgreSQL is very simple.

By the time of the fork(), not much has been done. Some needed
calculations can simply be stored into shared memory (which is trivial
to implement). Some other tasks can simply be executed by the cloned
process, exactly as they were executed in the server.

Using fork() on Win32 is pointless, hopless, awful. Don't even think
about it. It's a death warrant.

Import Notes

Resolved by subject fallback

#16

Hannu Krosing

hannu@tm.ee

about 24 years ago

In reply to: mlw (#5)

Re: OK, lets talk portability.

On Tue, 2002-05-07 at 19:44, mlw wrote:

Tom Lane wrote:

And no, I don't want to undo those changes. Especially not if the
only reason for it is to not have to use Cygwin on Windows. Most
of these changes made the startup code substantially simpler,
faster, and more reliable.

Then I think the notion of a pure Windows version is dead in the water. Writing
a fork()-like API for Windows is, of course, doable as evidenced by cygwin, and
from a general theory seems like a pretty straight forward thing to do (with a
few low level tricks of course) but the details are pretty scary.

There is still another way - use threads.

There you have of course the opposite problem - to determine what to
_not_ share, but AFAIK this has been done already at least once.

And there seems to be some consensus that doing things that would
eventually make it easier to use threaded model will probably increase
code quality in general.

---------------
Hannu

#17

Dann Corbit

DCorbit@connx.com

about 24 years ago

In reply to: Hannu Krosing (#16)

Re: OK, lets talk portability.

Translation from UNIX to Win32:
http://www.byte.com/art/9410/sec14/art3.htm

Import Notes

Resolved by subject fallback

#18

Dann Corbit

DCorbit@connx.com

about 24 years ago

In reply to: Dann Corbit (#17)

Re: OK, lets talk portability.

A nice paper on one alternative to consider:
http://www.infy.com/knowledge_capital/thought-papers/porting.pdf

Import Notes

Resolved by subject fallback

#19

mlw

markw@mohawksoft.com

about 24 years ago

In reply to: Dann Corbit (#11)

Re: OK, lets talk portability.

Dann Corbit wrote:

A native port to Win32 has already been accomplished by several groups
(including CONNX Solutions Inc., where I work).
There is also one from Japan:
http://hp.vector.co.jp/authors/VA023283/PostgreSQLe.html
I saw some others when I looked around.

It is the license issues, I think.

Trying to implement fork() is a bad idea, I agree. We used
CreateProcess instead to launch a new server.

That's my big issue, fork() is important to postgres because it assumes all the
child's global and static variables will be a copy of what was in the parent,
as well as system resources being cleaned up.

Think about file handles, they have to be tracked and handled on a process
level. The entire dynamic and static data memory area would have to be copied.
Memory pointers allocated with "malloc()" would also have to be valid in the
child, which means that the heap would have to be copied too. So, in short, the
whole data area.

I think abandoning cygwin will require more work than is justified, we would
just end up rewriting it. So, if we are going to require cygwin or something
like it, then I think we should spend our efforts to profile and optimize the
cygwin version.

I guess it comes down to the reason why we intend to get rid of the requirement
of cygwin on Windows. If it is performance, we may be able to spot optimize the
code under cygwin, and improve performance. If it is a license issue, then that
is not a technical discussion, it is a legal one. Actions we take to remove a
cygwin requirement, in the license case, are probably of limited technical
merit, but of creating code (which probably already exists) for the PostgreSQL
project with a license we can live with.

#20

Igor Kovalenko

Igor.Kovalenko@motorola.com

about 24 years ago

In reply to: mlw (#1)

Re: OK, lets talk portability.

mlw <markw@mohawksoft.com> writes:

In the current CVS directory, there is pgsql/src/backend/port directory.

BTW, I have been able to test the named-semas variant of posix_sema.c
on OS X, and it works. I don't have access to any platforms that
support unnamed POSIX semas, which is too bad because that seems much
the preferable variant. Can anyone check it out?

They are supported on QNX, so I will check it. They are also faster than
named ones.

-- igor

#21

mlw

markw@mohawksoft.com

about 24 years ago

In reply to: Dann Corbit (#15)

#22

Dann Corbit

DCorbit@connx.com

about 24 years ago

In reply to: mlw (#21)