Weird PostgreSQL crashes on FC2/FC3 64-bit

Started by William Yuabout 21 years ago4 messagesgeneral
Jump to latest
#1William Yu
wyu@talisys.com

Since I've upgraded my development server to FC2/FC3 64-bit, I
occasionally get these weird PostgreSQL semi-lockups. Once in a while, a
query (usually a big update of some type) will just stop dead. Doing a
ps -ef | grep postgres, I will see something like:

root 17034 1 0 21:41 ? 00:00:00 gdb -q -x /dev/stdin
postgres 9131
postgres 9131 2712 0 20:31 ? 00:00:00 postgres: postgres
netdata [local] VACUUM

Other connections continue to run (unless the dead process causes a
pileup behind it) and I can make new connections to the DB.

The strange thing is I cannot kill the dead process. I can type "kill -9
9131" all day long (yes I know it's bad) but the process will not go
away. Doing kill -9 on all the other postmasters will make them go away
but the bad one will still stay forever. The gdb process is killable though.

At this point, this dead process is still on my machine. Are there any
commands for me to figure out what's going on here?

#2Tom Lane
tgl@sss.pgh.pa.us
In reply to: William Yu (#1)
Re: Weird PostgreSQL crashes on FC2/FC3 64-bit

William Yu <wyu@talisys.com> writes:

Doing a ps -ef | grep postgres, I will see something like:

root 17034 1 0 21:41 ? 00:00:00 gdb -q -x /dev/stdin
postgres 9131
postgres 9131 2712 0 20:31 ? 00:00:00 postgres: postgres
netdata [local] VACUUM

So where did the gdb come from?

The strange thing is I cannot kill the dead process.

It's not dead --- gdb has got it by the throat.

regards, tom lane

#3William Yu
wyu@talisys.com
In reply to: Tom Lane (#2)
Re: Weird PostgreSQL crashes on FC2/FC3 64-bit

Tom Lane wrote:

William Yu <wyu@talisys.com> writes:

Doing a ps -ef | grep postgres, I will see something like:

root 17034 1 0 21:41 ? 00:00:00 gdb -q -x /dev/stdin postgres 9131
postgres 9131 2712 0 20:31 ? 00:00:00 postgres: postgres netdata [local] VACUUM

So where did the gdb come from?

To be honest, I do not know. I see the parent process is 1 so something
in the FCx kernel is triggering it. I just don't know where the logs for
something like this would be.

Show quoted text

The strange thing is I cannot kill the dead process.

It's not dead --- gdb has got it by the throat.

#4Martijn van Oosterhout
kleptog@svana.org
In reply to: William Yu (#3)
Re: Weird PostgreSQL crashes on FC2/FC3 64-bit

On Tue, Feb 01, 2005 at 10:32:15PM -0800, William Yu wrote:

Tom Lane wrote:

William Yu <wyu@talisys.com> writes:

Doing a ps -ef | grep postgres, I will see something like:

root 17034 1 0 21:41 ? 00:00:00 gdb -q -x /dev/stdin
postgres 9131
postgres 9131 2712 0 20:31 ? 00:00:00 postgres: postgres
netdata [local] VACUUM

So where did the gdb come from?

To be honest, I do not know. I see the parent process is 1 so something
in the FCx kernel is triggering it. I just don't know where the logs for
something like this would be.

Parent process 1 just means that it's either spawned by init or it's
real parent process has died. I wouldn't put any stock in it. Try using
"ps aux" to get the process state. Chances are it's in T (trace) state.
You can try to kill -CONT it. Maybe strace it.
--
Martijn van Oosterhout <kleptog@svana.org> http://svana.org/kleptog/

Show quoted text

Patent. n. Genius is 5% inspiration and 95% perspiration. A patent is a
tool for doing 5% of the work and then sitting around waiting for someone
else to do the other 95% so you can sue them.