PostgreSQL downloads compressed with bzip2 instead of gzip ?

Started by Joel Rodriguesabout 23 years ago11 messagesgeneral
Jump to latest
#1Joel Rodrigues
borgempath@Phreaker.net

Hi,

I know that for many people, file size is not much of an issue,
but for many others I suspect it is. Whether that is for reasons
of storage space on smaller drives, like on current laptops, or
older machines. Plus, it can cut down on bandwidth usage. And if
that's not enough reason, then how about simply because there is
a better, more efficient way to compress than Gzip ?

Therefore, I think it would be nice to have PostgreSQL downloads
available compressed with bzip2 instead of gzip. Starting with
version 7.3.3 perhaps ?

Cheers,
Joel

#2Daniel R. Anderson
dan@mathjunkies.com
In reply to: Joel Rodrigues (#1)
Re: PostgreSQL downloads compressed with bzip2 instead of

<snip>

I know that for many people, file size is not much of an issue,
but for many others I suspect it is. Whether that is for reasons
of storage space on smaller drives, like on current laptops, or
older machines.

Point of Information:

Improved compression efficiency does not help people with small drives.
Case in point: imagine sometime in the future a "supertarball" algorithm
is discovered. This wondrous new algorithm compresses terabytes of data
into a single bit. (Don't ask how -- the deranged computer scientist
who discovered it, went crazy doing so. You might be next!)

So Joe User boots up his trusty 386 with a 500MB hard drive, and dials
into AOL with a 9600 baud modem. He grabs postgresql-7.3.3.SUPERTARBALL
off the 'net, and types into his console (since he doesn't have enough
processing power to run a modern version of windows):

$ supertarball -xvf postgresql-7.3.3.SUPERTARBALL

And Joe User runs out of disk space.

Moral of the Story:

A smaller file to download (via better compression) only decreases the
time it takes to download.

:-D

--
Daniel R. Anderson
Great Lakes Industries, Inc.
80 Pineview Ave.
Buffalo, NY 14218
(716) 691-5900 x218

"Never let your schooling interfere with your education"
-- Mark Twain

#3Greg Sabino Mullane
greg@turnstep.com
In reply to: Daniel R. Anderson (#2)
Re: PostgreSQL downloads compressed with bzip2 instead of

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Moral of the Story:

A smaller file to download (via better compression) only decreases the
time it takes to download.

I think the real argument in favor of providing a bz2 alternative to gz
*is* the time to download. Loading a Postgres tarball over a dialup line
is much faster with bz2 rather than gz. This has been covered on the list
before: I recall that most everyone agreed it was a good idea, but nobody
on the website appeared ready to follow through with it and actually
set up the bz2 files.

- --
Greg Sabino Mullane greg@turnstep.com
PGP Key: 0x14964AC8 200303201036
-----BEGIN PGP SIGNATURE-----
Comment: http://www.turnstep.com/pgp.html

iD8DBQE+eeCfvJuQZxSWSsgRAvy+AJ9wWoOgXezn3p01/D4jed2sNqT6DQCfXkz5
0uvv7khQJ4LAXMYydMnpWdM=
=Lg8Y
-----END PGP SIGNATURE-----

#4Dennis Gearon
gearond@cvc.net
In reply to: Joel Rodrigues (#1)
Re: PostgreSQL downloads compressed with bzip2 instead

How about both, until the tools become more readily availabe or bzip is
built into Winzip. With windows versoin 'around the corner' it will be
imiportant for the ease of installation for the usual windows joe to not
have to hunt around the web for a command line only version of bzip and
not use a gui (current situation for bzip).

Joel Rodrigues wrote:

Show quoted text

Hi,

I know that for many people, file size is not much of an issue, but for
many others I suspect it is. Whether that is for reasons of storage
space on smaller drives, like on current laptops, or older machines.
Plus, it can cut down on bandwidth usage. And if that's not enough
reason, then how about simply because there is a better, more efficient
way to compress than Gzip ?

Therefore, I think it would be nice to have PostgreSQL downloads
available compressed with bzip2 instead of gzip. Starting with version
7.3.3 perhaps ?

Cheers,
Joel

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

#5Daniel R. Anderson
dan@mathjunkies.com
In reply to: Greg Sabino Mullane (#3)
Re: PostgreSQL downloads compressed with bzip2 instead of

I agree that bz2 might very well speed up download over a dialup line.
I was, however, pointing out that one of the arguments as to why
downloads should be available in bz2 format -- to minimize required disk
space -- was specious.

<snip>

I think the real argument in favor of providing a bz2 alternative to gz
*is* the time to download. Loading a Postgres tarball over a dialup line
is much faster with bz2 rather than gz.

<snip>
--
Daniel R. Anderson
Great Lakes Industries, Inc.
80 Pineview Ave.
Buffalo, NY 14218
(716) 691-5900 x218

"Never let your schooling interfere with your education"
-- Mark Twain

#6Joel Rodrigues
joelrodrigues@Phreaker.net
In reply to: Daniel R. Anderson (#5)
Re: PostgreSQL downloads compressed with bzip2 instead of

I keep all downloads as bzip2 compressed archives on my drive in
case (!:-) I ever need to re-install/re-compile. So, you see it
*does* minimize required disk space.

To suggest that an untarred, decompressed bzip2 archive would
need less space than if it had been compressed using any other
algorithm, would be an odd suggestion indeed.

- Joel

On Thursday, March 20, 2003, at 10:17 , Daniel R. Anderson wrote:

Show quoted text

I agree that bz2 might very well speed up download over a dialup line.
I was, however, pointing out that one of the arguments as to why
downloads should be available in bz2 format -- to minimize
required disk
space -- was specious.

<snip>

I think the real argument in favor of providing a bz2
alternative to gz
*is* the time to download. Loading a Postgres tarball over a
dialup line
is much faster with bz2 rather than gz.

<snip>
--

#7Joel Rodrigues
borgempath@Phreaker.net
In reply to: Daniel R. Anderson (#5)
Re: PostgreSQL downloads compressed with bzip2 instead of

I keep all downloads as bzip2 compressed archives on my drive in
case (!:-) I ever need to re-install/re-compile. So, you see it
*does* minimize required disk space.

To suggest that an untarred, decompressed bzip2 archive would
need less space than if it had been compressed using any other
algorithm, would be an odd suggestion indeed.

- Joel

On Thursday, March 20, 2003, at 10:17 , Daniel R. Anderson wrote:

Show quoted text

I agree that bz2 might very well speed up download over a dialup line.
I was, however, pointing out that one of the arguments as to why
downloads should be available in bz2 format -- to minimize
required disk
space -- was specious.

<snip>

I think the real argument in favor of providing a bz2
alternative to gz
*is* the time to download. Loading a Postgres tarball over a
dialup line
is much faster with bz2 rather than gz.

<snip>
--

#8Daniel R. Anderson
dan@mathjunkies.com
In reply to: Joel Rodrigues (#7)
Re: PostgreSQL downloads compressed with bzip2 instead of

<snip>

I keep all downloads as bzip2 compressed archives on my drive in
case (!:-) I ever need to re-install/re-compile. So, you see it
*does* minimize required disk space.

To suggest that an untarred, decompressed bzip2 archive would
need less space than if it had been compressed using any other
algorithm, would be an odd suggestion indeed.

</snip>

I wasn't suggesting that an untarred, decompressed bzip2 archive would
need less space then a compressed one; I was simply pointing out that if
you're downloading it to install you need so much disk space. AND, if I
remember correctly bunzip2 -- like when you gunzip a *.gz -- removes the
extension.[0]I'm not overly familiar with bzip2. I would assume you could use some extension when you call bzip2 to keep a copy of the file in bzip2 format. But, then again you'd still need to rm -Rf the directory created. So you'd have to recompress the tar file /anyways/ --
thus making it a negligible savings.

As far as saving files on your hard disk goes I'm not sure what benefit
you're getting. It's been my experience that saved sources go out of
date long before they're needed[1]Since you may need the sources to uninstall a program, in that context I could understand keeping said code. But, again, in my experience download time of old versions is negligible and most sites have old code available. Then again, I am using a broadband connection. :-D. You're much better off setting up
CVS.[2]Yes, I realize that you save the sources for when you feel the need to reinstall Debian/slackware/whatever distro you happen to have. But if you're reinstalling every week you should stop doing everything while root. ;-P

[0]: I'm not overly familiar with bzip2. I would assume you could use some extension when you call bzip2 to keep a copy of the file in bzip2 format. But, then again you'd still need to rm -Rf the directory created.
some extension when you call bzip2 to keep a copy of the file in bzip2
format. But, then again you'd still need to rm -Rf the directory
created.

[1]: Since you may need the sources to uninstall a program, in that context I could understand keeping said code. But, again, in my experience download time of old versions is negligible and most sites have old code available. Then again, I am using a broadband connection. :-D
context I could understand keeping said code. But, again, in my
experience download time of old versions is negligible and most sites
have old code available. Then again, I am using a broadband
connection. :-D

[2]: Yes, I realize that you save the sources for when you feel the need to reinstall Debian/slackware/whatever distro you happen to have. But if you're reinstalling every week you should stop doing everything while root. ;-P
to reinstall Debian/slackware/whatever distro you happen to have. But
if you're reinstalling every week you should stop doing everything while
root. ;-P

--
Daniel R. Anderson
Chief Lab Rat and Helper Monkey
Great Lakes Industries, Inc.
80 Pineview Ave.
Buffalo, NY 14218
(716) 691-5900 x218

"Never let your schooling interfere with your education"
-- Mark Twain

#9Joel Rodrigues
borgempath@Phreaker.net
In reply to: Daniel R. Anderson (#8)
Re: PostgreSQL downloads compressed with bzip2 instead of

On Monday, March 24, 2003, at 03:12 , Daniel R. Anderson wrote:

AND, if I
remember correctly bunzip2 -- like when you gunzip a *.gz --
removes the
extension.[0] So you'd have to recompress the tar file /anyways/ --
thus making it a negligible savings.

Not necessarily so.

Here are the aliases I've set up (I use Mac OS X) to make things
easier :

This will tar & compress a directory/file :

alias bztar "gnutar --use=bzip2 -cvf"

These will untar & decompress gzip or bzip2 files and leave no
detritus :

alias bzuntar "gnutar --use=bzip2 -xvf"

alias gzuntar "tar -zxvf"

Whether one uses broadband or whatever, it's nice to be able to
conserve bandwidth, whether that is to save yourself money or
simply reduce the load on servers. That would help other users
and reduce the costs for all those nice open source people.

Cheers,
Joel

#10List Subscriptions
listsub@mwwm.net
In reply to: Daniel R. Anderson (#8)
Re: PostgreSQL downloads compressed with bzip2 instead of

FWIW, the current version of tar uses
-z for gzip compression
-j for bzip2 compression

There is no need to specifically un{b}zip tars as the effective processes
are

tar -xjf xyz --becomes--> bunzip2 < xyz | tar -xf -
tar -xjf xyz ... --becomes--> tar -cf - ... | bzip2 > xyz

The primary advantage to providing the archive in bzip2 (instead of gzip)
format is the reduction of download time; saving storage space is a much
less significant issue.

Whether or not you save space with the archive is a separate issue, IMO
fairly insignificant given the difficulty in finding a new drive UNDER
40GB.

(Said by someone who just installed a 7.3.2 demo on a 16MB P-90 laptop
running RH70)

On 23 Mar 2003, Daniel R. Anderson wrote:

Show quoted text

<snip>

I wasn't suggesting that an untarred, decompressed bzip2 archive would
need less space then a compressed one; I was simply pointing out that if
you're downloading it to install you need so much disk space. AND, if I
remember correctly bunzip2 -- like when you gunzip a *.gz -- removes the
extension.[0] So you'd have to recompress the tar file /anyways/ --
thus making it a negligible savings.

#11Joel Rodrigues
borgempath@Phreaker.net
In reply to: List Subscriptions (#10)
Re: PostgreSQL downloads compressed with bzip2 instead of

On Monday, March 24, 2003, at 09:08 , List Subscriptions wrote:

The primary advantage to providing the archive in bzip2
(instead of gzip)
format is the reduction of download time; saving storage space
is a much
less significant issue.

Whether or not you save space with the archive is a separate issue, IMO
fairly insignificant given the difficulty in finding a new drive UNDER
40GB.

(Said by someone who just installed a 7.3.2 demo on a 16MB P-90 laptop
running RH70)

Mac OS X v10.1.5 on G3 233MHz desktop, 160MB RAM, 2MB VRAM,
4(four)GB hard drive.

Postgresql 7.2.2, awaiting 7.3.3

Still up to 6 months (or more) away from upgrading to a 15.x" G4
PowerBook.

C'est la vie.

- Joel