Data sets for download

Started by Jayadevan Mover 13 years ago4 messagesgeneral
Jump to latest
#1Jayadevan M
Jayadevan.Maymala@ibsplc.com

Hello all,
Does anyone know of reasonably-sized data dumps (csv or excel or xml..) that can be used for learning/teaching about performance tuning. Say - a set of 6-7 tables, may be two of them with a few million records etc? Total data volume would be in a few GB range. There are tools which generate data, but most of them seem to generate junk data. I came across this one (pretty good) -
http://www.ourairports.com/data/
If there were schedule and bookings tables to go with this, it would have been great.
There is http://www.imdb.com/interfaces also. But the data extraction process not simple.
Anything similar - the typical warehouse/customer/order tables or emp/dept/project ?
Regards,
Jayadevan

DISCLAIMER: "The information in this e-mail and any attachment is intended only for the person to whom it is addressed and may contain confidential and/or privileged material. If you have received this e-mail in error, kindly contact the sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor guarantees the accuracy, adequacy or completeness of the information contained in this email or any attachment and is not liable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."

#2Thomas Kellerer
spam_eater@gmx.net
In reply to: Jayadevan M (#1)
Re: Data sets for download

Jayadevan M, 25.10.2012 05:15:

There are tools which generate data, but most of them seem to
generate junk data.

Have a look a Benerator. It can create quite reasonable test data (e.g. valid addresses, "real" looking names and so on).

It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-benerator

Another option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/

It can generate testdata with a specific scale and it works well with Postgres.

Regards
Thomas

#3Jayadevan M
Jayadevan.Maymala@ibsplc.com
In reply to: Thomas Kellerer (#2)
Re: Data sets for download

Have a look a Benerator. It can create quite reasonable test data (e.g. valid
addresses, "real" looking names and so on).

It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-benerator

Another option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/

It can generate testdata with a specific scale and it works well with Postgres.

Thank you. Will try these.
Regards,
Jayadevan

DISCLAIMER: "The information in this e-mail and any attachment is intended only for the person to whom it is addressed and may contain confidential and/or privileged material. If you have received this e-mail in error, kindly contact the sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor guarantees the accuracy, adequacy or completeness of the information contained in this email or any attachment and is not liable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."

#4Thomas Boussekey
thomas.boussekey@gmail.com
In reply to: Jayadevan M (#3)
Re: Data sets for download

Hi,

I'm using Dell DVD store for training purposes, and I met some problems
with it!
Once they are corrected it works well (except the load test config on my
environment, problem encountered with a RSA fingerprint!)

The following slideshow tracks down the problems:
http://jkshah.blogspot.fr/2012/09/pgopen-2012-dvdstore-benchmark-and.html

Have fun,

-- Thomas BOUSSEKEY

2012/10/25 Jayadevan M <jayadevan.maymala@ibsplc.com>

Show quoted text

Have a look a Benerator. It can create quite reasonable test data (e.g.

valid

addresses, "real" looking names and so on).

It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-benerator

Another option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/

It can generate testdata with a specific scale and it works well with

Postgres.

Thank you. Will try these.
Regards,
Jayadevan

DISCLAIMER: "The information in this e-mail and any attachment is
intended only for the person to whom it is addressed and may contain
confidential and/or privileged material. If you have received this e-mail
in error, kindly contact the sender and destroy all copies of the original
communication. IBS makes no warranty, express or implied, nor guarantees
the accuracy, adequacy or completeness of the information contained in this
email or any attachment and is not liable for any errors, defects,
omissions, viruses or for resultant loss or damage, if any, direct or
indirect."
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general