Data sets for download
Hello all,
Does anyone know of reasonably-sized data dumps (csv or excel or xml..) that can be used for learning/teaching about performance tuning. Say - a set of 6-7 tables, may be two of them with a few million records etc? Total data volume would be in a few GB range. There are tools which generate data, but most of them seem to generate junk data. I came across this one (pretty good) -
http://www.ourairports.com/data/
If there were schedule and bookings tables to go with this, it would have been great.
There is http://www.imdb.com/interfaces also. But the data extraction process not simple.
Anything similar - the typical warehouse/customer/order tables or emp/dept/project ?
Regards,
Jayadevan
DISCLAIMER: "The information in this e-mail and any attachment is intended only for the person to whom it is addressed and may contain confidential and/or privileged material. If you have received this e-mail in error, kindly contact the sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor guarantees the accuracy, adequacy or completeness of the information contained in this email or any attachment and is not liable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."
Jayadevan M, 25.10.2012 05:15:
There are tools which generate data, but most of them seem to
generate junk data.
Have a look a Benerator. It can create quite reasonable test data (e.g. valid addresses, "real" looking names and so on).
It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-benerator
Another option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/
It can generate testdata with a specific scale and it works well with Postgres.
Regards
Thomas
Have a look a Benerator. It can create quite reasonable test data (e.g. valid
addresses, "real" looking names and so on).It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-beneratorAnother option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/It can generate testdata with a specific scale and it works well with Postgres.
Thank you. Will try these.
Regards,
Jayadevan
DISCLAIMER: "The information in this e-mail and any attachment is intended only for the person to whom it is addressed and may contain confidential and/or privileged material. If you have received this e-mail in error, kindly contact the sender and destroy all copies of the original communication. IBS makes no warranty, express or implied, nor guarantees the accuracy, adequacy or completeness of the information contained in this email or any attachment and is not liable for any errors, defects, omissions, viruses or for resultant loss or damage, if any, direct or indirect."
Hi,
I'm using Dell DVD store for training purposes, and I met some problems
with it!
Once they are corrected it works well (except the load test config on my
environment, problem encountered with a RSA fingerprint!)
The following slideshow tracks down the problems:
http://jkshah.blogspot.fr/2012/09/pgopen-2012-dvdstore-benchmark-and.html
Have fun,
-- Thomas BOUSSEKEY
2012/10/25 Jayadevan M <jayadevan.maymala@ibsplc.com>
Show quoted text
Have a look a Benerator. It can create quite reasonable test data (e.g.
valid
addresses, "real" looking names and so on).
It has a bit steep learning curve, but I'm quite happy with the results
http://databene.org/databene-beneratorAnother option might be the Dell DVD Store Loadtest:
http://linux.dell.com/dvdstore/It can generate testdata with a specific scale and it works well with
Postgres.
Thank you. Will try these.
Regards,
JayadevanDISCLAIMER: "The information in this e-mail and any attachment is
intended only for the person to whom it is addressed and may contain
confidential and/or privileged material. If you have received this e-mail
in error, kindly contact the sender and destroy all copies of the original
communication. IBS makes no warranty, express or implied, nor guarantees
the accuracy, adequacy or completeness of the information contained in this
email or any attachment and is not liable for any errors, defects,
omissions, viruses or for resultant loss or damage, if any, direct or
indirect."
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general