PostgreSQL 8.3 Released

Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

PostgreSQL 8.3 Released 286

Posted by ScuttleMonkey on Monday February 04, 2008 @05:07PM from the post-postgre-post dept.

jadavis writes "The release of the long-awaited PostgreSQL version 8.3 has been announced. The new feature list includes HOT, which dramatically improves performance for databases with high update activity; asynchronous commit; built-in full text search; large database features such as synchronized scans and reduced storage overhead; built-in SQL/XML support; spread checkpoints; and many more (too many major new features to list here). See the release notes for full details."

This discussion has been archived. No new comments can be posted.

PostgreSQL 8.3 Released

Load All Comments

Search 286 Comments Log In/Create an Account

Comments Filter:

PostegreSQL 8.3? (Score:5, Funny)

by Anonymous Coward writes: on Monday February 04, 2008 @05:10PM (#22297600)

Would that be POSTGR~1.SQL?

Share
twitter facebook
- - Re: (Score:3, Insightful)
    
    by sparks ( 7204 ) * writes:
    
    There are 2^16 port numbers, but there are no theoretical limits on the number of connections incoming to a single port.
    
    A TCP connection is uniquely identified by *all of* the source IP address and port, and the destination IP and port. There can be many connections to a server on a particular port, distinguished by the client address and port number.
    
    Of course to get more than 2^16 connections you would need more than one client machine.
Nice. (Score:2)

by LWATCDR ( 28044 ) writes:

I will probably wait for a while before I update but this looks great.
HOT and the full text search are two features that I could use.
Postgres is a good reliable database server. I just wish more projects supported it as an equal to MySQL.
- Re:Nice. (Score:5, Funny)
  
  by Seumas ( 6865 ) writes: on Monday February 04, 2008 @05:35PM (#22298096)
  
  8.3 had me at "full-text search".
  
  Now, please excuse me while Postgres 8.3 and I go take a little alone-time in a dark closet.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by ianare ( 1132971 ) writes:
  
  Agree. For an in-house project tracking system we decided to go with MySQL, not because it is better, as we found Postgres superior, but because of lack of support from various software components.
  - Re: (Score:3, Insightful)
    
    by tacocat ( 527354 ) writes:
    
    You have made two very serious flaws with your thinking about how and why you chose MySQL. In general it's rampant with PHB thinking.
    
    There is the obvious discussion you didn't have when you decided MySQL was better for your company. Why would you take something as important as your company database and leave it to a vendor to support? You have no in-house knowledge of your database. You have no back-up in the event that vendor gets into a contract dispute with you. And every time you need their suppor
    - Re: (Score:3, Interesting)
      
      by LWATCDR ( 28044 ) writes:
      
      The real advantage that MySQL has right now is the number of packages that are MySQL centric.
      You need a Wiki? Odd are that it supports MySQL and can be made to work with Postgres. Need a content management system? Odds are that it was written for MySQL and might work with Postgres. So you probably already have MySQL in house so people tend to go with MySQL even when Postgres is a better solution even if for no other reason than why maintain two databases.
      The only reason why we use Postgres for several of o
Cross Database Joins?? (Score:4, Interesting)

by Foofoobar ( 318279 ) writes: on Monday February 04, 2008 @05:11PM (#22297636)

The one thing that has stopped me from picking up Postgresql yet is that I can't do cross database joins on the same server. Should a user have the same permissions on two separate databases on the same server, a properly constructed query should be able to join across multiple DB's but they still don't implement this yet that I am aware of.

Share
twitter facebook
- Re:Cross Database Joins?? (Score:5, Interesting)
  
  by geniusj ( 140174 ) writes: on Monday February 04, 2008 @05:15PM (#22297704) Homepage
  
  You should consider using schemas as opposed to entirely separate databases.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by electroniceric ( 468976 ) writes:
    
    Use dblink:
    http://www.postgresql.org/docs/faqs.FAQ.html#item4.17 [postgresql.org]
    
    Not as slick as doing it entirely in SQL, but it works.
  - - Re: (Score:2)
      
      by Foofoobar ( 318279 ) writes:
      
      Agreed. And often times, the simple solution is often easier and faster than the complex solution. Why add several additional steps when a cross database join will do the trick without any security concerns? If you have you structures separated and your data separated properly in an MVC codebase, a simple cross database join can save alot of time and simplify a procedure that other procedures would turn into a 4 or 5 step process.
      - Re:Cross Database Joins?? (Score:4, Informative)
        
        by prog-guru ( 129751 ) writes: on Monday February 04, 2008 @06:40PM (#22299106) Homepage
        
        I don't know, seems to me a cross database join is neither simple nor proper structure. If you are stuck with it schemas would probably work, painless in Active Record, perl DBI requires 'schema.table' syntax.
        
        Parent Share
        twitter facebook
        
        Re: (Score:2)
        
        by Foofoobar ( 318279 ) writes:
        
        Object Relational mapping is an entirely separate argument which could go on infinitely into the horizon. A cross database join is implemented is most major RDBMS... with the exception of POSTGRES. A common example is with a USR table. The USR_ID is shared across multiple databases but rather than building schemas for each instance that you would like to join on, you just join. Can you imagine the numbers of schemas you would have to maintain and manage in this instance if you had alot of databases that use
        
        Re: (Score:2)
        
        by Foofoobar ( 318279 ) writes:
        
        Oh one additional note, you can't version control the entries in a database. This would be another reason why you want the queries separate. When alot of your code (like I illustrated in my previous example) is based upon schemas, you are relying on your data remaining intact and your database not getting corrupted. At least with a versioning control systems you have your backups, your checkouts, your test environments, your tags and your production code.
        Schemas will fail in this instance as well. Not a
        
        Re: (Score:2)
        
        by Martin Foster ( 4949 ) writes:
        
        Only if you plan to use multiple schemas at the same time. Otherwise you can use the defaults (public and the users name) or set the schema search path to whatever is desired. From that point forward using a table name will result in a search or the implied schema name if you only include the one.
        
        I use this all the time in Perl to do a sort of virtual hosting capability. Multiple copies of the same database under a different schema to keep information contained.
      - Re: (Score:3, Informative)
        
        by Qzukk ( 229616 ) writes:
        
        the simple solution is often easier and faster than the complex solution
        
        When you want something that walks like a duck and talks like a duck, the simplest, fastest, and easiest solution is to get a duck. What you want is done by schemas in postgresql. If you're really doing the separate databases for "performance" reasons, then one presumes that at some point you're going to be putting the databases on separate servers, in which case you'll be wishing you had started with dblink in the first place.
        
        What yo
    - Re:Cross Database Joins?? (Score:5, Insightful)
      
      by glwtta ( 532858 ) writes: on Monday February 04, 2008 @06:13PM (#22298730) Homepage
      
      Often databases are created in a vacuum and only later does one need to utilize multiples in a query.
      
      That feels wrong somehow; if your (logical) databases are so distinct that you can't plan to co-locate them in the same (Postgres) database, does it make sense to have such tight coupling on the query side? Now you have to synchronize the data between them, and you can't move them off the same machine, so what's the point of keeping those databases separate? It also seems like client code should never have to know where different databases are physically located.
      
      I don't agree that this is the "simple solution", it's a horrible hack on the part of the database engine (I don't actually know if anyone apart from Oracle does this) with unpredictable performance results - looks more like the "lazy solution".
      
      I don't know, just seems like such a thing breaks the database/application "contract".
      
      Besides, shouldn't your ORM layer abstract such minutiae away pretty easily?
      
      Parent Share
      twitter facebook
      - Re: (Score:2)
        
        by naasking ( 94116 ) writes:
        
        Sounds like a distributed application, whose distributed properties were then broken by forcing the data to be located on the same host. A shame.
- Re:Cross Database Joins?? (Score:5, Insightful)
  
  by dfetter ( 2035 ) writes: <david@fetter.org> on Monday February 04, 2008 @05:31PM (#22298020) Homepage Journal
  
  There are several ways to do cross-database JOINs in Postgres including dblink [postgresql.org], and even to other DBMSs via dblink-tds [pgfoundry.org] and DBI-Link [pgfoundry.org], but try schemas first, as another poster mentioned.
  
  Parent Share
  twitter facebook
  - - Re: (Score:3, Funny)
      
      by dfetter ( 2035 ) writes:
      
      I'm looking forward to the hooks you're working on that will make them more efficient :)
    - Apples and oranges (Score:3, Informative)
      
      by einhverfr ( 238914 ) writes:
      
      One of the issues with this discussion is that there is a disjunct in the terminology used by MySQL users and that used by the rest of us.
      
      MySQL uses "database" and "schema" synomymously (note that their "information schema" is a separate "database"). In that sense, PostgreSQL has long had cross-schema joins in the same way MySQL does. It is just slightly harder to set things up so that you create tables in the right schemas. (hint: SET search_path='schema_name');
      
      In this way, I do a *lot* of work using cr
- Re: (Score:2)
  
  by poet ( 8021 ) writes:
  
  Look at plproxy or use schemas.
- what do you mean by cross-database join? (Score:3, Informative)
  
  by Reality Master 201 ( 578873 ) writes:
  
  Are you thinking like you'd do in SQL server (IIRC) or MySQL, where you have a db reference in the table list (i.e., SELECT * from db1.table1, db2.table2 WHERE [join clause])?
  
  you'd probably just want to use a schema for that; the concept maps more or less the same way.
  - Re: (Score:2)
    
    by Foofoobar ( 318279 ) writes:
    
    A schema is an unnecessary step though. Nothing is added to make that a needed step in this case (except in large scale implementations where it may an easier level of maintenance). It justs adds an extra layer of complexity and slows down what could be a far speedier query process and development process.
    Not that I'm doing a bunch of cross database joins but I like to separate out my databases for future scalability on the network and schemas answer 70% of the solutions but leave others in the lurch as
    - whuh? (Score:2)
      
      by Reality Master 201 ( 578873 ) writes:
      
      Ok, so first you say:
      A schema is an unnecessary step though. Nothing is added to make that a needed step in this case (except in large scale implementations where it may an easier level of maintenance).
      Which is fine, as far as it goes, I guess. If you don't need it, then it doesn't serve a purpose for you.
      Not that I'm doing a bunch of cross database joins but I like to separate out my databases for future scalability on the network
      See, but whereas before you were complaining about how schemas add an unnece
      - Re: (Score:2)
        
        by Foofoobar ( 318279 ) writes:
        
        Not planning for scaling is a common failure of a bad developer. A good developer plans for scaling. Schemas arn't always a catchall solution. For instance, say I have a user table in one database but 16 other database have tables that have keys that reference that table. If I were to build schemas for EVERY function, I would have a SHITLOAD of schemas to manage. If I were to use ORM, I would have a bunch of bloated queries. Instead I can just build the queries as cross database joins and just manage the qu
        
        Re: (Score:3, Informative)
        
        by ahodgson ( 74077 ) writes:
        
        A PostgreSQL schema is just a namespace qualifier; it functions just like MySQL's cross-database joins and is conceptually similar. It isn't a full copy of your database DDL.
        
        Re: (Score:2)
        
        by Foofoobar ( 318279 ) writes:
        
        No. You don't build one schema that will meet all the needs of each function call for each cross database join for each table that has a foreign key connection in those 16 databases to that usr table in the other database. there could be 8 tables in each database and they themselves may need cross joined to each other (since they share the same foreign key).
        
        I beleieve it would be something like 16^8. In other words, a shitload.
        
        Re: (Score:3, Insightful)
        
        by Just Some Guy ( 3352 ) writes:
        
        I think the disconnect is that PostgreSQL has a different definition of schema [postgresql.org] than you are using.
Will it be used? (Score:4, Informative)

by Anonymous Coward writes: on Monday February 04, 2008 @05:13PM (#22297666)

I'm a postgresql fan, I've considered it a superior database for years.

However, it seems every client I come into contact with (I am a freelance software designer) seems to believe mysql is the only open source database available and certainly the best one for all jobs.

Mysql is great (or at least, was great) for fast connection times and speed but for a stable, feature-rich excellent database, postgresql has always been ideal.

It's just a shame no one seems to be aware of it.

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by cheater512 ( 783349 ) writes:
  
  People like myself who design software requiring a database usually prefer speed over features.
  Thats why MySQL is usually chosen.
  - Re:Will it be used? (Score:5, Funny)
    
    by ByteSlicer ( 735276 ) writes: on Monday February 04, 2008 @06:35PM (#22299052)
    
    That, and people are more likely to remember 'My' than 'Postgre'.
    
    Parent Share
    twitter facebook
    - You are on to something. (Score:5, Funny)
      
      by NotQuiteReal ( 608241 ) writes: on Monday February 04, 2008 @08:27PM (#22300586) Journal
      
      If PostgreSQL changed their name to OurSQL it would be easy to remember, and a sound a lot less selfish than MySQL.
      
      Parent Share
      twitter facebook
  - Re:Will it be used? (Score:4, Interesting)
    
    by jadavis ( 473492 ) writes: on Monday February 04, 2008 @07:30PM (#22299872)
    
    People like myself who design software requiring a database usually prefer speed over features.
    
    Keep in mind that PostgreSQL may have more stable performance for a varied workload. That may mean fewer surprise slowdowns for you.
    
    I don't know your specific situation, but you may want to re-evaluate postgresql for your needs, especially if you care about performance -- PostgreSQL made leaps and bounds in this area in 8.3. I'm not sure what the last version you tried was, but 8.2 was a good performance boost as well.
    
    And if it still doesn't hold up to your expectations, please post your benchmarks to pgsql-performance, so that others can either help you meet the needs, or improve postgresql for the future.
    
    I would think also, as a developer, you might like the data integrity available in PostgreSQL that can help catch tough bugs early. Also, MySQL has many configurable options that may make your application not work and your customers unhappy (including table type -- changing table types from MyISAM to InnoDB or vice-versa may break applications). PostgreSQL's options (for the most part) don't affect the application.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by cheater512 ( 783349 ) writes:
      
      Stability isnt critical for my applications.
      Raw speed is however. A decrease in speed would be rather bad.
      
      I'm talking about 600 queries per second averaged over the day.
      I've never bothered figuring out what the peak is. I'm scared what it could be. :)
      
      All I really use is basic SELECT, INSERT and UPDATE.
      Nothing fancy. Just straight basic SQL.
      
      MySQL fits these requirements perfectly.
      PostgreSQL not so much.
      - Re: (Score:3, Informative)
        
        by jocknerd ( 29758 ) writes:
        
        If you want raw speed, you should be using SQLite instead of MySQL. Plus, are you using the InnoDB database engine? If you are, then you are not getting MySQL's raw speed. That only comes with their native database engine. You know, the one that doesn't support much of SQL at all.
        
        Re: (Score:2)
        
        by cheater512 ( 783349 ) writes:
        
        Last time I checked, SQLite didnt handle 100gig databases very well. :)
        And I am using MyISAM.
        
        Re:Will it be used? (Score:5, Funny)
        
        by Plaid Phantom ( 818438 ) writes: on Monday February 04, 2008 @09:57PM (#22301564) Homepage
        
        You have a 100 GB database and you're not concerned with stability??
        
        Do you work for the government?
        
        Parent Share
        twitter facebook
  - Re: (Score:3, Informative)
    
    by LurkerXXX ( 667952 ) writes:
    
    You people who design software should read up on this thing called data integrity, and enforcement of foreign key constraints.
    
    MySQL is pretty bad at those, but if you use an innodb table and try to use them, you find it's no faster than postgresql. And still missing many many features that postgresql gives you.
    - Re: (Score:2)
      
      by cheater512 ( 783349 ) writes:
      
      There are many instances where they are not needed.
      - Re: (Score:3, Insightful)
        
        by LurkerXXX ( 667952 ) writes:
        
        And even more instances where they are needed, but not known/understood by a MySQL user.
And then... (Score:3, Interesting)

by n3tcat ( 664243 ) writes: on Monday February 04, 2008 @05:18PM (#22297782)

someone will make a comment regarding how sad the story of Postgres's popularity is, and how they've seen German folk music with more of a following.

Share
twitter facebook
- Re:And then... (Score:4, Funny)
  
  by thrillseeker ( 518224 ) writes: on Monday February 04, 2008 @05:33PM (#22298050)
  
  well, we will just have to polka holes in that discussion
  
  Parent Share
  twitter facebook
asynchronous committ (Score:5, Insightful)

by stoolpigeon ( 454276 ) * writes: <bittercode@gmail> on Monday February 04, 2008 @05:27PM (#22297942) Homepage Journal

this was a new feature for Oracle with 10g R2 also - and as a DBA I can only shake my head and ask "why?" Why would you want to drop the durability part of ACID? Why would you risk losing data for speed? There are so many ways to tune things and speed things up without taking such drastic measures. I know I'd fight tooth & nail before I'd turn this on in anything I managed. I just hate to think that someone with less understanding is going to think of it as a 'go-faster' button and then blame postgres when they lose something important.

Share
twitter facebook
- Re:asynchronous committ (Score:5, Funny)
  
  by Wesley Felter ( 138342 ) writes: <wesley@felter.org> on Monday February 04, 2008 @05:32PM (#22298028) Homepage
  
  Haven't you heard? In Web 2.0, data integrity doesn't matter.
  
  Parent Share
  twitter facebook
- Re:asynchronous committ (Score:5, Interesting)
  
  by nuzak ( 959558 ) writes: on Monday February 04, 2008 @05:40PM (#22298186) Journal
  
  > Why would you want to drop the durability part of ACID?
  
  SQL already allows you to drop to READ_UNCOMMITTED if you really really want to -- though the DB actually under no obligation to drop to that level, you're just specifying that you don't care. That removes A, C, and I all at once. Why not make the D part optional too?
  
  Not all databases are commerce. My company processes several billions of rows a day of data, and if we accidentally lose some data, it just degrades effectiveness a little bit and means our statistics just have to interpolate a smidge. In fact, we deliberately drop a lot of it anyway.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by nuzak ( 959558 ) writes:
    
    I'm smoking the discount crack today, what with the underscore in READ UNCOMMITTED and saying it removes Atomicity. Atomicity isn't lost, though running with autocommit on pretty much does the same thing (okay, not technically, but effectively). Seriously, a DB has to make ACID available, and sensibly speaking, the default. It doesn't mean that the user can't override it if they explicitly say that they don't care.
  - Re:asynchronous committ (Score:5, Interesting)
    
    by RelliK ( 4466 ) writes: on Monday February 04, 2008 @05:56PM (#22298418)
    
    SQL already allows you to drop to READ_UNCOMMITTED if you really really want to -- though the DB actually under no obligation to drop to that level, you're just specifying that you don't care. That removes A, C, and I all at once. Why not make the D part optional too?
    False. SQL standard explicitly specifies that writing to the database under READ UNCOMMITTED isolation is not allowed. You can only do read-only queries. Further, PostgreSQL doesn't even support READ UNCOMMITTED. There is no need for it. PostgreSQL implements MVCC [wikipedia.org] such that each transaction gets a private snapshot of the database. With that you get READ COMMITTED for free.
    I'm with the original poster here. Asynchronous transactions seem like a bad idea. But then it's not PostgreSQL's responsibility to enforce good software design. And maybe in some corner cases people can find use for them.
    
    Parent Share
    twitter facebook
- Re: (Score:3, Insightful)
  
  by Simon (S2) ( 600188 ) writes:
  
  Because sometimes you don't really care if the data will be there after the commit or not, you just need to do it fast. For example say you have a sensor that counts how many nails go in a package. You have to fill the package with 1000 nails, but it is not really important if there are 999 or 1001 nails in the package, the important thing is that the counter goes fast, say 1000 counts in 1/100 of a second.
  It's not a feature you will use in your web or c/s app, but it has it's uses, and it's good to have it
  - Re: (Score:3, Interesting)
    
    by stoolpigeon ( 454276 ) * writes:
    
    I have a limited frame of reference - my experience has primarily been in the support of mission critical business processes - where data loss is the end of one's job. And from the replies I guess I can see that circumstances exist where this might be desirable, though part of me wonders, if in such cases that a database is the right tool.
    
    My other concern still stands - I hope the documentation makes the ramifications of choosing this option clear.
    - Re:asynchronous committ (Score:4, Informative)
      
      by Simon (S2) ( 600188 ) writes: on Monday February 04, 2008 @06:02PM (#22298532) Homepage
      
      Sure. Like so many times in software development "only you can decide whether data loss is acceptable to you. But there are many classes of applications, including high-speed data ingest programs whose only goal is to get the stream of data into the database, where commit-but-don't-wait is not only acceptable but very desirable performancewise." (Tom Kyte)
      
      Parent Share
      twitter facebook
    - Re: (Score:2)
      
      by LWATCDR ( 28044 ) writes:
      
      Databases can be a good tool for this because of ease of extracting the data with standard reporting tools. You are correct that many times databases are being used to replace flat files. While it is often not the optimum choice it often the right choice. Modern databases are fast enough and thanks to FOSS cheap enough that it is just easier to use a database you know then write your own file handling code. It also means that if you expand the application in the future that the data is already in a database
  - Re: (Score:3, Insightful)
    
    by Ruzty ( 46204 ) writes:
    
    Such a count does not require a DB write per transaction ( nail++ ). Such minor amounts of data are better left memory resident if they are wiped after a quantifier is reached. DB writes are for the purpose of keeping state. In your example the only reason to keep state is should the machine fail and the boxes be partially filled the remaining count necessary to complete the box needs to be known. That is better done with a physical weight measurement than a DB query.
    
    Asynch writes are useful for keeping
- Re:asynchronous commit (Score:2, Informative)
  
  by Anonymous Coward writes:
  
  Asynchronous commit is very useful in applications where the thing that's important about the data is its statistical distribution, and not the individual data points per se.
- Re: (Score:2)
  
  by mattcasters ( 67972 ) writes:
  
  In the line of work I'm usually in, data warehousing, we have large data loading jobs that want to load say 100M rows of data.
  For one thing, you usually can't do that in one transaction. To get over that problem you slam commits in between.
  You basically don't care if the commits work immediately or not. What you care about is whether or not the 100M rows of data end up in the target table.
  
  As such, the question then becomes: when am I going to be certain that the data is written to disk? Obviously not whe
  - Re: (Score:2)
    
    by jadavis ( 473492 ) writes:
    
    As such, the question then becomes: when am I going to be certain that the data is written to disk?
    
    When a well-defined delay expires, see wal_writer_delay [postgresql.org]. The maximum window of vulnerability is 3 times the WAL writer delay.
    
    See more here:
    http://www.postgresql.org/docs/8.3/static/wal-async-commit.html [postgresql.org]
- Re: (Score:3, Informative)
  
  by ivoras ( 455934 ) writes:
  
  Are you sure this is such a disaster? As far as I can tell this only means that executing "COMMIT" doesn't block (wait) until the commit has actually happened but returns immediately, and the actual operation is performed "later". The data still goes through the journal (WAL), is still fsynced when needed, etc.
- Re: (Score:2)
  
  by Just Some Guy ( 3352 ) writes:
  
  Why would you want to drop the durability part of ACID? Why would you risk losing data for speed?
  We run an hour job to copy legacy FoxPro data to PostgreSQL [honeypot.net]. It's squirreled away in its own schema, and should that schema get totally destroyed, it only takes about 20 minutes to do a full rebuild.
  I would happily trade integrity for speed on that schema, and anything that gives me that option is welcome.
- Re:asynchronous committ (Score:5, Informative)
  
  by greg1104 ( 461138 ) writes: <gsmith@gregsmith.com> on Monday February 04, 2008 @06:46PM (#22299212) Homepage
  
  Why would you risk losing data for speed? There are so many ways to tune things and speed things up without taking such drastic measures.
  
  The new async commit feature bypasses the requirement that records physically hit disk in order to complete a commit. If you must wait for a disk commit (typically enforced by fsync), the maximum number of true commits any one client can do is limited by the rotation speed of the hard drive; typically an average of around 100/second for a standard 7200RPM disk with PostgreSQL. There is no way whatsoever to "tune things and speed things up" here; that's how fast the disk spins, that's how fast you get a physical commit, period.
  
  In order to accelerate this right now one needs to purchase a disk controller with a good battery-backed disk controller and pray it always works. If it doesn't, your database might be corrupted. With async commit, you can adjust the commit rate to something your disks can keep up with (say 50/second) just with this software feature while still allowing a write rate much higher than that, and at no point is database corruption possible (from this cause anyway). This makes people who want to use PostgreSQL in things like shared hosting environments have an option that allows heavy writes even for a single client while having a reasonable data integrity policy--only server crashes should ever lose you that brief period since your last true commit. That's a fair trade for some applications (think web message boards for example) and lets PostgreSQL be more competitive against MySQL based solutions in those areas.
  
  Parent Share
  twitter facebook
- Re: (Score:3, Funny)
  
  by plopez ( 54068 ) writes:
  
  Why would you want to drop the durability part of ACID? Why would you risk losing data for speed?
  
  SO you can write sloppy inefficient code and do sloppy DBA work and get away with it?
  
  Just an idea.
- - Re: (Score:3, Insightful)
    
    by dfetter ( 2035 ) writes:
    
    It is a lot safer than turning fsync off. That's the point of the feature :)
- - Re: (Score:2)
    
    by Bill, Shooter of Bul ( 629286 ) writes:
    
    I think if you go back and look your sources, your statement should read something more like " You never _had_ durability on your systems that you didn't properly setup to have durability". Which makes a lot more sense.
long live postgres (Score:4, Insightful)

by squoozer ( 730327 ) writes: on Monday February 04, 2008 @05:37PM (#22298132)

As far as databases go I don't think it gets much better than Postgres and I reckon over the years I've tried most popular databases. What I really don't understand though is why Postgres doesn't own more of the database market. Sure it was a bit slower than MySQL a few years ago but the benifits that you reaped for that slightly slower speed far outweighed the cost. The difference know is, I would say, much small and less significant.

I can only assume that MySQL keeps it's large market share because it has commercial backing and therefore good support. I'm sure there are plenty of products taht don't require that level of support though.

Share
twitter facebook
- Re: (Score:3, Interesting)
  
  by costing ( 748101 ) writes:
  
  I can only assume that MySQL keeps it's large market share because it has commercial backing and therefore good support.
  No, it's because people are used to LAMP, and tons of easy-to-install apps only have MySQL support. But there is hope, I see more and more PHP apps allowing you to choose PostgreSQL instead. I think this is the turning point, once they reach the critical mass needed to turn the developers' heads it will become THE open source database. And for a good reason, it beats MySQL in every way you imagine, including the obvious features and not so obvious performance. Well, maybe for two queries in a 10 rows table
  - Re: (Score:3, Insightful)
    
    by Lennie ( 16154 ) writes:
    
    The problem with the whole LAMP-acronym is, we have a lot more to choose from. What do you think of: LLPP (Linux, lighttpd, Perl, PostgreSQL) or something like that.
    - Re: (Score:3, Funny)
      
      by David_W ( 35680 ) writes:
      
      What do you think of: LLPP (Linux, lighttpd, Perl, PostgreSQL) or something like that.
      
      My personal favorite is FAPP (FreeBSD, Apache, PostgreSQL, Perl).
      
      ...
      
      What's so funny?
- Re: (Score:2)
  
  by bar-agent ( 698856 ) writes:
  
  What I really don't understand though is why Postgres doesn't own more of the database market.
  
  It's the market leader advantage combined with the network advantage. MySQL had it, and Postgres has been playing catch-up since. The obverse case of the "long tail" they talk about.
- Re: (Score:2)
  
  by Ed Avis ( 5917 ) writes:
  
  What I really don't understand though is why Postgres doesn't own more of the database market.
  Worse is better [jwz.org].
- Re: (Score:2)
  
  by Foofoobar ( 318279 ) writes:
  
  Nah. I dont use it because I still cant do a cross database join. That and features like SQL_CALC_FOUND_ROWS and query caching which make it EXTREMELY useful.
- Why PostgreSQL doesn't have more of the market (Score:3, Informative)
  
  by einhverfr ( 238914 ) writes:
  
  I have used PostgreSQL as my primary db since 2000 (version 6.5!) and I have watched it for a while.
  
  PostgreSQL had a number of problems in the past which made it hard to work with including:
  1) No ALTER TABLE DROP COLUMN support and other things needed for prototyping (fixed in 7.2 iirc)
  2) Issues with dependency tracking in text SQL dumps (fixed in 8.0) meaning that some times one had to hack dumps to get them to restore properly.
  3) maintenance tasks required exclusive locks on tables (corrected sometime in
  - - - Re: (Score:3, Insightful)
        
        by Just Some Guy ( 3352 ) writes:
        
        I thought Wikis were shared by definition.
        Forget about the Wiki part. "Shared" here means that many clients are simultaneously reading from and writing to the same store. Sites like Slashdot and Wikipedia are darn near read-only in that sense, because a comment or article is typically viewed many thousands of times more often than it is written or edited.
        Contrast with something like a point of sale system where each time a clerk scans in a product, a unit is removed from the inventory database and the accounting system gets updated with the n
- Re: (Score:2, Insightful)
  
  by pathological liar ( 659969 ) writes:
  
  Replication. None of the replication options for Postrgres are particularly pleasant, especially when compared to the support that's built into MySQL.
- Re: (Score:2)
  
  by mortonda ( 5175 ) writes:
  
  What I really don't understand though is why Postgres doesn't own more of the database market.
  This may sound silly, but for me the biggest hangup has been getting a database started - the default install iirc uses unix users to authenticate into their own databases, whereas mysql has its own internal user database - mysql I can set set up pretty quickly, but postgresql I always have to hunt for the right config file to set up a user that I can connect from the network with. I know, it would probably get easier if I used it more.
  
  But that brings me to the second problem - While the documentation fo
  - Re:long live postgres (Score:4, Informative)
    
    by greg1104 ( 461138 ) writes: <gsmith@gregsmith.com> on Monday February 04, 2008 @09:23PM (#22301248) Homepage
    
    This is one of those things that's painful only until you've crossed a certain threshold of information, then it never bothers you again. It's better now but still harder to put the pieces together than it should be.
    
    Start with the documentation on creating a cluster: http://www.postgresql.org/docs/current/static/creating-cluster.html [postgresql.org] In 8.3 the default of using auth you mentioned has been removed, for the reasons you described. So it now runs as unsecured for local users by default and you have to worry about this yourself, which since it reduces the frustration at getting started was deemed an improvement.
    
    That page suggests some options you can pass to initdb to adjust the default security level. Now you want to look at the initdb docs: http://www.postgresql.org/docs/current/static/app-initdb.html [postgresql.org] See that if you use -W/--pwprompt (same thing) you can assign a password to the database superuser at cluster creation time. If you do that, you can now change the default authentication scheme to password-based (-A md5 passed to initdb will do that), you'll be secure, and you'll have one user you can login as (postgres) to create more.
    
    To see what other authentication methods are available and to learn what your options are look at http://www.postgresql.org/docs/current/static/client-authentication.html [postgresql.org] The one you really need to dive into is pg_hba.conf which is the magic text file to edit here. A new one of those will be put in the base directory of your new database cluster. Open that file up, look at the documentation, and you'll need to add a line to add network support like one those in the examples. Probably something like
    
    host postgres all 192.168.12.0/24 md5
    
    (allow access to anybody on the 192.168.12 subnet access the database with a password)
    
    That should get you past the ugly initial hurdles. The next document you may need is how to add more users: http://www.postgresql.org/docs/current/static/sql-createrole.html [postgresql.org]
    
    again look at the examples first and then backtrack to the parameters, will make more sense that way. After that you'll want to create more databases with createdb: http://www.postgresql.org/docs/current/static/app-createdb.html [postgresql.org]
    
    And then you should be able to find your away around from there using the psql command line tool.
    
    Note that once you get past accepting connections over the network, you could use a tool like pgAdmin III to handle the rest of this work using a slicker interface. There's even a copy of it bundled with the Windows installer you can use on such a client to administer a remote server running a real OS. It's of course possible to install pgAdmin manually on other platforms as well, see http://www.pgadmin.org/ [pgadmin.org] for other verions (note that binary packages for platforms like RPM don't show up in every release, you have to go back to v1.8.0 to get the last full set of packages).
    
    Parent Share
    twitter facebook
    - Re: (Score:3, Interesting)
      
      by turbidostato ( 878842 ) writes:
      
      "This is one of those things that's painful only until you've crossed a certain threshold of information, then it never bothers you again."
      
      For the most part then it's too late.
      
      There are *a lot* of people that started their days on DBs by their own or as another issue clogged to their back. Then they go opensource just because it's easier to start with (no provisions, not a dozen calls to a provider, just an "apt-get install" and you are on your way). Then you look for Postgres; it has a fame of a serious,
Time for a cross-DB comparison (Score:5, Insightful)

by jd ( 1658 ) writes: <imipak@ y a hoo.com> on Monday February 04, 2008 @05:40PM (#22298180) Homepage Journal

PostgreSQL 8.3 is nicely timed. I've been looking forward to trying it in a setting which wouldn't allow the use of betas. Now I've got the on-topic stuff out the way, onto my main point.
There are so many Open Source databases (MySQL, MaxDB the last GPL version, Firebird, Postgresql, Ingres Community Edition, hsqldb and H2) that it is hard to know which ones implement what, which ones are useful for what, or which ones are optimal for what. Simple benchmarks (a) rarely are, and (b) usually want to promote one specific product over another. There are standardized tests, for a small fortune and by a relatively closed group who probably don't have a fraction of the range of experiences of databases in the real world, so cannot possibly be trusted to authenticate a standard or measure that could be used to compare databases.
We could really do with some serious side-by-side evaluations of these database engines, or at least decide what such evaluations would need to be to be actually useful. (Hey, maybe CmdrTaco can add a comparison section, get it sponsored by Which? or some other consumer guide, and have some of us run evaluations. It'd have to be done well to not be flamebait, which I think might rule me out, but if it could be done, it would be hellishly useful.)

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by naasking ( 94116 ) writes:
  
  A couple [geocities.com] such comparisons [devx.com] already exist. They may be a year or two out of date however.
- Re:Time for a cross-DB comparison (Score:4, Informative)
  
  by TheNarrator ( 200498 ) writes: on Monday February 04, 2008 @06:31PM (#22298992)
  
  Here's a feature matrix of new features vs. old versions. It's easy to see that 8.3 is a huge upgrade.
  
  http://www.postgresql.org/about/featurematrix [postgresql.org]
  
  Parent Share
  twitter facebook
- Re: (Score:3, Informative)
  
  by 0racle ( 667029 ) writes:
  
  PostgreSQL vs. MySQL WIki [wikivs.com]
  
  Seems to be features only, no performance.
Upgrade Procedure (Score:3, Interesting)

by AltImage ( 626465 ) writes: on Monday February 04, 2008 @06:53PM (#22299326) Homepage

"A dump/restore using pg_dump is required for those wishing to migrate data from any previous release"

Postgres have a habit of making you do this to upgrade and it really sucks. I understand the reasons behind it, but that does not reduce the amount of suck, especially for a large database.

Share
twitter facebook
- Re: (Score:2)
  
  by ashridah ( 72567 ) writes:
  
  Most of the admins I know that deal with large PostgreSQL databases use slony-I to do their migrations.
  
  Of course, if you're tight on hardware/disk resources, you're probably in trouble, no matter what, but slony can get you the migration/sync period down to a minimum of "shutdown, change setting, restart", if done correctly.
  Don't forget that PostgreSQL can easily be run side-by-side with another version of PostgreSQL on the same box, so long as they aren't sharing a data tree (duh) or network ports. This mi
  - Re: (Score:2)
    
    by Jeffrey Baker ( 6191 ) writes:
    
    Judging from history, it will be two years or more before slony is compatible with PostgreSQL 8.3. And the intrusive schema changes required of slony are unacceptable to most every DBA I've asked about it.
    - Re: (Score:2)
      
      by ashridah ( 72567 ) writes:
      
      Intrusive? Eh?
      
      Last time I set up slony-I myself, it required a _replication schema. That's hardly intrusive, it's a *separate schema*.
      
      The main difficulty is that it then tends to attach itself via a number of REFERENCES. Of course, since we're not actually discussing long-term usage, but short-term usage for purposes of migration, this is hardly a major problem. You add in slony's replication details, trigger replication, wait until replication has succeeded, promote the slave and shut down the master, test
    - Re: (Score:2)
      
      by greg1104 ( 461138 ) writes:
      
      Slony is already compatible with 8.3; the current maintainer (Chris Browne) is very on top of things. There are still some rumblings about problems with the dropping of some implicit casts to text in 8.3, but they have been squashing those as reported for months now already and they may already be completely gone. Since anybody with a database big enough that they need Slony to handle version upgrades is surely going to test heavily issues there should easily be caught during that.
      
      Intrusive schema changes
Postgres Books? (Score:2, Interesting)

by crustymonkey ( 731497 ) writes:

Does anyone know of a good, semi-recently written book on PostgreSQL? Everything I find is from at least 3 years ago. Is it that PostgreSQL hasn't changed much, barring this release, in the past few years?
Cool! Good support for full text indexing/search (Score:2)

by MarkWatson ( 189759 ) writes:

Just recently, I discovered that Ferret had synchronization problems when I deployed my my http://cookingspace.com/ [cookingspace.com] site using nginx and a mongrel cluster - a little nuisance to work around. I did some fast experimenting with indexing and search using MySQL and PostgreSQL, and I made a note to retry PostgeSQL when version 8.3 was released.

When a deployment platform has inherent weaknesses (like Rails!), it is important to be able to shove off as much processing as possible to more industrial strength tools
Does it support multithreaded queries? (Score:3, Interesting)

by G3ckoG33k ( 647276 ) writes: on Tuesday February 05, 2008 @06:32AM (#22304780)

Hi, I read that "MySQL does not uses several CPUs to execute single query - only multiple connections may benefit from several CPUs.". That was written January 6 2004 by Peter Zaitsev, then a full-time developer at MySQL AB, www.mysql.com. I found the quote at http://lists.mysql.com/benchmarks/45 [mysql.com]

Does anyone know if PostgreSQL supports a dual or quad CPU when it comes to executing a single query, or if MySQL now supports it?

The reason I ask is that I have a database with tens of millions of records and even 'simple' queries take a long time. Would it be beneficial to buy a 8 core machine, i.e. dual quad, over a single quad cpu?

Thanks for any tips or links!

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by Just Some Guy ( 3352 ) writes:
  
  It would seem not to [postgresql.org]. Yeah, I wish it had that, too. The other posters who keep telling you to get faster IO miss the idea of having extra CPUs handling locking, cache management, etc. so that even single running queries are faster.
Just one issue (Score:3, Interesting)

by bytesex ( 112972 ) writes: on Tuesday February 05, 2008 @10:12AM (#22306008) Homepage

I'm a great fan of postgres but I ran into an irritating limitation recently; I replicate a database over a large number of very small nodes using slony. I really don't care about the integrity of the slaves - they're read-only to their clients and should I suspect they're corrupt I just reboot (they live in memory and the OS lives on a 1 Gb read-only flash drive). But postgres insists on having a WAL directory (pg_xlog) with chunks of 16MB in it. And that's big if you live in 128MB of ramdisk, and you can't turn that off. I mean, from my reasoning - the WAL isn't really used unless you do recovery; the versions of the data are in the db itself (otherwise we wouldn't need vacuum, now would we ?) So why can't I just configure postgres to not use WAL ? And then if the db is corrupt we just die. No, say the guys on IRC, you just have to recompile it with its hard-defined value of 16MB down to something lower. Yeah right. I'm not interested in hacks - I want a versatile RDBMS.

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by Acheron ( 2182 ) writes:
  
  The 8.3 release notes list the Bucardo project http://bucardo.org/ [bucardo.org] for multi-master replication. I haven't used it... is there something that it is lacking that you think would be addressed by bringing it into the core code base?
- Re: (Score:2)
  
  by ashridah ( 72567 ) writes:
  
  Well, this does ship with Bucardo in tree. That does multi-master replication.
  
  Admittedly, "in tree" isn't the same as "built in", and I have no idea what the performance is like, and I don't know if it requires any application logic modifications to utilize.
  - - Re: (Score:2)
      
      by ashridah ( 72567 ) writes:
      
      I'm curious, since I haven't poked at it yet.
      
      What's missing from it, such that it misses out on the feature-list, in your opinion?
- Re:New HOT, faster Postgres (Score:5, Insightful)
  
  by naasking ( 94116 ) writes: <naasking AT gmail DOT com> on Monday February 04, 2008 @06:09PM (#22298662) Homepage
  
  Yeah, if only for those darn inconvenient facts demonstrating that PostgreSQL is faster than MySQL, particularly under load [tweakers.net]. Note that the benchmark was PostgreSQL 8.2. Now note that 8.3 is up to twice as fast as 8.2 [kaltenbrunner.cc]. I think the polarity on your order of magnitude performance difference should be reversed.
  
  Of course, if you actually care about data integrity and database features, there's not contest at all. But the performance gap is now non-existent, if not completely reversed.
  
  Parent Share
  twitter facebook
- Re:New HOT, faster Postgres (Score:5, Insightful)
  
  by glwtta ( 532858 ) writes: on Monday February 04, 2008 @06:32PM (#22299012) Homepage
  
  Oh give it a fucking rest. MySQL is 10-15% faster on simple queries, with few threads, on a single disk.
  
  And that's only with MyISAM (in which case, why bother with a database server? SQLite is probably enough for your needs).
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by geniusj ( 140174 ) writes:
  
  The only time PostgreSQL will be slower, generally, is in a situation where you have frequent connections/disconnections. Connection pooling greatly benefits pgsql. However, now there's pgBouncer which alleviates this.
  
  In general, as far as query planning and such, it will outperform MySQL.
- Re: (Score:3, Informative)
  
  by byoung ( 2340 ) writes:
  
  I guess nobody appreciates humor here. Well, not if it's against the fanboys.
  
  I'll stand by it though-- switching to MySQL from Postgres made my life significantly simpler:
  
  1) you can install MySQL easily
  2) MySQL has great vendor support
  3) my experience is that MySQL performs significantly better in the general case (i.e. I'm not spending my entire life tweaking performance)
  
  Many an honest thing were said in jest, I suppose.
- Re:How quickly they turn on you .. (Score:4, Insightful)
  
  by swimmar132 ( 302744 ) writes: <joe@@@pinkpucker...net> on Monday February 04, 2008 @06:36PM (#22299056) Homepage
  
  Going from 8.2 to 8.3 in postgresql is not a 'minor' release. It's quite a major step with a lot of new features.
  
  Would it make a difference to you if they bumped up the version number to 9?
  
  Parent Share
  twitter facebook
- Re: (Score:3, Informative)
  
  by glwtta ( 532858 ) writes:
  
  8.2 was released over a year ago - this is not a minor revision.
- Re: (Score:2)
  
  by Just Some Guy ( 3352 ) writes:
  
  we're already posting minor (!) revision releases for PostgreSQL on the front page?
  
  Slashdot has always had a fetish for F/OSS that outperforms the competition. This isn't new.
- MySQL is FAST, PgSQL FARTs on MySQL's Performance (Score:3, Funny)
  
  by einhverfr ( 238914 ) writes:
  
  Fast
  And
  Sorta
  Transactional
  
  PostgreSQL is
  Fast
  And
  Really
  Transactional
  
  And under heavy loads with normalized db's PostgreSQL's planner does *much* better than MySQL's.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

PostegreSQL 8.3? (Score:5, Funny)

Re: (Score:3, Insightful)

Nice. (Score:2)

Re:Nice. (Score:5, Funny)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:3, Interesting)

Cross Database Joins?? (Score:4, Interesting)

Re:Cross Database Joins?? (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Re:Cross Database Joins?? (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re:Cross Database Joins?? (Score:5, Insightful)

Re: (Score:2)

Re:Cross Database Joins?? (Score:5, Insightful)

Re: (Score:3, Funny)

Apples and oranges (Score:3, Informative)

Re: (Score:2)

what do you mean by cross-database join? (Score:3, Informative)

Re: (Score:2)

whuh? (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:3, Insightful)

Will it be used? (Score:4, Informative)

Re: (Score:3, Informative)

Re:Will it be used? (Score:5, Funny)

You are on to something. (Score:5, Funny)

Re:Will it be used? (Score:4, Interesting)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re:Will it be used? (Score:5, Funny)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:3, Insightful)

And then... (Score:3, Interesting)

Re:And then... (Score:4, Funny)

asynchronous committ (Score:5, Insightful)

Re:asynchronous committ (Score:5, Funny)

Re:asynchronous committ (Score:5, Interesting)

Re: (Score:2)

Re:asynchronous committ (Score:5, Interesting)

Re: (Score:3, Insightful)

Re: (Score:3, Interesting)

Re:asynchronous committ (Score:4, Informative)

Re: (Score:2)

Re: (Score:3, Insightful)

Re:asynchronous commit (Score:2, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re:asynchronous committ (Score:5, Informative)

Re: (Score:3, Funny)

Re: (Score:3, Insightful)

Re: (Score:2)

long live postgres (Score:4, Insightful)

Re: (Score:3, Interesting)

Re: (Score:3, Insightful)

Re: (Score:3, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Why PostgreSQL doesn't have more of the market (Score:3, Informative)

Re: (Score:3, Insightful)

Re: (Score:2, Insightful)

Re: (Score:2)

Re:long live postgres (Score:4, Informative)

Re: (Score:3, Interesting)

Time for a cross-DB comparison (Score:5, Insightful)

Re: (Score:3, Informative)

Re:Time for a cross-DB comparison (Score:4, Informative)

Re: (Score:3, Informative)

Upgrade Procedure (Score:3, Interesting)