Forgot your password?

typodupeerror
Bug Media The Almighty Buck

How Would You Handle a $1,000,000 Coding Error? 878

Posted by timothy
from the wasn't-me dept.
theodp writes "The Chicago Tribune's efforts to upgrade its computer system over the weekend turned into a fiasco when the system crashed, halting all printing operations and leaving about half of the Trib's subscribers without papers. The software contained 'a coding error,' according to a spokesman who estimated the cost to resolve the problem at 'under $1 million.' Any advice for the poor schmuck who's going to get the blame?"
This discussion has been archived. No new comments can be posted.

How Would You Handle a $1,000,000 Coding Error?

Comments Filter:
  • by Anonymous Coward on Monday July 19, 2004 @11:39PM (#9744852)
    As for the QA department...
  • by Fubar420 (701126) on Monday July 19, 2004 @11:41PM (#9744878)
    Well, ok so that might not fly, but hey, it works when its true if you work for a modestly forgiving employer...

    Now if the cause was insufficient testing, well then QA has to answer for it.

    And if there's no QA, well that's managements fault...

    Now if it all comes down to dumb circumstances, it's poor planning on the papers fault for not testing themselves ;-)

    That said, fess up, worse comes to worse, you now have national infamy, and any fame is good fame, right??
  • Testing? (Score:5, Insightful)

    by buff_pilot (221119) on Monday July 19, 2004 @11:42PM (#9744891) Homepage
    Where was the pre-install testing?

    A good test should have identified some errors, especially if it blew up IMMEDIATELY.
  • by gik (256327) on Monday July 19, 2004 @11:45PM (#9744920) Homepage
    People, it's called QA.

    Toss his newspaper subscription and egg his car. Other than that, leave the poor geek alone.

    How many people here have fucked LILO into the ground the night before a java assignment on a laptop with no floppy? anyone?

    yeah. i thought as much.

  • by Anonymous Coward on Monday July 19, 2004 @11:45PM (#9744929)
    Management frequently makes mistakes which cost much more. The difference is that their mistakes are not as easily identified or attributed to a single person.

    The culprit should just admit it. Shit happens, it's unavoidable even if you take all precautions. Don't make the same mistake again, though.
  • by Pulse_Instance (698417) on Monday July 19, 2004 @11:46PM (#9744940)
    In my experience being honest about your mistakes and having the willingness to learn from them always pays off.
  • by David Frankenstein (21337) on Monday July 19, 2004 @11:50PM (#9744977)
    With any large roll out, if only one person is at fault for a fiasco like this, then the project mas mismanaged. They should have had a plan in place to backout the change.
  • Deployment? (Score:5, Insightful)

    by BiggerIsBetter (682164) on Monday July 19, 2004 @11:52PM (#9744999)
    Where was the phased or parallel deployment?

    You don't just change a system like in a weekend. There WILL be problems, so you have to have ways of dealing with it. Maybe that means flicking the switch back to the old system if it fails, or maybe it means running with degraded capacity a while, but whatever it is, it's dead-in-the-water is not your Plan B.
  • UAT/QA anyone? (Score:5, Insightful)

    by bwy (726112) on Monday July 19, 2004 @11:55PM (#9745026)
    One of the benefits of working for a big company is a QA/UAT department. You have an entire department of people lined up just to test your shit. And, usually this type of job makes a person very anal. They log defects for just about everything.

    The person writing the code can unit test to his or her best ability, but it is really the job of someone else to put it through the wringer testing thousands of simulated real-world scenerios. Sure, a coder could do this testing. But a QA guy or gal is doing really well if he makes 3/4 the salary of the guy who wrote the code- so a divison of labor only makes sense.

    Not to mention the person writing the code makes the worst tester in the world. You only test it the way you THOUGHT people would use it. So, while a coder is perhaps the one who created the original problem, the real fault is in whoever let this slip through to production. Assuming, of course, that it wasn't some kind of time-bomb easter egg that would have been impossible to test. Although, good QA testers should alter their system date/time when testing date sensitive routines.
  • planning? (Score:5, Insightful)

    by twitter (104583) on Monday July 19, 2004 @11:56PM (#9745027) Homepage Journal
    A good test should have identified some errors, especially if it blew up IMMEDIATELY.

    Good planning would have had an abort procedure, so the show would go on. Everything changed should be undone if it did not work. They could figure it out after the paper was printed.

    Errors are inevitable. Good planning and implementation keep you from falling on your face even when you publish seven days a week. It's not the coder's fault.

  • by tftp (111690) on Monday July 19, 2004 @11:57PM (#9745041) Homepage
    I don't think any specific programmer will be blamed for that, and I don't think the phrase "coding error" really reflects what happened. It's more likely just a popular explanation like "the computer crashed".

    Noone [in their right mind] orders a brand new paper publishing system from a single consultant. The software probably was priced in several million dollars. Somewhere between the components something broke. For example, the file format that the publisher produced was rev. 2.1, but the software at the presses side was only aware of rev. 1.7 and below... If the coder only tested his code with the "other" piece of latest revision, he would never see any problem; and it is not his guilt that in real life the real customer uses some obsolete stuff that isn't compatible...

    This kind of problem is clearly of administrative nature, of a system design and of checking which pieces work with which other pieces. Clearly, blame should be assigned to non-existent QA procedures, insufficient unit testing and [obviously] inadequate integration of components. The coder is nowhere here, it's all system design and QA stuff, realm of managers.

  • Re:Testing? (Score:5, Insightful)

    by ryen (684684) on Monday July 19, 2004 @11:58PM (#9745058)
    I agree.
    Blame the project manager (hopefully their was one) that led testing the services thoroughly before deployment. Individual coders shouldn't be held to any legal liability.
    Any legal action should be directed towards the'outside provider' (as noted in the article).
  • by AngstAndGuitar (732149) on Tuesday July 20, 2004 @12:13AM (#9745192)
    Everyone makes mistakes, good teams try to find those before deployment.

    Where was the testing?
    Who decided that there would be no testing?
    Who decided that they would simply deploy the thing with no plan B?
    Obviously the PHB, you just need to point that out to the VP and you can have the PHBs seat.
  • In its 158 years, the Tribune failed to publish only at the time the Great Chicago Fire was destroying much of the city.

    So the paper can deliver every day for 158 yrs using mechanical printing presses ~ except where natural disasters occur ....

    The printing problems at the Chicago Tribune were related to efforts to upgrade computer equipment used to produce the newspaper, Malone said. The Tribune acquired customized software for the upgrade from an outside provider, and it contained a "coding error," he said.

    but as soon as computers are involved their printing press has morphed into a computer system. I wonder what provisions to *test* the upgrade before use where made?

    fail to recognise newspaper as computer system?

    it would be easy to blame the developers and company and there should be some recognition of responsibility for technical accuracy. but what about the newspaper. they have made a fundamental mistake in not recognising that printing press + computer = computer and let their newspaper system fail at the mercy of coding mistake.

    It seems while the paper can handle *mechanial* failure (158 yrs, 1 non delivery) it has yet to grasp *software* failure.

  • Where I work... (Score:4, Insightful)

    by bmo (77928) on Tuesday July 20, 2004 @12:17AM (#9745231)
    It doesn't make any difference if it's a broken punch or a whole set of dies cracked down the middle ($4000 for a 6 inch section, over 60 inches you do the math)...

    "If you say 'oops', it's OK."

    Did he say Oops?

    Seriously though...shit happens. That's why you don't bill employees directly for the mistakes they do. Suck it up, learn, and move on.

    --
    BMO
  • by youngec (239360) on Tuesday July 20, 2004 @12:20AM (#9745258)
    The old system was working! But, they didn't follow the rule, and look what happened!
  • Re:McDonald's (Score:2, Insightful)

    by Pieroxy (222434) on Tuesday July 20, 2004 @12:39AM (#9745393) Homepage
    And if you screw up a burger, it only costs the company about $0.17

    We are not living in the same world then. If you screw it up bad enough for someone to get injured or - god forbid - die by it, the figures will probably be 10 million times as big as what you are mentionning.
  • by Cabaal (163264) on Tuesday July 20, 2004 @12:43AM (#9745423)
    In the aerospace industry we deal with very expensive space equipment. As a result there are procedures that must be followed so nothing terrible happens. You can see where this story is going...
    Imagine a satellite, nearing completion, bolted down , and ready for final inspection. Joe Blow forgets to write in the change log that he took the un-bolted the satellite from the base. Workers come in the next day, do some work after checking the logs, and... the satellite tips over. OOPS... a billion dollars well spent. That is 1,000,000,000.
    "Uhm, boss, the good news is we finshed the satellite yesterday and... I don't know how to say this, but our last two years of work... well, I sort of... well, I tipped it over and it's destroyed.... "

    Or how about the contracts guy who forgets one Zero on a contract. Instead of ten million, the contracts reads one million. Of course everyone misses the zero... except the people PAYING. Contracts are signed and oops... "we want to start a new contract, we sort of forgot to add a zero." To which they reply, "Fuck off, you signed it..." and prompty save the company 9 million dollars.
  • by Tumbleweed (3706) * on Tuesday July 20, 2004 @12:44AM (#9745426)
    Mmm, yeah, perl, obviously the best language to debug, especially if it's someone else's code. *shudder*

    I think I'd rather debug someone else's assembly language than someone else's perl.
  • Re:Testing? (Score:2, Insightful)

    by trekkie314 (466907) on Tuesday July 20, 2004 @12:45AM (#9745441)
    It worked fine for laying out the paper, it just garbled it when transmitting to the printers. My question is, if they realized at press time that the electronic transmission wasn't working, why didn't they just use a disk?
  • Testing is Boring (Score:5, Insightful)

    by PingPongBoy (303994) on Tuesday July 20, 2004 @12:48AM (#9745455)
    Software testing is boring boring boring. You have to try things out again and again after each change. Modules that haven't changed gain confidence in the face of changes and might not be tested, but omitting tests can end up being the Achilles heel. There can be an overwhelming desire when a project nears completion to just get things done and over with. After all the hard problems may well be solved and it's all down to seemingly inconsequential details.

    These days programmers have a Sword of Damocles hanging over them. Once they finish a major piece of code they may have a hard time finding new work. The economy has not lived up to forecasts of more jobs. Outsourcing has reduced computer opportunities. Management of many companies do not see new uses for computers. Off-the-shelf programs abound for almost every aspect of computerized work.

    Stress may distract software engineers enough that someone will make a major mistake.
  • Re:UAT/QA anyone? (Score:1, Insightful)

    by Anonymous Coward on Tuesday July 20, 2004 @12:50AM (#9745463)
    Well maybe when they start paying people the SAME or BETTER to test than code, software quality will actually go up rather than remaining at the current embarassing level it's been at the last 20 years.
  • by Anonymous Coward on Tuesday July 20, 2004 @12:52AM (#9745477)
    It is not QA's job to assure that the code is good. QA's job is to identify the quality of the code, and to predict how it will behave when it goes into deployment. The best QA team is one that will predict where the support calls will come from.

    Deploying good code is a bussiness descision, not QA's decision.
  • by Anonymous Coward on Tuesday July 20, 2004 @12:55AM (#9745489)
    Just what we need - more eunuchs programmers.
  • by Bill, Shooter of Bul (629286) on Tuesday July 20, 2004 @12:59AM (#9745512) Journal
    I'm willing to bet there will be an opening for IT manager soon.
  • by raehl (609729) * <raehl311NO@SPAMyahoo.com> on Tuesday July 20, 2004 @01:17AM (#9745591) Homepage
    If I ever was on the specifying end of software (instead of the coding end, where I do these things reflexively anyway), I would demand the following of the team I hired:

    And that is why you're on the coding end instead of the decision making end - you'd have a compact, bug-free, featureless product that hit the market three years too late that nobody could afford to buy anyway.
  • by eric76 (679787) on Tuesday July 20, 2004 @01:19AM (#9745603)
    The coding error, if it existed, was minor.

    The serious error was in switching to a new system with such clearly inadequate testing.
  • I'd fire the CTO. (Score:3, Insightful)

    by solios (53048) on Tuesday July 20, 2004 @01:25AM (#9745640) Homepage
    Or the guy who signed off on rolling the thing out without extremely thorough testing beforehand. The IT / Software end of companies is a lot like animals in this respect- cut the head off and you prevent the body from doing anything grossly stupid.

    Start with the CTO and work your way down. If it's a software problem, why wasn't it discovered sooner? Who was in charge of QA? Who was in charge of making sure QA did their jobs? Who said YES WE CAN DO IT!, lying out their ass?

    The fun thing about capitalism is greed and/or the desire for profit leads to systems like this being built by the lowest bidder. :P You get what you pay for! :|
  • $1mil is nothing (Score:4, Insightful)

    by lkaos (187507) <.sw.yeknomedoc. .ta. .ynohtna.> on Tuesday July 20, 2004 @01:38AM (#9745701) Homepage Journal
    It's never an individual's fault. It's a breakdown in the QA/FVT/review structure. Is it the person who coded it's fault? Is it the team that reviewed the code? Is it the author of the FVT tests? Is it the person in charge of QA?

    What's that you say, this is all the same person? No wonder you had the bug to begin with...
  • Re:Deployment? (Score:5, Insightful)

    by mec (14700) <mec@shout.net> on Tuesday July 20, 2004 @01:48AM (#9745737) Journal
    Where was the phased or parallel deployment?

    Probably in the hands of someone who decided:

    (1) Cost of catastrophe: $1,000,000.
    (2) Chance of catastrophe: 5%
    (3) Cost of setting up parallel system, including hardware, software licenses, system administration: $250,000.

    If (1) times (2) is less than (3), then it's actually better not to spend the money on (3).

    Of course you can argue with the actual numbers in (1) (2) (3). (1) is the Tribune's own estimate. (2) is estimable by looking at the history of past projects, I'm just guessing 5%. And I just pulled (3) out of the air.

    That said, I bet they do have degraded capacity, and that they used it to print half their papers on Monday and all their papers on Tuesday.
  • by cheekyboy (598084) on Tuesday July 20, 2004 @02:14AM (#9745838) Homepage Journal
    If they had a clue, they would grow 10000 acres of canabis which;

    A) grows 10000x faster than trees
    B) makes 10x more pulp per acre
    C) uses 100x less water.
    D) stick it to the govt.

    But would they ever do that? NOOOO coz there are no patents in the process to expoit and oh the trouble of the govt wackos like bush n old guys being so anti-canabis (to protect their buddies profits)

    I guess they wouldnt want 100s of pot heads heading up to the 100000s acres of weed to take a few home, but what is so wrong with that OTH?

  • by Anonymous Coward on Tuesday July 20, 2004 @02:18AM (#9745860)
    I've been, in the past, inside the offices of the Chicago Tribune. From the people I talked to, it seemed the perception within IT in the paper itself is that corporate IT (Tribune Co.) has been somewhat more focused on standardization and cost-cutting at the expense of the reliability of the systems.

    Note that I am not against standarization and cost-cutting per se, but in their situation, there are a lot more tradeoffs than in a traditional manufacturing environment. Putting out a newspaper is difficult as it is (ah, the war stories...), but you would think management would understand that people will notice if the paper doesn't go out in the morning. Advertisers must have flipped!

    The only way it would have been worse would have been to happen on a Thursday. You don't really think all of those hefty Sunday papers are printed Saturday night, do you? :)

  • by weresquirrel (774788) on Tuesday July 20, 2004 @02:29AM (#9745936) Homepage
    Wouldn't it be grand if we had an honest enough industry that you could tell the truth about how previous projects went, consistently? Every time I've screwed up, I've learned something, either about myself as an engineer or about project dynamics, and I really hope I apply it. I'd rather hire and be hired by other people who understand that we are both aiming for consistent success and learning from failure, and who are honest enough to own it. I had to smile at this. Stuff like this happens all the time. But I'm kind of surprised that the bug was incorrectly pri'd and ignored through how many triage meetings? I don't know many test or dev pms who would blow off a complete review of all current bugs regardless of status or assignment...and certainly as responsibility rises, there are more people willing to crawl that list, ad naseum.
  • by veg_all (22581) on Tuesday July 20, 2004 @02:35AM (#9745966)
    See, I always thought that same thing! Then I started working in a large corporation. Now I understand what being brutally honest in a place like, say, Bosnia must have been like.

    I got out. If you're smart and just keep your lips shut, maybe you will too.
  • by veg_all (22581) on Tuesday July 20, 2004 @02:42AM (#9745994)
    If they had a clue, they would grow 10000 acres of canabis which;

    A) grows 10000x faster than trees
    B) makes 10x more pulp per acre
    C) uses 100x less water.
    D) stick it to the govt.


    I think you forgot your "...profit" clause, except here it would say
    D) Use a bunch of arguments of dubious value to misdirect attention from the fact that what you really want is to get stoned
    ...Profit!
  • by Anonymous Coward on Tuesday July 20, 2004 @02:46AM (#9746012)
    I've been, in the past, inside the offices of the Chicago Tribune. From the people I talked to, it seemed the perception within IT in the paper itself is that corporate IT (Tribune Co.) has been somewhat more focused on standardization and cost-cutting at the expense of the reliability of the systems.
    Absolutely -- Corporate IT has been focused on cost-cutting, because their customers (including Chicago Tribune) have forced them to slash their budgets to the bone, in "give backs" throughout the year (Corporate IT is funded by budget line items in each customer's budget).

    The key issue here is that Chicago Tribune has a very limited IT group, really it's just a "systems" group. All of the other traditional IT functions have been outsourced to Corporate.

    That was fine before the Times-Mirror merger, because Chicago Tribune was the top dog, and could bully all the services they wanted out of Corporate IT. But now the LA Times has that spot.

    The good news is, the FUBAR situation on Monday morning was strictly a "systems" issue within Chicago Tribune IT, and not just a "systems" issue, but one where "systems" could direct all the blame to their vendor, which might help them keep their jobs a little longer...

  • Grab all the eMails where someone in management told you to cut a corner, or replied that they didn't want to spend too nuch time designing, or authorized fewer QA hours then should have been done, and print it all out, with headers, and forward it all to another account.

    When they come after you, present it as if it you were trying to do it right, but somebody wouldn't let you.
    If they fire you, sue.

    Unless:
    a) you work for one of the few companies that actually supports a real team atmosphere, or

    b) Everything was done by the book, and you still screwed up.

    When someone in an industrial field is forced to work 16 hour a day, 7 day a week, and has a mistake the company suffers the ramaifications, not the worker(or the workers faimly).
  • by Soko (17987) on Tuesday July 20, 2004 @03:07AM (#9746115) Homepage
    I'm giving up moderation on this story to post this, so listen the fuck up.

    I work in newspapers, and have for the past 7 years. The blame for this fiasco should be pinned directly on the project manager. Not the coders, not the people trying to get the thing running, but the project manager. Right in the middle of his fucking forehead.

    I've torn the guts out of many newpaper networks upgrading or improving them, but never have I ever put anyone in the position of "If the new system doesn't work, we're fucked." I've always made ab-so-fucking-loutely certain there was a fall back position where the paper would hit the press. I actually had this conversation before:

    <Management weenie> What happens if this new server fails?
    <me> I haven't touched the old server. If the new one hiccups one whit, we fire up the old box and produce product.
    <Management weenie> I don't like that - we've spent a million bucks on the new gear. Delays make me look bad.
    <me> Well, if you're willing to man the phones when the advertisers call demanding re-prints of thier ads because of human error somewhere, I have no problem with it.
    <Management weenie> You're an asshole. I could have you fired.
    <me> In this instance, I'm paid to be an asshole. You can't fire me for doing my job.
    <Management weenie> Heh. OK, we'll go with your plan.

    Not planning some way to get the paper on the press is dereliction of duty, and deserves your professional head to be lopped off.

    Is there _no_ professionalism anymore? Fuck, I should be paid more. Morons like that burn me - when you blow up a critical system with no backup, it's not just your livelyhood, but for everyone who depends on that system functioning as needed - it's thier livelyhood as well. Fucking morons.

    Soko
  • by eelke_klein (676038) on Tuesday July 20, 2004 @03:37AM (#9746244)

    This shows why truly redundant systems should be build using a mixture of different hardware and software developed by independend teams. This would reduce the risk of all devices being hit by the same problem.

  • by Flower (31351) on Tuesday July 20, 2004 @04:42AM (#9746459) Homepage
    It isn't that dumb of an idea though definitely desperate. If he can get the pages into the proper format (I think TIFF. Don't quote me on it.) he can feed the files into the CTP (Camera/Creation To Plate) system. Without plates you don't get a paper out. EOF.
  • by discord5 (798235) on Tuesday July 20, 2004 @04:58AM (#9746502)
    In my experience, it probably was not totally the QA department's... or the coder's... fault either. It was probably shitty managers paying too little attention to the need to allocate sufficient time for QA and realistic testing environments.

    Ah, yes, but now you are a step higher on the corporate ladder, and while in conversation with colleages the finger of blame always points up, in conversation with the boss however the finger always points down that ladder. Management is never to blame for bugs.

  • by smellystudent (663516) on Tuesday July 20, 2004 @05:35AM (#9746608)
    Absofuckinglutely.

    I was doing a much smaller upgrade this weekend - rebuilding a single server. Before I did anything, I removed the drive, imaged it, and placed it in a very safe place far away from coffee spills and clumsy feet.

    If anything went wrong during the rebuild and I'd been unable to bring the new system up by Monday morning, I'd simply slip the old drive back in and continue from where we were on Friday afternoon.
  • by Tony Hoyle (11698) <tmh@nodomain.org> on Tuesday July 20, 2004 @05:53AM (#9746647) Homepage
    printf and strlen cannot produce buffer overruns... and anyway how are you going to tell the length of a string without strlen?

    Maybe you meant strcpy and sprintf (use strncpy and snprintf instead!).

    Code safety isn't about the use of individual functions, or even languages (I've managed to DoS a Java app by making it allocate strings forever in a loop... the code was written such that the GC never cleaned up). It's about good practice, often learned through bitter experience (OTOH I can't think of a safe use for gets()...)
  • by greenrd (47933) on Tuesday July 20, 2004 @05:57AM (#9746658) Homepage
    So, are you actually going to rebut those arguments, or are you just going to sneer sneeringly?

  • Backout plan? (Score:3, Insightful)

    by Anonymous Coward on Tuesday July 20, 2004 @06:37AM (#9746802)
    You never begin a software upgrade without backout plans. At multiple checkpoints during the upgrade, backout plans are fully documented and if the upgrade has gone well, you move onto the next checkpoint.
    Of course, none of that matters if you haven't placed a "production-like" loan on the system before attempting to upgrade production. It is about risk management. You can't remove all risks in an upgrade, but you should be able to manage them assuming the risks are all documented and provided to stakeholders. If you've told the stakeholders all the risks clearly and in writing and pointed out that there was no test system, no DR system or no good backout plan that wouldn't impact half the customers **AND** they still decided to go forward, oh well. Their decision.
  • by dekeji (784080) on Tuesday July 20, 2004 @07:25AM (#9747001)
    The problem isn't the coding error. Such errors will inevitably happen. They also happened when we designed mechanical printing presses, which might break down unexpectedly because of built-in design errors. The problem here is poor management.

    In the past, creating single points of failure was hard: you had lots of men working on lots of printing presses. You couldn't do something as stupid as replacing them all in a single night--it just wasn't physically possible. Computers have just given greedy management the freedom to make more serious mistakes in a shorter amount of time. In this case, the mistake was upgrading a whole infrastructure at once and believing, naively, that that would necessarily go smoothly.
  • by tbone1 (309237) on Tuesday July 20, 2004 @07:36AM (#9747052) Homepage
    You insufferable ass -- you just slashdotted Illinois.

    He should suffer! Everyone knows that slashdotting Illinois is the job of the highway department.

  • by Rich0 (548339) on Tuesday July 20, 2004 @07:38AM (#9747061) Homepage
    This is simply the result of blind cost-cutting.

    "We need to reduce spending in non-core areas!" IT usually ends up being defined as non-core (unless you're an IT company).

    Suddenly management questions you if you want to buy so much as a network hub (el-cheapo consumer grade at that - not for infrastructure). You have to justify any expenditure, and so the guys on the bottom just stop asking since it is such a pain.

    I'm sure anybody on that failed project could have identified steps that would have yielded a fallback. They could have built a new server, and then switched it out with the old server and kept the old one ready to go in an emergency for a couple of weeks. But that would require a $2000 server requisition - or maybe $3000 since the corporate standard was picked by some idiot on the vendor's kickback list.

    For the guy on the bottom, they look bad for asking for money, and chances are that the fix would have worked fine with no failsafes at all - the last 15 upgrades probably did. He has to ask for money each time, and will have nothing to show for it.

    On the other hand, every person on that project was probably thinking the same thing. Sure, spending $2k is a good business decision, but upper management wouldn't recognize that, so let's just not ask. We won't point out how much we're saving on server hardware by not having backups - we'll just let our overall expenses speak for themselves and not call attention to our negligence. And then we'll get promoted year after year and if something goes wrong we just all look dumb and nobody understands computers anyway so management will just figure that these costs come up any time you use one.

    And you know what? This approach usually works in the end.

    The real responsible party is the one which made cost-cutting-at-any-cost the corporate line. Oh, sure, the corporate policies usually have exception clauses, but what bottom-rung employee is going to bother running a request 12 links of the chain of command just to spend an extra $1000 on hardware? The opportunity to use it would pass before it ever got approved.

    The problem is the question-everything approach of corporate fiduciary management. Sure, there is waste out there, but it doesn't take many botched migrations to drarf what you save by pinching pennies...
  • Re:You forgot... (Score:3, Insightful)

    by Rich0 (548339) on Tuesday July 20, 2004 @07:47AM (#9747102) Homepage
    Of course, if non-THC hemp were widespread in use, then people could grow the THC-variety in the open without notice...
  • Re:You forgot... (Score:4, Insightful)

    by jovetoo (629494) on Tuesday July 20, 2004 @07:49AM (#9747117) Journal

    My experiences tell me cannabis is a much more desirable drug than alcohol, both from the users and society's point of view.

    Use both drugs with some sense and nothing bad will happen. Overdo alcohol and it will make you loud and often aggressive. Overdo cannabis and you will fall asleep (which can be loud but seldon aggressive). Neither are very suitable for driving. (Although I prefer people who smoked over people who drank: they drive more relaxed.)

    It is when talking addiction that the large difference arises. Alcohol is a hard drug, you get physically addicted, cannabis is not. Alcohol demolishes you while it degrades you. Cannabis use over large timeperiods is claimed to deteriorate memory. (So, don't drink to forget, smoke! ;) If you smoke the cannabis (instead of eating it) you get the same risks as with tabacco use.

    Here in Belgium, cannabis is more or less legal now (we are allowed to carry upto 3.3 grams on the street and use it in private places and such). It is a good thing, because we did that anyway (I live about 40 kilometer from the closest cannabis shop in the Netherlands where I can buy as much as I like legally).

    There were no sudden changes in behaviour. No millions extra addicts, no stepping stones, nothing. The people who are inclined to (ab)use drugs usually do not care about legality.

  • by SamuraiiProgrammer (150280) on Tuesday July 20, 2004 @07:51AM (#9747125)
    You are very right.

    Spending the money to parallel the new system until its clear it works looks expensive until something like this happens -- then it looks cheap.

    I also notice you have a PLAN for what to do if things go wrong. That is also very smart. When things break and everyone panics, it's good to be able to just pick up a procedure worked out in calmer moments and go with it.
  • Re:McDonald's (Score:2, Insightful)

    by Anonymous Coward on Tuesday July 20, 2004 @08:16AM (#9747251)
    In the same way that smoking kills people.

    The difference between dying from obesity, and dying from food poisoning is that the first is your OWN decision
  • Re:You forgot... (Score:3, Insightful)

    by Sepper (524857) on Tuesday July 20, 2004 @09:13AM (#9747711) Journal
    It worked in Belgium but I doubt it would work in North America...

    The biggest problem of drugs is their connection to organized crime. They have little or no moral and are only interested in porfit and making yuo addicted to the thing... If we accepted the fact that people DO take drugs on a daily basis and that we should help them, things would change. But because of the NIMBY problem people just want the problem to 'go away'...

    Someone proposed to open centers (Forgot the English term) where you could come get clean stereilized needles and medical care to take you drug... The argument being that if you don't help those in trouble, they will STILL take the drug with old (infected) needles...

    It got a wave of protestation because poeple didn't want this around... In the mean time infections continu to spread and the problem doesn't solve itself...

    Of course, the best way would be to legalize every drug, to restric possetion to REALLY small doses and to restric the sales to the Gourvenement... If you don't follow thoses rules, you get fined, and not 5 years prison like in the US...

    Of course, i'm just dreaming in technicolor(c) here...
  • Re:Just one (Score:4, Insightful)

    by SillyNickName4me (760022) <dotslash@bartsplace.net> on Tuesday July 20, 2004 @09:58AM (#9748264) Homepage
    > If I were Wes Watkins, I'd devote the rest of my life looking after the well-being of those orphans.

    You know.. being involved in such an accident changes you for life, its not like most people who get involved in this will ever be able to put it aside and forget about it.

    Adding social pressure to that is not going to solve much at all, not for the victims either.

    No matter how terrible the results, accidents happen, and we'll haev to live with that. Yes, we need to deal with the consequences, but an attitude that results in more people paying for the rest of their life as a result from accidents is not going to accomplish that, it is only going to generate more 'guilty' people who are too much stuck in solving their guilt issue and can't contribute to societuy as a whole as a result.
  • by bitrot42 (523887) <bitrot42@hotmail.com> on Tuesday July 20, 2004 @10:03AM (#9748343)
    Perhaps they suffered from the "while we're changing things..." mentality.

    Newspapers are 24/7 operations, usually with a mishmash of various vintages of systems all intricately tied together, so it is *very* difficult to schedule and perform upgrades. It would be quite tempting while setting up new servers to update to the latest server software, which requires new desktop software, etc., etc.

    The problem with doing this, of course, is that if something breaks, you have precious little idea where to look for the problem. It sounds like everything was tested, and the 'folks who know' were long gone by the time trouble began.

    It's not really surprising that problems cropped up - I've been involved in newspaper software/IT before, and that's par for the course with these systems. What does surprise me, however, is their apparent inability to deal with the situation, either by rolling back to a previous system, using a series of workarounds, etc.

    At the papers I've dealt with, the attitude of "the show must go on" extends well into the server room. There are thousands of critical functions that can go awry, not just with the publishing system, but with presses, satellite news feeds, etc., yet somehow the paper ALWAYS goes out. (I guess, even in this case, it did get done to some degree.) The level of determination and cleverness this elicits from people is an amazing sight.

    It sounds like the Trib has lost some of that sense of "whatever it takes," which is a shame.

    So I'd blame it on inadequate investment in staffing and backup/alternate systems (it was standard practice to literally have "two of everything" for exactly this sort of situation), and lack of access to knowledgable support from the vendor (it is CCI *Europe*, after all.)

    I feel for those involved; I really do. It's easy to watch from a distance and say "they should have known! they should have planned ahead!" But the reality is that everyone who runs those systems has their fingers crossed every minute of every day, hoping they're ready when the shit hits the redundant cooling units for the computer room...

  • by PingPongBoy (303994) on Tuesday July 20, 2004 @10:34AM (#9748739)
    It doesn't make sense to be making a request for every item. A policy should be in place.

  • by Ergoat (798735) on Tuesday July 20, 2004 @10:56PM (#9756121)
    One guy alone does not make a mistake like that. I'm a big believer in: If it's possible to make a mistake (particularly one of this scale) then preventive procedures and quality assurance are not up to scratch. This was a team effort, not a 1 man screw-up. If one guy loses his job or suffers as a consequence of this event, then I urge you all to action. Action similar scale to our support of Kevin Mitnick. How would you like to write code and (say) forget a comma, have it pass cleanly through all peer reviews, unit testing, system testing etc etc, only to find that it caused a $1m problem and your career and livelihood was now on the line ? One of the biggest reasons you are constantly subjected to security audits and process audits and finance audits is not because you have been a naughty boy and someone from up on high wants to catch you out, it's to find holes in processes and procedures that might at some point in the future cause a problem (like this one) (I'm tipping that the auditors don't get their arse kicked over this.) If one guy loses his job, you'd all better stand up and make with the noise, otherwise no one will come to your rescue when you need it.

Your computer account is overdrawn. Please reauthorize.

Working...