Why Phones Lost

Tuesday June 16, 2009

This morning I was reading Antonio Rodriguez's "Path Dependence And Smartphones" post, where he muses about different perspectives on the "smartphone" market; in particular the European / American divide outlined by Tomi Ahonen in "A Tale Of Two Smartphones". Antonio tends to think a few years ahead of his time, so I'm always interested in his take on trends like this. It seems that he's very cautiously optimistic that the "user customized mobile computer" thing is an important trend, but he also notes that Mr. Ahonen believes that the "[smartphone] operating system and any applications had ZERO bearing on the decision [of which phone to buy]. Not for mass market consumers"; and maybe we're looking at this the wrong way.

As I started thinking about my own reactions to this, I realized: I've heard this tune before. Remember when pundits used to talk about "convergence" between television and computers? Since the advent of the computer, futurists have been predicting the dawn of a strange new device: part computer, part television, part telephone, part vacuum cleaner. What would it look like?

Well, a few months ago, I feel like Paul Graham answered that question pretty definitively. For years, we've wondered what you would get if you mixed computers and televisions. In Mr. Graham's words: "We now know the answer: computers."

As a child of the digital age — I've been using computers with keyboards, mice, color displays, and networking almost as long as I've been able to read — I always found this conclusion somewhat obvious. A few of the early computers that I had the opportunity to use, an Atari 800 and an Amiga 1000, both used televisions as monitors, so I have always thought of a television as an output device — you could plug it into a VCR, a computer, or a cable box, but fundamentally it was just a bag of pixels.

I remember the exact moment that it dawned on me that computers were going to take over from TV: I was 14 years old, playing Myst for the first time, and monkeying with the configuration of system extensions that were loaded on my computer in order to squeeze the last few ounces of performance so that the video clips in the game would play smoothly. I remember thinking, "This is just a problem with RAM and CPU. In a few years computers will have so much of both that you'll be able to play full screen video without even turning off any extensions."

I, uh, had a pretty limited idea of how optimization worked at the time (the video was still jerky even after I turned off all my extensions), but I am frequently reminded of this insight when I am watching YouTube movies on my LCD "television". That television, by the way, is just a monitor for a computer that runs Ubuntu so I can watch Hulu and YouTube. I think maybe I have cable bundled with my internet service, because it's cheaper that way but I've never plugged it in to anything.

I didn't realized how powerful articulating this particular idea is until recently though, because I didn't realize just how much money is spent protecting obsolete infrastructure from the relentless onslaught of microprocessor technology. Phone companies — which, increasingly, are combination cable/phone/internet companies — are stuck between a rock and a hard place. As Internet service providers, they are a facilitator of the transition, and make a huge amount of money selling network services to people to make their computers more useful. But, as cable companies, they want people to think that television is some special, extra expensive thing that needs to be delievered over a different cable. As phone companies, both wired and wireless, they want people to think that voice and SMS data are special, extra expensive things that need to be delivered via special, magical wireless signals that can't be reduced to the simple and banal "internet". At the same time, especially as wired phone companies, they want the cost savings that comes from doing all of their networking as plain old IP, with no actual pesky phone circuits to worry about. Except they still want to sell you the service as if the phone were a different thing from your "internet" connection. (Whenever I see an ad for Comcast Digital Voice, I can't help but think, "Do you think that's air you're breathing?".)

There's still a lot of speculation in each of these industries that some new, hybridized technology is going to create a special and unique relationship with the consumer. But that's one thing Mr. Ahonen got right: the consumer doesn't care about your "operating system". They don't care about your "applications". They just care what they can do with their technology, and they care how much it costs to do so. The thing is, computers do more, and cost less, than any other specialized, dedicated technology. If your industry is fighting computers in the hopes of holding on to some residual value, you are going to lose. Here's a simple formula:

Computer + X = Computer

Consider a few specific examples: the convergence of computers with television has resulted in three general categories of technology: YouTube (and other flash video sites, such as Hulu), Tivo (and other DVRs), and digital cable boxes with on-demand technology. YouTube is a program you run on a computer to watch videos. A Tivo is a computer (running linux) that is running a program to let you watch and record videos. And those cable boxes are computers (running some crappy cut-down embedded OS) that let you watch videos on the cable company's terms. Whether or not your customers care about choice, all these things are computers because it's fundamentally cheaper and easier for the vendors to produce these things out of commodity PC components rather than specialized "media" electronics.

But Mr. Graham neatly outlined that trend already, so let's move on to other industries. What happens when you add a computer to an accounting ledger? You get a computer program (like BusinessMind, or QuickBooks) which lets you do accounting on your computer. Computers and books? The Kindle, which is a hand-held computer that lets you read books. If you look a bit deeper, you'll find that the Kindle is actually a computer program¹ that can run places other than its dedicated device. ~~Only crafty marketing folks prevent it from being more widely accessible; say, on your desktop or "television".~~ Update, December 2019: the "kindle" actually runs on all those places now, or [anywhere you can access a web browser](https://read.amazon.com/).

Let's get to the point of this whole schpiel: phones. Phones are already computers, pure and simple. They are just small computers with microphones and speakers, and soon, cameras and screens. You can look at the exciting developments in the world of phones and see that this is so. What are the hottest phones of the last few years? The iPhone, which is a small Macintosh computer, and the G1, which is a small Linux PC. Microsoft would have you believe that their small Windows PCs are equally relevant, even if they are clearly an also-ran in this category. (Disclosure: I actually have a Windows Mobile phone, and I'm fairly happy with it, but I'll be glad when I can finally ditch it for Android.) None of these "phones" does anything interesting in the area of phone-ness. They don't have particularly awesome voice quality or particularly awesome reception or even particularly awesome voicemail, although the iPhone certainly raised the bar. They're just better computers than the previous generation of "phones"; computers that can run a wider variety of programs.

However, phones are still computers with weird restrictions, restrictions that are purely a function of the "path dependence" that Antonio mentions, which dragged them out of the muck and the mire of the telecom industry. SMS is my favorite example of this: 10¢ to send a 140 character message. How much does a tweet cost on twitter? How much does an instant message cost on AIM, or Google Talk, or any IRC network you please? If you were billed at SMS rates to read this post, it would have cost you $10; the cost of a decent paperback. I know I'm wordy, but I'm not that wordy. If you were charged at SMS rates for a day's worth of casual web browsing, images and all, you'd probably have to take out a mortgage just to pay for it. Phone companies have been able to sustain the myth that SMS data is somehow special and deserves to be treated as sacred and precious, fully 1000 times more expensive than the regular bytes you get off the internet, even at the obscene prices they charge for usage-based data plans.

SMS is particularly egregious, but voice isn't that much different. Phone companies charge such ridiculous rates for "voice" data that Skype built an entire profitable business around giving people the same service for free, and only making money by piggybacking on the phone companies' greed and charging you when sending voice messages over phone networks rather than the internet. I can't imagine casting the wasteful overhead of legacy phone networks in any sharper relief.

So, we're not there yet, but the market pressure is tremendous to treat data as data, regardless whether it's voice, or SMS, or IM, or "internet" (in other words: everything else, including voice and SMS and IM messages which are sent via different mechanisms). Until the advent of the recent crop of smartphones, it was difficult and expensive to get an unlimited data plan. Now, unlimited data plans are the norm, except for "tethering" - using your phone as a proxy for your laptop. The phone companies are still desperate to convince you that you should pay $60 per month for the privilege of having a USB dongle that you can plug into your laptop rather than just using the mobile IP endpoint — which, by the way, probably aleady has a USB port — that's already in your pocket.

The "mass market" user might not care about operating systems or APIs, but they do understand that a bill with seventeen different break-out metered sections is a bald-faced attempt to rip them off, and a flat-rate or easy to understand pay-as-you-go plan with one number on it is better.

To the extent that phones are not yet interchangeable, unrestricted mobile IP endpoints, it is due to the high barrier to entry to telecom providers, lack of regulation of misleading pricing schemes, and the symbiotic relationship between government and the telecom industry. However, if one wireless carrier moves to provide simpler billing with more features, the others are forced to follow suit - even more so than cable companies and land-line providers, who can hold their customers hostage via development deals with local governments. So, this progression is happening, albeit slowly. For example, when AT&T introduced its iPhone plans, many of the other metered PDA and Blackberry plans, both on AT&T and other providers, began receding from their marketing materials.

Fifteen years ago ... ugh, I feel old. Let's say ... ten years ago, my computer was barely powerful enough to dedicate all of its processing power to playing one low-resolution movie that took up maybe half the screen. I was still paying for internet over a phone line with a cap on the number of hours I could use it. Today, I have real-time two-way video connection to anywhere in the world, 24-7, for a single flat rate. I own a device that fits in the palm of my hand which contains days worth of continuous music, a library of dozens of books, and connects to the internet.

So, back to that "mass market consumer". Maybe they don't care about my Python console or IRC chat or SSH access applications, but most "mass market" people do listen to music and read books. And they're going to care about those features being on their phones, and remaining cheap enough that they can use those features without worrying that they'll go broke if they feel like changing out their playlist. Also - nobody is really a "mass market" consumer, anyway. You might not be technical, but maybe you're a golfer, or a swimmer, or a finance nerd. You want to be able to check the weather on your mobile, or update your latest personal best lap time, or get updates when stocks hit certain price threshholds. Nobody cares what APIs these apps use, or even whether you call them "apps", but everybody has one extra thing they'd like their mobile to do.

The increasingly ubiquitous, user-customizable, network connected, commodity pocket computer is exactly the technology that is going to deliver that. It's going to have to become commoditized, which means it's going to be standardized, and secured, which means it's not going to be locked up in carrier notions of what's a "text message" and what's a "voice call" and allow for precise price segregation of every different type of data.

In the future, almost every device will be a computer, albeit with specialized peripherals to assist with performing tasks. If we're lucky, they will be networked together in standard ways to allow us to control all of them in a consistent and convenient way.

This progression towards computers is good for all of us. Trust the computer. The computer is your friend.

Paid link. See disclosures. ↩

Who Wants To Know?

Thursday June 04, 2009

Alternate Titles:

I Have No Log File, Yet I Must Scream

or

So You Logged It, Now What?

or

This is why I don't want to have LOG_DEBUG in Twisted

Sometimes, when writing a program, you feel compelled to make the program emit some output which is peripheral to its operation. The question is - who wants to know about that information?

Maybe you're debugging the program, and you insert a simple 'print' statement to get some information about it. Maybe your program is a network server, and you are recording the fact that a message was received and processed. Maybe you're maintaining an old library routine, and you want it to emit a message that points to a newer, better version of that routine which is now preferred. Finally, regardless of what kind of program you're writing, maybe it has produced an error that a user or administrator will need to deal with, and you would like to show it to them.

This activity is referred to in several different contexts depending on how the messages are delivered, but it is most commonly known as "logging". It is critical to the operation of many, many different kinds of programs. Unfortunately, it is one of the most poorly-understood and poorly-implemented areas of software in general. Software is a veritable cornucopia of poorly-understood and poorly-implemented ideas, so that's really saying something. You can see some of the more hilarious and visible examples of developers getting this wrong in the "Pop-Up Potpourri" series on the Daily WTF.

It might seem odd that I lump together funny dialog boxes with "logging". A dialog box is a little square on your screen; a log message is some text in a file somewhere. But they are very much the same thing, and they fail in very much the same way. Log files just do it less visibly.

The point that I hope to communicate here is that for every producer of information, there is a consumer. When most programmers need to produce a "log message", however, they are thinking only of getting the information out of their program in some format, any format; not how that information is going to be used later.

When I say "most programmers", I most definitely include myself. I'm probably guiltier than most. one of the reasons I'm writing about this in the first place is to work out some better approaches to the problem.

Consider this output from the "tomboy" desktop sticky-notes program on Ubuntu Hardy. If I start it from the command-line, I see this:

[DEBUG]: NoteManager created with note path "/home/glyph/.tomboy".
[INFO]: Initializing Mono.Addins
[DEBUG]: AddinManager.OnAddinLoaded: Tomboy.Tomboy
[DEBUG]:            Name: Tomboy.Tomboy,0.10
[DEBUG]:     Description:
[DEBUG]:     Namespace: Tomboy
[DEBUG]:         Enabled: True
[DEBUG]:            File: /usr/lib/tomboy/Tomboy.exe
[DEBUG]: Updating note XML to newest format...

It goes on for several hundred more lines just at startup, and continues to produce messages as the program runs. These messages are diligently classified into categories: DEBUG and INFO. I'm sure they're useful to someone. But why am I seeing them? I just wanted to start a program to put some sticky notes on my desktop, and none of this information is useful to that task.

I have to imagine that pretty much all of these messages are useful only to Tomboy's developers. But, worse than the fact that I see them is the fact that if something really interesting happened — I discovered a critical bug, let's say — all of that log output which is being splatted onto my screen is going nowhere. It is a book written on water. (Well, a book written on video memory, which is pretty much the same thing.) Meanwhile, thanks to the bug-reporting facilities in Ubuntu, I'm sure that I could opt to give the Tomboy developers a huge ton of mostly useless information, like the contents of my registers at the time that it crashed.

Consider not just the placement of the messages (on my screen, where I certainly don't care about them) but their formatting. Who is that elaborate right-justification of labels in the "DEBUG" output for, anyway? It isn't for me, I don't want to see these messages in the first place. I doubt it helps the developers, either; rather than just grepping for '[DEBUG]: File', now they need to put in a regular expression to collapse whitespace, or count the number of spaces that the justification happens to put in. Presumably if this output is useful at all, it is useful in a search.

Text Formatting and the Inevitable Descent into Log-Level Hell

The right-aligned pretty-printing is a beautiful illustration of a very common anti-pattern in logging: trying to convey structured information by messing with a textual format. A developer wanting to write a message indicating that there is a problem with the program, left with the extremely narrow confines of a logging API which just takes a string, will often do something like this:

log("*****")
log("THIS SHOULD NEVER HAPPEN! HELP!!!")
log("*****")

Of course, this frantic wording doesn't help the output go anywhere but silently into a log file where it will be ignored. But, perhaps if this is some server software, an administrator will notice this message and set up an alert that makes their blackberry buzz when they notice those particular words show up in the log file so they can ssh in and look for problems.

Then the developer gets chastised by his manager for his un-informative error message, and updates it to be something clearer:

log("Serious Error: phase inducers have been depolarized. Contact engineering immediately.")

Of course this breaks the administrators' alerts, so after much discussion between programmers and admins, log levels are added so that admins only get alerts when something "really bad" happens, where "really bad" is an agreed upon flag:

log2(SERIOUS_ERROR, "phase inducers have been depolarized. Contact engineering immediately.")

Okay. Now we've got a log level so admins can tell when their pagers should go off. Except, different developers have different ideas about what "serious" means.

log2(SERIOUS_ERROR, "OMG I lost my cat Mittens. Where is my cat?")

Clearly this is an abuse of the new "severity" flag that was added, but the cat-engineering team thinks that loss of a cat is pretty serious, so we add a new thing, a log "system".

log3(SYSTEM_CATS, SERIOUS_ERROR, "OMG I lost my cat Mittens. Where is my cat?")

Most logging systems stop in this general vicinity, but we still haven't solved the problem, which is that the log message has no structure and you can't tell what's going on without groveling around in a bunch of text files with regular expressions or manually reading each message. Which cat was lost? Which phase inducer was depolarized? How do we get from a log message or alert to this information? The 'log levels' solution to this problem is clearly untenable:

logRidiculous(SYSTEM_CATS, ALERT_IF_YOU_LIKE_CATS, O_RLY, YA_RLY, SERIOUS_BUSINESS, BUT_NOT_TOO_SERIOUS, CAT_LEVEL("Mittens"), "OMG I lost my cat Mittens. Where is my cat?")

More importantly, if you're writing a library, you have a bunch of other problems. This diagnostic information needs to be logged somewhere, but what if this library is being used on a user's desktop machine? Some of these messages are relevant to them as well. How do you tell the user who is using a GUI that a cat has been lost? How do you show them the picture of Mittens so they will recognize her if they see her?

Everyone agrees that log messages need some "small amount" of information associated with them, but very few people can agree on what that information should be. Even at the simplest layer, the idea of a "level", there are lots of open questions. Is the "debug" level for a programmer trying to debug something on their test rig, or is it for administrators trying to debug something in production? Should there be a difference between those two things? How serious does a problem have to be before warranting a "critical" classification?

Once you're using logging code written by more than one programmer, or worse yet, more than one team, you're going to be facing this problem.

The Particular Problem of Libraries

This is, of course, my main interest, since this is where the rubber meets the road for Twisted. Libraries need to communicate to several different audiences:

We need to tell developers using the library about the correct way to use the library at runtime.
We need to tell administrators of systems using the library about the status of the library and tasks they may need to perform to keep it functioning well. (Clear your caches, restart the server, install a security update...) We also need to provide administrators with information they can mine for statistics about how the library is performing; how many requests handled, where its resources are going, etc.
We need to notify users of applications using the library about things that the library is doing which may be relevant to them. (A new message has arrived, a new printer is available... obviously this depends heavily on what the library does.)

Libraries, especially event-driven engines such as Twisted, libevent, and glib, have a particularly difficult time because they have to deal with all of these audiences simultaneously. However, I think that any application or server which needs to do some kind of logging or user notification needs a subset of these features, so if any logging system could solve this problem, it could solve pretty much all logging problems.

Type of Information by Type of Audience

Developers, Developers, Developers

Many languages don't have a solution to developer communication at all. Python has one — the warnings module — but it is in many ways inadequate.

The warnings module doesn't easily let you selectively see which libraries you want to see warnings for. If I'm developing an application A using libraries M, N, and O, which themselves have dependencies on X, Y, and Z respectively, I don't want to see warnings that M caused in X or that O caused in Z; those are problems for the maintainers of M and O.

I am maintaining only A, so I want to see warnings caused by my application in M, N, and O. I can try to filter specifically by module, but unfortunately the
only way of determining which library caused which issue is by directly
examining stack depth, which is unreliable at best and misleading at
worst. Even if I could filter very accurately, it's hard to get a stand-alone report of warnings and deal with them as they're supposed to be. Warnings show up to end-users as well, and to administrators looking at applications in production. It's worth putting up with that to have at least some solution for communication with developers, but it would certainly be better if it didn't happen.

Finally, it's easy to generate a huge amount of warning noise (and, especially as of Python 2.6, many libraries do). With that much noise and no reporting functionality it's hard to the warnings you care about.

A better solution for communicating for developers would be one which:

allowed developers to declare somewhere what code they are working on and what code they are just using
recorded relevant warnings to a log file which was optimized, perhaps with an associated tool, for locating and removing the sources of the warnings
allowed end-users to easily communicate their warning data to developers without inundating them with irrelevant noise while using the application

Administrators

To communicate with administrators there is a huge variety of options, but many of them depend on a lot of ad-hoc hackery by the admins themselves, which means they are inconsistent and therefore there is little reusable technology or standardized APIs available.

Right now the gold standard for talking to admins seems to be just writing strings into a text file and hoping they have some facility to read it.

A better solution for communicating with administrators would be one which:

preserved structured data in an analysis-friendly format, rather than formatting it in human readable messages. (For most UNIX admins, I imagine some kind of structured text would be best, so "grep" would still work but more advanced tools could also be brought to bear. I'm not sure what the tools in the Windows world look like. The "Event Viewer" looks like maybe it's a step in the right direction, but its UI is incredibly primitive.)
provided easily-accessible hooks for dispatching different types of events to ad-hoc code to wire up to existing notification systems - without significantly altering the behavior of the system doing the logging, if the logging hooks were broken, as admin-written code tends to be a bit flaky
included an enumerated list of events which administrators could inspect before they happened to run across them in log files

Although it's a crappy format in many ways, the Common Log Format for HTTP might serve as a good example. Unfortunately it's too purpose-specific to extend to do more than what it already does, but lots of tools have been written to produce lots of interesting data from even that very simple standard.

Desktop Users

There are two popluar cases for communicating with end-users. One case is that you're actually running a program on their desktop and you want to tell them something. Another is that you've got some code running in a web application which wants to tell them something "out of bounds".

On the desktop, there are fairly standard "notification" APIs for popping up little bubbles. On the web, there are emerging conventions for these notifications, like a bar that descends from the top of the page to mimic the firefox 'do you want to remember this password?' UI element. A good example of this is Stack Overflow's notification banner.

Unfortunately both of these have a problem with scale and with timing. If your application suddenly encounters a large number of errors, it will flood the user's screen. If the user isn't present when a notification occurs (or navigates away), the bubbles or banners may disappear.

A better way to talk to end-users would be:

for desktop applications, a mini-email interface, which records notifications in a scrolling list so that users can inspect notifications that occur while they're away.
for web applications, a standard API so that multiple applications on a single site (or even, potentially, on different sites) can drop notifications into a queue which can be displayed appropriately. (Since websites tend to have strong preferences to control their own design, an actual standard widget might not be possible, but it would be nice.)
for both of these, a standard protocol which would enable notifications to be easily streamed to different computers or mobile devices without needing to reconfigure
a specific classification of messages at the API level, saying, "I want to tell the end-user about this". Messages about crashes, etc, should be displayed as an option to send information to the developer. In the context of a web application this can be done automatically and silently; in a desktop app there would need to be some channel set up for sending that information.

Let's not forget that administrators are users too. Everything that happens in a server's notification APIs should be able to be trivially filtered and redirected to administrator's desktop machine (or their phone) so they can immediately notice when something has gone wrong.

Appendix A: Optimization and Dynamic Instrumentation

This doesn't fit into the "figure out what you need to say and who you need to say it to" theme of the rest of the requirements here, but it is nevertheless important. If you make heavy use of a logging system, especially one where you have lots of messages that are logged "just in case" and rarely displayed, you will quickly discover that it's consuming a lot of resources.

The work to calculate what goes into a log message shouldn't be done unless the message is actually going to be emitted. Similarly, it should be easy to dynamically add and remove log events from particular methods or particular objects without modifying their code. The best way to do this is to keep the logging entirely separate and add it at the level of methods and functions, rather than ad-hoc in the middle of application code. Sometimes, of course, you want to emit log messages at a very particular point, but in general it should be easy to add instrumentation to a method without modifying its code, to avoid cluttering up application logic with lots of "just in case" debug messages.

What Can Do This?

I'm not aware of any logging system that can already do these things. We'd have to write a new one. This essay was largely composed due to my desire to understand what I thought a "good" logging system would do. It might be too ambitious.

There are a few fundamental units which are missing from most logging systems. While lots of logging systems have various ways to indicate subtle and nuanced levels of urgency, few have a way to indicate who a message might be relevant to. Logging systems also generally don't have a good way to associate structured information along with a log message.

Twisted's logging mechanism does have a free-form dictionary associated with each message, but that's not much help unless you impose your own structure on it, which means you have to build all this infrastructure anyway. It is, at least, possible to treat the Twisted system as a kernel which this could be based upon.

In order to produce an enumeration of events before the events are actually logged, this API will require pre-declaration of log messages. This might be too much of a burden, since sometimes you want to just stick a log message into the middle of some code so that you can see if it happens. So, in practice, it's more likely that pre-declaration will need to be optional and you'll need to be able to associate ad-hoc data and have it still be persisted along with your message.

There will need to be some way for communicating both the structured data of an event and the human-readable text associated with that event — preferably in a way which can be internationalized.

There's also a bunch of UI, web, and protocol standardization work that would need to be done. Luckly, that's independent of the actual log machinery; if it already existed it would be a trivial matter to hook it up. In the meanwhile, something that did all of this but just used existing facilities, like the desktop notification spec, status icons and email, would still be immensely useful.

Sponsor Twisted!

Saturday May 23, 2009

I've kicked off our 2009 sponsorship drive with a post on the Twisted Matrix Labs blog.

Please share and enjoy.

Making easy_install work with Combinator

Thursday April 23, 2009

This is posted mainly for my own benefit, so that I won't have to re-remember the command-line options to make this work for the nth time in a row, but some of you may enjoy it:

easy_install --prefix ~/.local/ --site-dirs
    ~/.local/lib/python2.5/site-packages
    your_package_here

Now that I've gone through and understood why that's necessary, I think I might be able to fix it in Combinator one day...

Notification Disappointment in Ubuntu Jaunty

Saturday April 11, 2009

I have recently been working on some applications that make use of the features provided by notification-daemon. I installed Jaunty to check out how the much-vaunted new notifications framework works and how it would affect these applications. I was aware that it sacrificed some features, but I wasn't worried. I am generally a fan of the GNOME philosophy of dropping functionality and configurability when it doesn't really serve the user. I also appreciated the new, more minimal and sleek-looking graphic design of the notifications.

I feel like I need to preface my reactions with an apology. I really like Ubuntu. In many ways Jaunty looks really slick, and I'm generally enthusiastic to upgrade.

But this notifications stuff? Wow. What a disaster.

First, a little background. I recently started working on two different applications which made heavy use of the notification API. While I originally thought I'd need to implement some of the notification features that I wanted on my own, I was pleasantly surprised to discover that notification-daemon provided almost all of them.

I want to present the user with a time-critical notification, one which didn't grab the focus and interrupt their work. I want to show the user how much time they had to respond, and provide a few different options for responding to certain notifications. In one application, I'm implementing a sort of dead-man's switch, where failing to respond within the time limit also counts as a negative response. I also want to emphasize certain notifications, and provide hyperlinks to their web-based origin so the user can jump straight into the application at the appropriate point if a notification is interesting.

Some of the notifications I want to generate are notifications of events, some are indications of a change in status of my application; so sometimes I want to point at a particular status icon and sometimes I wanted to just drop the notification into a queue - hopefully a global queue which would intermix with other notifications.

I was assuming that I'd need to implement some of these features myself, but to my pleasant surprise notification-daemon handled every single use-case, more or less exactly how I envisioned it working. I was thrilled. As part of searching around for "notification" stuff, I learned that Jaunty will have some newer, even cooler notifications stuff, and I was really excited. Unfortunately, Notify-OSD, the new Ubuntu-specific notification daemon, drops nearly all of these features. So, I'm back to square one.

Here are some of the specific things which bothered me about it.

Applications which emit a notification that prompts for an action — something which they only would have done if they explicitly wanted to avoid grabbing focus, since popping up a dialog box is easy enough — will have a modal dialog box pop up and grab the user's focus while they're working.
Timeouts are no longer honored. It so happens that in my application I have an operation which takes 2 minutes to time out; I would really like my notification to stay on-screen for this entire time. I can still do this with notify-osd, but in order to do so I have to watch for the "closed" event and constantly create new notifications. A smoothly animating timer was a much nicer interface than a sequence of bubbles saying "In 30 seconds I will time out. In 25 seconds I will time out. In 20 seconds I will time out." Yes, I realize that the Ubuntu desktop team will say that I should do something different in this situation, but they're not designing my application and I don't like the options they've suggested. This would be much less of a problem if the provided notification timeout weren't so distressingly fast. As a user, by the time I've realized that a notification has popped up and taken the time to focus my eyes on it, it disappears halfway through my reading its message. I am a very fast reader, and I'm pretty sure that there are a lot of people who are never really going to notice any notify-osd bubbles; they're so fleeting, they're just visual noise.
Markup is now silently ignored. I can't emphasize portions of a notification with a larger font, or provide a hyperlink to the origin of the notification. Similarly, since actions have been broken, I can't provide an action to jump to what caused the notification. Notifications have thus become disconnected UI element which tell user something while providing them absolutely no tools to deal with it or respond to it. For example, when Pidgin tells me that a user has signed on, I can no longer interact with the notification to say "yes, I would like to talk to that person". I have to switch windows to the buddy list, locate the person who just signed on, and click on their name, rather than just clicking on the notification itself. Or, I have to click on the tiny notification-daemon status icon that's a hard target to hit with my mouse, rather than a big friendly button.
Notifications are now displayed in the upper-right-hand corner of the screen (where important chrome, like close boxes, search boxes, toolbars, and menus frequently reside) rather than in the lower-right, where less important application features are traditionally located. Granted, this is a damned-if-you-do-damned-if-you-don't situation, because some applications put important action buttons in the lower right, but I have been learning where to position my windows to avoid that area of the screen for years, and now I have to learn new habits.
Notifications can no longer be positioned on screen, relative to either widgets in windows or status icons. This removes the ability for an application to use notifications to draw attention to a particular area of the screen. Instead, users must make some connection between the notification bubble and the status icon themselves, perhaps by identifying some common graphical element. If you're already using the "icon" area of the notification to display a picture (such as a person's portrait) it's a bit cramped to also show a copy of the status icon, especially given that the icon will now be squished for you so it's hard to get a pixel-accurate rendering of the status icon anyway.
Some of these problems are nominally addressed by the new "indicator applet" facility in Jaunty, but...
1. The indicator applet and libindicate library appear to be almost completely undocumented; the "reference manual" looks like it was an auto-generated stub. The automatically generated API documentation isn't hosted online anywhere.
2. The python bindings for libindicate are similarly undocumented, and they aren't packaged anywhere, not even a PPA. They also use autoconf, rather than distutils, for installation, so their build process doesn't produce a usable extension module and thus they resist installation anywhere but in /usr.
3. I tried to read what passes for documentation — the patch to Pidgin's libnotify plugin that switches it to use libindicate for some things. I tried it out because I wanted to see if maybe the indicator applet could address some of my concerns, but my misuse of the API caused the indicator applet to instantly segfault.
4. From what I can tell, you can't just provide an icon and some text, you need to actually create a .desktop file, which means that packaging applications which want to use the indicator applet automatically gets two additional layers of complexity: first, you need to create a .desktop entry, and second, you need to figure out a way to have your application include it during installation.

It's interesting to read the version history for Growl, the OS X notification tool which so clearly provided the visual inspiration for Notify-OSD, and notice that many of the features now being removed (application level positioning, close buttons on notifications) are features which were added to later versions of Growl.

The "notification design guidelines" provide some very vague suggestions for ad-hoc mechanisms to work around these regressions. These suggestions are unhelpful. I want an API that I can call, not a picture of a window I need to re-create myself. If the desktop team wants to change the look of my application in the future, I don't want them to submit a giant pile of patches against it.

Not only are the suggestions not a library, they don't include sample code, either. How do I actually create an alert box which doesn't take focus, doesn't include any window manager controls, but stays above other windows? Describing it this way is an invitation for applications to be inconsistent. My interpretation of this specification will inevitably be different from other application authors. For example, the specification doesn't say anything about compositing, but the window in the screenshot clearly appears to be alpha blended with the background. It also doesn't say anything about WM controls, but some pictures have minimize/close buttons and some don't. Some applications will have alpha blending for their notifications, some won't. And since there's no library here, there's no reasonable way to consistently control the behavior of many applications.

With these features in libnotify, we have a single uniform queue for users' attention. A single point of control which might be adjusted, bugfixed, and tweaked across the desktop as a whole, without writing tons of patches for individual applications. Notify-OSD even takes advantage of that point of control. But the suggestions for many of these features is to take control out of a single easily-managed client/server protocol and push it into a bunch of ad-hoc application-specific widgetry.

To add insult to injury, the one place that I do get hard examples of how to do things, on the notification development guidelines page, the Python code samples have yet to be written. This is a minor nit, as I can definitely figure out what's going on from the C# code, but it's endemic of the same systemic problem with the Linux desktop ecosystem that brought us this half-baked replacement for notification-daemon. Jamie Zawinski identified this problem as the "Cascade of Attention-Deficit Teenagers", but Canonical demonstrates that you can create this same problem with a medium-size company that employs highly competent, adult engineers.

Notification-daemon isn't perfect. It could clearly stand to be improved, especially in the face of notification spam. So please, improve it! Or, at worst, if upstream is not cooperative, fork it. I'm pretty sure that the solution to the rate limiting problem is not "then... what?". Notify-OSD cuts the gordian knot of notify-spam by only letting you see one thing at a time, but that has its own problems.

Now, to be fair, all these regressions haven't really cost me any work. I want my applications to be cross-platform, so I was going to have to implement most of this functionality for Windows anyway, using animating borderless windows. Now I'm just going to be using my own notification widgetry on Ubuntu as well, rather than elegantly integrating with the platform and providing all of my notification interaction through a familiar UI. But I'm sad that the superior notification infrastructure on Linux in general and Ubuntu specifically is no longer something that makes my application easier to write first.

So, beyond this one little screed, I'm really not going to complain too much. I'll implement some of my own ideas for notification, try to come up with some way to be friendly to Notify-OSD in the meanwhile, and I'll still eventually upgrade all of my computers to jaunty and enjoy the other eye-candy and performance improvements. This is not too terrible of a price to pay, and I do keep it in perspective. I also understand that I'm not the guy who has to make the hard decisions for what goes into Ubuntu or GNOME or whatever. I understand that sometimes, in order to make an omelette, you have to kill a few people.

But, if a decision maker for Ubuntu were to care about my entirely irrelevant opinion, as both an application developer and heavy user of notification systems, I would say this to them:

You guys have done some great work on Notify-OSD. It's a worthy prototype. In many ways it is better than notification-daemon: it looks nicer, it makes notifications between applications more consistent. I can appreciate the uncompromising vision you have for cleaning up the sometimes confusing pile of notifications that users see.

You should package Notify-OSD in Jaunty, so that people start using it. Start updating applications to honor the capabilities that it provides. But please, don't make it the default in the first release where it's included, and yes, include a preference for the period of transition. Write some libraries to support the other use-cases which Notify-OSD right now ignores. Document and stabilize the indicator applet. Package the Python bindings, please. Make it not crash when applications abuse it.

Regardless of this new notification system's unpolished state, I'm sure many users will update and start experimenting with Notify-OSD, much as many started with Compiz for years before it became the default window manager. Most users can keep using the regular notification bubbles until Karmic, though. When Karmic comes along, you'll have had the time you need to finish the documentation and provide application developers better alternatives to a good notification API before yanking the carpet out from under them.

If this advice is ignored, as I'm almost sure it will be, it won't bother me - notification is hardly the most important API that an OS provides. The thing that I really hope someone will take away from this is the general theme that platforms should evolve experimental features slowly, and you should always have a well-documented, better alternative ready before you remove something. Notify-OSD removes a half-dozen features and informally, halfheartedly gestures at some ways you can make your window pop up to address what some of those features used to do. That's fine, as a scrappy new competitor to notification-daemon, but not as a core part of a major platform.

My real hope is not specifically that Notify-OSD will actually be pulled. Of course I'd be happy if it were, but Jaunty going to be released in just a few days. Again, I feel like I need to qualify these statements: if I'd really wanted to impact decisions like this I should really be regularly using beta releases. Not to mention the fact that if I'd actually finished these hypothetical applications I'm thinking about, I'm sure my voice would carry more weight.

My real hope is that you, gentle reader, will take this message, and the next time you are contemplating boiling your favorite ocean, you'll stop and reflect. Break down the changes you are planning on into individual, incremental improvements, rather than sweeping, break-everything lateral movement. Make radical improvements, but make them behind the stable facade of a system which is only lifted when the radical improvement is clearly both radical and an improvement.