Sunday, May 24, 2009

Thinking about tagging

Tagging is a way to label articles and other information objects (such as events) for retrieval and clustering. A good community news web site will support searching articles, for example, and will find all articles that contain a word (or phrase). At their simplest, tags allow an article to be found even if it doesn't contain the search term.

Many times, articles will talk around a subject--the elephant in the living room. When we talk about cutting services and increasing public sector revenue, it is understood that we are talking about "budget," "city finances," and "Bud Conway," the city finance manager, even though those words might not be in the article (let's pretend). The extra words become tags so that the article can be retrieved for a searcher who would likely care.

In searching, you have precision and recall. Precision says that all search results will match the query--there will be no irrelevant articles. Recall says that all articles that match the query, even at the fuzzy edges, will be in the search results, along with a percentage of irrelevant ones. For most applications when we are searching a database of community articles, we want high recall, and tags help with that.

Tags also cluster information. The author of an article can tag it with "News" and readers can browse the News category and find the article, along with other news stories. If the author tags the article with two tags--"News" and "Schools," say--then the article appears in both categories (high recall in the browsing scenario).

Then, there is the notion of readers tagging the stories, the way Flickr lets viewers tag photos. It's a little different with photos, obviously, because the photos can't be searched without captions or tags. With articles, tagging not only potentially helps searching, but it's also a way of interacting with the article--it's a measure of popularity, or interest in an article. Both articles were viewed 1000 times, but this one had a lot more tags proposed. But do we really want readers deciding that something is "News?"

Ed Chi and Todd Mytkowicz of Xerox PARC wrote about aspects of tags recently. They explored the basic mystery of why uncontrolled reader tagging generally works. Users can tag articles with any combination of letters and numbers. Anarchy. Chaos. But, by most measures, the tag system achieves its goals. "Social tagging...[is] attempting to solve a mapping problem," they write. Users are collectively creating a map that will enable them and others like them to navigate the territory efficiently in the future.

One reason I think the potential for anarchy isn't realized is that griefers--vandals--like their mischief to be visible. If I tag a gossip story as "News" and it now appears on the Front Page, it's like spray-painting a mustache on a billboard--everyone can see how clever I am. But if I tag an article as "asdf," it is easily ignored.

Col Needham, the creator of the Internet Movie Database (imdb), has written that a few fairly basic ad hoc tweaks to the search interface greatly improved the searchability of the very large database. Finding a movie like "20,000 Leagues Under the Sea" would be difficult without synonyms like "Twenty Thousand," "20 Thousand," "20000," and so on.

Mindful of Needham's experience, I think a community news web site will need both an anarchic reader tag system alongside a more controlled tag vocabulary used by authors and editors. We'll want to suggest tags for use by authors that perhaps come pre-synonymized. One selection by an author from a drop-down list might add 3 tags. For the reader: "Propose a tag. Type it here. We'll let you know."

By the way, Chi and Mytkowicz found that entropy makes tag systems work less efficiently as they grow. Tags become less descriptive over time and "tags are becoming less meaningful in regards to providing salient navigability."

"Even with a tagging system, the navigability of the document set is becoming more challenging over time. One way for users to respond to this evolutionary pressure is to increase the number of tags they use to specify a document."

Chi and Mytkowicz find that, as the number of articles grows, users are increasing the number of tags they apply, and searchers are using more search terms. They report that Yahoo!'s average query length was 1.2 words in 1998, 2.5 words in 2004, and 3.3 words in May 2006.

Friday, May 22, 2009

Forbes.com offers a working example

Wednesday, Forbes.com posted a near-perfect example of the difference between local news and remote, national, mega-news. They posted Matt Woolsey's Home of the Week, a $6.7 million estate on 20 acres, apparently in Santa Clara County. It's not clear where the home of the week is because the headline says Los Altos, but the article says Los Gatos. I live in Los Gatos, and the photo looks like Los Altos to me.


I pointed out the error in an e-mail to Forbes yesterday, they haven't fixed it, so I think they're fair game. (I have had my suggested corrections implemented by several national news sites; Forbes is just lame.) The trouble is, Mr. Woolsey's text describes Los Gatos--he mentions the zip code, the proximity to the redwoods and the ocean, and the Spanish translation of the name. The url is http://www.forbes.com/2009/05/20/los-gatos-home-lifestyle-real-estate-home-of-the-week.html--the name Los Gatos is embedded in the url. How embarrassing if the home is really somewhere else, which I suspect it is, since I wrote a book on Los Gatos architecture and the home doesn't look familiar.

From Wikipedia, we learn that Mr. Woolsey is 28 and was born in San Francisco. He writes for Forbes (limiting himself to real estate, lifestyle, labor, baseball, transportation, and small business) but he has also appeared on CNN, CNBC, NPR, Fox News and others.

Mr. Woolsey is on the Steve Lopez track--journalists who get so big they can't respond to an e-mail and who don't stay in one place long enough to understand what they're talking about. At least Lopez still only covers Los Angeles. Woolsey is trying to cover six desks nationally.

I'm whining about this because I believe the future needs journalists, but that they must be embedded in their beat. This shoddy "Home of the Week" nonsense only works in national printed media. On the web, his story about California real estate that does such a great job filling space between Breitling ads in Forbes in Manhattan instantly reaches a guy who wrote a book about architecture in Los Gatos and he's busted.

The web keeps journalists--anyone who tries to tell the rest of us how it is--honest. I've written many wrong things in thousands of Los Gatos stories, but I corrected them as soon as I was told they were wrong. When the facts were in dispute, reader's comments presented alternative viewpoints. The articles are part of the archival record, and because they were corrected and responded to, the archives are more valuable. I'm guessing Matt Woolsey and Forbes don't think that way with respect to left-coast "Home of the Week" fluff.

UPDATE: It took my wife, Peggy, five minutes to determine that the house is in Los Gatos, after all. It's on the hillside west of Highway 17, just south of Wood Rd. She found it on realtor Michael Nevis' web page. This means that the only correction Forbes needs to make is to the headline. Oh, and the "gated community" comment has to go unless two or three homes sharing a gate qualifies as a gated community.

FURTHER UPDATE: 9/22/09 It's been 3 months. The article is still online and the headline still says Los Altos.

Wednesday, May 20, 2009

The Wisdom of Crowds In Reverse

An article in this morning's Wall Street Journal caught my eye. Although it is not the point of the article, I was intrigued by the results of a study led by social influence researcher Robert Cialdini.

When hotels left a message asking guests to please reuse their towels to help the hotel save energy, 84% of the guests refused. When the message was, instead, "Partner with us to help the environment," almost twice as many guests, 31%, reused their towels.

When the message was changed to "Almost 75% of guests reuse towels," 44% of guests responded by reusing their towels. And when the message became even more local and specific--"75% of the guests who stayed in this room reuse towels"--49% of guests began reusing their towels.

Local and specific. "Dine out more often" or "Denny's: Real Breakfast 24/7"--these messages are easy to tune out. But to a local audience: "New desserts at the Cup & Saucer," or "Cup & Saucer Sponsors Little League" may sound corny, but unless you're just passing through, they are important stories that you want to know about. "Considering retirement, Smith may close Cup & Saucer" is a grabber.

I suspect these local messages are sticky for the same reason we want to know what guests who stayed in our hotel room did with their towels. Local people, specific places where we go or could go to spend our money--these are a degree of magnitude more important to us than some international chain's ads. National advertisers are so used to the automatic multiplier of media like television that they don't seem to realize the impact that the web will have on community, localcast, advertising.

Tuesday, May 19, 2009

It's a new, over-wired world

The Onion News Network reports that police were able to figure out what caused a fire at a fictional crowded college party because there were 43,000 photos taken and twittered at the party. My wife points out that this is not just funny, it's a glimpse of the future:


Police Slog Through 40,000 Insipid Party Pics To Find Cause Of Dorm Fire

Update 5/20: The morning news featured the preliminary murder trial of Johannes Mehserle, in which the witnesses were called to testify about their camera and cell phone videos of the shooting. The gunshot could be heard on some of the video and the victim's relatives reportedly reacted each time the sound was played from another viewpoint.

Monday, May 18, 2009

Dean Singleton: Dawn Breaks Over Marblehead

Dean Singleton, the news aggregator who bought up dozens of once-great newspapers in the fin-de-siecle of their existence, wrote a memo to his editors on May 8. He just figured out that the web is important, too, but he has decided that giving away news articles online devalues his dead-tree newspapers. This may sound sarcastic and hyperbolic, but it is no more than the truth.
...we continue to do an injustice to our print subscribers and create perceptions that our content has no value by putting all of our print content online for free. Not only does this erode our print circulation, it devalues the core of our business - the great local journalism we (and only we) produce on a daily basis.

Judging by the San Jose Mercury News, it is a stretch to call it "great local journalism" these days. Singleton says it's time to end the free ride, time to tell
our online audience (who don’t buy the print edition), that if you want access to all online content, you are going to have to register, and/or pay. If a non-subscriber wants the newspaper content in its entirety online, they will be directed to some sort of registration or pay vehicle (and if they are a print subscriber, they will have full access at no charge). To be clear, the brand value proposition to the consumer is that the newspaper is a product, whether in print or online, which must be paid for.

Singleton clearly thinks, in the Internet age, that the best way to disseminate news or communicate with an audience is to print the message on paper and place it on doorsteps. Wow.

To reach a younger audience, he proposes a new kind of regional news web site that will include some user-generated news and be "actively managed" to present breaking news. The key will be to differentiate from the existing newspaper.com kind of site and not just present printed newspaper content. He calls the new vision "news.com."

The most interesting proposal is a local site:
We will build a new local utility site (Local.com), which is an ecosystem of local information, resources, user content, shopping guides, and marketplaces. This site will be focused on a younger audience as well as other targeted audiences based on demographics which are attractive to our current and potential advertisers. We have the advantage of being the trusted source of for news and information in our communities and have a large base of traffic to feed into Local.com.

Local.com will leverage existing newspaper content and existing traffic, and we will add new content (such as Entertainment/Lifestyle) to target a younger audience. Central to this local site will be an aggregation of city or community sites (in the YourHub model) and marketplaces.

Local.com will be the ultimate site for people to find stuff, do stuff, and get stuff done in their local market.

This sounds okay, except that we've had the YourHub model off the San Jose Mercury News website for two years now, and it stinks. Go to the mothership web site, click on "Your City" and find the name of your city in the list. Then upload your photos and comment in the forums. Go on. What are you waiting for? You don't find this compelling in the slightest? Well, what's wrong with you?

Thanks to fellow blogger and former newspaperman Gary Scott for posting the entire memo.

Tuesday, May 12, 2009

Closer to the Source

I just realized why I tend to stiffen when I hear someone who has been a professional journalist for decades talk about the web as a media platform. It's odd, because I have a great respect for journalists and their profession. I seem to find myself quoting Clay Shirky at every turn these days, but I agree with him. He says we don't need newspapers any longer, but we have an urgent need for journalism.

I was raised on Time magazine and the CBS Evening News more than the daily newspaper. Walter Cronkite told us what happened, and Time followed up a few days later with a much more thorough report. We've been subscribing to the Wall Street Journal for many years now, and I read it nearly every morning, 24 hours late (we don't visit the mailbox before breakfast). I read the Los Angeles Times online edition often. But I get 100% of my news from Google News these days. I scan those headlines 10 times a day.

Through Google News, I am directed to the New York Times, the Washington Post, the BBC, sometimes CNN and others. That's how I read my news--I don't know how others do it these days.

Have you ever noticed that the news is comprehensive and authoritative only to the degree that you don't know what they're talking about? I lived near Santa Monica Airport as a kid, and we saw a lot of light plane crashes. The TV news usually got some details wrong. I worked for the actor John Carradine's son, Chris, and when his father died, every news story seemed to think he had a different number of surviving children.

So, back to getting the news. When Mr. Ghanem was blown up in Beirut, I read the New York Times account and was glad that they got every detail, every nuance, exactly right. But during the time I covered the news in little Los Gatos, I watched the other media screw up details large and small. The Bay City News Service, which supplies many outlets in the area, is virtually unknown by locals. They have a source at the county fire department, because they put every call on the wire quickly. But they had no one on the scene--they were talking to a desk jockey who wasn't on the scene, either.

One story was an overturned truck just off Highway 17, in which the driver died. Every media source in the area was telling you that this accident was slowing traffic on Highway 17 because on the map it looked close. In fact, the accident occurred at a construction site 400' above the highway. I was there. Drivers had no idea it had happened; they could not glimpse so much as a Highway Patrol vehicle. Traffic was slow because it was a summer Saturday and people were heading to the beach.

This is a long-winded way to say that I like my news from people close to the source. If it's Beirut, the New York Times is close enough. But if I want to know about a crash at Santa Monica airport, now that I live 400 miles away, I'll look for a local Santa Monica news site before I'll visit the Los Angeles Times. If I can find an Ocean Park or a Mar Vista news site or blog, that will tell me more because they are adjacent to the airport, and Santa Monica is a big city, comparatively.

So, if there was a journalist who was covering the heck out of Ocean Park and Mar Vista, that would be ideal. I enjoy Steve Lopez' columns in the LA Times, but he's too big and far-ranging for a plane crash (and now he's an author and a movie character, too). When some journalists talk about the future of the news, they seem to see themselves as Steve Lopez one day. That's what bugs me.

I think the future of news will bring us closer to the source. There is no reason to pull back and let gods-gift-to-writing explain it from media central. The location-independent web lets us zoom in and find out from someone who was right there. Sometimes that might be too close--the person might not write well, or might assume more local knowledge than we have. That's when we zoom out a level--read about Ocean Park in the LA Times, for example. But we will want the capability of zooming in really, really close.

Re-reading my explanation of why a report might be too close, I realize why I respect journalists. I said we might zoom out if the on-the-scene report was poorly written or didn't know its audience. What I'm saying is that I want more journalism so that "even" local news will be reported well.

Sunday, May 10, 2009

Community Garage Sale

Our town holds a Community Garage Sale every May, coordinated by the town's excellent Community Services Director, Regina Falkner, and the volunteer Community Services Commission. When I was first appointed to that commission, I assumed that the event was some sort of communal swap meet on town property. I have since learned that I'm not alone in that initial assumption, but it is wrong nonetheless.

What it is is a single day when everyone is encouraged to have a garage sale and to register their intentions with the town. The town gives sellers a kit of literature--a nice letter, selling tips, and a piece on valuing things for the IRS--and it prints a few thousand maps to the garage sales for buyers. The maps are printed on newsprint and include a table of addresses with an attempt at classification, such as "toys," "baby things," and so forth. There are usually 50 or 60 sales, so the map gets crowded.

The town has a nice website by CivicPlus, and I can't remember if they post a PDF there. But the town pays for an ad in the local weekly dead-tree newspaper to advertise the event. Costs for the town's involvement is about $6,000. We learned this because this year the event has been canceled, "for now." [Note, the 'canceled' link is to a San Jose Mercury News story, which means the link will break at some point in the future.]

Town staff--folks like energetic Volunteer Coordinator Monica Renn--enter garage sale registration information into a spreadsheet, presumably, to produce the table listing. I'm not sure how the map is done, but the town has an impressive geographic information system (GIS) and has cadastral maps of every parcel in town available online. I would bet money that the $6,000 budget for the event is simply what they pay to print the newsprint maps and what they pay for the newspaper advertisement.



I have belabored this point because it is typical of the last-century thinking that is in the process of changing. First, the town thinks it needs to be involved "coordinating" something that will coordinate itself. Second, although the town went out of its way to avoid any mention of or support for the online-only Los Gatos Observer, the annual garage sale ad represents a "gimme" to the weekly newspaper and the printer. They won't be happy at this loss of revenue--look for an editorial opposing some other town "event" in the near future.

But is this the most efficient way to do this? Of course not, and--short of legally restricting each household's sale items--it is probably the least efficient mechanism possible. We settle on one, arbitrary day, for the convenience of the organizers. The town tries to hand out the printed maps--stacks are left at the library and town offices--but most buyers just follow the (illegal) signs on telephone poles. Of course, there is no way to advertise specific items, so vague hints like "toys" are all a buyer gets.

There are garage sales here every weekend, just like everywhere else. The town's kit of information for sellers is a drop in the ocean of information available on the web. (Google returns 25 million links for "garage sale tips.")For buyers, following the signs works on any weekend.

The web is already coordinating garage sales, thank you. It's called eBay, craigslist, and Freecycle. Clearly, it's not about spreading your junk on your driveway, it's about trading things you don't need or want anymore for a little money. Most of us have things we'd rather give away than throw away, even if we're not "save the landfill" environmentalists.



But there is a downside to existing web services. You can't sell your lawn mower on eBay, because you don't want to have to ship it somewhere. Craigslist is great, but you have to interact with strangers. I haven't used Freecycle yet--I think it's a cool concept, but Freecycle is about free, non-profit stuff, and some things are worth more than $0.

A good community news site would let neighbors post sales as events with a link to a Google map. I think members of a community should have the ability to post articles, like this blog post, with photographs. So, you should be able to associate an article about your sale that features some of the items you expect to sell. The community news site would be able to produce a map of all garage sale locations for a given date.

This works like the "self-organizing groups" that Clay Shirky describes in Here Comes Everybody, currently on my nightstand. He explains that the organizational costs of some things of value are just not worth it--Hello? The town needs to save its $6,000?--but that a service can support self-organization. He cites Flickr, where people can share photos and tag them with labels like "Mermaid Parade." And just like that, you and I can find all photographs of the Mermaid Parade, without Flickr or the dozens of photographers doing any "coordinating." It's pretty obvious that's how used stuff should find a good home.

Thursday, May 7, 2009

New Kindle has newspapers in mind


Amazon announced the new, larger, Kindle DX yesterday, and every mainstream media report included the fact that newspapers like the New York Times were part of the New York announcement. Detailed reports like the Huffington Post's revealed that the Times will be selling the gizmo for cheap if you sign up for a long-term subscription to the online NYT.

It all makes sense, but I have to admit to a pang of annoyance that old, dead-tree media gets insider treatment, when information you can only get online--the Huffington Post, say, or this humble blog--didn't get invited to the big press conference. But then I read the Amazon Kindle DX page.

This newest Kindle, which will be out June or later of this year, will read PDF files without translation, and Amazon specifically mentions that this feature allows you to read your "neighborhood newsletter." Why, yes, of course it does.

So the future of newspapers--neighborhood-specific localcasting--is included in Amazon's thinking after all. Amazon also announced a "WhisperNet" service that will let you--that is, everyone--push a PDF to your Kindle. Presumably (but never presume), this will allow small local news publishers to charge for pushing PDFs to your Kindle just like the New York Times.

Tuesday, May 5, 2009

San Diego News Network

The San Diego News Network is a promising experiment. Instead of a number of web sites focused on San Diego's various communities, the SDNN aggregates stories from 25 "media partners." The About Us page lists a lot of paid staff.

This short posting can't do the news site justice, but it occurs to me that "San Diego News" shouldn't have a Food and Drink Section Editor unless we're focused on local cuisine and restaurants. My first reaction was--I'll go to a food site for recipes and world-class restaurants.

But then I found editor Maria Hunt's manifesto and she does talk about area restaurants and recipes for seasonal produce. She also plans to reveal local markets and review local bars. Community News is about people, and Hunt seems to get it:

But the most important part of our Food + Drink section is you, the reader. So we want to know what you’re cooking at home, where you’re eating out and what you’re craving. Got a question on nutrition or dining? Ask us. We’ll post reader food photos and recipes and ideas on how to feed a family on a budget. If this whets your appetite, then I hope you’ll come be a part of our delicious discussion. So what do you feel like eating?

I'm also pleased that the editorial roster at SDNN includes an East County Editor and a North Inland Editor. If I lived in the East County, I think I'd be hitting that section and ignoring most of the others.

Defining 'Citizen Journalism'

Dan Gillmor produced a definition for 'Citizen Journalism', and Jay Rosen echoed it, along with providing a comprehensive overview of the term. He includes Steve Outing's 2005 article about 11 Layers of Citizen Journalism. If you ask me, the focus on "Citizen Journalism" is misguided.

Who can do journalism besides journalists? Well, I guess mere citizens could give it a try. Outing described layer 2, for example, as "recruit citizen add-on contributions for stories written by professional journalists."

Clay Shirky writes for a blog called Many 2 Many--I think that name is the right approach. When I worked on e-mail systems in 1990, we had to explain the idea of store-and-forward message communication. When we developed one of the more successful groupware packages, Collabra Share, we were very aware that while e-mail allowed one-to-one and one-to-many communication, groupware was many-to-many.

If everyone reads and many of the readers also write, a large archive quickly grows that captures community knowledge and zeitgeist. Some of it is quite ephemeral, some raw and unprocessed. Many-to-many effectively means quantity rather than quality--this devalues individual pieces of information somewhat. All things being equal, would we choose a few really well-produced thought pieces, or would we choose immediate, comprehensive news?

Comprehensive is important. Great coverage of the city council, but nothing about the school board doesn't work. When it comes to community news, you really don't want to leave anything out. But the old news media--metro newspapers, radio, and television--cover small communities sporadically.

Sharing the responsibility for reporting on your community binds you to your neighbors. That's why 'citizen' works for me--good citizens are naturally journalists in the sense that they are willing to report to the community when they observe things of interest. In this sense, we've had "citizen journalists" all along writing Letters to the Editor. But there are many, mostly professional journalists, who use the term to mean "amateur journalists."

"Certain writers, of whom I am one, do not live, think or write on the range of the moment."--Ayn Rand.

"If a writer wrote merely for his time, I would have to break my pen and throw it away."--Victor Hugo.

Citizen journalists are not writing like Rand and Hugo when they observe and report for their community. (They may or may not ever write like Rand or Hugo.) The question, in my mind, is how to define what it is professional journalists do.

Sunday, May 3, 2009

Clay Shirky: Thinking the Unthinkable

Clay Shirky currently has 923 comments on his blog post (3/13/09). I read it as a manifesto, as important for community news sites as Berners-Lee's Semantic Web article. (The unthinkable, if you can't visit Shirky's site right now, is the thought that newspapers don't work anymore, now that we have the web.)

Benkoil: Rebuilding Media

Should community news sites should limit participation (commenting, etc.) to paid subscribers? In Rebuilding Media, Dorian Benkoil quotes Steve Outing: "free readers won’t have their voices heard," but paid subscribers can interact.

Outing then comments on the post to say that he hasn't formed a strong opinion whether charging for the right to interact is a good thing. He references Card's "Ender's Game," so I'll reference Heinlein's Methuselah's Children and the planet where a group mind makes living easy.

Benkoil is an award-winning journalist and editor and Outing writes a column for Editor & Publisher magazine. Outing quotes [Maureen Dowd quoting] Google CEO Eric Schmidt: "Incumbents very seldom invent the future," but I think a lot of smart old-school journalists are doing a good job of figuring out what comes next.

Saturday, May 2, 2009

Replacing newspapers

I'm starting this blog because I'm pretty sure I know what comes after newspapers. I'll put my ideas out there and hope to hear from people who disagree, since I certainly don't know everything.

I put my home town--Los Gatos, California--on the web in early 2006 and spent three years being a journalist--media id, police scanner, the works--so I feel that I understand what it means to cover the news. I also wrote and administered the underlying content management system, which is my real strength--I've been creating software for nearly 30 years.

I'm reading The News Rules of Marketing and PR by David Meerman Scott, but it's a refresher for me. The new rules are that intrusive, interrupt-driven, one-way advertising is dead, and that no one needs "the media" to help them reach an audience.

The web has meant the formation of thousands of virtual communities--bass fishermen around the world can now converse. But, for some reason, most actual, real communities--Wappingers Falls, New York, to pick an example--haven't used the web to full advantage. I'll pick on Wappingers Falls to make my point. The nearly 5,000 residents of the village work and shop in a wider ambit that runs from Fishkill to Poughkeepsie and as far east as Hopewell Junction. But Wappingers is a community, with various services and a vibrant downtown. Google Wappingers and you get:

Town Government
Wikipedia entry

The Wikipedia entry is first. The Poughkeepsie Journal covers Wappingers, and I'm sure it does a fine job (a search for Wappingers says "Did you mean Wappinger's?" No.). If a gasoline truck overturns or a man kills-his-family-before-turning-the-gun-on-himself in Wappingers, the Journal will be on top of it.

Fifty years ago, I'll bet Wappingers had a local newspaper. If you saw an ad for Friendly's Ice Cream in the local paper, you knew it wasn't the Friendly's in Beacon or Fishkill--it was the one in Wappingers, out on Route 9. If the Kiwanis club had an event planned for the day before Easter, you learned about it from the paper--maybe even the front page. You got to know the editor and his opinions. You could predict what his editorial might say each week, and you knew which folks would write a letter to the editor to disagree. The guy who owned the garage in town might write a Car Talk column (and might pay the paper to run it), and it was okay if his topic was the importance of a tune-up and he just happened to be running a tune-up special.

The local library kept every issue of the local paper. If you wanted to research an obituary from a few years back, you could. And if you did something noteworthy--got married, or enlisted in the army, your home town paper would print your picture. If you had a visitor staying with you and you needed to find out what time Catholic services would be held; the paper had that, too.

The web holds great promise to provide a lot of that kind of neighborly communication. We can certainly present round-the-clock news with the web, and we can maintain an event calendar. The problem is scope--what artificial intelligence researchers call "the world problem." When you say "Show me all weddings to be held this Saturday," the computer asks, conceptually, "What do you mean...in the world?" That's a lot of weddings. Usually, you want to set more reasonable boundaries--this county, or that village. "How big is your world?" And there's the problem with traditional news sites: the economics of newspapers encourage them to aggregate into big, metropolitan papers. They put that regional paper online and...well, that's a lot of weddings.

Put another way, a list of all 487 weddings to be held in the county this Saturday doesn't make me feel connected with my village. But--and this may or may not be a revelation to you--a picture of the eight young people tying the knot this weekend within walking distance of my house builds a sense of community. Now I've got something to talk about waiting in line at the post office--doesn't the Willis boy look too young to get married? Wasn't the Samuels kid the Valedictorian at the high school last year? Some call this "hyperlocal," but that term is already trite.

When you search the Poughkeepsie Journal for Wappingers, you land on localsearch.poughkeepsiejournal.com/. There are plenty of "hometown" and "smalltown" and other sobriquets on the web trying to create a single site that can pretend to be focused on your home town. Merchant Circle. Comedian: Where're you from? (without waiting) Oh? Me, too.

Newspapers and web software companies want one operation that reaches hundreds of thousands. Write once, read many. That's nice for them, but it doesn't put the web to work for individual communities. That's one of the "new rules" that David Scott writes about--the media companies don't get to decide how we do it.