Jan 22

A great many words have been written on the subject of e-mail spam. Effort has been poured into all kinds of technological measures against it. In my view, many of these efforts have been a waste of time, because they have failed to address the fundamental problem of spam.

To explain my thinking, I’ll start with some basic statements:

  1. Your attention is a valuable resource. If you doubt this, you need only look at the amount of money spent on advertising in an attempt to acquire your attention.

  2. Therefore, your inbox is a valuable resource. Many people, perhaps most people, now check e-mail multiple times a day. In fact, according to some surveys college students spend more time on the Internet than watching TV. They check their e-mail inbox more than they look at ad breaks.

  3. SMTP e-mail allows anyone to send mail. There’s no centralized registration required in SMTP; there’s no control over the growth of the SMTP e-mail network. While some servers restrict which SMTP clients may connect to them, there’s essentially no control over who sends mail, as it’s always possible to open a new web e-mail account, buy a new ISP dial-up account, or whatever.

  4. SMTP e-mail is free for the sender. Sure, many people pay for their Internet access; but once you have an Internet connection, sending e-mail basically doesn’t cost you anything—it has marginal cost.

Now, let me re-cast those four statements:

We have unrestricted access for anyone in the world to use arbitrary amounts of a valuable resource.

Can you think of any case where there has been a system like that, and it has worked? I can’t. The canonical example is the tragedy of the commons, but there are plenty of others, including the Cambridge ‘Green Bike’ scheme and the overfishing of cod.

In order to avoid a “tragedy of the commons” situation, we need to alter the situation so that one of the statements above is no longer true. Let’s go through them again and consider our options.

Continue reading »

Aug 26

We finally got to watching A.i.: Artificial Intelligence. We’re probably the last people alive who haven’t seen it, so I trust you will allow me the indulgence of a few spoilers in the course of my criticism.

Let’s start with the big issue: the movie has the most egregious deus ex machina ending I have seen in years of movie-watching. It’s so hideous that it could be used as the canonical example when educating future generations of movie makers in what not to do. Apparently the ending was part of the Kubrick script for the movie, but Spielberg gave it that final saccharine twist. I’d like to think that Kubrick would have seen sense and removed the whole thing, like he did the original pie fight ending to Dr Strangelove.

A.i. is supposedly some kind of tribute or homage to Kubrick…but of course, the problem is it’s hard to pick two directors whose styles are as dissimilar as Kubrick and Spielberg, unless you start talking about (say) Errol Morris and John Waters.

Visually, there’s really nothing Kubrick to see. The fight with the bike gangs is a frenetic MTV cut-up, rather than a sequence of smooth menacing tracking shots. Even when David finds rows of boxed Davids, and Spielberg finally tries to use a Kubrick-style tracking shot for effect, he keeps the camera too high and the result is merely tedious. In fact, it brought to mind the groundbreaking camera work of Ed Wood, as lovingly recreated by Tim Burton.

Perhaps the worst thing, though, is that Spielberg just can’t seem to avoid the temptation to try and make every single story into a kid-friendly movie. Thus a male robot prostitute suddenly takes David to visit the cartoon head of Albert Einstein, voiced by Robin Williams, which we’re told is conveniently situated in the middle of the biggest red light district on earth. No, that’s not the noise of Stanley Kubrick spinning in his grave, it’s just the whirling pulleys as my suspended disbelief comes crashing to the ground.

In the original script, the mother’s an alcoholic, and the robot kid inadvertently feeds her problem when he keeps making her Bloody Marys just the way she likes them, in a futile attempt to get her to love him. Yeah, that would have worked. What doesn’t work is making mom a nice mug of coffee. Not even if you whirl the coffee containers around in an inexplicable fashion in the middle of the shot. But problem drinking is an Adult Situation, so we can’t have that in a Spielberg movie.

Yes, it’s a fairy tale, but I’m old enough to remember that fairy tales used to have wicked witches and evil monsters in. C’mon, Mr Spielberg, I know you can do better.

Dec 08

A research proposal.

Introduction

There are a great many inexact and colloquial units of measurement used in everyday life by native English speakers. Often they are used in situations where their meaning can only be metaphorical—“a handful of people turned up”, “I have a truckload of work to do next week”, “I was stuck in traffic for an eternity”.

This research project would aim to answer a number of fundamental questions about such inexact measurements:

  1. Are the relative sizes of the measurements universally understood between native English speakers? For example, does everyone have roughly the same idea of how much “a hell of a lot” is, compared to (say) “an obscene amount”?

  2. Do they have a well-ordered ranking? That is, if unit A is generally deemed larger than unit B, and unit B is larger than unit C, is it guaranteed that unit A is generally felt to be larger than unit C?

  3. Is there a definite deep semantic content to inexact quantities, such that they can be understood without conscious reasoning? Is this what gives them their literary expressive power?

Methodology

The project would be divided into four phases, outlined below.

PHASE ONE

The initial phase would consist of collecting an extensive list of inexact or colloquial quantitative measures. The principal criterion for inclusion on the list would be that the measurement in question is ill-defined; hence the list could include “miles”, in the sense of “the bus stop is bloody miles away”.

This initial research phase would probably take several months.

PHASE TWO

Next, the units would be categorized according to their dimensionality. Unlike in exact unit systems, colloquial units can have multiple values for their dimension. For example, a “handful” can be either a dimensionless quantity (“a handful of people showed up”), a unit of currency (“a handful of spare change in my pocket”), a unit of volume (“a handful of rice to throw at the bride and groom”), or a unit of weight (“a handful of potato chips”).

This phase would also be quite time-consuming; it would take a bunch of months. During this time, software development for phase three would begin.

PHASE THREE

A group of randomly chosen test subjects, all native English speakers, would be asked to answer computer-administered questions, presented via the web in the following form:

Unit A is less than


is greater than

Unit B

The central choice would be made using either a drop-down box or a radio button.

The subjects would thus be presented with single pairs of units in isolation, chosen from the list, and asked to indicate their immediate unthinking ranking of the pair of items. Only inexact units with compatible dimensionality would be chosen for comparison.

This method is superior to the obvious approach of simply asking subjects to rank units in a list for two reasons. Firstly, it allows us to investigate the second question above (whether the units are well-ordered), without assuming an answer. Secondly, it allows the subjects to record their instinctive rather than reasoned response, and makes the questioning process less of an intellectual process.

To put it another way: Most people would have trouble ranking a set of ten approximate units they were given in a list. However, almost anyone can say which of “a heck of a lot” and “a shitload” is the largest unit, without having to think very hard at all. Remember, the intent here is to record the deep semantics of the units, the meaning we are not conscious of.

PHASE FOUR

Statistical analysis would be performed on the collected data, and appropriate conclusions drawn.

Followup Work

There is clearly potential for further research along these lines, once the initial work has been done. The effect of modifying adjectives on the rankings of inexact measures could be evaluated, or the adjectives themselves possibly ranked in power—if “forever” is shorter than “an eternity”, is “fucking forever” still shorter than “a bleeding eternity”?

Adjectives and modifying phrases could also be measured, using similar methodology. This would allow us to answer questions such as: is ”complete and utter fuckwit of the first order” more or less insulting than “24 karat solid gold fuckwit”?

Budget

Funding for this project could be modest. It wouldn’t need screaming fast computer equipment or cavernous laboratory space. Most of the budget for the early phases would be spent on aids to dissociative cogitation, probably consumed in a public house or similar venue.

Value Of The Research

Beyond intellectual curiosity, this research has the potential to yield a great steaming crapload of valuable data. Thesaurus compilers could include numerical indications of the size of inexact quantitative measures, allowing people to choose (say) a volumetric metaphor of appropriate capacity when describing how much they drank the previous night. Dictionaries and textbooks could carry tables of inexact units ranked in order, to assist non-native speakers in learning the “deep meaning” quantity indicated by each phrase. The information might also be trouser-wettingly valuable to researchers in natural language comprehension in the fields of artificial intelligence and computational linguistics.