Track your comments!
[x]


When you register, comments on your articles and replies to your comments appear here. Register Now!

Sign in to your account
[x]

Not a Scientific Blogging member yet?

Register Now for a Free Scientificblogging.com Account

  • Customize your profile with pictures, banner, a blogroll and more.
  • Leave comments on articles, add other members to your friend lists, chat with people on the site.
  • Write blog posts that can be seen by hundreds of thousands of readers.

It's free and it only takes a minute!

Already a Scientific Blogging member?

Sign In Now

Fake Banner
By Dan Gillick | April 14th 2009 12:54 PM | 5 comments | Print | E-mail | Track Comments
.

More Random Walk articles

All

About Dan Gillick

I'm a computer science graduate student at UC Berkeley. I'm not sure where this walk will take me, but that's part of the point of walks.

www.dgillick.com... Full Bio

Over the past few weeks, more spam has been slipping through to my Gmail inbox. Consider: The oddly named Jule Cuc enthusiastically offers college Dip1omas [sic]: “At your own pace! No examination! No study! No classes!” Kieth l. Black is cautiously optimistic with his pitch: “Please your spouse more often than not,” while Marianne Daniels is mysteriously quaint: “Uplift your darling couch adventures”. Ironically, Bishop appears to be selling Viagra, though his email contains an esoteric reference to the diary of Samuel Pepys, in which he suggests that “meer sauce, or a pickle” are reasonable antidotes for souring Venison. A certain Elden Tyrimoh is miffed that I “again misplaced his number”.

What is all this nonsense? Who is sending all this email and can it really be profitable? Are things getting better or worse? What is Conficker, the so-called “Paris Hilton of Botnets”? And what exactly is Samuel Pepys doing in an ad for Canadian pharmaceuticals?

I’m going to try to answer all these questions. But it’s too much writing for one night so this will be the first part of a spam mini-series. Samuel Pepys (pronounced like “peeps”, I’m told, will make repeated appearances, I hope).

The story starts with botnets. What is a botnet? It’s a network of bots. What is a bot? It’s a computer, connected at least periodically to the Internet, that has been infected with the latest incarnation of computer virus. What makes a network of bots such an important innovation? Each infected computer broadcasts its availability so that a centralized server can distribute tasks—like sending email spam—all over the world.
Finally, "botnet" searches outnumber searches for "Samuel Pepys"

A botnet is a hugely parallel supercomputer. If they so desired, botnet operators could use this processing power to, say, simulate exquisite climate models, or defeat Deep Blue (which defeated Gary Kasparov) in chess. But the botnet’s most valuable attribute is its distributed bandwidth. This is how to send a billion emails in a day or coordinate a denial of service attack to take down a website—“the digital equivalent of filling a fishtank with a firehose”.

Take the Storm botnet as an example. Some say it controls over a million computers. Some security experts suspect that Storm is run by the Russian Business Network, a group of renegade Russian (surprise!) computer scientists also blamed for the denial of service attacks that crashed Georgian President Mikhail Saakashvili’s website along with the National Bank of Georgia just before the 2008 South Ossetia War. Storm is also responsible for a considerable fraction of the world’s email spam.

How much money is in spam? IBM Security Systems expert Joshua Corman is widely quoted as claiming    that spam sent by Storm is generating “millions and millions of dollars every day”. And, does Storm send its own spam or does it rent its bandwidth to specialized spammers—a disconcerting possibility, suggesting a maturing underground economy.

A team of researchers in Berkeley and San Diego set out to investigate. First they figured out how Storm works. It’s complicated but here’s the gist: infected computers are classified as either workers or proxies based in part on whether they have a firewall installed. Workers do the actual spamming, receiving instructions from proxies that coordinate the jobs. Proxies take orders from a master server.

The researchers set up eight proxies, simply by installing the software (the didn’t have to go looking; it was in their email). This way, workers contacted them, requesting tasks. But just as the instructions were shipping out, they modified the text of the spam messages, replacing the link with their own—a mock pharmaceutical website complete with Viagra, “Viagra professional”, a shopping cart, etc. (though they never collected any money or personal information about their customers).

An imitation pharmacy website. Why get Viagra when you can go Professional?
In this way, they tracked half a billion spam messages—a Viagra campaign and a botnet expansion campaign with fake greeting cards (complete with dancing banana)—over 26 days. Here is the big result: 350 million emails resulted in 28 purchases. Average purchase size was $100. Estimating they tracked 1.5% of the full network and assuming continuous active spamming, this gives about $3.5 million in Viagra-inspired revenue per year. Much less than millions in a day, but not too shabby either.

Is this good or bad? Well, it’s annoying. If nobody ever bought anything as a result of email spam, it would stop. And 28 in 350 million—1 in 12.5 million—is awfully close to nobody. It’s good in the sense that at $3.5 million per year, it’s unlikely that Storm is running an affiliate program—renting their services to third party spammers—since profit margins are so slim.

 red flags mark pharmacy purchases; yellow flags mark new infections
The bad news is two-fold. First, the greeting card expansion campaign was considerably more successful. One in ten people that visited the greeting card page downloaded and installed the software (no purchase necessary). If people are gullible enough to install Storm themselves, it’s very hard to imagine how to slow down the botnets. Second, so much mystery remains. Perhaps this particular spam campaign was just a side-project, an experiment much less fruitful than the botnet’s ordinary activities.

Further analysis shows national differences. While the U.S. received over 10 times more spam than Japan (the 2nd most spammed country), it is the least efficient of targets: the U.S. response rate was lowest, while India, Pakistan, and Bulgaria showed the highest response rates. The researchers believe these differences are due more to spam-filtering technology and anti-spam awareness than to varied interest in Viagra.

Ok. A few statistics to close out this installment. Spam doubled in 2007 to 120 billion messages sent daily. The rate seems to have doubled again in 2008. Microsoft recently estimated that over 97% of all email sent is spam, and that nearly 1% of all computers are infected by some botnet. Bill Gates, it is said, receives around 10000 spam emails every day.

Comments

Fascinating article neat stats!

The $3.5 million estimate is just revenue as well; throw in operating costs & it looks even punier.

Tangentially, spam's namesake seems to be doing quite well in these times: http://www.cbsnews.com/video/watch/?id=4661999n%3fsource=search_video

Dan Gillick's picture
I forgot to add a citation for the spam research!

Spamalytics: An Empirical Analysis of Spam Marketing Conversion


C. Kanich, C. Kreibich, K. Levchenko, B. Enright, G. Voelker, V. Paxson, and S. Savage.
15th ACM Conference on Computer and Communications Security (CCS), 27-31 October 2008, Alexandria, VA.

logicman's picture
Dan:  thanks for the interesting article.  I look forward to the next installment. I don't think that spam has gone far over the 80% mark.  There are committed people who devote much of their free time to having spam-promoted and fraudulent sites taken down.

The botnet software is in fact resold as variants by bot kings to underlings.  It has been reported that the software has a backdoor by means of which the kings rip off the underlings from time to time.  The big money is made from the multiple 'brands' marketed, and the scams.  There is more money to be made from credit-card cramming and theft than from sales.  Currently, in my opinion, the biggest money is being made from scareware and from increasingly genuine-looking emails purporting to be from financial institutions.

Your readers  may find Spamwiki worth a read, and may also be interested in McAfee's S.P.A.M. experiment.

dorigo's picture
Hello Dan,

I avidly read your article.

Working with computer networks and distributed computing for high-energy physics research, I know how powerful this technology has become. The CMS experiment of which I am part at the Large Hadron Collider relies on grid- a large network of computers - to perform the huge amount of computing necessary to understand the physics of our complex apparata. However, spam appears different, in that it uses bandwidth far more than other resources -especially CPU. Or am I getting the wrong message from your piece ?

If spam relies on bandwidth, and if it makes up for 97% of all mails, I wonder if one could devise strategies to hit the cancer by shooting at the bulk. Imagine traffic between pairs of addresses not pre-confirmed both ways (through active user input) being taken down 59 minutes every hour. We would still get messages from non-confirmed parties, at a slower cadence; but we would kill 98% of the spam. Is this feasible ?

Cheers,
T.

Dan Gillick's picture
Hi Tommaso- you've got it right. I think the botnet operators don't know how to use the processing power they control to make money, otherwise they would... perhaps an LHC - RBN (Russian Business Network) collaboration? And it works out well because if they used lots of our CPU cycles, people would start to notice--part of the genius of the botnet is that most people don't know it's running on their machine and they don't care.

As for stopping spam, well that's another can of worms. There have been a variety of clever strategies proposed--what you're suggesting sounds similar--but typically these proposals require universal cooperation and that's not really the Internet's style. Instead, spam filters have gotten pretty good, thanks in part to mountains of training data.

Dan

Add a comment

The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <sup> <sub> <a> <em> <strong> <center> <cite> <code> <TH><ul> <ol> <li> <dl> <dt> <dd> <img> <br> <p> <blockquote> <strike> <object> <param> <embed> <del> <pre> <b> <i> <table> <tbody> <div> <tr> <td> <h1> <h2> <h3> <h4> <h5> <h6> <hr> <iframe>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.
CAPTCHA
If you register, you will never be bothered to prove you are human again. And you get a real editor toolbar to use instead of this HTML thing that wards off spam bots.