For the data lover - Yahoo Mail Visualization Project
Just got word about this.
With the roll out of the new Yahoo! Mail last May, Yahoo! recently launched more advanced anti-phishing defenses and enhanced spam protection. In a given month, Yahoo! blocks nearly 550 billion spam messages from hitting your inboxes (that’s approximately 1,800 emails for every Yahoo! Mail user).
This has been made possible by its open source cloud framework, Hadoop, which keeps mailboxes safe by filtering out spam and re-route emails for the 300 million mail users across the globe. As a quick example, the anti-spam protection aggregates anonymous data from the billions of emails sent and received each day and helps reduce spam reports by 65%. Yahoo!’s technology analyzes all this anonymous data (with the help of Hadoop) and identifies spam patterns so they can then use algorithms to predict future email patterns that will differentiate “good” and “bad” senders.
You can view the visualization at http://visualize.yahoo.com/ to discover how the “brain” behind Yahoo! processes the amount of data to protect users from phishing and spamming. On the left hand side, you will also see a tab called “Trending Keywords”. Clicking on this tab will lead you to a stream of keywords “Good” and “Spam”. You can also click on the chart to find out some interesting facts and stats about Yahoo!’s email crunching technology.
· There are 76, 584, 989 Yahoo! Mail users in Asia representing 23% of all network activity
· The Yahoo! Mail Network is blocking 24,300 spam mails per second in Asia
· For each good email that Yahoo! Mail Network delivers, four spam mails are blocked
· There are 1,000,000,000,000,000,000,000 variations of the word Viagra
Hope you will enjoy this Yahoo! Mail visualization project as much as I am now.
1, Nov 2011 • via Janette Toral • report abuse
