Slashdot Log In
MS Research Automates Search Engine Spam Hunt
Posted by
Zonk
on Thu Jul 13, 2006 02:35 PM
from the i'm-all-for-less-junk dept.
from the i'm-all-for-less-junk dept.
Barbie Dollar writes "Researchers at Microsoft are working on an ambitious new project to hunt down and neutralize large-scale search engine spammers. The project, called Strider Search Defender, automates the discovery of search spammers through non-content analysis. The project integrates technology from two previous Microsoft Research prototypes (Strider HoneyMonkey and Strider URL Tracer) and promises a new approach to removing junk results from search engine queries."
Related Stories
[+]
Microsoft 'URL Tracer' Hunts Typosquatters 124 comments
TonioSop writes "Microsoft Research has released a new tool to help pinpoint large-scale typosquatters that are known to be gaming pay-per-click domain parking services. The lightweight prototype, called Strider URL Tracer, builds on the work within Microsoft's Cybersecurity and Systems Management group to keep tabs on a sophisticated typosquatting scheme that uses multilayer URL redirection to make money from Google's AdSense for domains program. "
[+]
Microsofts "Honeymonkey" Project 320 comments
g0bshiTe writes "Ever hear the saying, 'given enough time a room full of monkeys could type out Shakespeare'? Well Microsoft seems to be taking this saying to heart, and taking a cue from the Honeynet project, they have created what they have dubbed 'honeymonkeys.' Security Focus has an article which describes this honeymonkey network, which is little more than a network of virtual Windows XP boxes in various patch states. These boxes are setup to crawl the seedier side of the web in search of vulnerabilities not bieng reported, and are being actively exploited in an attempt to further secure their product. Sounds like a decent idea from the Redmond crew to me."
Offsite: Washington Post Coverage
This discussion has been archived.
No new comments can be posted.
MS Research Automates Search Engine Spam Hunt
|
Log In/Create an Account
| Top
| 68 comments
| Search Discussion
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
This just in..... (Score:5, Funny)
More at 11.
Still seems reactive (Score:3, Insightful)
It's not an either-or situation (Score:5, Insightful)
These are not mutually exclusive goals. If you take away any incentive for spamalizing content (meaning, not only does it not boost your search placement, it penalizes you), then much of the pressure to run botnets and crack servers goes away.
Ultimately helps AdWords and Google...maybe (Score:1)
A side effect is better search results, which would increase use of Google again. Where is MSN Search in all of this...I don't know. But fewer of those crap sites, the better.
Not before time. (Score:2, Interesting)
(Last Journal: Tuesday December 19 2006, @05:12PM)
At the same time, I'm all for search engines blacklisting people who game the system, parked domains, crap aggredator pages, etc. It's all about building a better mousetrap.
But I thought.. (Score:4, Funny)
Cover-up (Score:2)
Go Microsoft! (Score:5, Insightful)
(http://www.insidebet.com/)
Hopefully, Microsoft's approach will give some effect and push other operators to work harder on preventing the web spam.
Amusingly, you're most likely getting affected only if you're searching for penis pumps, pornographic content and gambling.
Human Powered? (Score:4, Interesting)
(http://www.pembo13.com/)
if it works (Score:1, Flamebait)
Of course, with their track record of Neat Ideas vs. Actual Products, (WinFS, etc.) I'm not holding my breath.
I am, however, wishing them luck.
Non-content based comment spam prevention (Score:4, Informative)
(http://www.ioerror.us/ | Last Journal: Sunday May 22 2005, @06:28AM)
Good. (Score:3, Interesting)
I remember the first time I saw google - I was blown away: "Wow. These results are exactly the web pages I was looking for!" But that's no longer the case when you search in google. They've really fallen behind in being able to separate out (or, as they say, "search for") the pages I want from the junk.
I hope google will win this war, but maybe microsoft chucking some money at the problem will help light a fire under google to get this fixed before someone else does it better. If searching at google no longer brings me relevant results better than any other source, I'm gonna start looking for somewhere else to search. Just like I did when I switched to google from yahoo back in the twentieth century.
we can only hope (Score:2)
yay for MS research!
Experimental... (Score:1)
How about filtering domain kiters (Score:1)
(http://spion.ws/)
Good stuff, but not a big deal in Release 1 (Score:1)
(http://www.monash.com/blogs.html)
But it does nothing to address the vast majority of the pages that contaminate search engine results. I'm referring to automatically generated pages that look like good pages and hence rank well in search engines, but really have little except links and perhaps some public domain info. E.g., there could be one each for every resort hotel in Mexico. The search engine result turns up a summary that makes it look like there are "reviews" there. But either the reviews section is empty, or else they reproduce something that's available on dozens of other sites as well. In one case, apparently, a single such site had 4 billion "different" pages. I'm not making that number up.
More sophisticated kinds of link-network analysis will be needed before those bite the dust.
hmm... (Score:2)
Re:Will they share? (Score:1)
(Last Journal: Friday August 31, @07:08PM)
Re:Will they share? (Score:4, Insightful)
Second, what business rationale is there to give away a competitive advantage (after spending millions to get it) in the very competitive search market, where, by the way, Microsoft is not the market leader?
Why should they? You shouldn't want them to. (Score:3, Insightful)
Their purpose is to make their own search engine more effective for users, thus generating more traffic for them. A nice side effect would be that Yahoo and Google, etc., would feel more pressure to integrate similar technologies into their own engines. As usual, competition produces the best results.
Re:Will they share? (Score:2)
(Last Journal: Sunday July 01, @08:03AM)
Probably about the same odds as Google sending Yahoo and MSN detailed specs of thier search algorithums or the 2008 Republican presidential candidate going out and campaigning for the Democratic candidate or the US shipping Iran a fully functional atomic weapon production facility
Re:Will they share? (Score:2)
(Last Journal: Thursday October 24 2002, @10:23AM)
Long Live Competition!
This is how these markets are supposed to work. Let the smartest/best company with the best product find success and enjoy the fiscal rewards.
If MSN can out-do Google, I'd move my search traffic there in a heartbeat. Of course Google won't let that happen, WE THE -CONSUMER- WINS! This isn't communism, no reason that a company should have to give their competition their work if they put the effort into solving a problem/finding a solution.
Re:Will they share? (Score:1)
Re:and shut down? (Score:1, Insightful)
For reference:
(a) What does shutting down Windows boxes have to do with searching for search-engine spam?
(b) How does search-engine spam "find" you?
Could it possibly be that you saw the word "spam", and your brain shut off while you wrote a nonsensical post that might just have made sense in the context of an article about email-spam zombie computers, but is totally irrelevant in the context of search-engine spam?
Re:Strider Hiryu (Score:2)
(http://www.hwacha.net/)
Not so, I'm afraid -- he will never leave Eurasia alive.