Slashdot is powered by your submissions, so send in your scoop

 



Forgot your password?
typodupeerror
×
Advertising Botnet Facebook Youtube

How Much Internet Traffic Is Fake? Turns Out, a Lot of It, Actually. (nymag.com) 130

Long-time Slashdot reader AmiMoJo shared this article from New York magazine: In late November, the Justice Department unsealed indictments against eight people accused of fleecing advertisers of $36 million in two of the largest digital ad-fraud operations ever uncovered... Hucksters infected 1.7 million computers with malware that remotely directed traffic to "spoofed" websites.... [B]ots "faked clicks, mouse movements, and social network login information to masquerade as engaged human consumers." Some were sent to browse the internet to gather tracking cookies from other websites, just as a human visitor would have done through regular behavior. Fake people with fake cookies and fake social-media accounts, fake-moving their fake cursors, fake-clicking on fake websites -- the fraudsters had essentially created a simulacrum of the internet, where the only real things were the ads.

How much of the internet is fake? Studies generally suggest that, year after year, less than 60 percent of web traffic is human; some years, according to some researchers, a healthy majority of it is bot. For a period of time in 2013, the Times reported this year, a full half of YouTube traffic was "bots masquerading as people," a portion so high that employees feared an inflection point after which YouTube's systems for detecting fraudulent traffic would begin to regard bot traffic as real and human traffic as fake. They called this hypothetical event "the Inversion...."

[N]ot even Facebook, the world's greatest data-gathering organization, seems able to produce genuine figures. In October, small advertisers filed suit against the social-media giant, accusing it of covering up, for a year, its significant overstatements of the time users spent watching videos on the platform (by 60 to 80âpercent, Facebook says; by 150 to 900 percent, the plaintiffs say). According to an exhaustive list at MarketingLand, over the past two years Facebook has admitted to misreporting the reach of posts on Facebook Pages (in two different ways), the rate at which viewers complete ad videos, the average time spent reading its "Instant Articles," the amount of referral traffic from Facebook to external websites, the number of views that videos received via Facebook's mobile site, and the number of video views in Instant Articles.

On Twitter the author also shared a Twitter thread by the Washington Post's director of advertising technology, who shares his own complaints about the ecosystem of online advertising. "The problem isn't just that the internet is full of fakery and bullshit and bad numbers and malfunctioning metrics and bullshitters and fraudsters. The problem is that all the fake shit is layered on top of other fake shit and it just COMPOUNDS itself... Like you get fake users, who get autoplay videos which no one is really watching....

"That's not even counting the entire ad campaigns that are fake where the product is just a bullshit excuse to collect data on you."
This discussion has been archived. No new comments can be posted.

How Much Internet Traffic Is Fake? Turns Out, a Lot of It, Actually.

Comments Filter:
  • Meh (Score:2, Insightful)

    by Anonymous Coward

    Pond scum feeding on pond scum. I'm having a hard time drumming up concern.

  • by Anonymous Coward

    Pretty much any webmaster that views their logs regularly can back up this claim.

    One of my websites is a property listings portal in a small foreign county and I have more bot traffic on there than all my other websites combined for some reason. I have like 3 million submitted fake data info from bots in the last couple years.

    • by ls671 ( 1122017 )

      use mod_security -> detect bad request -> is IP on blacklists and in a country you don't care about -> automatically tarpit IP

      you can manually tarpit IP from other bad requests as well while reviewing mod_security logs, a little scripting makes this process quite fast.

      Saves a lot of bandwidth and saves on log space.

      • The ISP's have many reasons to _host_, if not actively foster false traffic. They are paid on the basis of traffic, and more traffic enhances _their_ market value. It's much like the USPS approach to junk mail, and ISP approaches to spam. The recipients _loathe_ the false traffic. The businesses carrying it try to strike a balance between maximum profit of selling bandwidth, advertising the size of their service, payment for websites, overwhelmed customers abandoning them for services with better filters, a

        • The ISP's have many reasons to _host_, if not actively foster false traffic. They are paid on the basis of traffic, and more traffic enhances _their_ market value.

          Inefficiency or fraud, it doesn't matter. It's not like extra money is paid to an incinerator. Someone is making money. In this case, the biggest beneficiary is Google. Their stock would crash if fake traffic were completely eliminated. Google not only doesn't have an incentive to eliminate fake traffic. They have a huge (i.e., stock bonuses for executives that make the decisions) motivation to at least preserve and potentially enlarge the fake traffic.

  • by Anonymous Coward

    No one likes this idea, but this is the answer

    Everyone is given a ipv6 block. All your traffic from them comes from this block and can be tracked if needed. You can randomly use any IP from this block as you wish (the block follows you around, is setup after you authenticate. The tech doesn't exist yet, but is simple to do, probably a simple USB key and password system for users) Businesses are treated the same as individuals, their servers also use the business block.

    An independent authority knows who the

  • Fake or satire? (Score:3, Interesting)

    by Col. Bloodnok ( 825749 ) on Saturday December 29, 2018 @11:15PM (#57877748)

    Who can tell the difference?

    https://twitter.com/TitaniaMcG... [twitter.com]

    Titania McGrath can.

    • Re:Fake or satire? (Score:4, Informative)

      by AmiMoJo ( 196126 ) on Sunday December 30, 2018 @06:52AM (#57878674) Homepage Journal

      Stuff like that is just a bit of silly fun, the problem is more to do with stuff like the fake outrage industry that posts endless videos about people being "triggered" on YouTube. Parody is great, convincing large numbers of people that there is a culture war going on that is in fact entirely manufactured clickbait for your YouTube channel just creates an army of dangerously delusional useful idiots.

  • If it's labeled AI.

    • by msauve ( 701917 )
      "If it's labeled AI."

      I can call you Betty
      And Betty, when you call me, you can call me Al...
  • XKCD (Score:5, Funny)

    by darkain ( 749283 ) on Saturday December 29, 2018 @11:42PM (#57877842) Homepage

    We're getting closer and closer to that XKCD dream life: https://xkcd.com/810/ [xkcd.com]

    • by tepples ( 727027 ) <tepples.gmail@com> on Sunday December 30, 2018 @01:01AM (#57878036) Homepage Journal

      The moderation mechanism described in xkcd #810 already resembles that in use on various forums and Q&A sites, such as Slashdot and Stack Overflow.

      1. Each newly registered user sees a page of what Stack Overflow calls "review audits". This resembles Slashdot metamoderation: does what the new user sees as constructive align with what established users see as constructive?
      2. Anyone who gets most of the review audits correct has posts placed in "awaiting moderation" state. Only established users can see such a post until at least one established user upvotes the post.
      3. Once a user is firmly in positive reputation/karma, the user's posts skip the "awaiting moderation" state.

      Yet this hasn't led to any artificial intelligence breakthroughs on the part of the spam industry. Instead, I've noticed that spammers on forums.nesdev.com appear to be humans in low-exchange-rate countries. They search for an old post, reword it, start a discussion, and days later edit the post to include off-topic commercial links. A user who isn't paying close attention is unlikely to see this karma whoring for what it is.

  • Facebook (Score:5, Interesting)

    by Dan East ( 318230 ) on Saturday December 29, 2018 @11:45PM (#57877844) Journal

    I'm not sure why the part about Facebook inflating ad display numbers is included here. That was not because of bot activity. The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity. Facebook has gone to great lengths to prevent scraping of their website, and it is extremely unlikely that the scraping of the site would involve scrolling through a newsfeed so that an ad became visible, began autoplaying and was streamed to a bot.

    Facebook misrepresented the amount of time a user sat watching an ad before they scrolled on past it, plain and simple. More than likely they were counting things like a small portion of the video still being visible on the screen as being "watched". For the difference, even by Facebook's own admission, to be off by 60-80% shows this was a misrepresentation of what it meant for a user to watch a video ad on a very large scale (including in the app on mobile platforms).

    • by msauve ( 701917 )
      "The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity."

      Depends on what you consider a "bot." I consider most bookface users to be unsentiant bots.
    • The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity.

      I would argue that the other way - because so MCUH of Facebook traffic is from mobile apps, that is where most of the bot activity is likely to be from.

      All it takes is an Android user logged into Facebook and some background app can have plenty of likes and other things they don't even see happening...

    • by tsa ( 15680 )

      Facebook is also not the world’s greatest data gatherer. Not by a long shot. Google is much much bigger.

    • > The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity.

      Whatever makes you think that a fraudulent data stream from a botnet or adware could not forge data from a mobile app?

    • by nasch ( 598556 )

      The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity.

      How can we be sure of that? Automated app interaction is certainly well supported on Android (for testing purposes) and I assume iOS as well. I don't know if it would be worthwhile, but it is certainly feasible.

      it is extremely unlikely that the scraping of the site would involve scrolling through a newsfeed so that an ad became visible, began autoplaying and was streamed to a bot.

      If the purpose of the bot is to trigger ad views, why would that not happen?

  • Sorry? (Score:4, Interesting)

    by markdavis ( 642305 ) on Sunday December 30, 2018 @12:16AM (#57877928)

    It really is quite difficult for me to feel all that sorry about advertisers and ad sellers being upset that their precious data is wrong/overstated/contaminated. The "ad wars" [on users' eyes, ears, cpu, screen space, bandwidth, patience] are so insane now, the anarchist in me is almost happy about it.

    Ooops, another site that wants to shame/annoy/warn/block me because of my ad filters protecting my privacy/sanity/bandwidth/battery/security. Hmmm....

    • There's still a bit of a problem: Decisions are being made based on this stuff.

      The data is contaminated, and then we don't know where it goes or who's using which information to make decisions, but someone is using it to make some decisions or else no one would be paying for it. So what happens when companies market their products, or politicians make policy decisions, based on that contaminated information?

      And don't think they won't. They're all chasing the opinions of a few demographic groups, and the

      • Re: (Score:1, Insightful)

        by Anonymous Coward

        Don't care. Nary a tear shed.
        Fucking parasites.
        Eventually they'll catch on, but for now, One of life's little pleasures is to make it tough as possible for those bottomfeeders to make a buck.

      • So what happens when companies market their products, or politicians make policy decisions, based on that contaminated information?

        The companies who make decisions based on bad data will be more likely to fail. As for politicians.....can they really be worse than now?

        • > The companies who make decisions based on bad data will be more likely to fail

          The companies providing the data, selling it, have very little reason to care. They tend to be small, fraudulent operations that _do not care_ what happens as long as they get the first or even second check. Simply because a client fails is not a compelling reason for the next fraudulent company not to avoid the business.

          I'm afraid that the "fraud will be forced out of the market by competition" optimism is one of the notable

          • I'm afraid that the "fraud will be forced out of the market by competition" optimism is one of the notable errors of Libertarian politics.

            Who said that? I said that companies who fall for fraud will go out of business. Learn to read.

        • They'll be more likely to fail, all else being equal. Unfortunately, we don't live in a world where things are fair and the best company making the best products win out in the end. Entrenched players will win out, and they'll force bad products on us.
          • They'll force bad products on us? What I shame. I don't know how I'll endure the burden. Bad products oh no.
    • by AmiMoJo ( 196126 )

      The advertisers are probably quite happy about this. It means their crap is overwhelming the real stuff.

      It's got the point where everything, even the real stuff, seems fake because the fake imitations are getting better. In fact there is a feedback loop, where real people see the fake people being effective and adopt their methods.

      Slashdot is not immune. Sometimes half the posts on a story seem fake, like they were written by bots or copy/paste shills. They probably aren't though, they are mostly real peopl

  • by AHuxley ( 892839 ) on Sunday December 30, 2018 @12:21AM (#57877940) Journal
    Ads need the data on users.
    The brand that takes out the ads pays for it all.

    With more advanced browsers getting the ad to display, work, track is getting more complex.

    What can the ad brands do?
    Make the browsers and OS more ad friendly.
    Make users have to view ads in some nations?
    Make the browser show an ad?
  • by Dracos ( 107777 ) on Sunday December 30, 2018 @12:44AM (#57877992)

    Advertising is black magic and always has been. The internet has shown ad peddlers that they don't actually know all that much about advertising, but now they can measure every aspect of their ignorance, hubris, and ineffectiveness... and it freaks them out.

    • by AmiMoJo ( 196126 )

      They know very well what works, it's just that the more effective the technique the more push-back they get for using it.

      At the extreme end you have stuff which is simply illegal, then you have stuff which is banned on TV and the more popular web sites. Stuff that browsers block by default (pop-ups and Javascript abuse). Paying influencers to mislead their viewers.

      All the time they are looking for new ways to get away with what they know works, and other people are looking for ways to stop them.

    • by kurkosdr ( 2378710 ) on Sunday December 30, 2018 @09:34AM (#57878964)
      Then there is the issue of "personalized ads" which ARE relevant but don't generate a purchase. For example, Google serving you lawnmower ads after you 've just bought the only lawnmower you will need for the next 10 years, or ads for holiday packages in your hometown you are never going to buy because your parents have a spare room to share, or ads by resellers you are never going to buy from because you buy from Amazon. But advertisers pay for those ads because users click on them accidentally or for kicks. Personalized ads are scams all the way down.
      • by AHuxley ( 892839 ) on Sunday December 30, 2018 @09:46AM (#57878998) Journal
        The dog the digital assistants microphone is listen for and then the dog food ads start.
      • by lenski ( 96498 )

        This.

        1) Lots of ads for computers and laptops after buying my first one in five years, and won't be buying another until this one is also no longer supported.

        2) Tons of ads for cars, despite my wife and me buying two cars in the last year, knowing that we may not be buying another (we're both over 60 and our cars last for >= 15 years...)

        3) I am somewhat badly addicted to inexpensive single-board computers, SSDs and D-I-Y electronic supplies and purchase these things regularly. I see surprisingly few ads

      • by Deaddy ( 1090107 )
        I always thought that this was just some dumb usage of statistic tools, like sorting people by the correlation of the items they already bought with the items you try to sell.
        Clearly, most people who bought a lawnmower have also bought a lawnmower, so you should show them ads of your lawnmower.
  • Brave Browser (Score:1, Informative)

    by Anonymous Coward

    Please look into the Brave browser which blocks advertisements and trackers by default, it is also implementing a system whereby people get paid to opt-in to watching advertisements. Also, the content creator will get a larger percentage of the advertising revenue as it is not siphoned off by middlemen. You can also tip people directly through the browser.

  • So this must be the fake news about all of the fakery that data mines and advertisers splash around like pigs in poop. Oh, horrors! liars and parasites are everywhere.

  • This is good (Score:5, Insightful)

    by Solandri ( 704621 ) on Sunday December 30, 2018 @02:02AM (#57878188)
    I've always maintained that the way to beat the panopticon companies isn't with ad blockers and privacy legislation. It's to dilute the value of the data they collect by inserting so much fake data that they can no longer sufficiently distinguish real people from the bots.

    There's an apocryphal story that after the end of the Cold War, a bunch of the CIA and KGB got together for drinks. The CIA spooks lamented that theirs had been the harder job. The Soviet Union was such a closed society and had so many restrictions on travel that it was virtually impossible for the CIA to get a spy in there, whereas all the KGB had to do was drive to a town next to a military base and mingle with staff from the base eating lunch there. The KGB spooks disagreed and claimed that theirs had been the harder job. The U.S. produced so much information that it was virtually impossible for them to separate out fact from fiction. If the National Enquirer ran a story about the military working on a, or some conspiracy theorist reported the military was controlling their brain waves with weather balloons, they had to devote resources to figure out if the stories were real or fake.
    • To get rid of the panopticon, we need to find some other source of revenue. At the moment there is no way to guarantee that if you pay for services you will no longer be tracked, so we are in a sort of latch-up.

  • Shitty metrics are better than no metrics at all. That's why online advertising will continue eat TV and newspaper ad revenues.

    • Not if you have to pay dearly for those shitty metrics in the form of less eyeballs (potential customers) or higher advertiser's cut from sales. There is no proof personalized ads (based on those metrics) even work, let alone that they are more efficient than the "spray wide" advertisements that TV and newspapers offer.
    • BTW you need to distinguish between "spray wide" online ads (similar yo TV ads) and personalized ads. Not every online ads has metrics behind it, abd not every online ad pays per-click.
      • by melted ( 227442 )

        I worked in ads at Google. _Every_ ad served by Google has metrics on it. It might not be clicks (brand advertisers often don't care about that, they just want to saturate the page), but there are metrics. I.e. whether it was in the viewport of the browser, etc. TV and newspaper ads don't have any metrics at all.

        • TV ads have metrics too (provided by Nielsen). In that regard, Google's saturate/spray wide ads are not that different from TV ads. My point was about "personalized" ads, and whether the cost of "personalized" metrics is worth the cost (it's not).
  • by Anonymous Coward

    We created machines to get rid of the most mechanical and tedious activities. I will gladly leave it to robots to look and click at ads on my behalf. Let's also teach robots to do the "I'm not a robot" routine, so we humans become free to do meaningful stuff instead.

  • If a few people are able to game the advertisers and make a cool million, I am fine with that. The marketing corporations (and even large corporations in general) always look for ways to fuck over the little guy. If the little guy gets a little revenge, I have to give them 3 cheers.
  • > Studies generally suggest that, year after year, less than 60 percent of web traffic is human; some years,
    > according to some researchers, a healthy majority of it is bot.

    I frequently visit Web pages that are bot-generated. Among them are price quotes for stocks, daily weather data (rainfall, temperature, wind speed, etc), and what checks have cleared my checking account. These are not human-generated.

    The problem is with bots that visit Web sites. Even there, not all bots are bad. After all, wit

"Experience has proved that some people indeed know everything." -- Russell Baker

Working...