AI Hallucinated a Dependency. So a Cybersecurity Researcher Built It as Proof-of-Concept Malware (theregister.com) 44

Posted by EditorDavid on Saturday March 30, 2024 @03:34PM from the this-package-doesn't-exist dept.

"Several big businesses have published source code that incorporates a software package previously hallucinated by generative AI," the Register reported Thursday

"Not only that but someone, having spotted this reoccurring hallucination, had turned that made-up dependency into a real one, which was subsequently downloaded and installed thousands of times by developers as a result of the AI's bad advice, we've learned." If the package was laced with actual malware, rather than being a benign test, the results could have been disastrous.

According to Bar Lanyado, security researcher at Lasso Security, one of the businesses fooled by AI into incorporating the package is Alibaba, which at the time of writing still includes a pip command to download the Python package huggingface-cli in its GraphTranslator installation instructions. There is a legit huggingface-cli, installed using pip install -U "huggingface_hub[cli]". But the huggingface-cli distributed via the Python Package Index (PyPI) and required by Alibaba's GraphTranslator — installed using pip install huggingface-cli — is fake, imagined by AI and turned real by Lanyado as an experiment.

He created huggingface-cli in December after seeing it repeatedly hallucinated by generative AI; by February this year, Alibaba was referring to it in GraphTranslator's README instructions rather than the real Hugging Face CLI tool... huggingface-cli received more than 15,000 authentic downloads in the three months it has been available... "In addition, we conducted a search on GitHub to determine whether this package was utilized within other companies' repositories," Lanyado said in the write-up for his experiment. "Our findings revealed that several large companies either use or recommend this package in their repositories...."

Lanyado also said that there was a Hugging Face-owned project that incorporated the fake huggingface-cli, but that was removed after he alerted the biz.
"With GPT-4, 24.2 percent of question responses produced hallucinated packages, of which 19.6 percent were repetitive, according to Lanyado..."

AI Hallucinated a Dependency. So a Cybersecurity Researcher Built It as Proof-of-Concept Malware

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 44 Comments Log In/Create an Account

Comments Filter:

Cost/Benefit (Score:5, Funny)

by Kunedog ( 1033226 ) writes: on Saturday March 30, 2024 @03:44PM (#64357162)

Not only that but someone, having spotted this reoccurring hallucination, had turned that made-up dependency into a real one, which was subsequently downloaded and installed thousands of times by developers as a result of the AI's bad advice, we've learned.
If you're married, then you know that sometimes this kind of thing can be worth it just to avoid starting another argument.

- Re: (Score:3, Insightful)
  
  by war4peace ( 1628283 ) writes:
  
  If you're married to the wrong person, then you know that sometimes this kind of thing can be worth it just to avoid starting another argument.
  Fixed that for you.
  - Re: (Score:3)
    
    by fluffernutter ( 1411889 ) writes:
    
    We can tell you either aren't married or will not be for long. If your relationship isn't work then how do you know you or your spouse isn't just there because it's easy?
    - Re: (Score:1)
      
      by war4peace ( 1628283 ) writes:
      
      You can't tell shit, my friend :)
      Our marriage isn't work because we match each other very well, we both know when to compromise, and we deeply respect each other to name a few of the many aspects that make our relationship work perfectly.
      We both know we're the lucky few in a world full of wrong matches.
      - Re: Cost/Benefit (Score:2)
        
        by NoSleepDemon ( 1521253 ) writes:
        
        Yeah sheâ(TM)s cheating on you
        
        Re: (Score:2)
        
        by war4peace ( 1628283 ) writes:
        
        That would be... very difficult, since we're together 100% of the time.
        
        Re: (Score:2)
        
        by quantaman ( 517394 ) writes:
        
        That would be... very difficult, since we're together 100% of the time.
        Dude, marrying your conjoined twin is just weird.
      - Re: (Score:2)
        
        by fluffernutter ( 1411889 ) writes:
        
        Just don't have kids or pets. They bring up all kinds of new different issues and if you aren't adapted to discussing them then you won't last long. I have known a couple couples that didn't fight... right up until the point they were divorced. One of you is backing down and making up for it some way outside of the relationship. It sounds like one of you is giving in for fear of arguing rather than compromising.
        
        Re: (Score:2)
        
        by war4peace ( 1628283 ) writes:
        
        No kids, but plenty of pets.
        Maybe I should point out this is my second marriage (I learned a LOT from the first one).
        We are together 100% of the time. Working from home is a blessing.
        And yes, plenty of small compromises to go around, but they are all openly discussed and agreed; I'd say the count of compromises is 50/50.
        I'm aware this type of relationship is very rare, but I've seen it at my maternal grandparents before. 53 years of blissful marriage, through pretty rough times in part, until one of them sa
- Predicting the future (Score:2)
  
  by flyingfsck ( 986395 ) writes:
  
  So the AI was not hallucinating - it was prophetical.
- Re:The Register is a JOKE news site (Score:5, Informative)
  
  by quonset ( 4839537 ) writes: on Saturday March 30, 2024 @04:25PM (#64357250)
  
  Funny thing, this exact same scenario was reported last year [scmagazine.com].
  In a June 6 blog post, Vulcan Cyber researchers explained a new malicious package spreading technique they call “AI package hallucination.” The technique stems from ChatGPT and other generative AI platforms sometimes answering user queries with hallucinated sources, links, blogs and statistics.
  Large-language models (LLMs) such as ChatGPT can generate these “hallucinations,” which are URLs, references, and even entire code libraries and functions that do not actually exist. The researchers said ChatGPT will even generate questionable fixes to CVEs, and — in this specific case — offer links to coding libraries that don’t exist, either.
  If ChatGPT creates fake code libraries (packages), the Vulcan Cyber researchers said attackers can use these hallucinations to spread malicious packages without using familiar techniques such as typosquatting or masquerading.
  Sounds pretty factual.
  Hans Kristian Graebener = StoneToss
  
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
  - Re: (Score:2)
    
    by Bite The Pillow ( 3087109 ) writes:
    
    "There is a legit huggingface-cli, installed using pip install -U "huggingface_hub[cli]"."
    I don't consider that hallucination, if a thing actually exists just under a different name. Of course I don't trust El reg to report accurately, so it's not clear whether hub is actually part of the name or just the name of the parent project. And did the hub project do what the non-hub AI result thought it would.
    I have these questions, but I don't care enough to look for more details. It would be nice if I didn't hav
AI is great... (Score:3)

by MpVpRb ( 1423381 ) writes: on Saturday March 30, 2024 @04:10PM (#64357230)

...for research and fun
It should NOT be used for serious work

- Re: (Score:3)
  
  by gweihir ( 88907 ) writes:
  
  Indeed. The only time you can use it for serious work is if the task the AI is asked to do is significantly below you own level of expertise and you fully verify the results. of course, that probably will cost you more time than doing it yourself from the start.
  - Re: (Score:2)
    
    by StikyPad ( 445176 ) writes:
    
    of course, that probably will cost you more time than doing it yourself from the start.
    Not if you have AI do it for you.
  - Re: (Score:3)
    
    by cascadingstylesheet ( 140919 ) writes:
    
    Indeed. The only time you can use it for serious work is if the task the AI is asked to do is significantly below you own level of expertise and you fully verify the results. of course, that probably will cost you more time than doing it yourself from the start.
    Not so, depending on the task.
    The task probably should be at or below your level of expertise, sure. But I'm using it frequently as a time saver. And it works.
    "Given the following php code, add pagination functionality."
    Boom, done. Could I have done it? Sure. In 4 seconds? Nope.
    - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      Can "AI" have added a nice security vulnerability? Sure.
- Re: (Score:3)
  
  by BytePusher ( 209961 ) writes:
  
  It's also great for your employer to pretend they can replace you on a whim, so you'll be afraid to ask for a raise.
- Re: (Score:2)
  
  by hey! ( 33014 ) writes:
  
  A chainsaw is a great tool. Some fool chopping his leg off with one doesn't change that.
  - Re: (Score:2)
    
    by sjames ( 1099 ) writes:
    
    But it's practically never the right tool to chop vegetables...
    - Re: (Score:2)
      
      by flyingfsck ( 986395 ) writes:
      
      To an elephant, a tree is a vegetable.
      - Re: (Score:2)
        
        by sjames ( 1099 ) writes:
        
        Now find me a video of an elephant using a chainsaw...:-)
- Re: (Score:2)
  
  by Lumpy ( 12016 ) writes:
  
  But companies are expecting to replace all their programmers with AI.... What could possibly go wrong?
Naive trust in LLMs + supply chains... (Score:3)

by ffkom ( 3519199 ) writes: on Saturday March 30, 2024 @04:33PM (#64357268)

... what a dream team for malicious actors! Both on their own are already a guarantee for all kinds of security disasters, but combining the two is truly peak incompetence.

Doom Loop (Score:4, Interesting)

by mspohr ( 589790 ) writes: on Saturday March 30, 2024 @05:21PM (#64357374)

AI is entering a doom loop where it hallucinates then incorporates the hallucination into subsequent versions.
Eventually AI will be all hallucinations.

- Re: (Score:1)
  
  by buck-yar ( 164658 ) writes:
  
  Is hallucinate the right word? Maybe its realizing people can be fed b.s.
- Re: (Score:2)
  
  by Lumpy ( 12016 ) writes:
  
  When you feed it a giant cesspool of invalidated data (The internet) you should not expect a single response to be accurate. none of these AI's are fed a carefully curated data set.
Needs a catchy name (Score:5, Interesting)

by clawsoon ( 748629 ) writes: on Saturday March 30, 2024 @05:32PM (#64357386)

I propose the straightforward "hallucination attack".

- Re:Needs a catchy name (Score:5, Funny)
  
  by jd ( 1658 ) writes: <imipak@nOsPaM.yahoo.com> on Saturday March 30, 2024 @06:23PM (#64357490) Homepage Journal
  
  If the hallucination involves object oriented code, it'd be class hysteria.
  
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  If it targets high availability systems, can we call it a HAHA?
I had a similar thought (Score:5, Interesting)

by SuperKendall ( 25149 ) writes: on Saturday March 30, 2024 @11:06PM (#64357862)

I asked an LLM (doesn't really matter which one, they are fail in this same way) for advice on how to load a specific file type, it gave me three possible packages to use to be able to land the file...
But not one of them actually existed. When pressed further on one of the frameworks that didn't exist, it doubled-down and gave me a website for the package - which did not exist.
And that led me to think, maybe I should build out that package. Not in order to create malware as described here, but because you already know some people in a similar situation will be directed right to your package of that specific name without having to do anything!
So it's pre-made marketing just waiting for a product.

- Re:I had a similar thought (Score:5, Funny)
  
  by Waccoon ( 1186667 ) writes: on Sunday March 31, 2024 @11:02PM (#64360096)
  
  So now AI thinks up the ideas and writes the specs, and real people do all the work to make this crap work.
  That's it... AI has now graduated to being the new pointed-haired boss.
  
  - Not exactly a boss.... (Score:1)
    
    by SuperKendall ( 25149 ) writes:
    
    So now AI thinks up the ideas and writes the specs, and real people do all the work to make this crap work.
    It's more like AI opens a door that you can take advantage of people going through.
    Sort of more like a force of nature than a boss.
    Or if you like, it is recognizing a shortcoming by pretending something does not exist when plainly it would be useful, and someone opts to fill that hole for the benefit of mankind. Although in truth that scenario feels a bit like AI is a boss. :-)
what about a fact-checker AI? (Score:4, Interesting)

by simlox ( 6576120 ) writes: on Sunday March 31, 2024 @04:40AM (#64358172)

That reads answers, check urls, see if code can compile, check references to see if they state what is claimed in the answer?

predictable (Score:2)

by Tom ( 822 ) writes:

Ironically, that this kind of stuff would happen with a big probabilistic prodiction engine was to be expected.
Let the AI write code, they said.
It's really good at it, they said.
Look how fast it generates code when I put in a natural language prompt!
What could possibly go wrong?
Programmers behind enemy lines (Score:3)

by ghoul ( 157158 ) writes: on Sunday March 31, 2024 @05:42AM (#64358238)

At this point it seems like there are vulnerabilities deliberately being built into AIs by programmers who are worried AI is coming for the jobs of their brethren. Apparently some folks at the AI company have empathy for the thousands of programmers AI is about to make redundant.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Cost/Benefit (Score:5, Funny)

Re: (Score:3, Insightful)

Re: (Score:3)

Re: (Score:1)

Re: Cost/Benefit (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Predicting the future (Score:2)

Re:The Register is a JOKE news site (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

AI is great... (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Naive trust in LLMs + supply chains... (Score:3)

Doom Loop (Score:4, Interesting)

Re: (Score:1)

Re: (Score:2)

Needs a catchy name (Score:5, Interesting)

Re:Needs a catchy name (Score:5, Funny)

Re: (Score:1)

I had a similar thought (Score:5, Interesting)

Re:I had a similar thought (Score:5, Funny)

Not exactly a boss.... (Score:1)

what about a fact-checker AI? (Score:4, Interesting)

predictable (Score:2)

Programmers behind enemy lines (Score:3)

Related Links Top of the: day, week, month.

Slashdot Top Deals