ao link
Affino
Search Teiss
My Account
Remember Login
My Account
Remember Login

Generative AI, software development and security

Linked InXFacebook

Dr Andrew Bolster at the Synopsys Software Integrity Group examines the emergence of Generative AI and asks whether this technology is friend or foe

 

Since the release of ChatGPT, the technology industry has been scrambling to establish and operationalise the practical implications of these human-level conversational interfaces. 

 

Now, almost every major organisation is connecting their internal or product documentation to a large language model (LLM) to enable rapid question-answering. Some are starting to wade into the use of generative AI systems to aid in the design and creation of new technical solutions, be it in marketing content, web application code or chip design. 

 

But the hype has had its sharp edges as well; the word ‘hallucination’ is never far from the lips of anyone discussing chatbots, and the assumptions that people have around human-like language being equivalent to common sense have been seriously challenged. Potential users of LLM derived systems would be wise to take an optimistic but pragmatic approach. 

 

The release of the first major public LLM set off successive waves of amazement, intrigue and often fear on the part of a public unprepared for the surprisingly human behaviour of this chatbot. It appeared to communicate with intentionality, with consideration, and with a distinctively natural human voice. Over successive chat-enquiries, it seemed able to remember its own answers to previous questions, enabling users to build up coherent and seemingly complex conversations, and to attempt to answer surprisingly deep questions. 

 

Yet, these systems should be treated as one would treat a child savant; it might know all the right words in the right order but may not really have the experience or critical thinking to evaluate its own view of the world. The outputs of these systems have not earned our institutional trust, and care must be taken in leveraging these systems without significant oversight.

 

Emergence of Generative AI

A later revelation was that as successive versions of these LLMs were released (ChatGPT by OpenAI, Bard by Google, Claude by Anthropic, and the open-source LLaMA model by Meta), use expanded from playful simulation of natural language conversations with a virtual oracle, to constructing poetry from abstract documentation, laying out personal statements and cover letters for job seekers, designing presentations for speakers, and summarising articles for journalists (and sometimes writing them…).

 

As ‘prompt engineering’ became one of the most searched for terms of the year, these models started to be used to generate any kind of textual, visual, or audio content imaginable. 

 

This generative AI capability quickly gained the attention of both security researchers and cyber-criminals. For many forms of online fraud and cyber-crime, the barrier to entry is the cost of having a human attempt to convince another human to shuffle some asset around, whether it’s convincing an IT helpdesk to reset your password, convincing a befuddled computer user to pay for tech support via untraceable gift vouchers, that your CFO really needs you to pay this unapproved invoice right now, or enticing an optimistic investor that a foreign prince really will pay you back. 

 

Now, such interactions can be automated on an unheard-of scale, at a predictable price. 

 

LLMs in software development: friend, foe or frustration?

Of particular interest to the cyber-security industry was the emergence of GitHub Copilot; a LLM-based tool that was able to analyse code, and propose additions or corrections based on prompts, either directly from a chat-like interface, or as a form of advanced code-completion.

 

Security leaders began to reflect on the potential risks of this utility; not just in the surface level estimation of ‘Will these LLMs always generate secure code?’, but on the deeper implications for the software development industry. 

 

If the risk to software integrity was just the risk introduced by LLM generated code having some occasional, hallucinated, bad security practices, the resolution could have been simple enough; lock it down. And indeed, it has been observed that many organisations took this approach as they were developing their LLM response policies. 

 

However, in the complex world of software engineering, developers pulling code from the internet to fix a hair-raisingly-frustrating problem is a recognised industry strength. And here the introduction of these LLMs into any part of the software supply chain implies significant downstream risks. (This fact is recognised repeatedly in the UK NCSC’s Guidelines on Secure AI System Development released 27th November 2023).

 

It gets worse. We have already seen sites like Reddit, StackOverflow, Wikimedia and more have taken steps to block content that appears to be generated by such LLMs. But the guidelines for assessing that appearance are extremely subjective, down to things like speed of response rather than quality or correctness of content.

 

Google and others have effectively thrown their hands in the air by saying that they will watermark generated content, implying that they have not identified a suitably generic method for detecting LLM-generated content, so they have to police it on the supply side.

 

These actions imply that the internet may already be infected with LLM-derived content, which, returning to the software development space, now means that an ‘intern with access to the internet’ is just as dangerous as an un-trusted LLM. 

 

How to approach security

Taken together, LLMs muddy the software development water in many ways. Security leaders have little choice but to expand their DevSecOps positions to encompass any ML or AI derived features and content and must be proactive in identifying where that risk lies in their own SDLCs. 

 

It’s no longer enough to run a scan against your codebase and declare that you are safe. Increasingly, security leaders are pushing for a wider, more holistic view of security that includes these upstream AI/ML risks.

 

Furthermore, Application Security Posture Management must include validating the operation, deployment, and even training data origin, as a first-class part of the software development lifecycle, rather than just an experimental afterthought. 

 

Ultimately, organisations will have to take a fundamentally different approach to application and service security. Rather than just building or deploying time scanning to attest that an application is secure, organisations will have to regularly and repeatedly validate that the behaviour of their third party service integrations (and therefore, their own services) is still as expected.

 


 

Dr Andrew Bolster is senior research and development manager at the Synopsys Software Integrity Group

 

Main image courtesy of iStockPhoto.com

Linked InXFacebook
Affino

Winston House, 3rd Floor, Units 306-309, 2-4 Dollis Park, London, N3 1HF

23-29 Hendon Lane, London, N3 1RT

020 8349 4363

© 2025, Lyonsdown Limited. teiss® is a registered trademark of Lyonsdown Ltd. VAT registration number: 830519543