ao link
Affino
Search Teiss
My Account
Remember Login
My Account
Remember Login

How large language models are reshaping cyber-threat intelligenc

As cyber-threat intelligence teams struggle to keep pace with an ever-growing volume of information, large language models are emerging as a powerful support tool. Used well, they can help organisations move faster, see patterns earlier and communicate risk more effectively. Used poorly, they risk reinforcing bias, creating false confidence and obscuring real threats.

 

The challenge is no longer finding intelligence but extracting insight quickly enough for it to influence real security decisions. Large language models (LLMs) are increasingly being used to address this gap, helping organisations spot emerging threats, understand attacker behaviour and produce usable intelligence at scale. Their growing role, however, also raises important questions about reliability, bias and how success should be measured.

 

One of the clearest benefits of LLMs in cyber-threat intelligence is their ability to surface early signs of new attack activity. Traditional intelligence processes tend to rely on known indicators of compromise and established threat actor profiles, which makes them inherently reactive. LLMs work differently.

 

By analysing large volumes of unstructured text, they can identify recurring themes, unfamiliar tools or subtle shifts in attacker behaviour across multiple sources. This makes it easier to spot weak signals that may indicate a new campaign or technique before it becomes widely recognised.

 

Industry research highlights how LLMs can cluster related reporting and highlight meaningful differences between incidents, allowing analysts to focus on what has changed rather than what has been repeated.

 

From raw reporting to structured insight

 

LLMs are also reshaping how organisations interpret and structure threat information. A significant proportion of intelligence work involves translating narrative descriptions into something more systematic: identifying tactics, techniques and procedures, linking incidents to known frameworks and understanding how different campaigns relate to one another.

 

This process is time-consuming and often inconsistent across teams. Research suggests that LLMs can support this work by extracting key entities, normalising terminology and assisting with the mapping of unstructured reports to established models such as MITRE ATT&CK, provided their outputs are reviewed by experienced analysts.

 

Used in this way, LLMs act as an efficiency layer rather than a replacement for human judgement.

Reporting is another area where the impact of LLMs is becoming more visible. Cyber-threat intelligence needs to be communicated to very different audiences, from security operations teams to senior executives.

 

LLMs can help generate tailored summaries, technical briefings and high-level risk narratives from the same underlying intelligence, reducing the time analysts spend on drafting and reformatting.

 

Case studies from organisations experimenting with generative AI show that this approach can significantly cut reporting time while maintaining clarity and consistency, as long as outputs are subject to proper review.

 

Limits, risks and measuring real value

 

Despite these advantages, LLMs have clear limitations that prevent them from operating autonomously in intelligence workflows. One of the most serious is hallucination, where models produce information that appears credible but is not supported by evidence.

 

In a cyber-security context, this can lead to false assumptions about vulnerabilities, threat actors or attack methods, potentially diverting attention and resources away from genuine risks. This behaviour is a well-documented characteristic of large language models and remains an ongoing challenge for their use in high-stakes domains.

 

Bias is another structural issue. Because LLMs are trained on large bodies of publicly available text, they tend to reflect the biases present in that data. In practice, this means threats affecting English-speaking regions, large organisations and high-profile sectors are more visible than those targeting smaller markets or less reported industries. Over time, this can skew threat prioritisation and create blind spots, particularly for organisations with a global footprint.

 

Language coverage further complicates the picture. Cyber-threat activity is global, and early reporting on new attacks often appears in local languages or regional forums. However, research shows that LLM performance drops in languages other than English, particularly when dealing with technical nuance. This increases the risk that early warning signals are missed simply because they originate in languages the model handles less effectively.

 

There are also security considerations tied to the use of LLMs themselves. When models ingest untrusted external content, they may be exposed to manipulation techniques such as prompt injection, which can influence outputs in unintended ways. For organisations integrating LLMs into security workflows, this raises important questions about data integrity, confidentiality and operational.

 

Given these constraints, organisations need meaningful ways to assess whether LLMs are actually improving cyber-threat intelligence outcomes. Measuring success purely in terms of speed or automation misses the point.

 

More useful indicators include how quickly emerging threats are identified after publication, how accurately intelligence is structured and how often AI-assisted outputs lead to concrete actions such as new detections, patching decisions or threat-hunting activity. Industry commentary increasingly stresses that intelligence metrics should be tied to business and security outcomes, rather than internal activity levels.

 

Trust and governance are equally important. Tracking how often AI-generated outputs require correction, monitoring language and source coverage and measuring the level of human review needed all help organisations understand whether LLMs are genuinely reducing analyst workload or simply shifting it elsewhere. These considerations align closely with emerging AI risk management frameworks, which emphasise transparency, oversight and continuous evaluation in high-risk use cases such as cyber-security.

 

In practice, the most effective use of large language models in cyber-threat intelligence treats them as support tools rather than decision-makers. But, when grounded in reliable sources, reviewed by experienced analysts and evaluated using outcome-focused metrics, LLMs can strengthen an organisation’s ability to anticipate and respond to cyber-threats. Without those safeguards, however, they risk adding speed without insight, and confidence without certainty.

 


Please take 30 seconds to register

Register Now

 

Already have an account? Sign in

Remember Login
Affino

Winston House, 3rd Floor, Units 306-309, 2-4 Dollis Park, London, N3 1HF

23-29 Hendon Lane, London, N3 1RT

020 8349 4363

© 2025, Lyonsdown Limited. teiss® is a registered trademark of Lyonsdown Ltd. VAT registration number: 830519543