Most "AI for cybersecurity" content is vendor marketing dressed up as advice. This isn't that. After spending real working hours with AI tools across detection engineering, security automation, and adversarial AI/ML research, here's what actually moves the needle — and what's hype.

The honest take is that AI hasn't changed cybersecurity as much as the marketing implies. But for individual practitioners, the right combination of tools can compress 30-40% of the busywork in a typical day, and a smaller percentage of it makes you genuinely better at the analytical work. That's worth attention.

How to think about AI tools in security

AI tools fall into three useful categories:

  1. General-purpose LLMs you use as a thinking partner. These are the highest-leverage and least vendor-locked.
  2. AI-augmented specialty tools — SIEMs, IDE assistants, code review platforms — where AI is built into a workflow you already do.
  3. Custom agent frameworks where you compose AI into your own workflows. The frontier; high effort, high payoff.

Most practitioners overweight category 2 (because vendors push it hard) and underweight category 1 (because it doesn't have a sales rep).

Category 1: General-purpose LLMs

This is where the time savings actually live for individual security practitioners.

Claude (Anthropic) — the current best general-purpose model for security work. Strengths: long context windows for log analysis, careful technical writing, willingness to engage with security context without aggressive refusals on legitimate work. Use it for: log analysis, detection rule writing, incident report drafting, threat model walkthroughs, code review for security issues, explaining unfamiliar techniques.

ChatGPT (OpenAI) — strong general assistant, large model selection. Use it for: quick syntax lookups, scripting tasks, brainstorming. Code Interpreter is useful for one-off log parsing and data analysis tasks without setting up a Python environment. Custom GPTs let you build specialized assistants (Splunk SPL helper, Sigma rule writer) without coding.

Local models — Llama 3, Mistral, Qwen, DeepSeek and others, run via Ollama, LM Studio, or llama.cpp. The use case: handling sensitive log data or PII without sending it to external APIs. Performance is now genuinely strong for analysis tasks. Worth setting up if you handle data you can't send to cloud LLMs.

Practical workflows:

The honest limitation: LLMs hallucinate technical specifics. CVE numbers, RFC details, exact tool flag syntax, specific version behaviors. Use them as thinking partners, not as sources of truth. Verify anything load-bearing.

Category 2: AI in security tools

Microsoft Security Copilot — integrated across Defender, Sentinel, Entra. Strong if you're a Microsoft shop. Worth the cost in enterprise environments where Sentinel is your SIEM. Limited value outside the Microsoft ecosystem.

Splunk AI Assistant / AI in SOAR — generates SPL queries from natural language, suggests playbook actions. Genuinely useful for SPL beginners; less impressive for advanced users who can write SPL faster than they can prompt for it.

Burp Suite AI features (Pro) — AI-assisted scanning, vulnerability description generation, attack payload suggestions. Modest improvements to existing workflows. Not transformative.

Snyk and Veracode AI augmentation — AI-prioritized vulnerability findings, AI-generated remediation suggestions. Reduces noise in already-noisy outputs. Useful for AppSec teams drowning in scanner results.

GitHub Copilot for security — increasingly useful for writing security-relevant code (validators, parsers, integration scripts). Watch carefully — Copilot will happily write subtly insecure code unless you review it.

SOC AI assistants from vendors (CrowdStrike Charlotte AI, SentinelOne Purple AI, etc.) — alert triage acceleration, natural language threat hunting. Useful at SOC scale; overhead isn't worth it for small teams.

Category 3: Custom agent frameworks

This is the frontier and where individual practitioners can build genuinely novel capability if they're willing to write code.

LangChain / LangGraph — Python frameworks for composing LLMs with tools, memory, and workflows. Steep learning curve, powerful results. Use cases: automated threat intel correlation, custom detection engineering pipelines, intelligent incident triage.

CrewAI — agent orchestration framework. Build multi-agent workflows where specialized agents collaborate on tasks. Useful for security research workflows where you want parallelism with role specialization.

Custom MCP servers — Model Context Protocol lets you build tool integrations that any compatible LLM can use. Building an MCP server that exposes your SIEM, your ticketing system, or your threat intel feeds means any LLM you use becomes immediately useful inside your environment.

Practical custom agent use cases:

Building these requires real engineering time but pays back fast at the team level.

What's overhyped

"AI-powered" everything in vendor sales materials. If a security vendor added "AI" to a feature name in the last 18 months and the underlying capability didn't change, they added a marketing layer. Look at what the feature does, not what it's branded as.

Agentic offensive security tools that promise to "automatically pentest your environment." Most of these are wrappers around existing scanners with an LLM commenting on output. Useful for documentation, not for finding novel vulnerabilities.

AI threat intel feeds. Many are just RSS aggregators with LLM-generated summaries. Sometimes worse than reading the source.

AI-generated detection rules at scale. A model can generate 1,000 Sigma rules in an hour. Quality varies dramatically. Curated rule sets from teams that maintain them remain more valuable.

What's actually undervalued

Using LLMs as a writing assistant for security work. Incident reports, executive summaries, vendor security questionnaires, internal documentation. Hours of saved work per week.

Custom GPTs / Projects for specific repetitive tasks. A custom GPT pre-loaded with your environment's context, your detection naming conventions, your report templates. Five minutes to set up, hours of savings ongoing.

LLMs as a translation layer between security and non-security stakeholders. Explaining a CVE's business impact, drafting communications to executives, writing customer-facing security advisories. The tone-matching is genuinely useful.

The security risks of using AI tools

A few things to keep in mind as you adopt AI tools in your workflow:

The starter stack

If you're new to integrating AI into your security workflow:

  1. A general-purpose LLM you trust. Claude, ChatGPT, or a local Llama setup. Use it daily.
  2. One IDE-integrated assistant. Copilot or Cursor. Use it for security scripting and automation work.
  3. One vendor AI feature you actually use. Pick the one your existing toolchain offers, learn what it's actually good at.
  4. Optional: a custom GPT or Claude Project loaded with your environment's context for repetitive tasks.

That's a baseline that produces real time savings within a week.

The bigger picture

AI tools won't replace security analysts in the next five years. They'll change the work — more synthesis, less rote pattern-matching — and they'll widen the gap between practitioners who learn to use them well and those who don't.

For people earlier in their careers, the leverage is even greater. AI tools can compress the learning curve on unfamiliar topics dramatically. Pair AI augmentation with structured study via Coursera's Google Cybersecurity Certificate or IBM Cybersecurity Analyst, and you have a foundation-building workflow that previous generations of practitioners would have killed for. See Using AI to Study for Certifications for the practical playbook.

What to read next

← Back to all reviews