Blog IT Operations

The Gaps in Microsoft 365 Defender That Let Targeted Phishing Through

By Ravi Srinivasan July 14, 2025 8 min read

Microsoft 365 Defender interface screenshot showing a targeted phishing email that bypassed standard email filters

Microsoft 365 Defender is not a bad email security product. If your organization is running Exchange Online without Defender for Office 365 Plan 1 or Plan 2, you should fix that before reading further — the baseline protection matters and EOP handles enormous volumes of commodity threats effectively. Microsoft's published detection rates for known malware in Exchange Online are genuinely high, and the investment in ATP infrastructure shows.

This article is for organizations that have Defender configured and want an honest assessment of where its detection stops — specifically for targeted phishing attacks. If your threat model is predominantly volume spam and mass credential phishing, Defender's coverage is adequate. If your threat model includes targeted spear phishing, BEC, and vendor email compromise, there are specific gaps worth understanding before you assume the stack is comprehensive.

What Defender for Office 365 Does Well

Before the gaps: the strengths, specifically, because understanding the capability boundary requires knowing where it actually sits.

Exchange Online Protection (EOP) is the base layer — anti-spam, anti-malware, connection filtering, and basic spoofing protection included with every M365 subscription. Connection filtering blocks known-bad IPs. The malware filter catches known malicious attachments by hash and behavioral signature. For commodity threats, EOP's coverage is substantial.

Safe Attachments (Plan 1+) detonates attachments in a virtual environment before delivery. This catches polymorphic malware that evades hash-based detection — the attachment runs in an isolated environment and is held if malicious behavior is observed. Detonation adds latency (typically 1–3 minutes for most attachments; Microsoft publishes average values), which most organizations accept given the protection value.

Safe Links (Plan 1+) rewrites URLs in email and wraps them in a time-of-click proxy check. This is designed to catch "delayed detonation" links — URLs that point to clean pages at delivery time and activate malicious behavior later. The proxy check fires at click time, not just delivery time. This is a meaningful capability that catches a real attack class.

Anti-spoofing and DMARC enforcement in Defender enforces your DMARC policy and adds its own spoofing detection layer on top. Composite authentication (compauth) scores each inbound message. These controls reduce direct domain spoofing risk materially.

Attack Simulation Training (Plan 2) runs internal phishing simulations for security awareness measurement. Valuable for training program calibration, though this is a different layer from real-time threat detection.

Gap 1: Novel Domain Infrastructure

Defender's reputation system scores sending infrastructure against Microsoft's proprietary threat intelligence and third-party reputation feeds. This works against established threat actors with persistent infrastructure. It does not work against a campaign that uses freshly registered domains with no reputation history.

Registering a domain for $12, configuring SPF/DKIM/DMARC correctly, and sending from a clean IP address with no prior abuse history produces an email that Defender's reputation checks score as neutral or unknown — not blocked, not flagged, delivered to inbox. The registration-to-attack window for targeted campaigns is often under 72 hours, shorter than the time it takes threat intelligence feeds to observe and score new infrastructure. This is not a Defender failure; it's a fundamental limitation of reputation-based detection against zero-history infrastructure.

The practical implication: a 3,500-employee professional services firm in the Midwest that has never received email from acme-billing-dept.com before — because it was registered three days ago — will have that email land in inbox with Defender scores showing no known threats. The detection gap is not in Defender's rules; it's in the absence of historical signal.

Gap 2: Text-Only Social Engineering

Safe Attachments and Safe Links operate on what's inside or linked from the email. A plain-text BEC message with no attachment and no URL — just a request for a wire transfer, a W-2 data pull, or an employee payroll change — has nothing for either control to inspect. Defender's anti-spam classifier will evaluate the message, but its classification signals are oriented toward spam characteristics (mass-sending patterns, keyword frequency, header anomalies) rather than semantic social engineering patterns.

A well-crafted BEC message reads like legitimate internal business correspondence. It uses appropriate vocabulary, references plausible business context, and makes a request that, while unusual, falls within the range of what executives occasionally ask employees to do. A spam classifier trained on volume mail characteristics will not reliably flag this message as malicious, because it doesn't share the statistical properties of spam — it was written specifically to avoid them.

This is the core gap: Defender's inspection pipeline doesn't include a component that reads the semantic intent of email body language and evaluates whether the combination of claimed identity, request type, and urgency framing suggests social engineering. That's not a criticism of Defender's design; it was architected for a different primary threat model.

Gap 3: Safe Links and Time-Delay Bypass

Safe Links provides time-of-click protection, which is better than delivery-time-only URL checking. But sophisticated campaigns have adapted. Two bypass patterns are well-documented:

Multi-redirect chains through legitimate services: The URL in the email points to a legitimate service (Bit.ly, Google Redirects, a compromised legitimate website) that only resolves to the malicious destination at click time, with the final redirect conditioned on user-agent, geographic location, or click timing. Safe Links evaluates the URL as seen in the email — if the first hop is a legitimate service, the subsequent redirect chain may not be fully traversed during the Safe Links check.

CAPTCHA gating: The malicious landing page presents a CAPTCHA before revealing phishing content. Automated detonation systems typically cannot solve CAPTCHAs, so the Safe Links check sees a CAPTCHA page, not the phishing form behind it. This technique is documented in multiple threat intelligence reports and specifically noted in Proofpoint TAP research as a growing evasion category.

We're not saying Safe Links is ineffective — it catches a real and large category of URL-based phishing. We're saying there are documented evasion techniques specifically designed to bypass it, and mid-market security teams should understand what those look like rather than treating Safe Links as comprehensive URL protection.

Gap 4: Impersonation Detection Scope

Defender for Office 365 Plan 2 includes user impersonation protection — you can specify up to 350 users (typically executives and high-value targets) whose identities Defender actively monitors for impersonation attempts. When an inbound email's display name or address resembles a protected user, it's flagged.

This is useful for protecting known high-value targets, but it has scope limitations. First, the 350-user cap means mid-market organizations with 1,000+ employees can only cover their most critical accounts. Finance team members below executive level, who are common BEC targets, may not be in the protected list. Second, vendor impersonation — an attacker impersonating a trusted external vendor rather than an internal employee — is outside the scope of the user impersonation policy, which focuses on internal identity protection.

Domain impersonation protection in Plan 2 covers a defined list of trusted domains. Lookalike domains for vendors not on your trusted list don't trigger this protection. Given that vendor email compromise has become the dominant BEC vector, this scope limitation matters operationally.

How Layered Detection Addresses the Gaps

The standard recommendation for closing these gaps is to add a supplemental inspection layer on top of Defender rather than replacing it. Defender handles the known-threat, high-volume baseline; the supplemental layer handles targeted social engineering that escapes the known-threat stack.

In practice, a supplemental layer integrating via the Microsoft Graph API (Mail.Read application permissions, OAuth 2.0 delegated access) can evaluate messages that Defender passed without flags and apply semantic analysis to identify social engineering patterns that don't generate Defender signals. The two systems are complementary rather than competitive — Defender's fast-path signature and reputation checks reduce the volume of messages requiring deeper analysis; semantic inspection handles the residual high-risk set.

The relevant NIST framework reference here is SP 800-53 SI-8 (Spam Protection), which explicitly recommends deploying spam protection mechanisms at multiple points (network entry, workstation, server) and notes that no single mechanism provides comprehensive coverage. The same defense-in-depth principle applies to phishing detection — no single product, including Defender, is designed to be the entire answer.

For more on how semantic analysis closes the social engineering gap, see our piece on LLM-based analysis versus signature detection. For specifics on the targeted threat types Defender's architecture doesn't cover comprehensively, the spear phishing threat page covers the detection approach in detail.