Products
How we help
- - - Industries
    - Banking
    - Insurance
    - Legal
    - Technology
    - Others
  - - Traditional security
      awareness is not enough.
    - It’s time to adapt.
    - Get in Touch
About
Resources
- - - Insights
    - Whitepapers
    - The latest in cybersecurity behavioral research by our in-house Science and Research team.
    - eBooks
    - Awareness, behavior, and culture-focused knowledge and how-tos.
    - Blog
    - Industry news, updates, and guidance for security professionals.
    - Podcasts
    - Insights on all things human cyber risk from leading industry voices.
  - - Science
    - SebDB
    - The world’s most comprehensive security behaviors database.
    - Research library
    - An archive of research and studies on behavioral cybersecurity by leading academics.
    - Free cybersecurity tools
    - Selection of human risk management tools.
    - Behave Hub
  - - Community
    - Our events
    - Cybersecurity conferences, expos, conventions, and trade shows around the globe.
    - Webinars
    - Join our live webinars, or watch the recordings on demand.
    - SebDB community
    - A community for professionals focused on changing security behaviors.
Plans
- - - Search for:
Login
Request demo

Select Page

Research library

CatBERT: Context-aware tiny BERT for detecting targeted social engineering emails

Targeted phishing emails are a major cyber threat on the Internet today and are insufficiently addressed by current defences. In this paper, we leverage industrial-scale datasets from Sophos cloud email security service, which defends tens of millions of customer mailboxes, to propose a novel Transformer-based architecture for detecting targeted phishing emails. Using real-world targeted phishing data as well as millions of benign customer emails for training and evaluation, we show that our proposed CatBERT (Context-Aware Tiny Bert) model achieves a 87% detection rate at a false positive rate of 1%, as compared to DistilBERT [20], LSTM (Long Short-Term Memory) [13], and logistic regression baselines which achieve 83%, 79%, and 54% detection rates respectively. Our model leverages both natural language and email header inputs, is more computationally efficient than competing transformer approaches, and we show that it is less prone to adversarial attacks which deliberately replace keywords with typos or synonyms.

Back

Read full article

Use cases

Industries

Traditional security
awareness is not enough.

Insights

Science

Community

CatBERT: Context-aware tiny BERT for detecting targeted social engineering emails

You May Also Like

The Behavior Grid: 35 ways behavior can change

Employee behavior: the psychological gateway for cyberattacks

Products

Other

How we help

About

Resources

Accreditations

Use cases

Industries

Traditional security awareness is not enough.

Insights

Science

Community

CatBERT: Context-aware tiny BERT for detecting targeted social engineering emails

You May Also Like

The Impact of Workload on Phishing Susceptibility: An Experiment

The Behavior Grid: 35 ways behavior can change

Employee behavior: the psychological gateway for cyberattacks

Traditional security
awareness is not enough.