MilterAgent

A high-performance email filtering server implementing the Milter protocol, designed to protect against phishing emails from spoofed legitimate companies and organizations.

Overview

MilterAgent is a comprehensive email security solution that implements the Milter protocol for real-time email filtering. Built with Rust and Tokio for maximum performance, it provides advanced phishing detection capabilities while maintaining low latency and high throughput.

Features

Advanced Phishing Detection: Comprehensive filters for detecting phishing emails impersonating:
- Banks and financial institutions (MUFG, Mizuho, SMBC, etc.)
- Delivery and logistics companies (Japan Post, Yamato, Sagawa, DHL, etc.)
- Transportation companies (JR, airlines, private railways, etc.)
- E-commerce platforms and online services
Spamhaus API Integration: Automated spam sender reporting:
- Automatic detection and reporting of phishing email source IPs
- Integration with Spamhaus API for threat intelligence sharing
- Configurable API authentication and endpoints
- IP whitelist support with CIDR notation for trusted networks
Advanced Regex Support: Powered by fancy-regex with support for:
- Negative lookahead ((?!...)) and positive lookahead ((?=...))
- Negative lookbehind ((?<!...)) and positive lookbehind ((?<=...))
- Complex pattern matching for sophisticated email filtering
Unicode Security: Comprehensive invisible character filtering and normalization:
- Removal of invisible characters, BiDi controls, and combining diacritics
- Protection against Unicode-based spoofing attacks in all languages
- NFKC normalization for consistent text processing
Milter Protocol Support: Full implementation of the Milter protocol for seamless integration
Real-time Processing: Asynchronous Rust implementation for handling thousands of concurrent connections
High-Speed Parallel Filtering: Multi-threaded filter evaluation with parallel execution for maximum performance
Flexible Configuration: Modular configuration system with include support
Signal Handling: Graceful shutdown and live configuration reload via SIGHUP/SIGTERM
Cross-platform: Supports both Unix and Windows environments
Comprehensive Logging: Configurable log levels with JST timestamp support

Architecture

Asynchronous Runtime: Built with Tokio for handling multiple concurrent connections
Modular Design: Separate modules for parsing, filtering, and protocol handling
Memory Safe: Written in Rust for guaranteed memory safety and performance
Parallel Filter Engine: Configurable rule-based filtering system with regex support and multi-threaded parallel processing for high throughput

Installation

Prerequisites

Rust 1.80 or later (with Rust 2024 edition support)
Cargo package manager
Compatible mail server (Postfix, Sendmail, etc.)

Building from Source

cd /usr/src/
git clone https://github.com/disco-v8/MilterAgent.git
mv MilterAgent "MilterAgent-$(date +%Y%m%d)"
cd /usr/src/"MilterAgent-$(date +%Y%m%d)"

cargo build --release

/bin/cp -af ./target/release/milter_agent /usr/sbin/
rsync -av ./logrotate.d/milter_agent /etc/logrotate.d/
rsync -av ./systemd/milter_agent.service /usr/lib/systemd/system/
rsync -av ./etc/ /etc/

Configuration

The server uses a modular configuration system:

Main Configuration File

Create MilterAgent.conf:

Listen 127.0.0.1:8898
Client_timeout 30
Log_file /var/log/milteragent.log
Log_level info

include MilterAgent.d

Filter Configuration

Place filter files in the MilterAgent.d/ directory:

filter_bank.conf - Banking and financial services filters
filter_transport.conf - Transportation and logistics filters
Additional filter files as needed

If you want to start with WARN (adding only a header) instead of REJECT, please perform the following batch replacement.

chmod 700 /etc/MilterAgent.d/reject2warn.sh
cd /etc/MilterAgent.d/
./reject2warn.sh

Filter Syntax

filter[filter_name] = 
    "decode_from:(?i)(Pattern):AND",
    "decode_from:!@(.*\.)?legitimate-domain\.com:REJECT"

Configuration Options

Listen: Server bind address and port
- Format: IP:PORT or just PORT
- Example: 127.0.0.1:8898 or 8898
Client_timeout: Client inactivity timeout in seconds
Log_file: Path to log file (optional, defaults to stdout)
Log_level: Logging verbosity (info, trace, debug)
Spamhaus_report: Enable Spamhaus API reporting (yes/no, default: no)
Spamhaus_api_token: Spamhaus API authentication token (optional)
Spamhaus_api_url: Spamhaus API endpoint URL (optional)
Spamhaus_safe_address: IP addresses or CIDR networks to exclude from Spamhaus reporting (optional, can be specified multiple times)
include: Include additional configuration directories

Usage

Starting the Server

systemctl daemon-reload
systemctl --no-pager status milter_agent

systemctl enable milter_agent
systemctl --no-pager status milter_agent

systemctl start milter_agent
systemctl --no-pager status milter_agent

Mail Server Integration

Postfix Integration

Add to /etc/postfix/main.cf:

smtpd_milters = inet:127.0.0.1:8898
milter_default_action = accept
milter_protocol = 6

Restart Postfix:

sudo systemctl restart postfix

Sendmail Integration

Add to /etc/mail/sendmail.mc:

INPUT_MAIL_FILTER(`milteragent', `S=inet:8898@127.0.0.1')

Signal Handling

SIGHUP: Reload configuration files
SIGTERM: Graceful shutdown
Ctrl+C (Windows): Graceful shutdown

# Reload configuration
systemctl reload milter_agent
systemctl --no-pager status milter_agent

# Graceful shutdown
systemctl stop milter_agent
systemctl --no-pager status milter_agent

Logging

The server supports configurable log levels:

info (0): Basic operational messages
trace (2): Detailed tracing information
debug (8): Comprehensive debugging output

Logs include JST timestamps and are output to either a file or stdout based on configuration.

Filter Examples

Banking Filter

filter[phish_mufg] = 
    "decode_from:(?i)(三菱UFJ|MUFG):AND",
    "decode_from:!@(.*\.)?mufg\.jp:REJECT"

Transportation Filter

filter[phish_jreast] = 
    "decode_from:(?i)(JR東日本|East Japan Railway):AND",
    "decode_from:!(@(.*\.)?jreast\.co\.jp|@(.*\.)?jre-vts\.com):REJECT"

Advanced Negative Lookahead Filter

filter[phish_monex_html] = 
    "body:(?i)monex.*(?!.*\.?(monex\.co\.jp|monex\.com|on-compass\.com)\b):REJECT"

This filter detects emails containing "monex" but not from legitimate Monex domains, preventing phishing attempts that impersonate the financial service while allowing legitimate communications.

Caveat: Negative Lookahead Recursion Depth in `fancy-regex`

When using fancy-regex for semantic filtering, be aware that negative lookaheads (?!...) containing more than 3–4 | branches may exceed internal recursion depth.

This can result in partial matches or incorrect evaluation, especially when filtering URLs with multiple domain exclusions.

Recommended workaround:

Split complex lookaheads into multiple filter rules
Avoid deeply nested alternations inside (?!...)
Use simpler patterns with fewer branches per lookahead

Example:

# Avoid this (may fail):
decode_html:https?://(?!.*\.(a\.com|b\.com|c\.com|d\.com|e\.com)\b).+:REJECT

# Prefer this (stable evaluation):
decode_html:https?://(?!.*\.(a\.com|b\.com|c\.com)\b).+:AND
decode_html:https?://(?!.*\.(d\.com|e\.com)\b).+:REJECT

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Notice:

This project uses third-party crates, some of which are licensed under Apache-2.0.
In particular, the mail-parser crate is licensed under Apache-2.0. Please review its license if you redistribute or modify this software.

Security Notice

This software is designed to help protect against phishing emails but should be used as part of a comprehensive email security strategy. Regular updates to filter rules are recommended to address new phishing campaigns.

Support

For issues, questions, or contributions, please use the GitHub issue tracker.

Multi-part Email with Attachments

[2025/07/23 15:31:20] このメールはマルチパートです
[2025/07/23 15:31:20] [mail-parser] テキストパート数: 1
[2025/07/23 15:31:20] [mail-parser] 非テキストパート数: 1
[2025/07/23 15:31:20] 本文(1): Email body content
[2025/07/23 15:31:20] 非テキストパート(1): content_type="application/pdf", encoding=Base64, filename=document.pdf, size=1024 bytes

Architecture

Module Structure

main.rs: Server startup, configuration management, signal handling
client.rs: Per-client Milter protocol handling
milter.rs: Milter command decoding and response generation
milter_command.rs: Milter protocol command definitions
parse.rs: MIME email parsing and output formatting
init.rs: Configuration file management
logging.rs: JST timestamp logging macros

Milter Protocol Flow

OPTNEG: Protocol negotiation
CONNECT: Client connection information
HELO/EHLO: SMTP greeting
DATA: Macro information
HEADER: Email headers (multiple)
BODY: Email body content (multiple chunks)
BODYEOB: End of body - triggers email parsing and output

Dependencies

tokio: Asynchronous runtime
mail-parser: MIME email parsing
fancy-regex: Advanced regex engine with lookahead/lookbehind support
chrono: Date and time handling
chrono-tz: Timezone support
reqwest: HTTP client for API integration
serde: Serialization framework

Development

Running in Development Mode

cargo run

Testing with Sample Email

You can test the server by sending emails through a configured Postfix instance or using telnet to send raw SMTP commands.

Debug Features

NUL byte visualization: \0 bytes are displayed as <NUL>
Hex dump output for unknown commands
Detailed protocol command logging
Error handling with descriptive messages

Changelog

v0.3.3（2025-10-02）Loopback Address Connection Optimization

Optimized loopback address connection handling to avoid unnecessary processing
Loopback connections (127.0.0.1, ::1) are now silently dropped without logging or spawning client threads
Enhanced invisible character filtering with comprehensive Unicode coverage
Removed language-specific restrictions for universal protection against invisible character attacks
Consolidated filtering logic between parse.rs and filter.rs for consistency
Added RemoteIP_Target configuration option for granular remote IP filtering control

v0.3.2（2025-09-18）Japanese text sanitization and From header spoofing countermeasures.

Removal of invisible and bidirectional control characters in Japanese text, and mitigation of address spoofing via From name manipulation.

v0.3.1（2025-08-20）Mitigation of recursion depth limitations

Modified processing to avoid recursion depth issues in fancy-regex.

v0.3.0 - Semantic Filtering Reinforced (2025-08-17)

Added Unicode normalization (NFKC) and invisible character stripping to Subject/From headers
Improved spoofing resistance against obfuscated Unicode and bidirectional control characters
Enhanced semantic matching accuracy and operational reproducibility

v0.2.1 (2025-08-13)

Refactored filter rule parsing logic for more robust and flexible configuration
Improved handling of colons (:) in filter rules, especially for URLs—no escaping required

v0.2.0 (2025-08-08)

Major Update: Rust 2024 Edition Support
- Upgraded from Rust 2021 to Rust 2024 edition
- Improved performance and latest language features
- Enhanced standard library integration
Advanced Regex Engine Upgrade
- Migrated from standard regex to fancy-regex 0.16
- Added support for negative lookahead ((?!...)) and positive lookahead ((?=...))
- Added support for negative lookbehind ((?<!...)) and positive lookbehind ((?<=...))
- Enables sophisticated phishing detection patterns
Spamhaus API Integration
- Automatic detection and reporting of phishing email source IPs
- Configurable Spamhaus API authentication and endpoints
- Threat intelligence sharing for spam-flagged emails
- Added Spamhaus_report, Spamhaus_api_token, and Spamhaus_api_url configuration options
- Added Spamhaus_safe_address for IP whitelist configuration
- Supports CIDR notation for network range exclusions
Filter Engine Improvements
- Fixed fancy-regex recursion depth issue with complex negative lookaheads
- Split complex OR patterns to avoid recursion limits (>3-4 branches)
- Improved semantic accuracy for w3.org and domain exclusion filters
- Enhanced filter reliability and pattern matching stability
Dependency Optimization
- Removed lazy_static dependency (unused)
- Replaced once_cell with standard library std::sync::OnceLock
- Reduced external dependencies and improved compilation time
- Updated to latest compatible versions of all dependencies
Code Quality Improvements
- Comprehensive code documentation and comments
- Improved error handling in regex matching
- Enhanced code readability and maintainability
- Applied consistent formatting with cargo fmt
- Resolved all cargo clippy warnings
Configuration System Enhancements
- Better error messages for invalid filter configurations
- Improved regex compilation error reporting
- Enhanced configuration file parsing robustness

v0.1.0

Initial release
Basic Milter protocol implementation
MIME email parsing and output
Configuration file support
Signal handling
JST timestamp logging

Name		Name	Last commit message	Last commit date
Latest commit History 189 Commits
.github		.github
etc		etc
logrotate.d		logrotate.d
src		src
systemd		systemd
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
MilterAgent.conf.sample		MilterAgent.conf.sample
README.ja-jp.utf-8.md		README.ja-jp.utf-8.md
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

MilterAgent

Overview

Features

Architecture

Installation

Prerequisites

Building from Source

Configuration

Main Configuration File

Filter Configuration

Filter Syntax

Configuration Options

Usage

Starting the Server

Mail Server Integration

Postfix Integration

Sendmail Integration

Signal Handling

Logging

Filter Examples

Banking Filter

Transportation Filter

Advanced Negative Lookahead Filter

Caveat: Negative Lookahead Recursion Depth in fancy-regex

Contributing

License

Security Notice

Support

Multi-part Email with Attachments

Architecture

Module Structure

Milter Protocol Flow

Dependencies

Development

Running in Development Mode

Testing with Sample Email

Debug Features

Changelog

v0.3.3（2025-10-02）Loopback Address Connection Optimization

v0.3.2（2025-09-18）Japanese text sanitization and From header spoofing countermeasures.

v0.3.1（2025-08-20）Mitigation of recursion depth limitations

v0.3.0 - Semantic Filtering Reinforced (2025-08-17)

v0.2.1 (2025-08-13)

v0.2.0 (2025-08-08)

v0.1.0

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Caveat: Negative Lookahead Recursion Depth in `fancy-regex`

Packages