FailSafe: Automated Fact-Checking System

FailSafe is an open-source, modular framework designed to automate the verification of textual claims. It employs a multi-stage pipeline that integrates Large Language Models (LLMs) with retrieval-augmented generation (RAG) techniques to decompose complex arguments, retrieve high-quality evidence, and evaluate factuality with traceability.

Theoretical Foundation

FailSafe is not just a wrapper around an LLM; it is an implementation of recent research in cognitive architectures and automated reasoning. It addresses key challenges like Sycophancy (LLMs agreeing with users) and Snowball Hallucinations.

Read our full Technical Whitepaper for a deep dive into the research methodology, including Multi-Agent Debate, H-Neuron suppression, and FActScore atomic verification.

System Architecture

The system operates on a four-stage pipeline designed to mitigate hallucinations and ensure comprehensive coverage of the input text.

graph TD
    Input[User Input] --> Layer0[Layer 0: Screening]
    Layer0 -->|Pass| Layer1[Layer 1: Decomposition]
    Layer0 -->|Fail| EarlyExit[Early Warning System]
    
    Layer1 --> SAG[Structrued Argumentation Graph]
    SAG --> Claims{Extract Claims}
    
    Claims -->|Check-worthy?| Layer2[Layer 2: Hybrid Retrieval]
    
    Layer2 --> Cache{Knowledge Base}
    Cache -->|Hit| Layer3
    Cache -->|Miss| Search[Query Generation & Web Search]
    Search --> Evidence
    
    Evidence --> Layer3[Layer 3: Verification]
    Layer3 --> Consistency[Cross-Examining Evidence]
    Consistency --> Verdict[Status: Supported/Refuted]
    
    Verdict --> Layer4[Layer 4: Synthesis]
    Layer4 --> FinalReport[Final Report & References]

Core Modules

1. Pre-computation & Screening (Layer 0)

Before expensive processing, the system filters content using lightweight models:

Metadata Analysis: Evaluates source credibility (if URL provided).
Stylometry: Detects sensationalist patterns using NLP heuristics to flag potential misinformation early.

2. Decomposition (Layer 1)

Instead of treating text as a blob, FailSafe converts it into a Structured Argumentation Graph (SAG).

Components: factcheck.core.Decompose
Function: Resolves coreferences and breaks complex sentences into atomic, verifiable claims, discarding subjective opinions.

3. Verification & Retrieval (Layer 2 & 3)

The system uses a hybrid approach to ground its verifications:

Vector Database (ChromaDB): Stores previously verified facts for low-latency retrieval (Caching).
Deep Search (Serper/Google): Fetches real-time information for novel claims.
Consensus Mechanism: Multiple evidence sources are analyzed. A claim is only marked as "Supported" if a majority of high-trust sources agree.

Technical Stack

Orchestration: Python 3.10+, Celery, Redis (for asynchronous task management).
LLM Integration: Abstracted layer supporting Google Gemini, OpenAI GPT-4, and Anthropic Claude.
Storage: ChromaDB (Vector Store), SQLite (Metadata).
Frontend: Flask-based interface for interaction and visualization.

Installation and Setup

Docker Deployment (Recommended)

The system is fully containerized for consistency across environments.

# 1. Clone the repository
git clone https://github.com/Amin7410/FailSafe-AI-Powered-Fact-Checking-System.git
cd FailSafe-AI-Powered-Fact-Checking-System

# 2. Configure Environment variables
cp .env.example .env
# Required: GOOGLE_API_KEY, SERPER_API_KEY

# 3. Build and Start Services
docker compose up -d --build

Local Development

Dependencies: pip install -r requirements.txt
Infrastructure: Ensure a local Redis instance is running (redis-server).
Worker: celery -A tasks worker --loglevel=INFO
Application: python webapp.py

Configuration

Configuration is managed via config.yaml. Key parameters include:

pipeline.retriever: Switch between hybrid (Search + Vector) or serper (Search only).
llm.default_model: Specifies the reasoning engine (e.g., gemini-2.0-flash).
vectordb.embedding_model: Model used for semantic search (default: intfloat/e5-base-v2).

As a Python Library

FailSafe can be easily integrated into your own Python projects:

from factcheck import FactCheck

# Initialize the system with default configuration
fc = FactCheck.from_config()

# Run the pipeline
result = fc.check_text_with_progress("The Eiffel Tower is in Rome.")

print(f"Verdict: {result['verdict']}")
print(f"Summary: {result['summary']['message']}")

Live Case Studies & Demos

See FailSafe in action with these real-world examples:

Deep Scientific Analysis (Complex): Content: A forensic scientific report analyzing a technical text about the role of Large Language Models (LLMs) in biomedicine. Goal: Demonstrates the system's ability to verify complex scientific claims against peer-reviewed literature. Result: 89% Highly Factual. It correctly validates claims about AI advancements while identifying nuances in "minimal effort" repurposing.
Visual Fact-Check Report (General): Content: A visual fact-check report for the claim: "Drinking lemon can cure all kinds of cancers." Goal: Demonstrates the system's ability to debunk dangerous health misinformation using authoritative medical sources (e.g., National Academies, Cancer Research UK). Result: 0% Mostly False. It unequivocally refutes the existence of a "universal cure" and identifies logical fallacies.
Visual Fact-Check Report (General): Content: A multi-claim "fake news" scenario featuring: Future events (e.g., "OpenAI has officially released GPT-6", "AI granted human rights"). Pseudoscience ("Alkaline Diet cures cancer"). Misattributed quotes (Albert Einstein on technology). Goal: Demonstrates the system's capacity to handle multiple disparate claims within a single narrative and identify logical impossibilities and anachronisms. Result: 0% Misinformation/Fiction. Classified as a complete fabrication.

Note: These files are actual outputs generated by the FailSafe engine. Download and open the HTML files in your browser to interact with the full analytics.

Contributing

We welcome contributions! Please see our CONTRIBUTING.md for guidelines.

Star History

Acknowledgements

This project was inspired by and built upon the architectural framework of Loki. We acknowledge their pioneering work in automated fact-verification and their contribution to the open-source community.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
assets		assets
demo_data		demo_data
docs		docs
factcheck		factcheck
scripts		scripts
templates		templates
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
demo_library_usage.py		demo_library_usage.py
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt
tasks.py		tasks.py
webapp.py		webapp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FailSafe: Automated Fact-Checking System

Theoretical Foundation

System Architecture

Core Modules

1. Pre-computation & Screening (Layer 0)

2. Decomposition (Layer 1)

3. Verification & Retrieval (Layer 2 & 3)

Technical Stack

Installation and Setup

Docker Deployment (Recommended)

Local Development

Configuration

As a Python Library

Live Case Studies & Demos

Contributing

Star History

Acknowledgements

License

About

Uh oh!

Releases 1

Languages

License

Amin7410/FailSafe-AI-Powered-Fact-Checking-System

Folders and files

Latest commit

History

Repository files navigation

FailSafe: Automated Fact-Checking System

Theoretical Foundation

System Architecture

Core Modules

1. Pre-computation & Screening (Layer 0)

2. Decomposition (Layer 1)

3. Verification & Retrieval (Layer 2 & 3)

Technical Stack

Installation and Setup

Docker Deployment (Recommended)

Local Development

Configuration

As a Python Library

Live Case Studies & Demos

Contributing

Star History

Acknowledgements

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Languages