Florida Man or Fiction

A true/false game where players guess whether a headline is a real Florida Man story or AI-generated fiction.

Introduction

This project combines web scraping, multi-agent AI behavior, and a Next.js frontend into a playable Florida Man or Fiction game.

Core loop:

Frontend presents a headline card.
Backend serves real and fake headlines.
Player guesses real vs fake.
Admin/agent workflows keep the headline pool fresh.

Current Status Snapshot

Phase 1 and Phase 2 are complete.
Phase 3 is complete (3.1 through 3.6).
Phase 3.3 scraping improvements are implemented (multi-source adapters, retries, metrics, dedupe).
Phase 3.4 generation hardening is implemented (OpenAI-primary with deterministic fallback, quality filters).
Phase 3.5 admin control plane is implemented (job queueing, polling, status lifecycle).
Phase 3.6 context augmentation is implemented end-to-end:
- recent real-headline context injection into generation prompts
- deterministic context ranking/filtering/windowing with source diversity
- provenance JSON included in generator output
- parsed result_provenance returned in admin job status API
- result_audit_id linked in admin job status API
- provenance/audit persistence in DB (generation_audits)
- provenance shown in admin UI status panel
CI split is stable.
Offline tests run automatically.
External/OpenAI paths remain isolated to manual/scheduled integration workflow.
Phase 4 polish/production work is now the active stage.

Tech Stack

Frontend

Next.js 16 (App Router, TypeScript)
Tailwind CSS
Jest

Backend

Python 3.13+
FastAPI
SQLAlchemy
Alembic
SQLite for dev
PostgreSQL for production-ready RAG

Agents

AutoGen 0.4+
OpenAI GPT-4o-mini
Requests + BeautifulSoup for scraping

Project Structure

Notes:

This README intentionally keeps both current implemented layout and planned/continuity paths.
Some continuity entries are intentionally listed even if not currently present to support roadmap tracking across sessions.

Current Implemented Layout (Source Of Truth)

flo-flo/
├── .github/
│   └── workflows/
│       ├── python-tests.ci.yml
│       ├── frontend-tests.ci.yml
│       └── integration-tests.manual.yml
├── backend/
│   ├── pyproject.toml
│   ├── app/
│   │   ├── main.py
│   │   ├── config.py
│   │   ├── db/
│   │   │   ├── database.py
│   │   │   └── repositories/
│   │   │       ├── generation_audit_repository.py
│   │   │       ├── headline_repository.py
│   │   │       └── token_usage_repository.py
│   │   ├── models/
│   │   │   ├── generation_audit.py
│   │   │   ├── headline.py
│   │   │   └── token_usage.py
│   │   ├── routers/
│   │   │   ├── game.py
│   │   │   └── admin.py
│   │   └── services/
│   │       └── headline_service.py
│   ├── migrations/
│   │   ├── env.py
│   │   └── versions/
│   │       ├── 18a6bfb4fa39_initial_schema.py
│   │       └── 9f2b4a7d1c0e_add_generation_audits.py
│   ├── tests/
│   │   ├── conftest.py
│   │   ├── test_db/
│   │   ├── test_routers/
│   │   └── test_services/
│   ├── alembic.ini
│   ├── requirements.txt
│   └── seed_data.py
├── agents/
│   ├── pyproject.toml
│   ├── pytest.ini
│   ├── src/
│   │   └── agents/
│   │       ├── __init__.py
│   │       ├── config.py
│   │       ├── scraper_agent.py
│   │       ├── generator_agent.py
│   │       ├── orchestrator.py
│   │       └── tools/
│   │           ├── __init__.py
│   │           ├── scraper.py
│   │           ├── database.py
│   │           └── generator_quality.py
│   ├── tools/  # compatibility namespace retained
│   └── tests/
│       ├── test_scraper_agent.py
│       ├── test_generator_agent.py
│       └── test_tools/
│           ├── test_tool_scraper.py
│           ├── test_tool_database.py
│           └── test_tool_generator_quality.py
├── frontend/
│   ├── src/
│   │   ├── app/
│   │   │   ├── page.tsx
│   │   │   └── admin/
│   │   │       └── page.tsx
│   │   ├── components/
│   │   ├── lib/
│   │   │   └── api.ts
│   │   └── types/
│   │       └── index.ts
│   ├── __tests__/
│   │   ├── app/
│   │   │   └── admin.page.test.tsx
│   │   └── lib/
│   │       └── api.test.ts
│   └── package.json
├── scripts/
│   └── canary_admin_job.sh
├── tests/
│   ├── test_api_integration.py
│   └── test_e2e_headline_flow.py
├── env.py
├── makefile
├── .gitignore
└── README.md

Planned/Continuity Paths (Intentionally Retained)

agents/
├── config.py                    # planned compatibility shim
├── scraper_agent.py             # planned compatibility shim
├── generator_agent.py           # planned compatibility shim
├── orchestrator.py              # planned compatibility shim
└── tools/
    ├── scraper.py               # planned compatibility shim
    └── database.py              # planned compatibility shim

frontend/__tests__/components/
├── Game.test.tsx                # planned
└── GameCard.test.tsx            # planned

frontend/__tests__/lib/
└── api.test.ts                  # implemented/planned expansion

Development Roadmap

Phase 1: Foundation ✅

Project structure + Next.js install
Backend scaffold (FastAPI + SQLite)
Database models (headlines table)
Seed data with test headlines
Frontend game UI (working end-to-end)

Phase 2: AI Agents ✅

AutoGen 0.4+ agent setup
Scraper agent (collect real headlines)
Generator agent (create fake headlines)
Database integration tools
Orchestrator for agent coordination

Phase 3: Agent Enhancement (Complete)

Goal: robust offline-first behavior, explicit external/openai test gates, stronger quality controls.

3.1 Fix Agent Execution

Debug why agents terminate without running tools
Add baseline tool-level tests for scraper/database paths
Verify database writes from agent tools (offline)
Add stronger tool schemas for AutoGen
Add richer generator assertions for real OpenAI path

3.2 Testing Strategy (Implemented)

Offline tests default (not external and not openai)
External scraping tests marked @pytest.mark.external
OpenAI integration tests marked @pytest.mark.openai
Manual/scheduled integration workflow with secret guard

3.3 Improve Scraping (Core Implemented)

Add additional real news source adapters beyond current conservative setup
Retry/backoff and timeout strategy
Stronger validation and dedupe metrics (scrape_with_metrics)

3.4 Enhance Generation (Core Implemented)

Connect generation path to OpenAI outputs (with deterministic fallback)
Baseline quality checks (length, phrase plausibility, duplicate filtering)

3.5 Admin Interface (Implemented)

Add endpoints to trigger scrape/generate jobs from API
Add frontend admin page to run jobs and show status/logs
Add admin job status endpoint and polling contract

3.6 Context Augmentation (Implemented)

Inject small recent real-headline context set into generation prompt
Include provenance metadata in generator output summary
Parse and expose result_provenance in admin job status API payload
Render provenance details in admin UI status panel (read-only)
Persist provenance/audit history in DB (migration + repository/service)
Expand context strategy beyond small recent set (ranking/filtering/windowing)

Phase 4: Polish & Production (Current)

Phase 5: Advanced Features (Future)

Getting Started

Prerequisites

Node.js 20+
Python 3.13+

Installation (Recommended, Repository Root)

git clone https://github.com/humanauction/flo-flo.git
cd flo-flo

python3.13 -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip

# Install both packages editable
python -m pip install -e backend -e agents

# Test tooling
python -m pip install pytest pytest-asyncio pytest-cov

# Optional agents dev tooling (Ruff)
python -m pip install -e './agents[dev]'

Environment

Create backend/.env:

DATABASE_URL=sqlite:///./floridaman.db
OPENAI_API_KEY=your_key_here

Optional agent knobs (if used in your local flow):

OPENAI_MODEL=gpt-4o-mini
MAX_HEADLINES_PER_SCRAPE=10
TARGET_URL=https://floridaman.com/

Create frontend/.env.local:

NEXT_PUBLIC_API_URL=http://localhost:8000

Database + App Boot

cd backend
python -m alembic upgrade head
python seed_data.py
uvicorn app.main:app --reload --port 8000

# in another terminal
cd frontend
npm install
npm run dev

Run Agents

# from repo root, with editable installs active
python -m agents.orchestrator

Migration-First Workflow

Use Alembic as the only schema change path.

cd backend
python -m alembic revision --autogenerate -m "describe schema change"
python -m alembic upgrade head
python seed_data.py

Do not use runtime Base.metadata.create_all() for schema management.

API Endpoints

Game

GET /api/game/headline
POST /api/game/guess

Admin

GET /api/admin/stats
POST /api/admin/headline (manual insert)
POST /api/admin/scrape (queues scrape job, optional count 1-50, default 10)
POST /api/admin/generate (queues generate job, optional count 1-50, default 10)
GET /api/admin/jobs/{job_id} (returns queued/running/completed/failed state with result_summary, parsed result_provenance, and result_audit_id when available)

Testing

Quick Local Commands

# backend offline
cd backend
python -m pytest -m "not external and not openai"

# agents offline
cd ../agents
python -m pytest -m "not external and not openai"

# focused scraper/generator tool tests
python -m pytest -q tests/test_tools/test_tool_scraper.py
python -m pytest -q tests/test_tools/test_tool_generator_quality.py

# provenance-focused checks
python -m pytest -q agents/tests/test_generator_agent.py -k "provenance or openai_provider"
python -m pytest -q backend/tests/test_routers/test_admin.py -k "provenance or dedupe"

# agents lint
python -m ruff check agents/src/agents agents/tests

# frontend admin provenance panel test
cd frontend
npm test -- --verbose __tests__/app/admin.page.test.tsx

Root Integration Scaffolds

tests/test_api_integration.py
tests/test_e2e_headline_flow.py

CI Workflows

Python Tests (Offline)

File: .github/workflows/python-tests.ci.yml
Trigger: backend/agents push or pull request
Runs offline-only backend and agents suites

Frontend Tests

File: .github/workflows/frontend-tests.ci.yml
Trigger: frontend push or pull request
Runs npm test with coverage

Integration Tests (Manual)

File: .github/workflows/integration-tests.manual.yml
Trigger: manual + weekly schedule
Suites: external, openai, all
OpenAI path runs only when OPENAI_API_KEY is present

Contributing

Learning project, open to iteration. Keep changes small, tested, and roadmap-aligned.

License

MIT

Status: 🚧 Phase 4 Last Updated: April 17, 2026 Next Milestone: Phase 4.1 accounts/stats baseline plus UX loading/error polish

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github/workflows		.github/workflows
agents		agents
backend		backend
frontend		frontend
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
makefile		makefile
ragAdr.md		ragAdr.md

Folders and files

Latest commit

History

Repository files navigation

Florida Man or Fiction

Introduction

Current Status Snapshot

Tech Stack

Frontend

Backend

Agents

Project Structure

Current Implemented Layout (Source Of Truth)

Planned/Continuity Paths (Intentionally Retained)

Development Roadmap

Phase 1: Foundation ✅

Phase 2: AI Agents ✅

Phase 3: Agent Enhancement (Complete)

3.1 Fix Agent Execution

3.2 Testing Strategy (Implemented)

3.3 Improve Scraping (Core Implemented)

3.4 Enhance Generation (Core Implemented)

3.5 Admin Interface (Implemented)

3.6 Context Augmentation (Implemented)

Phase 4: Polish & Production (Current)

Phase 5: Advanced Features (Future)

Getting Started

Prerequisites

Installation (Recommended, Repository Root)

Environment

Database + App Boot

Run Agents

Migration-First Workflow

API Endpoints

Game

Admin

Testing

Quick Local Commands

Root Integration Scaffolds

CI Workflows

Python Tests (Offline)

Frontend Tests

Integration Tests (Manual)

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages