Jobhunter

A safe, low-cost Telegram job-search assistant. It collects jobs from approved sources, ranks them against your profile, sends a short digest, and lets you improve the search by replying in plain English. It can also research sales leads from an ICP, but stores candidates only after you approve them.

Architecture and detailed contracts live in ARCHITECTURE.md. Contributor instructions in AGENTS.md. Open work in tasks.md.

The Problem It Solves

Good roles are spread across many job boards, ATS pages, RSS feeds, and email alerts.
Looking at one or two sources misses too much; checking many sources manually takes too much time.
Basic keyword filters are too shallow for specific searches, for example "Product Manager building AI tools" vs "Product Marketing Manager at an AI company".
Saved searches drift: wrong language, wrong seniority, wrong timezone, unrelated role family, duplicate postings.
Feedback is usually wasted. When you reject a job, normal job boards do not learn your exact reason and adjust the next search.

How It Works

You give Jobhunter a detailed profile: target titles, preferred work, exclusions, languages, location/timezone, salary, and examples of what "good" looks like.
It indexes jobs from many approved sources in your category: public job boards, RSS/API feeds, ATS pages, and IMAP email alerts.
It deduplicates repeated postings across sources.
It ranks jobs in two passes: fast local rules first, then an optional LLM pass that checks fit against your full profile description.
It sends only the top matches to Telegram, with buttons for Irrelevant, Remind me tomorrow, Give me cover note, and Applied.
Your feedback changes future results. Button clicks and plain-English comments become training signals for source selection and scoring.
You can say things like skip jobs requiring German, deprioritize Product Marketing Manager, or prioritize product builder roles using Claude/Codex.
The agent can propose updated sources or scoring rules from that feedback; you approve changes before they are saved.
The same OpenClaw setup can research leads from input/icp.local.md: it searches public sources, shows candidates, saves only approved leads, and drafts copy-paste pitches without sending anything.

What It Does

Collects jobs on demand from public RSS, JSON APIs, ATS boards, and IMAP email alerts (LinkedIn / Wellfound / Djinni / company alerts) with no logged-in scraping.
Ranks jobs in two layers: fast local rules, then an optional bounded LLM relevance pass that reads your free-form profile and skips obvious mismatches such as wrong role family, required language, or seniority.
Sends a Telegram digest of new jobs with per-job buttons: Irrelevant, Remind me tomorrow, Give me cover note, Applied.
Refines itself through chat: type /agent <request> or any normal free-form message and Codex, via OpenClaw and your subscription, investigates, answers, and proposes bounded changes you approve per action.
Runs scheduled OpenClaw maintenance: source collection every 4 hours, daily rescore, and monthly source review.
Researches lead candidates from your ICP through public sources and stores only approved candidates.
Generates tailored cover notes via OpenAI (paid, budget-gated).
Never applies to jobs, messages recruiters, sends email, logs into LinkedIn, or mounts browser cookies.
Every change the agent proposes is approval-gated, auto-archived, and one-tap reversible via /revert <id>.

Quick Start

You need: a Telegram bot token, your Telegram chat ID, and a Codex CLI subscription (ChatGPT Pro or equivalent). Optionally, an OpenAI API key for cover notes and the L2 relevance pass.

The runtime is real OpenClaw plus the local jobhunter-service. A small local OpenClaw tool plugin exposes Jobhunter actions in OpenClaw trajectories; there is no duplicate Codex-native MCP registration. Use ./bin/openclaw for normal operation. ./bin/jobhunter remains only as a temporary deprecated wrapper.

# 1. Configure secrets
cp .env.example .env
# edit .env and set TELEGRAM_BOT_TOKEN + TELEGRAM_ALLOWED_CHAT_ID
# (optional) set OPENAI_API_KEY for cover notes + better L2 relevance

# 2. Authorize Codex (one-time device login)
./bin/openclaw start
./bin/openclaw onboard

# 3. Start both containers
./bin/openclaw status

OpenClaw owns the Telegram connection. After onboarding, send Get more jobs or /jobs to the Telegram bot. For leads, send /leads or a request such as find me 5 founders who raised Series A this week building AI products.

Required environment variables

Variable	Required	Notes
`TELEGRAM_BOT_TOKEN`	yes	From @BotFather
`TELEGRAM_ALLOWED_CHAT_ID`	yes	Your numeric chat ID; restricts the bot to your chat only
`OPENAI_API_KEY`	optional	Enables cover-note generation and the OpenAI-backed L2 relevance pass. Without it, L2 falls back to a coarser local heuristic and cover notes use a generic template.
`FIRECRAWL_API_KEY`	optional	Lets OpenClaw and `jobhunter-service` handle JS-heavy or Cloudflare-blocked community pages such as DOU.
`EXA_API_KEY`	optional	Lets OpenClaw search for new source candidates through Exa.

All other env vars (model, budget caps, agent quotas, IMAP credentials) have sensible defaults; see .env.example if you want to tune them. Source fetches use JOBHUNTER_ROBOTS_TXT_RESPECT=ignore by default because the bot does not crawl, index, or recursively follow links: it fetches one configured public/API/RSS/ATS URL per source on a human-triggered run, usually only a few requests per source per day. Politeness is handled by the per-host rate limiter, 30s timeout, 8MB response cap, SSRF guard, and source approval flow. If you specifically want robots.txt enforcement, set JOBHUNTER_ROBOTS_TXT_RESPECT=trust or strict.

How To Use It

From Telegram (everything you need day-to-day)

After ./bin/openclaw start and ./bin/openclaw onboard:

Set your profile — type a normal message such as please replace my about-me profile with: ..., or use /agent <same request>. Example:

please replace my about-me profile with: Product manager / product builder.
Goal: build product prototypes
or implement features in existing products with Claude Code, Codex, or related
AI tooling. Strengths: discovery, product analytics, fast prototyping. Avoid:
internships, junior-only roles.

Get your first digest — type Get more jobs or /jobs. OpenClaw calls jobhunter_get_more_jobs; if the source queue is stale, it runs collection first, then renders fresh jobs.
React to each card — tap Irrelevant / Remind me tomorrow / Give me cover note / Applied.
Teach the system in plain English — type skip jobs that mention German required or prioritize Product Builder roles building with Claude or Codex. The agent proposes a directive change; you approve; the next Get more jobs reflects it.
Refine sources and scoring when needed — tap Update sources or Tune scoring. The agent proposes changes; you approve per-candidate or per-rule.
Research leads when needed — write your ICP in input/icp.local.md, then type /leads to see saved leads or ask for new research. Example: find me 5 founders who raised Series A this week building AI products. The agent shows candidates first; it saves them only after you approve.

Operator commands

./bin/openclaw onboard    # one-time Dockerized OpenClaw onboarding, no host daemon install
./bin/openclaw start      # start jobhunter-service + openclaw-gateway
./bin/openclaw stop       # stop both containers
./bin/openclaw restart    # rebuild and recreate both containers
./bin/openclaw config     # print the container-relative plugin + skill config snippet
./bin/openclaw cron-install # register/refresh the scheduled OpenClaw jobs; may ask for OpenClaw scope approval
./bin/openclaw cron-list  # list registered OpenClaw cron jobs
./bin/openclaw status     # check service and gateway health
./bin/openclaw logs       # follow both logs; use "logs gateway" or "logs service"
./bin/openclaw shell      # shell into jobhunter-service; use "shell gateway" for OpenClaw

The gateway exposes Jobhunter through the local jobhunter-tools OpenClaw plugin, which calls jobhunter-service over the Compose network. See MIGRATION_DISCOVERY.md, MIGRATION_NOTES.md, and skills/jobhunter/README.md.

Telegram commands cheatsheet

Command	Use
`/agent <text>`	Ask the agent to investigate and propose bounded actions
any normal text	Same agent path; good for feedback, questions, and profile/source/scoring requests
`/history`	Last 10 agent-applied actions
`/revert <id>`	Restore the archived file for a reversible agent action
`/applied`, `/snoozed`, `/irrelevant`	Retrieve recent jobs by status (since cards leave the chat after each action)
`/jobs`, `/sources`, `/tune`, `/usage`	Slash equivalents of the four reply-keyboard buttons
`/refresh`	Force a background source refresh without waiting for the stale gate
`/leads`	Show saved lead candidates; for new research, ask in plain English

Agent Examples

What /agent requests actually look like. Each one routes through OpenClaw + Codex, returns an answer, and may include one or more proposed_actions you approve per-action in chat.

You type	Bot returns	Approve to apply
`skip jobs whose description is primarily in German`	"I'll add a directive to skip German-language jobs."	`directive_edit` writes a timestamped line under `# Directives`
`prioritize Product Builder roles that build with Claude or Codex; deprioritize generic PM`	"Got it — adding a priority directive that L2 will apply per-job."	`directive_edit`. Next `Get more jobs` reflects it via L2's `priority: high` tag
`please remove the directive about language`	"I'll drop that directive."	`directive_edit` with a removal patch
`/agent please add this and figure out how to fetch it: https://jobs.dou.ua/vacancies/?category=Product%20Manager`	"I fetched the page, found the RSS at /feeds/?category=Product+Manager, and validated it returns 30 entries."	`sources_proposal` (add the RSS) + `directive_edit` (mark it priority)
`/agent you missed https://weworkremotely.com/remote-jobs/webpt-principal-product-manager from 2 days ago. why?`	Root-cause analysis with two repair options	`sources_proposal` (change source type) or `human_followup` (file a task)
`which sources produced jobs I applied to in the last 30 days?`	"RemoteOK: 4, We Work Remotely: 2, Arbeitnow: 1."	None; `data_answer` shown inline
`jobs I applied to yesterday`	List with timestamps	None; just data
`/agent suggest 3 niche aggregators I'm missing`	Three candidates with rationale	`sources_proposal` with 3 entries; HEAD-probed before approval prompt
`refine my about-me profile for clarity without changing intent`	"Tightened wording. Diff: ..."	`profile_edit` replaces `# About me`; `# Directives` untouched
`/agent stop suggesting individual company career pages — focus on aggregators`	"Captured. Future discovery runs will steer toward aggregators."	`directive_edit`
`/agent backup my config and profile`	Path to the new tar.gz	`backup_export` already executed
`/agent show me what rule fires most often`	Top-firing rules with counts	None; just analysis

Every write action is gated behind [Apply 1] [Apply 2] [Apply all] [Reject all] buttons. Applied actions are logged with the file diff archived; /revert <id> restores any reversible file change byte-for-byte.

Cost Controls

Item	Default cap
L1 per-job scoring	Free (no LLM)
L2 relevance pass	OpenAI at indexing/collection time for new candidates above the L1 cutoff, ≤ `JOBHUNTER_L2_MAX_JOBS=30` per source run, cached per job (~$0.003/run typical with current caps)
Cover notes	OpenAI, `gpt-4o-mini`, daily/monthly budget gate, 10/day, one-tap override on overage
Agent requests	Codex subscription (no per-call cost), 20/day, 10s cooldown, capped per-request (5 turns / 20 SQL / 10 file reads / 5 fetches / 180s)
Bulk write actions	Approval tap PLUS typed `CONFIRM <id>` reply within 60s
Collection	1 / 10 minutes per `Get more jobs`

Check current spend any time via Usage, /usage, or ./bin/openclaw status.

Email Alerts (Optional)

Use email alerts for sites that don't expose a public RSS / API but do offer email notifications (LinkedIn, Wellfound, Djinni, company alerts, Google Alerts). The bot reads only the configured IMAP folder and never sends email.

# .env
EMAIL_IMAP_HOST=imap.gmail.com
EMAIL_IMAP_USERNAME=you@example.com
EMAIL_IMAP_PASSWORD=<Gmail app password>
EMAIL_IMAP_FOLDER=job-alerts

Then add a source row per sender (you can do this from Telegram via /agent please add a source for djinni.co alerts at FROM no-reply@djinni.co):

{
  "id": "linkedin-alerts",
  "name": "LinkedIn Email Alerts",
  "type": "imap",
  "url": "imap://job-alerts",
  "status": "active",
  "priority": "high",
  "created_by": "user",
  "query": "FROM \"jobs-noreply@linkedin.com\""
}

The IMAP collector tracks per-source UID high-water marks, so old messages are not reprocessed forever. If a digest email contains several jobs, the agent can now propose an email_parser_proposal template so future emails from that sender produce distinct job rows instead of one generic row per link.

Troubleshooting

Symptom	Check
No Telegram messages	Verify bot token + chat ID; `./bin/openclaw logs gateway`
`Get more jobs` says wait	Collection rate limit kicked in; try after the shown wait
Digest is empty	`/agent please tune scoring to be more permissive`, or add sources
Bad jobs reach the digest	Type the pattern to skip; approve the resulting `directive_edit`
Good jobs are hidden	Ask `why was [URL] not in my last digest?`, then teach the fix in plain English
Same job repeats	Snoozed-due jobs are allowed to reappear once; otherwise check `digest_log`
Cover note denied	Use `Usage` to see budget; approve the one-time override only if intended
Agent proposal never appears	`./bin/openclaw logs gateway`, then verify trajectories contain `jobhunter_*` tool calls
`Daily agent quota reached`	Default 20/day; raise `JOBHUNTER_RATE_LIMIT_AGENT_PER_DAY` in `.env`
Codex login expired	Run `./bin/openclaw onboard` and follow OpenClaw/Codex auth prompts if needed
`/revert` says "no reversible archive"	Some action kinds (data_answer, human_followup, bulk_update_jobs, rescore_jobs) don't archive a file; not reversible by `/revert` today

Safety Notes

Do not mount browser profiles, cookies, your home directory, SSH keys, or /var/run/docker.sock.
Do not enable email sending.
Do not give the bot recruiter messaging credentials.
Keep OpenAI keys scoped to a dedicated low-budget project.
Apply manually outside the bot; then tap Applied.
Treat /agent approvals like config changes: read each proposed action's summary, apply only what you want, and use /history + /revert <id> if a reversible file edit goes sideways.

Reference

The rest of this file is reference material. You won't need most of it for normal use — Telegram and ./bin/openclaw cover the daily workflow.

Configuration Files

You usually don't need to touch these. Telegram covers everything once the bot is running. The files exist mostly to bootstrap the very first run and to store the agent's audit trail.

File	What it is	Day-to-day editing
`input/profile.local.md`	Your profile (`# About me` + `# Directives`). Required to bootstrap before you can talk to Telegram.	Use normal agent chat after bootstrap. Direct file edits also work.
`input/cv.local.md`	Optional CV text, used only for cover notes.	Edit the file; no Telegram command for CV today.
`input/icp.local.md`	Optional ICP for lead research and pitch drafting.	Edit the file before asking for leads.
`config/sources.json`	Source registry. Ships with sensible defaults (Remotive, RemoteOK, Arbeitnow, WeWorkRemotely, optional IMAP).	Tap `Update sources` in Telegram or `/agent please add ...`. Direct edits also work.
`config/scoring.json`	Active scoring ruleset. Ships with a baseline.	Tap `Tune scoring` in Telegram. Direct edits work but the agent's shadow-test path is safer.
`config/jobhunter.json`	Budgets and runtime config.	Edit only if the defaults don't fit.
`.env`	Secrets and runtime paths.	Edit before first start; rare changes after.

For the very first run, copy the example files:

cp .env.example .env
cp input/profile.example.md input/profile.local.md   # optional; init copies the example if missing
cp input/cv.example.md input/cv.local.md             # optional, only for cover notes
cp input/icp.example.md input/icp.local.md           # optional, only for lead research

The bot auto-copies input/profile.example.md, input/cv.example.md, and input/icp.example.md into local files if you skip the copy. If you have a legacy config/profile.local.json, the bot folds it into # About me on first init and backs the JSON up.

Source Lifecycle and Priority

Each source row has a status (active / test / disabled) and a priority (high / medium / low). High-priority sources fetch first per Get more jobs click. Agent-discovered sources land as status: test, created_by: agent and auto-promote to active when you mark a job from them as Applied or request a cover note.

Robots.txt handling is opt-in. The default ignore policy is intentional for this narrow bot: it fetches documented public feeds or a single configured page, not a search-engine-scale crawl. trust respects robots.txt for unknown/non-low-risk sources while allowing user and low-risk sources; strict checks robots.txt for every public source.

Python Commands (Development / Debugging)

For day-to-day use, prefer Telegram + ./bin/openclaw. The Python CLI is for development and smoke testing.

python3 -m jobhunter init           # init schema, sources, local profile files
python3 -m jobhunter collect        # fetch from enabled sources, L1-score, and index capped L2 relevance
python3 -m jobhunter digest         # print current ranked digest rows as JSON
python3 -m jobhunter leads          # print current lead digest rows as JSON
python3 -m jobhunter run-once       # init + collect once
python3 -m jobhunter service        # local HTTP service used by OpenClaw plugin tools
python3 -m jobhunter usage          # local OpenAI usage summary

There's no CLI for /agent itself — the agent surface is Telegram-only by design (every action is approval-gated and the chat is the audit log).

Raw Docker Commands

The launcher above is recommended. Raw Compose is useful for debugging:

docker compose --profile openclaw up -d jobhunter-service openclaw-gateway
docker compose logs -f jobhunter-service
docker compose --profile openclaw logs -f openclaw-gateway
docker compose --profile openclaw config --quiet

Data and Backups

Path	Contents	Gitignored?
`data/jobs.sqlite`	Jobs, leads, scores, verdicts, feedback, digests, drafts, usage, agent_actions audit	yes
`data/backup/`	Archives produced by `/agent backup ...` and your manual SQLite snapshots	yes
`data/email_samples/`	Recent IMAP alert bodies that the agent can inspect to propose better email parsers	yes
`config/sources.json`, `config/scoring.json`, `config/jobhunter.json`	Source registry, scoring rules, runtime config	committed
`config/scoring.v<n>.json`	Auto-archived previous scoring versions (used by `/revert`)	committed
`input/profile.local.md`, `input/cv.local.md`, `input/icp.local.md`	Your private profile, optional CV, and optional ICP	yes
`input/profile.<ts>.md.bak`	Auto-archived previous profile versions	yes
Docker volume `openclaw_home`	OpenClaw state, sessions, trajectories, and per-agent Codex config	docker-managed
`~/.codex`	Codex auth mounted read-only into the gateway	outside repo

Manual SQLite backup:

mkdir -p data/backup
sqlite3 data/jobs.sqlite ".backup 'data/backup/jobs-$(date +%Y%m%d-%H%M%S).sqlite'"

Or via the agent: /agent backup my config and profile.

Mental Model and Architecture

For how the two containers talk to each other, the bounded action set, the L1/L2 split, the OpenClaw tool surface, and the audit-and-revert chain, see ARCHITECTURE.md. The README intentionally does not duplicate that material.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
bin		bin
config		config
docker/openclaw-gateway		docker/openclaw-gateway
docs		docs
input		input
jobhunter		jobhunter
plugins/jobhunter-tools		plugins/jobhunter-tools
skills		skills
tests		tests
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
MIGRATION.md		MIGRATION.md
MIGRATION_DISCOVERY.md		MIGRATION_DISCOVERY.md
MIGRATION_NOTES.md		MIGRATION_NOTES.md
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
tasks.md		tasks.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jobhunter

The Problem It Solves

How It Works

What It Does

Quick Start

Required environment variables

How To Use It

From Telegram (everything you need day-to-day)

Operator commands

Telegram commands cheatsheet

Agent Examples

Cost Controls

Email Alerts (Optional)

Troubleshooting

Safety Notes

Reference

Configuration Files

Source Lifecycle and Priority

Python Commands (Development / Debugging)

Raw Docker Commands

Data and Backups

Mental Model and Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Jobhunter

The Problem It Solves

How It Works

What It Does

Quick Start

Required environment variables

How To Use It

From Telegram (everything you need day-to-day)

Operator commands

Telegram commands cheatsheet

Agent Examples

Cost Controls

Email Alerts (Optional)

Troubleshooting

Safety Notes

Reference

Configuration Files

Source Lifecycle and Priority

Python Commands (Development / Debugging)

Raw Docker Commands

Data and Backups

Mental Model and Architecture

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages