Add Agents.md evaluation skill, refine service agent guidance, and resolve merge conflicts#1377
Add Agents.md evaluation skill, refine service agent guidance, and resolve merge conflicts#1377saratpoluri wants to merge 7 commits into
Conversation
There was a problem hiding this comment.
Pull request overview
This PR standardizes and streamlines service-level Agents.md guides across multiple SceneScape microservices, adds a new rubric + efficacy-testing skill for evaluating Agents.md quality, and updates Copilot instructions to reference the new evaluation skill.
Changes:
- Added “Verification Gate (Standardized)” sections with command paths and pass criteria to multiple
Agents.mdguides. - Rewrote several service
Agents.mdfiles to be more concise and KPI/constraint-driven. - Introduced a new
.github/skills/agent_evaluation/skill (rubric + efficacy test procedure) and linked it from.github/copilot-instructions.md.
Reviewed changes
Copilot reviewed 10 out of 10 changed files in this pull request and generated 5 comments.
Show a summary per file
| File | Description |
|---|---|
| tracker/Agents.md | Adds a standardized verification gate table for tracker changes. |
| tools/tracker/evaluation/Agents.md | Adds a standardized verification gate table for evaluation tooling. |
| mapping/Agents.md | Rewrites mapping agent guide into a concise format and adds verification gate commands. |
| manager/Agents.md | Rewrites manager agent guide into a concise format and adds verification gate commands. |
| controller/Agents.md | Rewrites controller agent guide into a concise format and adds verification gate commands. |
| cluster_analytics/Agents.md | Rewrites cluster analytics agent guide into a concise format and adds verification gate commands. |
| autocalibration/Agents.md | Rewrites autocalibration agent guide into a concise format and adds verification gate commands. |
| .github/skills/agent_evaluation/agents-md-evaluation.md | Adds a scoring rubric and required JSON output format for evaluating Agents.md. |
| .github/skills/agent_evaluation/SKILL.md | Adds the evaluation skill entry and an efficacy-testing procedure. |
| .github/copilot-instructions.md | References the new agent evaluation skill and adds an on-demand loading trigger. |
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
|
@copilot address code review comments |
Addressed in commit |
|
@copilot resolve the merge conflicts in this pull request. Understand the intent of the changes we made in this PR. We evaluated the Agents.md instructions against the rubric we created, scored it against that and refined it. |
Resolved the merge conflicts by merging |
📝 Description
This PR standardizes and improves service-level
Agents.mdguidance, adds anagent_evaluationskill for scoring and efficacy testing ofAgents.md, updates Copilot instruction routing, and resolves merge conflicts withmainwhile preserving the PR’s intent.What was changed:
Agents.mdfiles to be more concise and KPI/constraint-driven, including standardized verification guidance..github/skills/agent_evaluation/with:SKILL.mdfor evaluation flow and efficacy procedureagents-md-evaluation.mdfor rubric-based scoring.github/copilot-instructions.mdto:SKILL.md.github/skills/.../SKILL.mdpath references consistentlymainand resolved conflicts in affectedAgents.mdand Copilot instruction files to keep the refined rubric-driven guidance intact.Why:
Agents.md.How this adds value:
Agents.mdimpact.mainwhile retaining all intended guidance refinements.✨ Type of Change
Select the type of change your PR introduces:
🧪 Testing Scenarios
Describe how the changes were tested and how reviewers can test them too:
✅ Checklist
Before submitting the PR, ensure the following: