Building deployment ready apps w/o writing code
Problem Statement
- We address the challenge of consistently delivering quality code with Agentic solutions
- Foundational language models, no matter how advanced, have a chance of hallucinating output
- This results in reduced quality of delivered software in the best case, and hard to track bugs and vulnerabilities at the worst
- Considering the consumption models for AI services, this also has growing financial implications
- This affects AI users across all backgrounds and skill levels
- Technical users and non-technical folks are both at risk of receiving incorrect outputs from AI and may not realize it until it becomes costly to correct
- Considering the business needs for increased agility with smaller staff counts, this urgency is compounded by the non-deterministic nature of Generative and Agentic AI technology
- Less staff altogether means less specialists, bandwidth, and resilience to the unknown(under those circumstances), in exchange for increase workloads per staff member
- Without a series of corrective rules, guidelines, and quality control assets, this becomes something similar to scope creep
By structuring how IBM Bob operates through skill specification, agents and OpenAPI description (the NYC Benefits Screening YML File), the solution ensures AI becomes a dependable engineering teammate rather than an unpredictable tool, helping teams do more with less. While also making critical services more accessible to real people who need them most.
- Structured AI Agent Skills
Giving Bob step-by-step recipe, Skills specifications as a guide.
- Hallucination Reduction
Structured rules and guideline, creating consistency.
- Guided Solution Engineering
Bob is methodically walked through a guide instead of being told “build me something”
- Works for All Skill Levels
Technical and non- technical can rely on the outputs, due to quality no matter the level of the user.
- Cost Control
Reducing hallucinations and bad outputs, reduces financial cost of fixing mistakes.
- Real- World Application
Applied to NYC Benefits Screening, making Government benefit information accessible via chats without needed a computer.
- Agentic coding via Spec Driven Development
Working primarily through skill specs, agent personas and OpenAPI/ YML files rather than writing large code, makes it accessible for smaller teams.
- Optional architecture diagram
Similar to Design Language, diagrams serve as a common communication medium of expectations for people and AI alike
- We used about 3 task sessions to produce our solution
- Total reported Bob token cost came in at under 10 tokens, less than 25% of the individual contributor budget and just above 5% of the team budget
- Compared to teams that exhausted their budgets before submission, this makes a compelling statement for efficiency
- Using a combination of agent definitions, agent skills, API documentation(OpenAPI), and generated API client code...
- We were able to create a properly documented, containerized instance of a CrewAI application
- We were also able to extend the base CrewAI application with a custom tool connected to a 3rd party web service
- Bob reports and screenshot of majority token consumption may be found at bob-report/ in the project root directory
watsonx.ai / watsonx Orchestrate Integration
- WatsonX is the LLM provider for our CrewAI Agentic application
- The CrewAI application is an example of what can be generated with our solution
- Using the app require getting credentials from the NYC Benefits Screening API, which takes a few hours to a few days for approval
- This also applies to receiving test credentials
- CrewAI example app
- Recent
dockerinstallation - NYC Benefits Screening API Credentials
- Infrastructure Architect Agent
- Bob IDE
- Bob Shell
- Other LLM harnesses capable or reading Agentskills specifications
- Recent
- If you don't have an app on hand, start a planning session by asking to design a multi-container solution
- Make a copy of the
mystack.csvspreadsheet found at.bob/skills/design-container-solution/references/and fill it out with details for each container you want to use - Submit the new CSV file to the agent
- Confirm the generated Mermaid chart and docker-compose files are to your liking
- Spec Driven Workflow with CrewAI screenshots
- Video walkthrough of generating container solution via prompting
.bob: Bob skills atskillsand notes atnotesbob-reports: location of reports, Bobalytics, and screenshotsny-benefits-screener: location of CrewAI appny-benefits-screener/dockerfile: dockerfile for NY-Benefits-Screenerny-benefits-screener/README.md: README and start up instructions for NY-Benefits-ScreenerAGENTS.md: Sub agent details forinfrastructure-architectsubagent -ny-benefits-oas: API Client forny-benefits-screenerny-benefits-oas/README.md: README and start up instructions for NY-Benefits-OAS
- Jason Scoon|Team Lead: Coding, documentation, and project planning
- Tarcia Pillows| Team Member: reasearch, documentation, project planning, support across all areas of project
- Salman Abiola Suleiman| Team Member: reasearch, project planning, support across all areas of project
- Vincent Jared| Team Member: reasearch, project planning, support across all areas of project
• MIT
- CrewAI
- OpenAPI
- OpenAPI-generator
- agentskills.io
- mermaid.ai
- Contact / Support (optional) • Team Lead may be reached at jasonc.scoon@gmail.com