Releases · webtech-network/autograder

27 May 02:09

ArthurCRodrigues

0.4.0

5c645e9

0.4.0 Latest

Latest

New Features

Remote Sandbox Manager API: The Sandbox Manager can now run as a standalone FastAPI service, decoupling sandbox orchestration from the main Autograder API. A new remote mode enables the main service to communicate with the Sandbox Manager over REST, allowing independent scaling, resource isolation, and shared sandbox pools across multiple API instances. Includes Docker/Compose configuration, health checks, and timeout handling.
AI Algorithm Verification Tests: A new category of static analysis tests uses LLMs to verify that student code faithfully implements a specific algorithm (e.g., Quick Sort, Binary Search, Dijkstra). Three test functions are available: ai_sorting_algorithm, ai_search_algorithm, and ai_graph_algorithm. Each validates algorithmic logic, complexity characteristics, and ensures no built-in library shortcuts are used. Requires an active LLM provider configured via OPENAI_API_KEY.
total_score GitHub Action Output: The final numeric score (0–100) is now exposed as an action output (total_score) via GITHUB_OUTPUT, allowing downstream workflow steps to consume the grading result.

Bug Fixes

PreFlight crash with no language: Fixed AttributeError when using the webdev template preset without specifying submission-language. Returns an empty LanguageSetupConfig instead of crashing on None.value.
Submission persistence after creation: Fixed SubmissionRepository.create() not flushing/refreshing the entity, which caused auto-generated IDs to be missing and triggered IntegrityError in downstream submission results.
Test suite schema drift: Migrated test fixtures from the old language string field to the current languages list field in GradingConfiguration, resolving widespread test failures.

What's Changed

fix: handle None submission_language in PreFlightStep by @ArthurCRodrigues in #325
Add total_score output and GITHUB_OUTPUT write by @matheusmra in #326
docs: refactor GitHub Action documentation by @ArthurCRodrigues in #327
Refresh submission on add; update tests by @matheusmra in #329
Add Sandbox Manager API and remote mode by @matheusmra in #330
feat(static_analysis): add AI algorithm tests (#332) by @ArthurCRodrigues in #333

Full Changelog: 0.3.1...0.4.0

Contributors

matheusmra and ArthurCRodrigues

Assets 2

16 May 18:24

ArthurCRodrigues

0.3.1

9325cf9

0.3.1

Marketplace

What's Changed

fix: prevent event-loop blocking in deliberate execution sandbox lifecycle by @ArthurCRodrigues in #317
Rename CRITERIA_ASSETS_* to EXTERNAL_ASSETS_* by @matheusmra in #320
fix(security): prevent shell command injection in sandbox file ops (#313) by @ArthurCRodrigues in #321
fix: secure command construction in SandboxContainer by @ArthurCRodrigues in #322
feat: enable integration tests on CI PRs by @ArthurCRodrigues in #323

Full Changelog: 0.3.0...0.3.1

Contributors

matheusmra and ArthurCRodrigues

Assets 2

07 May 00:17

ArthurCRodrigues

0.3.0

91035f0

0.3.0

New Features

AST-based Structural Analysis: Introduced a new StructuralAnalysisStep in the grading pipeline using ast-grep to parse submission files once and pass structural roots downstream for static checks.
Forbidden Keyword Structural Test: Added forbidden_keyword test support backed by AST queries for reliable construct detection.
Multiple Template Support: Pipeline/template loading now supports multiple templates (list or comma-separated), allowing mixed capabilities in a single grading config.
Template Split for Static vs Execution Tests: Static code analysis tests were moved to a dedicated static_analysis template, while execution-oriented tests remain in execution templates (e.g., input_output).

Refactors

Grader Architecture Cleanup: Extracted SubmissionGrader from GraderService and reorganized grader services for clearer separation of responsibilities and maintainability.

Bug Fixes

Multi-template Regression Fixes: Fixed submission_language collision in grading execution, improved template-name normalization/validation, and resolved stale template state in criteria tree service.
Static Analysis Reliability: Hardened structural-analysis availability handling and fixed regex escaping edge cases in forbidden-import checks.

What's Changed

Abstract syntax tree + forbidden keyword test by @jaoppb in #300
refactor: extract SubmissionGrader from GraderService by @jaoppb in #304
[Feature] Extract static analysis tests and add multi-template support (#305) by @ArthurCRodrigues in #306

Full Changelog: 0.2.0...0.3.0

Contributors

jaoppb and ArthurCRodrigues

Assets 2

23 Apr 10:59

ArthurCRodrigues

0.2.0

79f80ac

0.2.0

New Features

External Execution Mode: Added a new external execution mode to the GitHub Action, allowing grading configurations to be managed centrally in Autograder Cloud instead of requiring config files in student repositories. The action fetches the grading config by ID from the cloud API, runs the pipeline locally, and exports results back for submission tracking and analytics. Existing repo mode remains the default and is fully backward-compatible.
Cloud API Contract: Introduced two new protected endpoints: GET /api/v1/configs/id/{config_id} for fetching grading configurations by internal ID, and POST /api/v1/submissions/external-results for ingesting externally computed grading results. Results are immediately visible through existing submission retrieval endpoints.
M2M Bearer Token Authentication: Protected external mode endpoints with machine-to-machine bearer token auth using AUTOGRADER_INTEGRATION_TOKEN. Validation uses hmac.compare_digest to prevent timing attacks. Existing public endpoints are unaffected.
Cloud Client and Exporter: Added cloud_client.py for HTTP communication with the cloud API (config fetch + result submission) and cloud_exporter.py for mode-specific result export. The Exporter abstract class now supports export_with_context for richer pipeline-aware exports.
Structured Logging Middleware: Added HTTP middleware for X-Request-ID propagation, per-request ContextVar logging, and a JSON formatter with service, env, method, path, status, and duration_ms fields. Configurable via LOG_LEVEL, SERVICE_NAME, and APP_ENV environment variables.

Bug Fixes

Grading Step model_config: Fixed model_config being set to forbid, which caused errors during the grading step.
Deliberate Execution Error Handling: Restored the except block that was accidentally removed during the asset injection refactor, which caused unhandled exceptions to propagate as raw 500 errors instead of structured SYSTEM_ERROR responses. Error result count now matches the number of requested test cases.

What's Changed

fix: fixing grading step model_config by @matheusmra in #294
feat: external mode web API contract, config fetch, and external result ingestion by @matheusmra in #295
feat: add external mode for GitHub Action by @matheusmra in #297
fix: restore error handling in deliberate execution service by @ArthurCRodrigues in #298
feat: add request correlation and structured logging middleware by @ArthurCRodrigues in #299

Full Changelog: 0.1.3...0.2.0

Contributors

matheusmra and ArthurCRodrigues

Assets 2

14 Apr 23:32

ArthurCRodrigues

0.1.3

acc0957

0.1.3

Marketplace

New Features

S3-Compatible Asset Injection: Integrated support for grader-owned static assets (datasets, fixtures) stored in S3 buckets. These files are injected into the sandbox using a secure Base64-encoded process, ensuring full compatibility with high-isolation gVisor environments.
File Artifact Validation: Introduced the expect_file_artifact test to the Input/Output template. This allows the autograder to extract and validate files generated by student programs during execution, supporting exact, partial, and regex-based matching.

What's Changed

fix: use logger.exception instead of logger.error in except blocks by @xyf25 in #271
chore: remove dead parsers package and AutograderResponse dataclass by @xyf25 in #272
docs: add MkDocs portal and core concept guides by @ArthurCRodrigues in #274
docs: make getting started a real grading workflow by @ArthurCRodrigues in #275
Destroy grading sandboxes at pipeline cleanup by @ArthurCRodrigues in #279
(feat) new I/O test : Expect File Output by @ArthurCRodrigues in #290
fix: removing aibatch false success on pipeline by @matheusmra in #291
feat: implement static asset placement for sandbox executions by @jaoppb in #292

Full Changelog: 1.1.2...0.1.3

Contributors

jaoppb, matheusmra, and 2 other contributors

Assets 2

12 Apr 02:14

ArthurCRodrigues

0.1.2

acc0957

0.1.2

Marketplace

What's Changed

refactor: decouple UpstashDriver from env loading (Item 15) by @koladefaj in #262
Add test weight and propagate to service by @matheusmra in #264
feat: Deterministic container naming for traceability #265 by @LuisHenriqueDC301 in #266
enable feedback step by @matheusmra in #269
refactor: replace print statements with logging in LanguagePool by @xyf25 in #270

New Contributors

@koladefaj made their first contribution in #262
@xyf25 made their first contribution in #270

Full Changelog: 0.1.1...1.1.2

Contributors

matheusmra, koladefaj, and 2 other contributors

Assets 2

07 Apr 10:49

ArthurCRodrigues

0.1.1

aa8bb54

0.1.1

Marketplace

New features

I8N support
AI Execution Refactor

What's Changed

226 td standardize language prep i18n by @matheusmra in #258
233 td formalize exporter plugin interface by @matheusmra in #259
refactoring AI batch by @matheusmra in #260

Full Changelog: 0.1.0...0.1.1

Contributors

matheusmra

Assets 2

31 Mar 16:18

ArthurCRodrigues

0.1.0

70b4778

0.1.0

Marketplace

Non Official Stable Release

What's Changed

fix: Penalty logic by @matheusmra in #250
234 td separate pipelineexecution summary logic by @matheusmra in #253
235 td standardize step error handling by @matheusmra in #255
225 td split web dev template monolith by @matheusmra in #256
220 td decouple sandbox lifecycle from preflight by @matheusmra in #257

Full Changelog: 0.0.9...0.1.0

Contributors

matheusmra

Assets 2

25 Mar 11:23

ArthurCRodrigues

0.0.9

5cd125e

0.0.9

Marketplace

What's Changed

[TD][Template ABC] Standardize template contract and validation (#231) by @ArthurCRodrigues in #247
[TD][PipelineExecution] Add typed accessors and step adoption (#224) by @ArthurCRodrigues in #246
[TD][GraderService] Make grading flow stateless (#221) by @ArthurCRodrigues in #244
[TD][Templates] Unify template registration in service layer (#223) by @ArthurCRodrigues in #245
Fix: Github Module #241 by @Sl3nc in #248

Full Changelog: 0.0.8...0.0.9

Contributors

ArthurCRodrigues and Sl3nc

Assets 2

20 Mar 15:43

ArthurCRodrigues

0.0.8

ee67b02

0.0.8

Marketplace

What's Changed

222 td resolve language before running tests by @matheusmra in #240
229 td replace print calls with logging by @LuisHenriqueDC301 in #242
216 td fix inverted reporter mode assignment by @matheusmra in #243

New Contributors

@LuisHenriqueDC301 made their first contribution in #242

Full Changelog: 0.0.7...0.0.8

Contributors

matheusmra and LuisHenriqueDC301

Assets 2

Releases: webtech-network/autograder

0.4.0

New Features

Bug Fixes

What's Changed

Contributors

Uh oh!

0.3.1

What's Changed

Contributors

Uh oh!

0.3.0

New Features

Refactors

Bug Fixes

What's Changed

Contributors

Uh oh!

0.2.0

New Features

Bug Fixes

What's Changed

Contributors

Uh oh!

0.1.3

New Features

What's Changed

Contributors

Uh oh!

0.1.2

What's Changed

New Contributors

Contributors

Uh oh!

0.1.1

New features

What's Changed

Contributors

Uh oh!

0.1.0

What's Changed

Contributors

Uh oh!

0.0.9

What's Changed

Contributors

Uh oh!

0.0.8

What's Changed

New Contributors

Contributors

Uh oh!