refactor: overhaul unit tests and achieve 100% coverage on core modules by ArthurCRodrigues · Pull Request #347 · webtech-network/autograder

ArthurCRodrigues · 2026-05-28T11:26:05Z

Context

The Autograder project required a significant overhaul of its unit and integration testing suite to ensure high-fidelity verification of core domain logic, recursive data structures, and failure recovery mechanisms. Previous tests relied too heavily on superficial mocking, which bypassed critical algorithmic paths such as score balancing and state transition validations.

Solution

This PR refactors and expands the testing suite to achieve near 100% coverage on core modules while enforcing deep, meaningful assertions:

Refactored SubmissionGrader Tests: Replaced heavy MagicMock usage in tests/unit/services/grader/test_submission_grader.py with real SubjectNode and TestNode objects. This allows for actual verification of the recursive weight-balancing algorithm and score calculation logic.
CriteriaTree Coverage: Added tests/unit/models/test_criteria_tree.py to achieve 100% coverage on recursive tree traversal methods (get_all_tests).
SandboxService Robustness: Added tests/unit/services/test_sandbox_service.py with comprehensive coverage for success paths and critical failure recovery (e.g., ensuring sandboxes are released back to the pool if workdir preparation fails).
PipelineExecution State Machine: Enhanced tests/unit/pipeline/test_pipeline_execution_accessors.py to cover all state transition branches and strict data requirement validations.
SandboxStep Isolation: Verified 100% coverage for SandboxStep, including branches for skipping execution when no sandbox is required by templates.

Further clarifications

Tests were verified with pytest --cov=autograder --cov-branch to ensure maximal path coverage.
No changes were made to the core application logic; this PR focuses purely on test infrastructure and reliability.

Related issues

Addresses gaps identified in testing coverage audit.

Checklist

I linked the related issue(s) and explained the motivation.
I kept this PR focused and scoped to a single concern.
I added or updated tests for changed behavior.
I ran the relevant tests locally.
I updated documentation when needed (No documentation changes required).

…s and services

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

…failure

- Add missing docstrings to core models and services. - Fix critical pylint errors (missing arguments, not-callable false positives). - Move imports to top level where possible. - Address broad exception catching with selective pylint-disable. - Fix signatures of abstract methods for consistency. - Refactor sandbox and pre-flight services for better adherence to standards. - Improve logging with lazy formatting. - Resolve various minor warnings (unused imports, reimports, etc.).

…pdating AI test mocks

ArthurCRodrigues

ok

refactor: overhaul unit tests and achieve 100% coverage on core model…

73e737d

…s and services

Copilot AI review requested due to automatic review settings May 28, 2026 11:26

Copilot started reviewing on behalf of ArthurCRodrigues May 28, 2026 11:26 View session

Copilot AI reviewed May 28, 2026

ArthurCRodrigues added 6 commits May 28, 2026 08:34

fix: replace pytest-mock with standard unittest.mock.patch to fix CI …

f5342f8

…failure

fix: address pylint failures in new test files

3dc467c

fix: set PYTHONPATH in pylint workflow to resolve import errors

f2ce54e

Merge branch 'main' into test-harnessing

1ee85a2

fix: resolve test failures by reverting circular import changes and u…

305de7a

…pdating AI test mocks

ArthurCRodrigues commented May 30, 2026

View reviewed changes

ArthurCRodrigues merged commit d7ae6ba into main May 30, 2026
2 checks passed

ArthurCRodrigues deleted the test-harnessing branch May 30, 2026 13:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: overhaul unit tests and achieve 100% coverage on core modules#347

refactor: overhaul unit tests and achieve 100% coverage on core modules#347
ArthurCRodrigues merged 7 commits into
mainfrom
test-harnessing

ArthurCRodrigues commented May 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

ArthurCRodrigues left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ArthurCRodrigues commented May 28, 2026

Context

Solution

Further clarifications

Related issues

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurCRodrigues left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants