feature: New bazel test for config and dictionaries by frankslin · Pull Request #9 · frankslin/OpenCC

frankslin · 2026-01-03T01:37:55Z

Summary

add a //data/config:config_dict_validation_test to test dictionaries and configs against a testcases.json file
switch all CLI/Python/Node tests to consume testcases.json as the single source of truth; drop .in/.ans dependencies and adjust Bazel/CMake wiring
streamline dictionary build outputs (no standalone TWPhrases{IT,Name,Other}.ocd2) and align DictionaryTest with the actual generated dict set
add maintenance helpers (refresh_assets.sh cleanup and fix, rapidjson dep/path for CLI test) and keep wasm assets in sync via testcases.json

Testing

bazel test //data/dictionary:dictionary_test
bazel test //test:command_line_converter_test
bazel test //python/tests:test_opencc
node/test.js (sync/async/promise) using updated testcases.json

- Switch all tests (C++ CLI, Python, Node) to consume `testcases.json` and drop `.in`/`.ans` dependencies; keep filegroup for the JSON. - Prune TWPhrases sub-dictionary artifacts and align DictionaryTest to current generated dict set. - Add rapidjson dep/path for CLI test, refresh_assets script fixes, and keep Bazel Python toolchain note.

Copilot

Pull request overview

This PR modernizes the test infrastructure by consolidating all test cases into a single testcases.json file, replacing the legacy .in/.ans file pairs. The change improves maintainability by establishing a single source of truth for test data across CLI, Python, and Node.js test suites.

Key Changes:

Unified test data format using testcases.json with structured test cases containing input, expected outputs per configuration, and unique IDs
Updated all test runners (C++, Python, Node.js) to consume JSON-based test cases instead of file-based inputs
Streamlined dictionary build to exclude intermediate phrase components from standalone binary outputs

Reviewed changes

Copilot reviewed 37 out of 37 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
test/testcases/testcases.json	New consolidated JSON file containing all test cases with inputs and expected outputs per config
test/testcases/.in, test/testcases/.ans	Removed legacy test input/answer files (migrated to JSON)
test/testcases/BUILD.bazel	Updated to reference testcases.json instead of globbing .in/.ans files
test/CommandLineConvertTest.cpp	Refactored to parse testcases.json and dynamically generate test cases
test/BUILD.bazel	Added rapidjson dependency for JSON parsing
test/CMakeLists.txt	Added rapidjson include path for test compilation
python/tests/test_opencc.py	Migrated from glob-based file reading to JSON-based test iteration
node/test.js	Refactored to read testcases.json and iterate over cases instead of hardcoded config list
data/dictionary/DictionaryTest.cpp	Updated dictionary list to exclude removed TWPhrases component files
data/dictionary/BUILD.bazel	Added PHRASE_PARTS exclusion to prevent standalone .ocd2 generation for merge components
data/config/ConfigDictValidationTest.cpp	New end-to-end validation test for configs against testcases.json
data/config/BUILD.bazel	Added cc_test target for config validation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

test/CommandLineConvertTest.cpp

node/test.js

data/config/ConfigDictValidationTest.cpp

- Rename and guard streams in CommandLineConvertTest; ensure input file opens and normalize CRLF. - Fix node test promise handling to propagate errors correctly. - Mark ConfigDictValidationTest as Bazel-only to skip CMake builds.

## Summary - add a `//data/config:config_dict_validation_test` to test dictionaries and configs against a `testcases.json` file - switch all CLI/Python/Node tests to consume `testcases.json` as the single source of truth; drop `.in/.ans` dependencies and adjust Bazel/CMake wiring - streamline dictionary build outputs (no standalone `TWPhrases{IT,Name,Other}.ocd2`) and align DictionaryTest with the actual generated dict set - add maintenance helpers (refresh_assets.sh cleanup and fix, rapidjson dep/path for CLI test) and keep wasm assets in sync via `testcases.json` ## Testing - bazel test //data/dictionary:dictionary_test - bazel test //test:command_line_converter_test - bazel test //python/tests:test_opencc - node/test.js (sync/async/promise) using updated testcases.json ---- * feature: add a new ConfigDictValidationTest.cpp to be executed in bazel * Changeover to JSON-based testcases and clean dictionary outputs - Switch all tests (C++ CLI, Python, Node) to consume `testcases.json` and drop `.in`/`.ans` dependencies; keep filegroup for the JSON. - Prune TWPhrases sub-dictionary artifacts and align DictionaryTest to current generated dict set. - Add rapidjson dep/path for CLI test, refresh_assets script fixes, and keep Bazel Python toolchain note. * Normalize CommandLineConvertTest for CRLF comparisons on Windows * Address review feedback for tests and Bazel-only validation - Rename and guard streams in CommandLineConvertTest; ensure input file opens and normalize CRLF. - Fix node test promise handling to propagate errors correctly. - Mark ConfigDictValidationTest as Bazel-only to skip CMake builds.

frankslin added 3 commits January 2, 2026 14:41

feature: add a new ConfigDictValidationTest.cpp to be executed in bazel

9c2c965

Normalize CommandLineConvertTest for CRLF comparisons on Windows

0248fd7

frankslin requested a review from Copilot January 3, 2026 01:44

Copilot started reviewing on behalf of frankslin January 3, 2026 01:45 View session

Copilot AI reviewed Jan 3, 2026

View reviewed changes

test/CommandLineConvertTest.cpp Outdated Show resolved Hide resolved

test/CommandLineConvertTest.cpp Show resolved Hide resolved

node/test.js Outdated Show resolved Hide resolved

data/config/ConfigDictValidationTest.cpp Show resolved Hide resolved

frankslin mentioned this pull request Jan 3, 2026

改进 OpenCC 测试结构：使用单一输入一次测试多个 config，以降低测试编写与 review 成本 BYVoid/OpenCC#1003

Closed

frankslin merged commit 36c7cbb into master Jan 3, 2026
23 checks passed

frankslin deleted the feature/new-bazel-test-for-config-and-dictionaries branch January 3, 2026 03:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: New bazel test for config and dictionaries#9

feature: New bazel test for config and dictionaries#9
frankslin merged 4 commits intomasterfrom
feature/new-bazel-test-for-config-and-dictionaries

frankslin commented Jan 3, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

frankslin commented Jan 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

frankslin commented Jan 3, 2026 •

edited

Loading