Skip to content
This repository was archived by the owner on Jun 30, 2025. It is now read-only.

Conversation

@mart-r
Copy link
Collaborator

@mart-r mart-r commented Jun 3, 2025

Stuff to sync (i.e PRs after v2 rewrite started)

Stuff that's missing completely from v2

  • Preprocessing for UMLS and/or Snomed
  • Usage monitoring
  • Backwards compatibility workflow
  • Multiprocessing

mart-r added 30 commits June 3, 2025 14:18
That is, added things to create a fake model and a script to create + check model, as well as run a simple test on vocab.
@mart-r mart-r merged commit d446f07 into main Jun 6, 2025
4 checks passed
alhendrickson pushed a commit to CogStack/cogstack-nlp that referenced this pull request Jun 24, 2025
* CU-86999tnz7: Update usage of constants for train/test splitting

* CU-86999tnz7: Add alternative category names

* CU-86999tnz7: Fix hashes due to config changes

* CU-86999tnz7: Allow (optionally) training addons (e.g MetaCAT) during supervised training

* CU-86999tnz7: Add optional change description when saving model

* CU-86999tnz7: Add a test for description upon save

* CU-86999tnz7: Fix typing during tests

* CU-86999tnz7: Fix issues with extra labels, add relevant tests

* CU-86999tnz7: Ported DeID improvments

* CU-86999tnz7: Add missing resource files

* CU-86999tnz7: Add preprocessors for UMLS and Snomed

* CU-86999tnz7: Add usage monitoring

* CU-86999tnz7: Use promise of a hash for usage monitoring

* CU-86999tnz7: Allow 15 minutes for tests within main workflow

* CU-86999tnz7: Add README for release scripts

* CU-86999tnz7: Allowing conversion of beta namespaces to proper ones during deserialisation

* CU-86999tnz7: Allow clearing unpacked data when saving model pack

* CU-86999tnz7: Make sure model pack path refers to existing file/folder

* CU-86999tnz7: Add base backwards compatibility stuff.

That is, added things to create a fake model and a script to create + check model, as well as run a simple test on vocab.

* CU-86999tnz7: Run model regression during workflow

* CU-86999tnz7: Fix vocab data path during regression vocab test

* CU-86999tnz7: Fix typo in script

* CU-86999tnz7: Add hash to custom names unless explicitly disabled

* CU-86999tnz7: Add backwards compatibiltiy script

* CU-86999tnz7: Run backwards compatibility as part of workflow

* CU-86999tnz7: Avoid runtime warnings due to config namespaces

* CU-86999tnz7: Add initial multiprocessing option

* CU-86999tnz7: Add minor tests for batching

* CU-86999tnz7: Allow text index to be a string. Add doc string to multiprocessing method

* CU-86999tnz7: Allow batching on a per-character basis

* CU-86999tnz7: Add a few tests for a per-character batching

* CU-86999tnz7: Fix issue with resulting text indices for multiprocessing

* CU-86999tnz7: Add minor multiprocessing test

* CU-86999tnz7: Allow an extra 5 minutes for workflow /tests
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants