Improved `Features` types by TobyBoyne · Pull Request #725 · experimental-design/bofire

TobyBoyne · 2026-02-15T16:10:47Z

Motivation

With the latest release of ty==0.0.17, the type checker now errors for attribute access on union where some elements lack the attribute (as mentioned in #719). This means that code like below now raises a typing error:

inputs = Inputs(...)
for feature in inputs.get(CategoricalInput):
    print(feature.categories)
# >> ty check
#  Error: Attribute `categories` is not defined on `DiscreteInput`, `ContinuousInput` in union `DiscreteInput | CategoricalInput | ContinuousInput`

A short term fix is just to pin the version of ty. This PR presents two different possible approaches to fix this in the long term. Let me know what you think of them @jduerholt, and whether you would like me to go ahead with either/both of them :)

1. TypeGuards

We can use TypeGuard to filter down the possible features contained within a Features object. For example, this PR currently includes one for checking if all the features in an Inputs object are continuous. This can be used as below (also see the change in benchmarks.py for an example).

if Inputs.is_continuous(inputs):
    for feature in inputs.get():
        print(feature.lower_bound) 
# no ty check errors!

2. Overloads

We can also overload the get method on Features to specify the types of features in the containers. This approach should be a bit more seamless, since you won't need to call an extra function compared to approach (1). I've currently added an example implementing this for Outputs, with an example in naming_conventions.py.

for feature in inputs.get(CategoricalInput):
    # thanks to the overload, we can now infer the type of `inputs.get(CategoricalInput)` is `Inputs[CategoricalInput]`
    print(feature.categories)
# again, no ty check errors!

Have you read the Contributing Guidelines on pull requests?

Yes

Have you updated `CHANGELOG.md`?

Not yet

Test Plan

The number ty errors that are currently raised on the CI should go down.

jduerholt

This looks super interesting, I personally like the overload approach, despite not really understanding it :D

@bertiqwerty what do you think? I personally think that implementing the overload would be quite nice.

jduerholt · 2026-02-16T10:02:10Z

bofire/data_models/domain/features.py

        )
        return clean_exp
+
+    @overload


Why do we need the overload two times?

Unfortunately that's part of the specification for overloads (link). I think the idea is that the actual implementation of get should have no type hints, and all of the type hints go into the different overloads.

So the second overload is a fallback if the argument types don't match the first overload, which will happen if excludes is provided, since you can't express Intersection[GetIncludesT, ~GetExcludesT] in Python's type system (yet) so we need the fallback to be more generic!

TobyBoyne · 2026-02-16T10:20:05Z

If you wanted to take this even further, you can also use Annotated to provide type hints for the keys themselves. For example, you can have continuous_input_key: Annotated[str, ContinuousInput]. This still behaves exactly like a string, but the extra context provided by Annotated lets you use it in discriminating overloads. For example:

class Feature: ...

class ContinuousFeature(Feature): ...

class CategoricalFeature(Feature): ...

features: dict[str, Feature]

def get_continuous_key() -> Annotated[str, ContinuousFeature]:
    # Dummy function to get a key with the correct annotation.
    # One could imagine Inputs.get_keys(includes=ContinuousInput) returns a similarly
    # annotated feature key
    return "x1"

@overload
def get_feature_from_key(key: Annotated[str, ContinuousFeature]) -> ContinuousFeature:
    ...

@overload
def get_feature_from_key(key: Annotated[str, CategoricalFeature]) -> CategoricalFeature:
    ...

def get_feature_from_key(key: str) -> Feature:
    return features[key]


key = get_continuous_key()
feature = get_feature_from_key(key)
reveal_type(feature) # ContinuousFeature

What do you think? It may be a bit overkill, but really we should only need to touch the features.py file. All of the annotations will (hopefully) nicely propagate out from calling methods on Inputs and Outputs.

jduerholt · 2026-02-16T11:06:30Z

I like it and you are right we are only using it for the feature containers. It will definitely add some boiler plate code but will make typehints etc. much nicer. @bertiqwerty: should Toby go ahead with this? What do you think?

TobyBoyne · 2026-02-17T10:40:30Z

I was digging into this a bit further, and it turns out the Annotated approach doesn't actually work - the type checkers see the two Annotated arguments as simply str, so it will always match the first one. Oh well! The proper approach would be to use something like NewType, but this may come with its own overhead. I will keep playing around with this!

Edit: The latest commit shows an example how custom key types might work. Unfortunately, I don't think it's really viable. If you're curious why I say that, it's because in my opinion two key things are missing from the Python Type system:

Intersection types: we really want get_by_key(ContinuousFeatureKey) to have return type Intersection[InputT, ContinuousInput], but intersection types don't yet exist in Python.
Union overloads: we have (key: ContinuousFeatureKey) -> ContinuousInput and (key: DiscreteFeatureKey) -> DiscreteInput, and a fallback (key: str) -> AnyInput. This means that any key will always match this last case, and so will always return AnyInput. We would like the fallback to be (key: Intersection[str, Not[ContinuousFeatureKey], Not[DiscreteFeatureKey], ...]) -> AnyInput, but again this isn't supported (and probably isn't very sound from a type hinting perspective.

My suggestion going forward would be to write code like Inputs.get(ContinuousInput).get_by_key(featkey), which means function calling will be a bit more verbose but the type hinting is more explicit. This can all be done with the original overload approach in this PR, and I will remove the NewType stuff.

…ools`)

jduerholt · 2026-02-19T11:30:45Z

Hi @TobyBoyne, thanks for your efforts here, until this is ready, I will pin the ty version. To see if we get degradation with respect to this one.

jduerholt · 2026-02-19T11:31:01Z

Hi @TobyBoyne, thanks for your efforts here, until this is ready, I will pin the ty version. To see if we get degradation with respect to the old one.

TobyBoyne · 2026-02-20T14:33:37Z

bofire/utils/torch_tools.py

    """
    constraints = []
+    inputs = domain.inputs.get([ContinuousInput, DiscreteInput])
    for c in domain.constraints.get(constraint):


ty infers the type of c to be Constraint, whereas pyright correctly infers it to by LinearEqualityConstraint | LinearInequalityConstraint. Adding an extra .get([LinearEqualityConstraint, LinearInequalityConstraint]) here reveals @Todo with ty.

TobyBoyne added 4 commits February 15, 2026 15:25

Better generic vars in features

7abff70

Example of using typeguard

4837e1c

Move TypeGuard to top of function

6113fc6

get overloads and example for output

0c9880f

jduerholt reviewed Feb 16, 2026

View reviewed changes

Add similar typing to Constraints

2460712

TobyBoyne added 5 commits February 17, 2026 13:42

Possible suggestion for using NewType with feature keys (see `torch_t…

b0ae8fd

…ools`)

Merge branch 'main' into typing/feature-key-types

4779c15

Revert NewType change

87ad757

Clean up get signatures in Features and Inputs

df132a7

Switch from list to sequence for covariance

c344d91

jduerholt mentioned this pull request Feb 19, 2026

Pin Ty Version #734

Merged

TobyBoyne added 3 commits February 20, 2026 13:54

Change calls on Features: down to 143 diagnostics

a268633

Clean up deterministic categorical and some constraints

0fe4211

A bit more cleanup

0a06aa8

TobyBoyne commented Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved `Features` types#725

Improved `Features` types#725
TobyBoyne wants to merge 13 commits intoexperimental-design:mainfrom
TobyBoyne:typing/feature-key-types

TobyBoyne commented Feb 15, 2026

Uh oh!

jduerholt left a comment

Uh oh!

jduerholt Feb 16, 2026

Uh oh!

TobyBoyne Feb 16, 2026

Uh oh!

TobyBoyne commented Feb 16, 2026 •

edited

Loading

Uh oh!

jduerholt commented Feb 16, 2026

Uh oh!

TobyBoyne commented Feb 17, 2026 •

edited

Loading

Uh oh!

jduerholt commented Feb 19, 2026

Uh oh!

jduerholt commented Feb 19, 2026

Uh oh!

TobyBoyne Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TobyBoyne commented Feb 15, 2026

Motivation

1. TypeGuards

2. Overloads

Have you read the Contributing Guidelines on pull requests?

Have you updated CHANGELOG.md?

Test Plan

Uh oh!

jduerholt left a comment

Choose a reason for hiding this comment

Uh oh!

jduerholt Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

TobyBoyne Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

TobyBoyne commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jduerholt commented Feb 16, 2026

Uh oh!

TobyBoyne commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jduerholt commented Feb 19, 2026

Uh oh!

jduerholt commented Feb 19, 2026

Uh oh!

TobyBoyne Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Have you updated `CHANGELOG.md`?

TobyBoyne commented Feb 16, 2026 •

edited

Loading

TobyBoyne commented Feb 17, 2026 •

edited

Loading