Goodbye pytest warnings by AdrianSosic · Pull Request #781 · emdgroup/baybe

AdrianSosic · 2026-04-22T08:30:17Z

Say goodbye to all pytest warnings, once and for all. This is achieved but turning the warnings into errors using the -W error mode, requiring us to fix the causing issues. Generally good practice since:

it finds more issues in our own code base (see changes)
and also prevents us from running e.g. into deprecated features from third-party code (see changes)

So the PR essentially finalizes the work from #766. In addition, it enables the --strict mode.

Copilot

Pull request overview

This PR tightens the test suite’s warning hygiene by turning warnings into errors and enabling stricter pytest behavior, then adjusting code/tests to eliminate or suppress the resulting warnings (including a few third‑party deprecations).

Changes:

Enable stricter pytest configuration/marker handling and convert warnings to errors via pytest.ini (filterwarnings = error plus targeted ignores).
Update tests to avoid/contain deprecation warnings that occur at collection time (module import / parametrization).
Adjust runtime code paths to avoid warning-triggering tensor conversions and pandas FutureWarnings.

Reviewed changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`pytest.ini`	Enables strict pytest modes and turns warnings into errors with targeted ignores.
`tests/conftest.py`	Adds a temporary global warning ignore and fixes/skips warning-triggering test helpers.
`tests/test_deprecations.py`	Suppresses deprecations that fire at collection time and in specific tests under `-W error` behavior.
`tests/test_surrogate.py`	Adjusts test input size to avoid warning-triggering behavior.
`tests/hypothesis_strategies/alternative_creation/test_acquisition.py`	Deduplicates acquisition function strings used for parametrization.
`tests/constraints/test_constraints_continuous.py`	Fixes a duplicated parametrization id.
`tests/constraints/test_cardinality_constraint_continuous.py`	Ensures the expected warning is always captured under warning-as-error mode.
`baybe/constraints/discrete.py`	Suppresses pandas FutureWarning during dummy assignments for dependency constraints.
`baybe/surrogates/random_forest.py`	Avoids converting tensors requiring grad to numpy directly by detaching first.
`baybe/surrogates/ngboost.py`	Detaches and converts torch tensors before passing to NGBoost prediction API.
`baybe/surrogates/linear.py`	Detaches and converts torch tensors before passing to sklearn prediction API.
`baybe/utils/clustering_algorithms/third_party/kmedoids.py`	Attempts to replace deprecated/removed sklearn utility usage in k-medoids++ init.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-22T09:25:39Z

            rand_vals = random_state_.random_sample(n_local_trials) * current_pot
-            candidate_ids = np.searchsorted(stable_cumsum(closest_dist_sq), rand_vals)
+            candidate_ids = np.searchsorted(
+                np.cumulative_sum(closest_dist_sq), rand_vals


np.cumulative_sum is not a NumPy API, so this will raise AttributeError at runtime. If the goal is to remove the scikit-learn stable_cumsum import, replace this with a valid cumulative sum implementation (e.g., np.cumsum) or inline a numerically-stable cumulative sum equivalent to stable_cumsum to preserve behavior.

Suggested change

np.cumulative_sum(closest_dist_sq), rand_vals

np.cumsum(closest_dist_sq), rand_vals

Copilot · 2026-04-22T09:25:40Z

    if hasattr(cl, "abbreviation")
 ]
+fullnames = [cl.__name__ for cl in get_subclasses(AcquisitionFunction)]
+combined = set(abbreviations + fullnames)


Using a set for pytest.mark.parametrize makes test collection order non-deterministic (hash-randomized), which can lead to flaky ordering/IDs across runs and platforms. Consider keeping the de-duplication but switching to a deterministic sequence (e.g., sorted(set(...))) before passing it to parametrize.

Suggested change

combined = set(abbreviations + fullnames)

combined = sorted(set(abbreviations + fullnames))

Copilot · 2026-04-22T09:25:40Z

+        with warnings.catch_warnings():
+            # Suppress pandas FutureWarning about setting incompatible dtype.
+            # TODO: Needs proper fix for pandas 3.0 compatibility.
+            warnings.simplefilter("ignore", FutureWarning)
+
+            for k, _ in enumerate(self.parameters):
+                # .loc assignments are not supported by mypy + pandas-stubs yet
+                # See https://github.com/pandas-dev/pandas-stubs/issues/572
+                censored_data.loc[  # type: ignore[call-overload]
+                    ~self.conditions[k].evaluate(data[self.parameters[k]]),
+                    self.affected_parameters[k],
+                ] = Dummy()



The warnings.simplefilter("ignore", FutureWarning) inside get_invalid suppresses all FutureWarnings raised in this block, which can mask unrelated warnings and make it harder to spot new pandas deprecations. Prefer narrowing the filter to the specific pandas warning message (or module) you're targeting, or proactively adjust the dataframe/column dtypes before assignment to avoid emitting the warning in the first place.

Suggested change

with warnings.catch_warnings():

# Suppress pandas FutureWarning about setting incompatible dtype.

# TODO: Needs proper fix for pandas 3.0 compatibility.

warnings.simplefilter("ignore", FutureWarning)

for k, _ in enumerate(self.parameters):

# .loc assignments are not supported by mypy + pandas-stubs yet

# See https://github.com/pandas-dev/pandas-stubs/issues/572

censored_data.loc[ # type: ignore[call-overload]

~self.conditions[k].evaluate(data[self.parameters[k]]),

self.affected_parameters[k],

] = Dummy()

# Cast affected columns to object before assigning Dummy() values to avoid

# pandas FutureWarning about setting incompatible dtypes.

all_affected_params = list(

dict.fromkeys(

affected_param

for affected_params in self.affected_parameters

for affected_param in affected_params

)

)

if all_affected_params:

censored_data[all_affected_params] = censored_data[

all_affected_params

].astype(object)

for k, _ in enumerate(self.parameters):

# .loc assignments are not supported by mypy + pandas-stubs yet

# See https://github.com/pandas-dev/pandas-stubs/issues/572

censored_data.loc[ # type: ignore[call-overload]

~self.conditions[k].evaluate(data[self.parameters[k]]),

self.affected_parameters[k],

] = Dummy()

Copilot · 2026-04-22T09:25:40Z

+    ignore:.*add(ed|ing) jitter.*:linear_operator.utils.warnings.NumericalWarning
+    ignore:Optimization failed.*:RuntimeWarning:botorch.optim.optimize
+    ; Note: File format instead of module format needed due to BoTorch reraising using `warn_explicit` without `module` argument
+    ignore:`scipy_minimize` terminated with status .*:botorch.exceptions.warnings.OptimizationWarning:.*botorch/fit  


The filterwarnings rule on this line has trailing whitespace after the .*botorch/fit regex. Depending on how the config is parsed, this can accidentally become part of the regex and prevent the ignore from matching. Please remove the trailing spaces to make the rule unambiguous.

Suggested change

ignore:`scipy_minimize` terminated with status .*:botorch.exceptions.warnings.OptimizationWarning:.*botorch/fit

ignore:`scipy_minimize` terminated with status .*:botorch.exceptions.warnings.OptimizationWarning:.*botorch/fit

A proper fix is needed for Pandas 3.0, but this is outside the scope of the PR. For now, since we're living with the warning anyway, a simple ignore to enable the pytest -W error mode suffices.

Original error message: sklearn.exceptions.DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples, ), for example using ravel().

Deprecation message from sklearn: "`sklearn.utils.extmath.stable_cumsum` is deprecated in version 1.8 and " "will be removed in 1.10. Use `np.cumulative_sum` with the desired dtype " "directly instead."

Original error message: botorch.exceptions.warnings.BadInitialCandidatesWarning: All acquisition values for raw samples points are the same for at least one batch. Choosing initial conditions at random.

The reason why this used to work before is due to the previously missing n_rows argument: with only a single data point, the models fell back to the simple MeanPredictionSurrogate, bypassing the optional import error

Treat warnings as errors during tests

834d13e

AdrianSosic self-assigned this Apr 22, 2026

AdrianSosic added the tests Related to testing. label Apr 22, 2026

AdrianSosic marked this pull request as ready for review April 22, 2026 09:20

AdrianSosic requested review from AVHopp and Scienfitz as code owners April 22, 2026 09:20

Copilot AI review requested due to automatic review settings April 22, 2026 09:20

Copilot started reviewing on behalf of AdrianSosic April 22, 2026 09:20 View session

AdrianSosic force-pushed the tests/Werror branch from 9ea2754 to 53ee879 Compare April 22, 2026 09:22

Copilot AI reviewed Apr 22, 2026

View reviewed changes

AdrianSosic force-pushed the tests/Werror branch from 53ee879 to 691e310 Compare April 22, 2026 10:57

AdrianSosic added 19 commits April 22, 2026 13:13

Ignore torch.jit.script warning

ccb9661

Ignore legacy target warning for test parametrization

058522d

Ignore legacy target warnings in test bodies

f9a4f9a

Ignore pandas FutureWarning

9704318

A proper fix is needed for Pandas 3.0, but this is outside the scope of the PR. For now, since we're living with the warning anyway, a simple ignore to enable the pytest -W error mode suffices.

Ignore linear_operator NumericalWarning

7050dca

Ignore BoTorch failed optimization RuntimeWarning

b8acbfa

Avoid expected warning being converted into error

eb8b21b

Ignore BoTorch log-acquisition warnings

0704191

Fix target tensor layout

24e8d89

Original error message: sklearn.exceptions.DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples, ), for example using ravel().

Temporarily ignore linear_operator DeprecationWarning

6475402

Ignore scipy optimization termination warning (reraised by BoTorch)

018902e

Drop tenacity warning

e67feae

Extend regex for linear_operator warning

9c2a17c

Replace stable_cumsum with np.cumulative_sum

1158cff

Deprecation message from sklearn: "`sklearn.utils.extmath.stable_cumsum` is deprecated in version 1.8 and " "will be removed in 1.10. Use `np.cumulative_sum` with the desired dtype " "directly instead."

Avoid acquisition value degeneracy by adding more data

ba3e59c

Original error message: botorch.exceptions.warnings.BadInitialCandidatesWarning: All acquisition values for raw samples points are the same for at least one batch. Choosing initial conditions at random.

Add missing detach() call to numpy surrogates

b397234

Activate pytest's strict mode

4f7d1d4

Fix some duplicate parametrization IDs

5aa81af

Temporarily disable strict parametrization requirement

75c0aec

AdrianSosic force-pushed the tests/Werror branch from 691e310 to 75c0aec Compare April 22, 2026 11:13

AdrianSosic added 2 commits April 22, 2026 13:59

Register filterwarnings marker for strict mode

2c8c70e

Skip test in CORETEST environment

6fda0db

The reason why this used to work before is due to the previously missing n_rows argument: with only a single data point, the models fell back to the simple MeanPredictionSurrogate, bypassing the optional import error

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Goodbye pytest warnings#781

Goodbye pytest warnings#781
AdrianSosic wants to merge 22 commits intomainfrom
tests/Werror

AdrianSosic commented Apr 22, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	np.cumulative_sum(closest_dist_sq), rand_vals
	np.cumsum(closest_dist_sq), rand_vals

	combined = set(abbreviations + fullnames)
	combined = sorted(set(abbreviations + fullnames))

	ignore:`scipy_minimize` terminated with status .:botorch.exceptions.warnings.OptimizationWarning:.botorch/fit
	ignore:`scipy_minimize` terminated with status .:botorch.exceptions.warnings.OptimizationWarning:.botorch/fit

Conversation

AdrianSosic commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AdrianSosic commented Apr 22, 2026 •

edited

Loading