Implement `expect_disjoint()` #2239

stibu81 · 2025-09-19T14:39:33Z

This introduces two negated expectations as suggested in #1851 with the following functionality:

expect_not_contains(x, y) tests that x contains none of the elements of y (i.e. y is disjoint from x).
expect_not_in(x, y) tests that no element of x is in y (i.e. x is disjoint from y).

While the not negated expectations actually do something different, these two are equivalent. It might still make sense to have them both.

During implementation I realised that one might have different expectations from these names. For example, one might expect that expect_not_in(x, y) checks that:

none of the elements of x are in y (which is what I implemented)
x is not a subset of y

Both of them could also meaningfully be understood as inversions of the other two expectations. Would the second variant also be of interest?

Let me know if anything should be improved.

hadley · 2025-10-06T22:15:23Z

@stibu81 my inclination would be to define the tests like this:

expect_not_contains(x, y) tests that x contains no element of y (i.e. y is a not subset of x).
expect_in(x, y) tests that no element of x is in y (i.e. x is a not subset of y).

(i.e. just replacing "every" with "no", and adding "not" before subset). Does that make sense to you or have I confused myself? (As I do whenever I look at these functions)

lionel- · 2025-10-07T06:45:23Z

How about a single expect_disjoint() function?

stibu81 · 2025-10-07T08:11:11Z

@hadley It is very confusing, yes. I think the confusion has to do with what I briefly mentioned at the end of my original post: there are two reasonable ways to think about these functions: in terms of single elements or in terms of sets. In the case of expect_in(), these two ways are equivalent, but with expect_not_in() they are not.

For expect_in(x, y), they can be formulated as follows:

elements: check that every element of x is in y
set: check that the entire set x is in y

These two statements turn out to say exactly the same (x is a subset of y) but for expect_not_in(x, y) this is different:

elements: check that no element of x is in y (x is disjoint from y).
set: check that the set x is not in y (x is not a subset of y).

As an example, expect_not_in(c(3, 4), 1:3) would fail in the first case because 3 is in 1:3, but it would succeed in the second case, because c(3, 4) is not a subset of 1:3. Only this second case is the exact inverse of expect_in() in that it succeeds precisely when the other fails. But (at least to me), the element-wise check seems more natural and more useful.

Your description of the tests mixes those two distinct ways of understanding the functions, so I think it is not correct.

The name suggested by @lionel- is much clearer and cannot be misunderstood, so I prefer that one. From the name, it is less obvious that this is a kind of inverse to expect_in() and expect_contains(), which someone might still try to find. But as I have said above, my implementation is also not the exact inverse of expect_in(), so it might actually be better to not imply that it is.

What is your preferred way forward? Should I replace the two functions by expect_disjoint()? And do you still see any justification to implement the "set-variants" of the functions (which would be difficult to name properly, I think)?

DavisVaughan · 2025-10-10T14:00:22Z

When I think of these, I do think the "elements" based approach mentioned above is what I'd expect them to do. I think pictures are useful here:

What falls out from these pictures is that not-contains and not-in would use the same implementation when defined this way. I do think their error messages would probably be a little different:

# Not contains
`actual` contains some of the values in `unexpected`

# Not in
Some values of `actual` are in `unexpected`

And of course you'd provide the arguments in different orders, expect_not_contains(haystack, needles) vs expect_not_in(needles, haystack).

But I do really like what @lionel- suggested here. I think expect_disjoint():

Is a very clear name. In particular I prefer to have a positive assertion over a not assertion.
Has no ambiguity about whether the vectors must be partially or fully separated (I think it implies fully disjoint with no overlap at all)
Is nice because it's a single function, capturing how the implementations are the same between the two of them.

I don't think the argument order actually matters all that much. Reporting something like this feels like it would be good enough for all use cases

{act$lab} (`actual`) and {exp$lab} (`expected`) are not disjoint.
* Present in both `values(union(act$val, exp$val))`

stibu81 · 2025-10-10T14:06:50Z

@DavisVaughan I think, we agree then. This would replace the two functions that I implemented with a single one that does exactly the same, but has a clearer name and produces slightly different output. But it would not be the exact inverse of exptect_in() or expect_contains().

And I would not implement the set-variants of expect_not_in() and expect_not_contains().

Is it ok for me to go ahead or should I wait on a comment by @hadley?

DavisVaughan · 2025-10-10T14:11:19Z

I think you can go ahead!

hadley · 2025-10-10T14:41:50Z

Plan sounds good to me!

stibu81 · 2025-10-10T15:41:12Z

Second attempt, now with expect_disjoint(). I tried to keep the documentation and the failure message in the spirit of the other expectations in "setequal-group".

DavisVaughan

@hadley looks good to me and the implementation matches the spirit of expect_in() - I'll let you be the final approver and merge-er

DavisVaughan · 2025-10-10T16:34:40Z

R/expect-setequal.R

+    )
+    msg_act <- c(
+      sprintf("Actual: %s", values(act$val)),
+      sprintf("Expected: none of %s", values(exp$val)),


Suggested change

sprintf("Expected: none of %s", values(exp$val)),

sprintf("Expected: None of %s", values(exp$val)),

I think I like having this capitalized more

DavisVaughan · 2025-10-10T16:37:05Z

tests/testthat/test-expect-setequal.R

+
+  expect_snapshot_failure(expect_disjoint(x1, x2))
+  expect_snapshot_failure(expect_disjoint(x1, x3))
+})


Might be useful to have a test for expect_disjoint(c("a", NA), NA) to test that missing values are matched exactly?

Thanks for the review. I made the requested changes. In doing so, I might have stumbled onto something else: expect_failure() does not succeed for expect_disjoint() and some other functions in this file, e.g.:

expect_failure(expect_in(3, 5)) ## Error: Expected zero successes. ## Actually succeeded 1 times

I think that the reason is that the call of fail() is not inside return() for some functions, such that the later pass() is also executed. expect_snapshot_failure() seems to be ok with this but not expect_failure().

I could fix those missing return()s, but I'm not sure that it is good to mix this into this PR that is about something else.

@stibu81 that's because the expectation style has changed since you started working on this PR 😬 I've updated your expectation to the new style and expect_failure(expect_in(3, 5)) now correctly passes.

Seems I picked a bad moment for this... 😆 Thanks for fixing it.

stibu81 added 3 commits September 18, 2025 22:19

add expect_not_contains() (r-lib#1851)

25959a4

add expect_not_in() (r-lib#1851)

0d18c57

update news

86ef654

replace expect_not_in() and expect_not_contains() by expect_disjoint()

eb9eca6

DavisVaughan approved these changes Oct 10, 2025

View reviewed changes

DavisVaughan changed the title ~~expect_not_contains() and expect_not_in()~~ Implement expect_disjoint() Oct 10, 2025

stibu81 and others added 4 commits October 10, 2025 20:41

resolve review comments from @DavisVaughan

100e4e0

Merged origin/main into stibu81-main

3d32674

Update expectation style

2f5a8a4

More style tweaks

e73be51

hadley merged commit ae5dda6 into r-lib:main Oct 10, 2025
14 of 15 checks passed

	sprintf("Expected: none of %s", values(exp$val)),
	sprintf("Expected: None of %s", values(exp$val)),

Implement expect_disjoint() #2239

Implement expect_disjoint() #2239

Uh oh!

Conversation

stibu81 commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hadley commented Oct 6, 2025

Uh oh!

lionel- commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stibu81 commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DavisVaughan commented Oct 10, 2025

Uh oh!

stibu81 commented Oct 10, 2025

Uh oh!

DavisVaughan commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hadley commented Oct 10, 2025

Uh oh!

stibu81 commented Oct 10, 2025

Uh oh!

DavisVaughan left a comment

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

stibu81 Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

hadley Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

stibu81 Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Implement `expect_disjoint()` #2239

Implement `expect_disjoint()` #2239

stibu81 commented Sep 19, 2025 •

edited

Loading

lionel- commented Oct 7, 2025 •

edited

Loading

stibu81 commented Oct 7, 2025 •

edited

Loading

DavisVaughan commented Oct 10, 2025 •

edited

Loading