Fix problem with completed observations blocking scheduling new ones #65

jnation3406 · 2025-07-17T00:18:03Z

This came from a problem Nikolaus reported last month: https://lcogt.slack.com/archives/C0ANPF49F/p1749233652771689.

The problem was that completed observations that finished early, such that their end time was still in the future past the current scheduling cutoff, would block other requests from getting scheduled on that resource until their scheduled end time, rather than allowing other requests to get scheduled there since it finished early. I believe this change should fix that behavior setting the internal observation end time for completed "running" observations to be the scheduling cutoff time for that run. We need to still have the completed observations in that list since that is used for other things like preventing a request that just completed from getting rescheduled that same run it completed in.

jchate6 · 2025-07-17T00:30:02Z

adaptive_scheduler/observations.py

+                    elif observation['state'] == 'COMPLETED':
+                        # If the observation is already completed, set its end time so we don't block
+                        # scheduling other observations using that end time as a reason
+                        observation['end'] = starts_before


Just to clarify, starts_before is always in the future when this is being run, correct?

In this case, starts_before is the estimated scheduling run completion time, always in the future usually ~1-5 min or so (based on how long a run takes). ends_after is when this current scheduling run began, probably 0-1 min in the past.

jchate6

makes sense. Thanks for updating this.

markBowman · 2025-07-17T17:05:13Z

If I'm reading it right, calling _get_running_observations() is modifying the schedule. This is an 'unexpected consequence' or 'hidden side effect' anti-pattern; a method called get shouldn't modify the underlying data. Maybe you could change the method name or feel free to ignore me because you know the scheduler infinitely better than I do.

markBowman

Approved with one question.

jnation3406 · 2025-07-17T17:09:53Z

If I'm reading it right, calling _get_running_observations() is modifying the schedule. This is an 'unexpected consequence' or 'hidden side effect' anti-pattern; a method called get shouldn't modify the underlying data. Maybe you could change the method name or feel free to ignore me because you know the scheduler infinitely better than I do.

_get_running_observations is getting the set of observations that are currently "running" based on their start/end times overlapping the current scheduling runs runtime. Modifying the observations pulled down from the API isn't modifying the actual observations in the Observation Portal (its not pushed back up) - its just a local modification because downstream we use the end time to decide when we can schedule on that resource this run, and I want to prevent us from using the end time when the observation is already complete. I understand it's a bit convoluted though - I was just trying to make the least changes possible to the scheduler to fix it so I have less risk of breaking other things.

markBowman · 2025-07-17T18:42:17Z

If I'm reading it right, calling _get_running_observations() is modifying the schedule. This is an 'unexpected consequence' or 'hidden side effect' anti-pattern; a method called get shouldn't modify the underlying data. Maybe you could change the method name or feel free to ignore me because you know the scheduler infinitely better than I do.

_get_running_observations is getting the set of observations that are currently "running" based on their start/end times overlapping the current scheduling runs runtime. Modifying the observations pulled down from the API isn't modifying the actual observations in the Observation Portal (its not pushed back up) - its just a local modification because downstream we use the end time to decide when we can schedule on that resource this run, and I want to prevent us from using the end time when the observation is already complete. I understand it's a bit convoluted though - I was just trying to make the least changes possible to the scheduler to fix it so I have less risk of breaking other things.

Yeah, I get that it's not changing the observation portal and I agree that minimum changes is good.
I approved the PR as it stands. - Future-you is probably less prone to confusion than future-me.

Fix problem with completed observations blocking scheduling new ones

7842cc4

jnation3406 requested review from jchate6 and markBowman July 17, 2025 00:18

jchate6 reviewed Jul 17, 2025

View reviewed changes

jchate6 approved these changes Jul 17, 2025

View reviewed changes

markBowman approved these changes Jul 17, 2025

View reviewed changes

jnation3406 merged commit 4cd235c into main Jul 22, 2025
11 checks passed

jnation3406 deleted the fix/adjust_scheduling_cutoff branch July 22, 2025 22:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix problem with completed observations blocking scheduling new ones #65

Fix problem with completed observations blocking scheduling new ones #65

Uh oh!

jnation3406 commented Jul 17, 2025

Uh oh!

jchate6 Jul 17, 2025

Uh oh!

jnation3406 Jul 17, 2025

Uh oh!

jchate6 left a comment

Uh oh!

markBowman commented Jul 17, 2025

Uh oh!

markBowman left a comment

Uh oh!

jnation3406 commented Jul 17, 2025

Uh oh!

markBowman commented Jul 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix problem with completed observations blocking scheduling new ones #65

Fix problem with completed observations blocking scheduling new ones #65

Uh oh!

Conversation

jnation3406 commented Jul 17, 2025

Uh oh!

jchate6 Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

jnation3406 Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

jchate6 left a comment

Choose a reason for hiding this comment

Uh oh!

markBowman commented Jul 17, 2025

Uh oh!

markBowman left a comment

Choose a reason for hiding this comment

Uh oh!

jnation3406 commented Jul 17, 2025

Uh oh!

markBowman commented Jul 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants