Fix Windows daemon restart and self-update flow by svarlamov · Pull Request #1076 · git-ai-project/git-ai

svarlamov · 2026-04-14T04:42:42Z

Summary

separate daemon shutdown into stop, restart, and restart-after-update actions
restart the daemon after max-uptime exits and let the Windows installer own restart timing after self-updates
fix the Windows installer reserved $PID loop bug and add a regression test

Testing

cargo test --package git-ai --test windows_install_script -- --nocapture
cargo fmt -- --check

Manual validation

reproduced the pre-fix Windows max-uptime failure with auto-updates and version checks explicitly enabled
reproduced the pre-fix Windows installer failure caused by the reserved PowerShell $PID loop variable
verified the fixed daemon now restarts on max uptime and takes the cached-update shutdown path on Windows

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 4 additional findings.

devin-ai-integration

Devin Review found 1 new potential issue.

View 7 additional findings in Devin Review.

devin-ai-integration · 2026-04-14T06:36:34Z

        self.shutdown_condvar.notify_all();
    }


🟡 Explicit Shutdown control request does not reset shutdown_action, causing unintended restart

When the update check loop or uptime check calls request_restart() or request_restart_after_update(), they store a non-Stop value in shutdown_action before calling request_shutdown(). However, an explicit ControlRequest::Shutdown (e.g. from git-ai bg shutdown) at src/daemon.rs:7389 only calls request_shutdown(), which never resets shutdown_action back to Stop. If the timing aligns (update/uptime check sets a restart intent, then user sends an explicit shutdown before the daemon finishes draining), the daemon will still attempt to restart in src/commands/daemon.rs:199-200 despite the explicit stop request. The request_shutdown method at src/daemon.rs:3760 should reset shutdown_action to DaemonExitAction::Stop so that an explicit shutdown always overrides a previously-set restart intent.

(Refers to lines 3760-3773)

Prompt for agents

The problem is in ActorDaemonCoordinator::request_shutdown (src/daemon.rs around line 3760). This method sets shutting_down to true and notifies waiters, but it does NOT reset shutdown_action to DaemonExitAction::Stop. The request_restart and request_restart_after_update methods store a non-Stop value in shutdown_action BEFORE calling request_shutdown. An explicit ControlRequest::Shutdown (processed at src/daemon.rs:7389) calls request_shutdown() which preserves the restart intent. To fix: request_shutdown should reset shutdown_action to Stop, ensuring that any explicit shutdown overrides a previously set restart intent. The request_restart and request_restart_after_update methods already store their desired action before calling request_shutdown, so the ordering needs to be: request_restart stores the action, then request_shutdown would normally reset it — this won't work. Alternative approach: Instead of modifying request_shutdown, add a separate request_stop method that resets shutdown_action to Stop and then calls request_shutdown, and use it from the ControlRequest::Shutdown handler at line 7389. Or, reverse the order in request_restart/request_restart_after_update to call request_shutdown first and then store the action (using a compare-and-swap to avoid overwriting a Stop that was set by an explicit shutdown). The cleanest approach is probably to make the Shutdown control handler explicitly store DaemonExitAction::Stop before calling request_shutdown.

Was this helpful? React with 👍 or 👎 to provide feedback.

Patched this by adding an explicit request_stop() path and using it for ControlRequest::Shutdown, so a user-initiated stop now overrides any previously queued restart or restart-after-update intent. I also reran the daemon restart/update lifecycle tests locally after the change.

devin-ai-integration · 2026-04-21T16:04:56Z

🚩 Health check calls plain request_shutdown(), preserving a possibly stale RestartAfterUpdate action

The daemon_socket_health_check_loop at src/daemon.rs:7994 calls coordinator.request_shutdown() (the plain version) rather than coordinator.request_stop(). If the update check loop at src/daemon.rs:8033 had previously stored RestartAfterUpdate before the health check detected socket failure, the health check's spawn_self_restart() at src/daemon.rs:7983 would start a detached daemon AND handle_run at src/commands/daemon.rs:191-207 would also attempt a self-update+restart. The daemon lock file prevents two instances from running, and is_shutting_down() checks at src/daemon.rs:7956 and src/daemon.rs:7964 make the race window extremely narrow. Still, using request_stop() here (as the control listener does at src/daemon.rs:7607) would be more explicit about the health-check's intent.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-04-21T16:05:10Z

🚩 Bare request_shutdown() from error conditions preserves prior restart action

Internal error conditions in the trace ingest worker (lines 4604, 4626, 4644, 4797) and Windows pipe workers (lines 7254, 7269, 7447, 7460) call request_shutdown() rather than request_stop(). Since request_shutdown() does not modify shutdown_action, if the update check loop had previously set RestartAfterUpdate or Restart, these error-triggered shutdowns will preserve that action. This means the daemon could restart after a buffer overflow or duplicate sequence error. The window for this race is very narrow (update check sets action, then error occurs before shutdown completes), and a fresh daemon process wouldn't inherit the corrupt state. Still, if there's a concern about restart-cycling after persistent errors, the error call sites could be changed to request_stop() to force a clean stop.

Was this helpful? React with 👍 or 👎 to provide feedback.

CLAassistant · 2026-05-05T07:28:47Z

All committers have signed the CLA.

devin-ai-integration Bot reviewed Apr 14, 2026

View reviewed changes

svarlamov force-pushed the codex/windows-daemon-restart-upgrade-fix branch 5 times, most recently from d4e99ce to 19515ab Compare April 21, 2026 15:28

devin-ai-integration Bot reviewed Apr 21, 2026

View reviewed changes

svarlamov added 5 commits May 5, 2026 04:45

Fix Windows daemon restart and self-update flow

577ae1a

Fix daemon restart behavior in tests and coverage

107ef67

Honor explicit daemon shutdown over restart intent

f5b7c94

Add regression test for explicit daemon stop override

eef5de0

Recover Windows daemon after detached update failures

d6e61e2

svarlamov force-pushed the codex/windows-daemon-restart-upgrade-fix branch from 19515ab to d6e61e2 Compare May 5, 2026 04:59

Fix Windows path assertion in bash provenance test

31f33dc

svarlamov force-pushed the codex/windows-daemon-restart-upgrade-fix branch from a4a405f to 31f33dc Compare May 5, 2026 07:29

svarlamov merged commit 9d0ad9a into main May 5, 2026
29 checks passed

svarlamov deleted the codex/windows-daemon-restart-upgrade-fix branch May 5, 2026 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Windows daemon restart and self-update flow#1076

Fix Windows daemon restart and self-update flow#1076
svarlamov merged 6 commits into
mainfrom
codex/windows-daemon-restart-upgrade-fix

svarlamov commented Apr 14, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Apr 14, 2026

Uh oh!

svarlamov Apr 14, 2026

Uh oh!

devin-ai-integration Bot Apr 21, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot Apr 21, 2026 •

edited

Loading

Uh oh!

CLAassistant commented May 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

svarlamov commented Apr 14, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Manual validation

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

svarlamov Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

CLAassistant commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

svarlamov commented Apr 14, 2026 •

edited by devin-ai-integration Bot

Loading

devin-ai-integration Bot Apr 21, 2026 •

edited

Loading

devin-ai-integration Bot Apr 21, 2026 •

edited

Loading

CLAassistant commented May 5, 2026 •

edited

Loading