Always take th->interrupt_lock in ubf_clear

luke-gruber · luke-gru · commit d72a0fed6019 · 2026-03-11T09:24:18.000-04:00
Patch 0837263 fixed a race condition on ubfs, but it's only valid if right after a call to `ubf_clear`, we assume the ubf function cannot be in the middle of running. This patch removes an optimization in `ubf_clear` that violates that assumption. In short, `ubf_clear` needs to take `th->interrupt_lock` unconditionally both to avoid deadlocks and to be able to reason about when ubfs can be run. This should fix CI errors like https://ci.rvm.jp/results/trunk-jemalloc@ruby-sp2-noble-docker/6242153. The error was in test_timeout.rb, which had a deadlock during VM shutdown. ```ruby r = Ractor.new do begin Timeout.timeout(0.1) { sleep } rescue Timeout::Error :ok end end.value assert_equal :ok, r ``` The deadlock happened during `rb_ractor_terminate_interrupt_main_thread` with 2 ractors: 1) r1 t1: UBF called with t2->interrupt_lock (ubf = ubf_waiting) 2) r2 t2: ubf cleared from previous thread_sched_wait_events_call (but no lock taken, because of optimization) 3) r2 t2: thread_sched_wait_events: acquire thread_sched_lock(t2) (caller calling native_sleep() in loop) 4) r2 t2: ubf_set: try to acquire t2->interrupt_lock [block] 5) r1 t1: try to acquire thread_sched_lock(t2) [block, deadlock] t2 needs to block on t2->interrupt_lock in step 2 until the ubf has completed. Only then can it register a new ubf in the next `native_sleep` iteration.
diff --git a/thread_pthread.c b/thread_pthread.c
@@ -1055,14 +1055,12 @@ ubf_set(rb_thread_t *th, rb_unblock_function_t *func, void *arg)
 static void
 ubf_clear(rb_thread_t *th)
 {
-    if (th->unblock.func) {
-        rb_native_mutex_lock(&th->interrupt_lock);
-        {
-            th->unblock.func = NULL;
-            th->unblock.arg  = NULL;
-        }
-        rb_native_mutex_unlock(&th->interrupt_lock);
+    rb_native_mutex_lock(&th->interrupt_lock);
+    {
+        th->unblock.func = NULL;
+        th->unblock.arg  = NULL;
     }
+    rb_native_mutex_unlock(&th->interrupt_lock);
 }
 
 static void

Original file line number	Diff line number	Diff line change
`@@ -1055,14 +1055,12 @@ ubf_set(rb_thread_t th, rb_unblock_function_t func, void *arg)`
`1055`	`1055`	`static void`
`1056`	`1056`	`ubf_clear(rb_thread_t *th)`
`1057`	`1057`	`{`
`1058`		`- if (th->unblock.func) {`
`1059`		`- rb_native_mutex_lock(&th->interrupt_lock);`
`1060`		`- {`
`1061`		`- th->unblock.func = NULL;`
`1062`		`- th->unblock.arg = NULL;`
`1063`		`- }`
`1064`		`- rb_native_mutex_unlock(&th->interrupt_lock);`
	`1058`	`+ rb_native_mutex_lock(&th->interrupt_lock);`
	`1059`	`+ {`
	`1060`	`+ th->unblock.func = NULL;`
	`1061`	`+ th->unblock.arg = NULL;`
`1065`	`1062`	`}`
	`1063`	`+ rb_native_mutex_unlock(&th->interrupt_lock);`
`1066`	`1064`	`}`
`1067`	`1065`
`1068`	`1066`	`static void`