Amortizing tokio's global queue acquisitions
On the tokio multi-thread scheduler’s worst-case benchmark, pulling tasks from the inject queue in batches rather than one at a time reduces latency by 92%. The change reuses a batch-pop helper already present in the idle path, capped at 32 to prevent burying local work behind converted-remote tasks.