The in-process rollback of a node configuration change appears to have introduced enough pressure on autoscaling that we were seeing increased delays for cold boots and scale outs. Once we deleted many of the crashing pods, we saw queues rapidly drop.
L40S and CPU instances are properly pulling images again. All nodes are functioning as expected.