Engineered a distributed task orchestration engine solving the double-execution problem for time-sensitive background jobs. Implemented a two-layer synchronization model: Redis-based distributed leader election for single-scheduler guarantee, and PostgreSQL FOR UPDATE SKIP LOCKED for concurrent multi-worker queue draining without contention. Built a self-healing Janitor service for zombie task recovery from crashed nodes and an exponential backoff algorithm to prevent thundering herd scenarios.