And I am wondering, why use an ESP32 if you don't need the WiFi? (And, please, no WiFi in a toy!)
Currently we connect to a Wifi network to reach the Deno edge server. Some popular toys doing it: Yoto, Toniebox