I had no idea WebVoyager only spanned 15 websites lol... the 452 figure you have still seems a little low though - do you have plans to expand it? It seems like you'd want as many sites as possible to improve the real-world accuracy of agents due to the long tail nature of website traffic
We definitely plan to expand it. I want to get to ~10,000 for a reasonable benchmark.
15 blew my mind -- it's too easy to overfit that dataset