Using machine learning to balance arena

Plebzgosplat-skullcrusher · December 15, 2020, 3:29pm

Free ideas for blizz to try.

Blizzard is a business which invests significantly, whether it appears so or not, in beta testing of this game. They have devs, testers, designers, etc who are all trying to create a pleasant experience and retain, if not grow, the playerbase. Having a poor game experience while over-investing in development resources, who use stale analytics techniques to balance the game in the modern era, are both bad for business.

From a business perspective, there is valuable business data being created daily by arena matches at all levels. This data could be analyzed the old way, humans or data scientists looking at simple rating charts (% of rogues over 2200 for example) and turning knobs based on high level inputs, or it could be actually put into modern machine learning models to remove the human aspect, and their bias, from the analysis.

For example to balance 3v3s blizzard could build a predictive/regressive model that would take the classes as inputs as well as the individual performance of each player each match (number of kicks landed, cc chain duration, damage mitigated by defensive cds, etc), and outputs a prediction of which team will win. They have ample data to train this model against, and if they are able to produce a model that’s even 90% accurate (which is very low by today’s AI standards), they would actually have a tool to use when they decide ‘hey let’s nerf this by 50%’ or something.

In that case, they would simulate the nerf, run that class change through the model 100k times (or some large number), and determine if they have overtuned or undertuned with their proposed change.

This may seem complicated and very foreign, but this is truly not revolutionary, and blizzard is one of the businesses that truly has access to all the data they could ever want to perform tuning with, and if they are not using machine learning to tune their game in 2020, they are squandering a great opportunity.

signed,

a computer scientist.

Akeu-kelthuzad · December 15, 2020, 3:47pm

“Simulate the nerf” is the actual hard part of doing this. You would need to simulate actual 3v3 combat, not just throw classes as labels into some simple model.
100k/year+ for an ML researcher to build a complex model of a type that took openAI/Deepmind years to perfect and relied on saved replay data (and in Deepmind’s case Blizzard explicitly making a research binary for faster training). Add in a few more team members plus the physical cost of training a complex model, and it’s definitely not free.
Human players who play things like the beta naturally explore the changed regions of the feature space. You can’t take a pre-trained model, throw it a new bone, and expect it to use that bone optimally. This would mean that such a model could only ever be provably useful for predicting number tuning balance changes as opposed to reworks. Unless transfer learning makes some large leaps you’re going to have to retrain the model from a baseline each time there’s a significant class/item change (like at least every patch).

Acerola-darkspear · December 15, 2020, 3:54pm

Dont blizz cooperate with google before? for the star craft AI or some thing, sorry i dont really remanber what it is.

Angryjuice-frostmourne · December 15, 2020, 4:02pm

Cool idea, would be nice to see them try to quantify and list all the elements that go into arena. Kinda fun thought experiment.

I guess ‘positive’ traits in the arena would be things like:

CC landed * difficulty of CC to land (super hard in itself)
Defensive CD’s used during opponents offensive CD’s
CC landed on target with CD’s up
Enemy casts made to fail due to line of sight or interrupted by one of your abilities
Impactful buffs purged
Dispels ranked by value of what is being dispelled
HPS
DPS
Damage landed while target stunned or while enemy healer is CC’d

I mean I’m pretty sure you could eventually make a comprehensive ruleset for your AI to follow, it’s not unthinkable.

Akeu-kelthuzad · December 15, 2020, 4:04pm

Yeah Deepmind was acquired by google some time ago. AlphaStar (Deepmind’s Starcraft AI) was developed with some help from Blizzard in making a version of the game efficient to train on. The AlphaStar team ran into trouble where they needed their trained models to behave “like humans”, but you can see the AI doing wacky things even on SC2 ladder games.

That being said, deepmind is focused on solving more important problems than WoW arena balance. The SCII was just a proof of concept for them. Here’s their summary of the project if you want to read more about it: https://deepmind.com/blog/article/alphastar-mastering-real-time-strategy-game-starcraft-ii