Alpha GO

A computer program called AlphaGo has beaten Lee Sedol, a master player In the ancient Chinese board game of Go in a landmark moment for AI.

In Go two player take turns in placing black or white stones on a 19-by-19 grid. To win you need to surround your opponent's pieces. It is a more complex challenge for a computer than chess - there are more possible positions in Go than atoms in the universe - and players rely on human instinct as much as strategy. Most experts had considered defeating a Go champion beyond the present capacity of artificial intelligence.

When IBM's Deep Blue supercomputer beat world chess champion Gerry Kasparov in 1907, the victory was not due to human-like intelligence: it won by the brute force of its processing power, searching through millions of moves in seconds.However, the complexity of Go requires a completely different approach.Processing power is not enough to beat a Go champion of the calibre of Lee Sedol.

AlphaGo, developed by a company called DeepMind, was designed to adopt an intuitive, human-like style of gameplay. It followed a technique called "deep learning", which imitates the workings of the human brain to allow a program to develop by teaching itself through trial and error. Rather than programming rules and strategies, DeepMind designer taught AlphaGo to play the game by feeding it data based on typical Go moves. Then AlphaGo played against itself, millions of times, learning from its mistakes and getting better each time. Deep learning changes the relationship humans have with their technological creations, It results in Al that displays surprising and unpredictable behaviour Lee Sedol described being stunned by one unconventional move from AlphaGo which no human would ever have made. In traditional computer programming the aim is to anticipate all possible eventualities and a surprise means that something needs to be fixed. With deep learning, unpredictability and surprises may indicate that the program has found a better way of doing something than its human creators had ever imagined. In the coming years, AlphaGo will continue to learn and to improve, surprising and teaching humans new moves and strategies.

Other Al programs, also based on deep leaming and unpredictability, could be designed to overtake human abilities in highly complex tasks: diagnosing illness, managing power grids, protecting against cyber threat. All of this has enormous implications for the future development of Al and our ability to control it.

Why s GO described as the 'Holy Grail' (Santo Graal) of computer games?

Last updated