Mastering the game of Stratego with model-free multiagent reinforcement learning.


Journal

Science (New York, N.Y.)
ISSN: 1095-9203
Titre abrégé: Science
Pays: United States
ID NLM: 0404511

Informations de publication

Date de publication:
02 12 2022
Historique:
entrez: 1 12 2022
pubmed: 2 12 2022
medline: 6 12 2022
Statut: ppublish

Résumé

We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board games that artificial intelligence (AI) has not yet mastered. It is a game characterized by a twin challenge: It requires long-term strategic thinking as in chess, but it also requires dealing with imperfect information as in poker. The technique underpinning DeepNash uses a game-theoretic, model-free deep reinforcement learning method, without search, that learns to master Stratego through self-play from scratch. DeepNash beat existing state-of-the-art AI methods in Stratego and achieved a year-to-date (2022) and all-time top-three ranking on the Gravon games platform, competing with human expert players.

Identifiants

pubmed: 36454847
doi: 10.1126/science.add4679
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

990-996

Auteurs

Julien Perolat (J)

DeepMind Technologies Ltd., London, UK.

Bart De Vylder (B)

DeepMind Technologies Ltd., London, UK.

Daniel Hennes (D)

DeepMind Technologies Ltd., London, UK.

Eugene Tarassov (E)

DeepMind Technologies Ltd., London, UK.

Florian Strub (F)

DeepMind Technologies Ltd., London, UK.

Vincent de Boer (V)

DeepMind Technologies Ltd., London, UK.

Paul Muller (P)

DeepMind Technologies Ltd., London, UK.

Jerome T Connor (JT)

DeepMind Technologies Ltd., London, UK.

Neil Burch (N)

DeepMind Technologies Ltd., London, UK.

Thomas Anthony (T)

DeepMind Technologies Ltd., London, UK.

Stephen McAleer (S)

DeepMind Technologies Ltd., London, UK.

Romuald Elie (R)

DeepMind Technologies Ltd., London, UK.

Sarah H Cen (SH)

DeepMind Technologies Ltd., London, UK.

Zhe Wang (Z)

DeepMind Technologies Ltd., London, UK.

Audrunas Gruslys (A)

DeepMind Technologies Ltd., London, UK.

Aleksandra Malysheva (A)

DeepMind Technologies Ltd., London, UK.

Mina Khan (M)

DeepMind Technologies Ltd., London, UK.

Sherjil Ozair (S)

DeepMind Technologies Ltd., London, UK.

Finbarr Timbers (F)

DeepMind Technologies Ltd., London, UK.

Toby Pohlen (T)

DeepMind Technologies Ltd., London, UK.

Tom Eccles (T)

DeepMind Technologies Ltd., London, UK.

Mark Rowland (M)

DeepMind Technologies Ltd., London, UK.

Marc Lanctot (M)

DeepMind Technologies Ltd., London, UK.

Jean-Baptiste Lespiau (JB)

DeepMind Technologies Ltd., London, UK.

Bilal Piot (B)

DeepMind Technologies Ltd., London, UK.

Shayegan Omidshafiei (S)

DeepMind Technologies Ltd., London, UK.

Edward Lockhart (E)

DeepMind Technologies Ltd., London, UK.

Laurent Sifre (L)

DeepMind Technologies Ltd., London, UK.

Nathalie Beauguerlange (N)

DeepMind Technologies Ltd., London, UK.

Remi Munos (R)

DeepMind Technologies Ltd., London, UK.

David Silver (D)

DeepMind Technologies Ltd., London, UK.

Satinder Singh (S)

DeepMind Technologies Ltd., London, UK.

Demis Hassabis (D)

DeepMind Technologies Ltd., London, UK.

Karl Tuyls (K)

DeepMind Technologies Ltd., London, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH