Jugador de póquer de computadora

Un jugador de póquer por computadora es un programa de computadora diseñado para jugar al póquer contra oponentes humanos u otros oponentes de computadora. Se le conoce comúnmente como pokerbot o simplemente bot .

En Internet

Estos bots o programas de computadora se utilizan a menudo en situaciones de póquer en línea como oponentes legítimos para jugadores humanos o como una forma de hacer trampa . Si el uso de un bot constituye trampa o no, normalmente lo define la sala de póquer que alberga los juegos de póquer reales. La mayoría (si no todas) las salas de juegos prohíben el uso de bots, aunque el nivel de aplicación de los operadores del sitio varía considerablemente.

Bots de jugador

El tema de los robots de los jugadores y la asistencia informática, mientras se juega al póquer en línea, es muy controvertido. La opinión de los jugadores es bastante variada cuando se trata de decidir qué tipos de software de computadora entran en la categoría de ventaja injusta. Uno de los factores principales para definir un bot es si el programa de computadora puede interactuar con el cliente de póquer (en otras palabras, jugar solo) sin la ayuda de su operador humano. Se dice que los programas de computadora con esta capacidad tienen o son un reproductor automático y están universalmente definidos para estar en la categoría de bots independientemente de lo bien que jueguen al póquer.

La cuestión de la ventaja injusta tiene mucho que ver con los tipos de información e inteligencia artificial disponibles para el programa de computadora . Además, los bots pueden jugar durante muchas horas seguidas sin debilidades humanas como la fatiga y pueden soportar las variaciones naturales del juego sin verse influenciados por las emociones humanas (o " inclinación "). Por otro lado, los bots tienen algunas desventajas significativas; por ejemplo, es muy difícil para un bot leer con precisión un farol o adaptarse a la estrategia de los oponentes como lo hacen los humanos.

Aplicación de la casa

Si bien los términos y condiciones de los sitios de póquer generalmente prohíben el uso de bots, el nivel de cumplimiento depende del operador del sitio. Algunos buscarán y prohibirán a los usuarios de bots mediante el uso de una variedad de herramientas de software. El cliente de póquer puede programarse para detectar bots, aunque esto es controvertido por derecho propio, ya que podría considerarse equivalente a incrustar software espía en el software del cliente. ^{[ cita requerida ]} Otro método es utilizar CAPTCHA a intervalos aleatorios durante el juego.

Bots de la casa

El tema de los bots domésticos es aún más controvertido debido al conflicto de intereses que potencialmente plantea. Según la definición más estricta, un bot de la casa es un jugador automatizado operado por la propia sala de póquer en línea, aunque algunos definirían ejemplos más indirectos (por ejemplo, un jugador que opera bots con el conocimiento y consentimiento del operador) como "bots de la casa" como bien. Este tipo de bots sería el equivalente a cómplices de ladrillo y mortero .

En un casino de ladrillo y cemento, un jugador de la casa no subvierte la imparcialidad del juego que se ofrece siempre que la casa sea honesta. En un entorno en línea, lo mismo también es cierto. Por definición, una sala de póquer en línea honesta que opta por operar bots de la casa garantizaría que los bots de la casa no tuvieran acceso a ninguna información que no esté disponible para ningún otro jugador en la mano (lo mismo se aplicaría a cualquier cómplice humano ). El problema es que en un entorno en línea, la casa no tiene forma de demostrar que sus bots no están recibiendo información confidencial del servidor de tarjetas. Esto se ve agravado por la facilidad con la que se puede compartir información clandestinamente en un entorno digital. Es esencialmente imposible, incluso para la casa, demostrar que no controlan a algunos jugadores; probablemente la única forma real que se podría hacer sería divulgar la información personal confidencial de cada jugador y eso, obviamente, no se puede hacer debido a consideraciones de privacidad .

Inteligencia artificial

El póquer es un juego de información imperfecta (porque algunas cartas en juego están ocultas), lo que dificulta que cualquiera (incluida una computadora) pueda deducir el resultado final de la mano. Debido a esta falta de información, los programadores de la computadora tienen que implementar sistemas basados en el teorema de Bayes , el equilibrio de Nash , la simulación de Monte Carlo o las redes neuronales , todas las cuales son técnicas imperfectas.

Las IA como PokerSnowie y Claudico se han creado permitiendo que la computadora determine la mejor estrategia posible al permitirle jugar sola una enorme cantidad de veces. Este parece ser el enfoque actual de la IA del póquer, en lugar de intentar hacer una computadora que juegue como un humano. Esto da como resultado un tamaño de apuesta extraño y una estrategia muy diferente a la que los humanos están acostumbrados a ver.

Se están desarrollando métodos para al menos aproximarse a la estrategia de póquer perfecta desde la perspectiva de la teoría del juego en el juego mano a mano (dos jugadores), y se están creando cada vez más buenos sistemas para el juego multijugador. La estrategia perfecta tiene múltiples significados en este contexto. Desde el punto de vista óptimo de la teoría del juego, una estrategia perfecta es aquella que no puede esperar perder frente a la estrategia de ningún otro jugador; sin embargo, la estrategia óptima puede variar en presencia de jugadores subóptimos que tienen debilidades que pueden explotarse. En este caso, una estrategia perfecta sería aquella que modele de manera correcta o cercana esas debilidades y las aproveche para obtener ganancias, como las explicadas anteriormente.

La IA logró un rendimiento sobrehumano en el póquer durante la década de 2010, con la siguiente línea de tiempo. En 2015, las computadoras resolvieron el heads-up limit hold'em a través de Cepheus . Alrededor de 2018, Libratus demostró una habilidad sobrehumana en el heads-up no limit hold'em. En 2019, Pluribus demostró una habilidad sobrehumana en el hold'em sin límite para seis jugadores, la variedad única de póquer más comúnmente jugada en el mundo. ^[1]

Grupos de investigación

Grupo de investigación de póquer informático (Universidad de Alberta, Canadá)

Una gran parte de la investigación sobre los jugadores de póquer por computadora se está realizando en la Universidad de Alberta por el Grupo de Investigación de Póquer por Computadora, dirigido por el Dr. Michael Bowling. El grupo desarrolló los agentes Poki , PsOpti , Hyperborean y Polaris . Poki ha obtenido la licencia para el juego de entretenimiento STACKED con el jugador de póquer canadiense Daniel Negreanu . PsOpti estaba disponible bajo el nombre "SparBot" en el programa de entrenamiento de póquer "Poker Academy". La serie de programas hiperbóreos ha competido en la Competencia Anual de Póquer por Computadora, y más recientemente se llevó tres medallas de oro de seis eventos en la competencia de 2012. La misma línea de investigación también produjo Polaris , que jugó contra profesionales humanos en 2007 y 2008, y se convirtió en el primer programa de póquer de computadora en ganar una competencia de póquer significativa.

In January 2015, an article in Science^[2] by Michael Bowling, Neil Burch, Michael Johanson, and Oskari Tammelin claimed that their poker bot Cepheus had "essentially weakly solved" the game of heads-up limit Texas hold 'em.^[3]^[4]^[5]

School of Computer Science from Carnegie Mellon University

T. Sandholm and A. Gilpin from Carnegie Mellon University have started poker AI research in 2004 beginning with unbeatable agent for 3-card game called Rhode-Island Hold 'em. Next step was GS1 which outperformed the best commercially available poker bots. Since 2006 poker agents from this group have participated in annual computer competitions. "At some point we will have a program better than the best human players" – claims Sandholm. His bot, Claudico, faced off against four human opponents in 2015. In 2017 the program's latest software, Libratus, faced off against four professional poker players. By the end of the experiment the four human players had lost a combined $1.8 million of simulated money.^[6]

The University of Auckland Game AI Group

A team from the University of Auckland consists of a small number of scientists who employ case-based reasoning to create and enhance Texas Hold’em poker agents. The group applies different AI techniques to a number of games including participation in the commercial projects Small Worlds and Civilization (video game).

Neo Poker Laboratory

Neo Poker Lab is an established science team focused on the research of poker artificial intelligence. For several years it has developed and applied state-of-the-art algorithms and procedures like regret minimization and gradient search equilibrium approximation, decision trees, recursive search methods as well as expert algorithms to solve a variety of problems related to the game of poker.

Concursos históricos

ICCM 2004 PokerBot competition

One of the earliest no-limit poker bot competitions was organized in 2004 by International Conference on Cognitive Modelling.^[7] The tournament hosted five bots from various universities from around the world. The winner was Ace Gruber, from University of Toronto.^[8]

ACM competitions

The Association for Computing Machinery (ACM) has hosted competitions where the competitors submit a piece of software capable of playing poker on their specific platform. The event hosts conduct the contests by operating the software and reporting the results.^{[citation needed]}

The 2005 World Series of Poker Robots

In the summer 2005, the online poker room Golden Palace hosted a promotional tournament in Las Vegas, at the old Binions, with a $100k giveaway prize. It was billed as the 2005 World Series of Poker Robots. The tournament was bots only with no entry fee. The bot developers were computer scientists from six nationalities who traveled at their own expense. The host platform was Poker Academy. The event also featured a demonstration headsup event with Phil Laak.

University of Alberta's Man V Machine experiments

In the summer 2007, the University of Alberta hosted a highly specialized headsup tournament between humans and their Polaris bot, at the AAAI conference in Vancouver, BC, Canada. The host platform was written by the University of Alberta. There was a $50k maximum giveaway purse with special rules to motivate the humans to play well. The humans paid no entry fee. The unique tournament featured four duplicate style sessions of 500 hands each. The humans won by a narrow margin.

In the summer of 2008, the University of Alberta and the poker coaching website Stoxpoker ran a second tournament during the World Series of Poker in Las Vegas. The tournament had six duplicate sessions of 500 hands each, and the human players were Heads-Up Limit specialists. Polaris won the tournament with 3 wins, 2 losses and a draw. The results of the tournament, including the hand histories from the matches, are available on the competition website.

The 2015 Brains vs AI competition by Rivers Casino, CMU and Microsoft

From April–May 2015, Carnegie Mellon University Sandholm's latest bot, Claudico, faced off against four human opponents, in a series of no-limit Texas Hold'em matches.^[9]^[10] Finally, after playing 80,000 hands, humans were up by a combined total of $732,713. But even though humans technically won, scientists considered the win as statistically insignificant (rather, a statistical tie) when that $732,713 is compared to the total betting amount of $170,000,000 ($170 million). However, some have determined this claim to be disingenuous.^[11] Statistically insignificant here means that the programmers of Claudico can not say with 95% confidence (a 95% confidence interval) that humans are better than the computer program. However, it is a statistically significant win on a 90% confidence interval. This means that the human players are somewhere between a 10 to 1 and 20 to 1 favorite.^[12]

The way the tournament was structured was in two sets of two players each. In each of the two sets, the players got the opposite cards. Meaning if the computer has As9c (Ace of Spades & Nine of Clubs) and the human has Jh8d on one computer, the other of the two players in the set will have As9c up against the computer's Jh8d. However, even with the human players winning more than the computer—not all of the players were positive in their head to head match ups.

The totals for each of the players winnings were as follows:

Douglas Polk: +$213,671
Dong Kim: +$70,491
Bjorn Li: +$529,033
Jason Les: -$80,482^[13]

The Annual Computer Poker Competition

Since 2006, the Annual Computer Poker Competition has run a series of competitions for poker programs. Since 2010, three types of poker were played: Heads-Up Limit Texas Hold'em, Heads-Up No-Limit Texas Hold'em, and 3-player Limit Texas Hold'em. Within each event, two winners are named: the agent that wins the most matches (Bankroll Instant Run-off), and the agent that wins the most money (Total Bankroll). These winners are often not the same agent, as Bankroll Instant Run-off rewards robust players, and Total Bankroll rewards players that are good at exploiting the other agents' mistakes. The competition is motivated by scientific research, and there is an emphasis on ensuring that all of the results are statistically significant by running millions of hands of poker. The 2012 competition had the same formats with more than 70 million hands played to eliminate luck factor.

Some researchers developed web application where people could play and assess quality of the AI. So as of December 2012 the following top groups and individual researchers’ agents could be found:

Hyperborean (9 gold, 5 silver and 3 bronze)
Bluffbot (1 gold, 3 silver and 2 bronze medals)
Sartre (1 gold, 5 silver and 3 bronze medals)
Neo Poker Bot (1 gold, 5 bronze medals)

Results

2010 ^[14]
Heads-up Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. PULPO (Marv Andersen, UK) 2. Hyperborean-TBR (University of Alberta, Canada) 3. Sartre (University of Auckland, New Zealand)	1. Rockhopper (David Lin, USA) 2. GGValuta (Mihai Ciucu, Romania) 3. Hyperborean-IRO (University of Alberta, Canada)
Heads Up No Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Tartanian4-TBR (Carnegie Mellon University, USA) 2. PokerBotSLO (Universities of Maribor & Ljubljana, Slovenia) 3. HyperboreanNL-TBR (University of Alberta, Canada)	1. HyperboreanNL-IRO (University of Alberta, Canada) 2. SartreNL (University of Auckland, New Zealand) 3. Tartanian4-IRO (Carnegie Mellon University, USA)
3-max Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Hyperborean3P-TBR (University of Alberta, Canada) 2. LittleRock (Rod Byrnes, Australia) 3. Bender (Technical University Darmstadt, German)	1. Hyperborean3P-IRO (University of Alberta, Canada) 2. dcu3pl-IRO (Dublin City University, Ireland) 3. LittleRock (Rod Byrnes, Australia)
2011 ^[15] ^[16]
Heads-up Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Calamari (Marv Andersen, UK) 2. Sartre (University of Auckland, New Zealand) 3. Hyperborean-2011-2p-limit-tbr (University of Alberta, Canada)	1. Hyperborean-2011-2p-limit-iro (University of Alberta, Canada) 2. Slumbot (Eric Jackson, USA) 3. Calamari (Marv Andersen, UK)
Heads Up No Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Lucky7 (Mikrospin d.o.o., Slovenia) 2. SartreNL (University of Auckland, New Zealand) 3. Hyperborean-2011-2p-nolimit-tbr (University of Alberta, Canada)	1. Hyperborean-2011-2p-nolimit-iro (University of Alberta, Canada) 2. SartreNL (University of Auckland, New Zealand) 3. Hugh (USA & Canada)
3-max Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Sartre3p (University of Auckland, New Zealand) 2. Hyperborean-2011-3p-limit-tbr (University of Alberta, Canada) 3. AAIMontybot (Charles University in Prague, Czech Republic) 3. LittleRock (Rod Byrnes, Australia)	1. Hyperborean-2011-3p-limit-iro (University of Alberta, Canada) 2. Sartre3p (University of Auckland, New Zealand) 3. LittleRock (Rod Byrnes, Australia)
2012
Heads-up Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Slumbot (Eric Jackson, USA) 2. Little Rock (Rod Byrnes, Australia) 2. Zbot (Ilkka Rajala, Finland)	1. Slumbot (Eric Jackson, USA) 2. Hyperborean (University of Alberta, Canada) 3. Zbot (Ilkka Rajala, Finland)
Heads Up No Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Little Rock (Rod Byrnes, Australia) 2. Hyperborean (University of Alberta, Canada) 3. Tartanian5 (Carnegie Mellon University, USA)	1. Hyperborean (University of Alberta, Canada) 2. Tartanian5 (Carnegie Mellon University, USA) 3. Neo Poker Bot (Alexander Lee, Spain)
3-max Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Hyperborean (University of Alberta, Canada) 2. Little Rock (Rod Byrnes, Australia) 3. Neo Poker Bot (Alexander Lee, Spain) 3. Sartre (University of Auckland, New Zealand)	1. Hyperborean (University of Alberta, Canada) 2. Little Rock (Rod Byrnes, Australia) 3. Neo Poker Bot (Alexander Lee, Spain) 3. Sartre (University of Auckland, New Zealand)
2013
Heads-up Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Marv (Marv Anderson, UK) 2. Feste (François Pays, France) 2. Hyperborean (University of Alberta, Canada)	1. Neo Poker Bot (Alexander Lee, Spain) 2. Hyperborean (University of Alberta, Canada) 3. Zbot (Ilkka Rajala, Finland) 3. Marv (Marv Anderson, UK)
Heads Up No Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Slumbot NL (Eric Jackson, USA) 2. Hyperborean (University of Alberta, Canada) 3. Tartanian6 (Carnegie Mellon University, USA)	1. Hyperborean (University of Alberta, Canada) 2. Slumbot NL (Eric Jackson, USA) 3. Tartanian6 (Carnegie Mellon University, USA) 3. Nyx (Charles University, Czech Republic)
3-max Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Hyperborean (University of Alberta, Canada) 2. Little Rock (Rod Byrnes, Australia) 3. Neo Poker Bot (Alexander Lee, Spain)	1. Hyperborean (University of Alberta, Canada) 2. Little Rock (Rod Byrnes, Australia) 3. Neo Poker Bot (Alexander Lee, Spain)
2014
Heads-up Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Escabeche (Marv Andersen, UK) 2. SmooCT (University College London, UK) 3. Hyperborean (University of Alberta, Canada) 3. Feste (Francois Pays, France)
Heads Up No Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Tartanian7 (Carnegie Mellon University, USA) 2. Nyx (Charles University, Czech Republic) 2. Prelude (Unfold Poker, USA) 2. Slumbot (Eric Jackson, USA)	1. Tartanian7 (Carnegie Mellon University, USA) 2. Prelude (Unfold Poker, USA) 2. Hyperborean (University of Alberta, Canada) 2. Slumbot (Eric Jackson, USA)
3-max Limit Texas Hold'em
Total Bankroll	Bankroll Instant Run-off
1. Hyperborean (University of Alberta, Canada) 2. SmooCT (University College London, UK) 3. KEmpfer (Technische Universität Darmstadt, Germany)	1. Hyperborean (University of Alberta, Canada) 2. SmooCT (University College London, UK) 3. KEmpfer (Technische Universität Darmstadt, Germany)

Ver también

Polaris (poker bot)
Cepheus (poker bot)

Referencias

^ Brown, Noam; Sandholm, Tuomas (30 August 2019). "Superhuman AI for multiplayer poker". Science. 365 (6456): 885–890. doi:10.1126/science.aay2400.
^ Bowling, Michael; Burch, Neil; Johanson, Michael; Tammelin, Oskari (Jan 2015). "Heads-up limit hold'em poker is solved". Science. 347 (6218): 145–9. CiteSeerX 10.1.1.697.72. doi:10.1126/science.1259433. PMID 25574016.
^ Philip Ball (2015-01-08). "Game Theorists Crack Poker". Nature. Nature. doi:10.1038/nature.2015.16683. Retrieved 2015-01-13.
^ Robert Lee Hotz (2015-01-08). "Computer Conquers Texas Hold 'Em, Researchers Say". Wall Street Journal.
^ Bob McDonald (2015-01-10). "Poker Computer Takes the Pot [audio interview]". Quirks & Quarks (Podcast).
^ Joshua Brustein (31 January 2017). "Inside the 20-Year Quest to Build Computers That Play Poker". Bloomberg. Retrieved 2 February 2017.
^ "Iccm 2004".
^ https://www.era.lib.ed.ac.uk/bitstream/1842/2392/2/Carter%20RG%20thesis%2007.pdf
^ Marilyn Malara (April 25, 2015). "Brains vs. AI: Computer faces poker pros in no-limit Texas Hold'em". UPI. Retrieved April 26, 2015.
^ "Rivers Casino's Brains vs AI". Archived from the original on 2016-02-03. Retrieved 2015-10-23.
^ "Brains Vs. AI | Carnegie Mellon School of Computer Science". www.cs.cmu.edu. Retrieved 2016-02-10.
^ "Brains Vs. AI | Carnegie Mellon School of Computer Science". www.cs.cmu.edu. Retrieved 2016-02-10.
^ "Brains vs Artificial Intelligence". www.riverscasino.com. Archived from the original on 2016-02-03. Retrieved 2016-02-10.
^ http://poker.cs.ualberta.ca/news_2010.html
^ http://poker.cs.ualberta.ca/news.html
^ https://www.thebestpokersitesonline.com/poker/poker-bots/

enlaces externos

The Annual Computer Poker Competition
Programming Poker AI Article by the programmer of the AI for the World Series of Poker Game
Caroline Hsu. "Can "pokerbots" beat humans?". USnews.com. Archived from the original on 27 March 2009.
MSNBC Article - 2004-Sep
Science News: The Ultimate Poker Face
NYTimes.com: Poker Bots Invade Online Gambling
CMU deals a winning hand for Texas Hold 'em Article about Carnegie Mellon University poker AI research group

[1] Brown, Noam; Sandholm, Tuomas (30 August 2019). "Superhuman AI for multiplayer poker". Science. 365 (6456): 885–890. doi:10.1126/science.aay2400.

[2] Bowling, Michael; Burch, Neil; Johanson, Michael; Tammelin, Oskari (Jan 2015). "Heads-up limit hold'em poker is solved". Science. 347 (6218): 145–9. CiteSeerX 10.1.1.697.72. doi:10.1126/science.1259433. PMID 25574016.

[3] Philip Ball (2015-01-08). "Game Theorists Crack Poker". Nature. Nature. doi:10.1038/nature.2015.16683. Retrieved 2015-01-13.

[4] Robert Lee Hotz (2015-01-08). "Computer Conquers Texas Hold 'Em, Researchers Say". Wall Street Journal.

[5] Bob McDonald (2015-01-10). "Poker Computer Takes the Pot [audio interview]". Quirks & Quarks (Podcast).

[6] Joshua Brustein (31 January 2017). "Inside the 20-Year Quest to Build Computers That Play Poker". Bloomberg. Retrieved 2 February 2017.

[7] "Iccm 2004".

[8] ttps://www.era.lib.ed.ac.uk/bitstream/1842/2392/2/Carter%20RG%20thesis%2007.pdf

[9] Marilyn Malara (April 25, 2015). "Brains vs. AI: Computer faces poker pros in no-limit Texas Hold'em". UPI. Retrieved April 26, 2015.

[10] "Rivers Casino's Brains vs AI". Archived from the original on 2016-02-03. Retrieved 2015-10-23.

[11] "Brains Vs. AI | Carnegie Mellon School of Computer Science". www.cs.cmu.edu. Retrieved 2016-02-10.

[12] "Brains Vs. AI | Carnegie Mellon School of Computer Science". www.cs.cmu.edu. Retrieved 2016-02-10.

[13] "Brains vs Artificial Intelligence". www.riverscasino.com. Archived from the original on 2016-02-03. Retrieved 2016-02-10.

[14] ttp://poker.cs.ualberta.ca/news_2010.html

[15] ttp://poker.cs.ualberta.ca/news.html

[16] ttps://www.thebestpokersitesonline.com/poker/poker-bots/

[1]