+

AlphaGo++
A foundation AI that scales across many game classes — and ships in production

AlphaGo solved Go. AlphaZero generalized across perfect-information games. AceGuardian does the next thing: one foundation model trained across imperfect-information games (poker, Mahjong, Guandan), with a conversational and auto-research layer sitting on top of the gameplay AI, deployed in production across multiple operators.

A decade of quiet work by a founder whose first paper on this was in 2005.

For tech & AI press · 2026

Learn more about AceGuardian + QuintAce →

What you can write

Four stories worth your byline

Four angles, each verifiable, each grounded in published work. Pick the one that fits your beat and we'll back it with founder access, advance ecology data, and on-the-record sources.

Story 1 · The manifesto

"GTO is a toy model — the field has moved past it"

Five-ish years of "GTO is solved" turns out to rest on weak ground. The foundation paper disclaims its own claim if you read past the abstract. Pluribus, the "beat top pros" result everyone cites, was reviewed: small sample, narrow conditions, never reproduced in the multi-table multi-format games people actually play. DRL trained on the actual game is the correction the field has been waiting for. The contrarian-AI piece your readers haven't seen.

Story 2 · The synthesis

"AlphaGo++ — the AI that closed the gap"

AlphaGo's research DNA, Stockfish's production discipline, cross-game-class generalization, and a conversational research engine on top. The first AI for strategic games that finishes what both the research labs and the production engines started. The AI-in-games landmark piece.

Story 3 · The provocation

"AI doesn't beat humans in poker. Cheating does."

Pluribus didn't actually beat humans in the games people play, and its data didn't even reach significance. But bots, real-time assistance, and collusion tooling do beat humans in real games every day. AceGuardian's anti-cheat watches around 30% of the online poker market and catches 90%+ of known bots at 99% precision. The poker-integrity exposé piece.

Story 4 · The field guide

"What the algorithms can't do"

Three things AI keeps failing to do in expert domains where it's marketed as a replacement. It can't run autonomously, it doesn't actually outperform, and it can't teach. Poker is the cleanest case study because the ground truth is closed-form, but the same pattern shows up in medicine, law, finance, and education. The cross-domain feature piece (Atlantic / HBR / Wired register).

Each angle has a long-form piece behind it (next slide). Pick yours; we'll set the briefing.

What you get if you cover this

Access, exclusives, on-the-record sources

Direct access to the founder (no publicist in the middle). Falsifiable claims you can pressure-test. Domain experts who go on the record. Advance exclusives when the angle warrants one. Thanh runs his own press conversations.

Founder access

Sit with Thanh

A real conversation, 30 minutes or longer. The technical claims, the manifesto, the cheating story, the inside view from working at industry direction level. On the record or background.

Advance access

Ecology report pre-brief

The cross-platform ecology data before it goes public. You get the methodology in advance so you can pressure-test it before you write.

Demo + experts

Live demo plus coach and operator interviews

A live demo running across NLHE, PLO, MTT, Squid, Mahjong, and Guandan. Coach interviews with GTO Lab, Uri Peleg, Brad Wilson, and Andrew Seidman. Operators willing to speak to what the integration actually changed.

Coverage shape is flexible: a standard editorial piece, a co-created longform for a deeper partnership, or an advance exclusive when the angle is unique enough to warrant one. Tell us what you're writing and we'll fit the access to the piece.

Read before you write

The substance library — six published pieces, fully backed

Each piece is a complete argument with citations and falsifiable claims. Email Thanh for the full versions; pre-publication PDFs are available for reporters.

⭐ Story 1 · The Manifesto

The Poker Toy Model Problem

The foundation paper backs away from its own headline if you read past the abstract. Pluribus was a small experiment in narrow conditions, never reproduced in real games. DRL on the actual game is the correction.

Email Thanh for the full piece →

⭐ Story 4 · Cross-domain field guide

What the Algorithms Can't Do

Three things AI keeps failing to do in expert domains: run autonomously, actually outperform, and teach. Poker is the cleanest case study; medicine, law, finance, and education show the same pattern. Atlantic / HBR / WIRED feature register.

Email Thanh for the full piece →

Story 2 · Technical brief

DRL vs CFR — the Architecture Story

Why every CFR-based system hits the same ceiling, and what changes when you train deep reinforcement learning directly on the game. The AlphaGo-for-imperfect-information argument, made concrete.

Email Thanh for the full piece →

Evaluation framework

How to Evaluate Gameplay AI

A falsifiable framework for assessing gameplay-AI claims: what to look for, what vendors hide, what the open problems still are. Useful for any reporter writing about any company in this space.

Email Thanh for the full piece →

Technical analysis

When Solvers Disagree

What it means when two solvers disagree on the same hand, and why the disagreement carries more information than the agreement.

Email Thanh for the full piece →

Books + coach series

Two books + 30+ coach-attributed pieces

Two book-length series, Squid Blood Battle and Squid Hunt Progressive, both solver-backed and on novel game variants. Plus 30+ shorter pieces co-authored with GTO Lab, Uri Peleg, Brad Wilson, Andrew Seidman, JRB, Cynn, Xuan, and Jasper Ma.

Email Thanh for access →

Ecology report — advance pre-brief available

A cross-platform ecology report comparing major poker platforms on softness, skill, and integrity. Methodology open at publication. Every operator gets invited to contribute data. Email Thanh for the advance pre-brief.

Story 2 · why the synthesis is real

Two camps tried. Neither finished the job.

The AI-in-games beat has two well-covered camps. A third camp has been finishing what they started, and the press hasn't caught up yet.

The research camp: AlphaGo, AlphaZero, DeepStack, Libratus, Pluribus. Deep research labs (DeepMind, CMU, Meta AI, OpenAI). Beautiful results in published papers. But none of them went into production. None of them solved real-world game settings (varied formats, opponents, multi-table dynamics, ICM, rake, exploitation). And none of them demonstrated superhuman strength across more than one game class. The Pluribus "beat top pros" claim, when reviewed, turns out to have been a narrow experiment with a sample that didn't reach significance, and the result was never reproduced in real games.
The production camp: Stockfish, Chess.com. Real users, real deployment, real engagement. But only chess. One game class, no path to generalize, no conversational layer, no research engine on top.
The synthesis: AceGuardian and QuintAce. Deployed in production like Stockfish and Chess.com, built on the research DNA of the AlphaGo line. We then cracked the problems both camps left open: a foundation model that generalizes across game classes (poker, Mahjong, Guandan), a conversational and auto-research layer sitting on top of the gameplay AI, and a feedback loop where the research output trains the gameplay AI.

DRL foundation model versus CFR-based competitors. The ceiling, visible.

QuintAce product surface — Review and Explore with GTO Coach

The deployed product surface. Review (text analysis) and Explore (live strategy grid), both running on the DRL foundation model. Not a research demo.

The founder — the arc you can verify

Academic AI researcher, engineering executive at scale, then back to the 2005 poker-AI work

Thanh Tran. CS PhD from KIT (Karlsruhe Institute of Technology). AI research group leader at AIFB, Karlsruhe's Semantic Web group. Visiting Assistant Professor at Stanford. Roughly 6,000 citations on Google Scholar, ranked top 5 in Semantic Search and top 50 in Web Search on the 2016 Google Scholar Global Index. "Most Cited Article 2006–2010" from the Journal of Web Semantics.

Academic AI · 2005–2014

Original poker-AI work + foundational semantic search

First built poker solvers and bots with his CS students starting in 2005. At KIT and AIFB, co-founded Semsolute and built SearchWebDB, the first natural-language semantic search over billion-scale data. Won the 2008 Billion Triple Challenge.

Executive at scale · 2014–2020

Helped Upwork go public

Executive role through Upwork's path to IPO. Upwork hit a $6.5B-plus valuation by 2019. Operational experience taking research engineering to public-company scale.

Industry direction · 2020–present

Co-CEO of A5 Labs · strategic direction across the A5 Group brands

Co-CEO of A5 Labs, the leading anti-cheat provider in poker, powering game integrity for most major sites. Strategic direction across the A5 Group brands, including the WPT acquisition and the growth of WPT Global and ClubWPT Gold.

Now · 2025–

CEO of AceGuardian and QuintAce — independent AI infrastructure for poker and gaming

AceGuardian is the foundation AI for strategic games. QuintAce is the player-facing app. Both run as one independent company serving the poker and gaming industry, with integration partners across multiple regions and brands. The platform itself is the maturation of work Thanh started with his CS students in 2005, built now with the compute that didn't exist for fifteen years.

Learn more about AceGuardian + QuintAce — careers, team, mission →

Falsifiable claims · each verifiable

What we've actually built — what no one else has shipped

Five claims, each verifiable, each a "first" in its category.

First

DRL foundation model trained on imperfect-information games at production scale

Not a research prototype. Live in operator integrations. Same architectural class as AlphaGo and AlphaStar, applied to a problem class neither was built for.

Many games, one model

Cross-class transfer: poker, Mahjong, Guandan, and beyond

Poker variants: NLHE, PLO (4-card, 5-card), Short Deck, MTT with ICM, Bomb Pots, Stand-Up Squid, All-in or Fold, Flash, Global Spins. Plus games outside poker: Mahjong, Guandan (200 million-plus players in Asia), and other imperfect-information strategic games. Same foundation model. New game added in under a week.

Sub-2s

Solver-grade response in real time

Sub-2-second action latency in production. Suitable for real-time integration into live operator surfaces. No precompute, no abstraction collapse.

90%+ · 99%+ · ~30%

AceGuardian anti-cheat: bot coverage, precision, market footprint

AceGuardian's anti-cheat: 90%+ coverage of all known bots in the industry at 99%+ precision, across roughly 30% of the online poker market under behavioral monitoring. Five threat vectors picked up from gameplay data alone: bots, collusion, real-time assistance, account sharing, chip dumping. Server-side; players install nothing.

Closed-loop research engine

Gameplay AI plus conversational AI plus auto-research, in continuous feedback

The gameplay AI is one half. The other half is a conversational and auto-research layer built on top of an LLM, paired with the gameplay AI. Together they propose hypotheses, verify them against the solver, output publishable articles and books, and feed the verified heuristics back into the gameplay AI as training signal. A continuous research loop, not a one-off methodology document. Output so far: long-form manifesto pieces, technical briefs, two book-length series (Squid Blood Battle, Squid Hunt Progressive), and 30+ coach-attributed pieces. All solver-backed, methodology open.

9+ variants on one engine

Independence + integration + integrity

The independent AI infrastructure layer — already live

QuintAce and AceGuardian run as one independent infrastructure layer. Not a subsidiary of any operator. Already integrated with the WPT family and with non-WPT major operators across LATAM, APAC, and worldwide markets.

"AI doesn't beat humans in poker. Cheating does." Pluribus didn't actually beat humans in the games people play, and its data didn't reach significance. But bots, real-time assistance, and collusion tooling do beat humans every day. AceGuardian's anti-cheat is what's pushing back, in production.

Live partners

WPT Global — post-hand QuintAce CTA over the live table

ClubWPT Gold — Hand Analysis as a first-class lobby tab

AceGuardian — integrity dashboard inside operator environments

Coach + content endorsements — different specialties

Nick Petrangelo · Daniel Dvoress · Uri Peleg · Brad Wilson · Andrew Seidman

Nick and Dan (GTO Lab) are two of the most respected high-stakes MTT minds in poker. Uri Peleg is the Stand-Up Squid specialist who authored the first strategy manual for that format. Brad Wilson (Chasing Poker Greatness) is a community-facing coach and AMA host. Andrew Seidman brings methodology-grade partnership work.

Support from WPT brand ambassadors

The WPT Global ambassador roster

The world-class players and content creators who put their name on the platforms QuintAce powers. The roster keeps expanding as the integration footprint grows.

Start the conversation

Pick the story. We'll set the briefing.

Email Thanh directly with the angle you want to cover. We'll line up the founder conversation, the advance materials, the coach and operator interviews, and the demo, sized to the piece you're writing.

Email Thanh

Founder access (primary): Thanh Tran · thanh.tran@aceguardian.co
Press partnerships + scheduling: Monica Rangel

AlphaGo++A foundation AI that scales across many game classes — and ships in production

Four stories worth your byline

Access, exclusives, on-the-record sources

The substance library — six published pieces, fully backed

Ecology report — advance pre-brief available

Two camps tried. Neither finished the job.

Academic AI researcher, engineering executive at scale, then back to the 2005 poker-AI work

What we've actually built — what no one else has shipped

The independent AI infrastructure layer — already live

Pick the story. We'll set the briefing.

AlphaGo++
A foundation AI that scales across many game classes — and ships in production