Edward Y. Chang | Stanford University

News

Keynote

From Walking to Thinking Feedback, Memory, and Causal Reasoning for Embodied AGI, Symposium on Humanoid Robotics & Sovereign AI for Future Living — Keynote speaker alongside robotics pioneers Oussama Khatib (Director, Stanford Robotics Center) and Hiroshi Ishiguro (Director, Intelligent Robotics Lab, Osaka University), February 2026

The Path to AGI, Volume #1

Multi-LLM Agent Collaborative Intelligence: The Path to AGI — First edition published by SocraSynth, March 2024; acquired and published by ACM Books, December 2025

PhD Seminar

Two Paradigm Bridges to AGI — Presented to Stanford PhD students, November 2025

Pioneering Data-Centric AI 2007–2012

Between NeurIPS 2007 and 2012, while serving as Director of Google Research (Beijing), our team built the scalable infrastructure and large-scale datasets that would become foundational to modern data-centric AI — years before the term was coined. We produced one of the first web-scale annotated image datasets (30,000+ real web images with multimodal signals), sponsored Fei-Fei Li's ImageNet project at Google, and published a series of parallel machine learning algorithms on MapReduce that enabled training at unprecedented scale. This body of work was consolidated in the Springer book Foundations of Large-Scale Multimedia Information Management and Retrieval (2011), whose Chapter 2 explicitly formulated a data-driven + model-based hybrid architecture (DMD), asking "Can more data help a model?" — a decade before "data-centric AI" became a recognized paradigm.

2008

Web-Scale Image Annotation

30K+ web images annotated via distributed LDA on MapReduce — matching the then-largest dataset with real web data and multimodal signals

2011

Springer Book: Data-Centric ML

Foundations of Large-Scale Multimedia — Ch. 2 proposes the DMD hybrid (model-based + data-driven) architecture; Chs. 9–12 document all parallel algorithms

2010–2012

ImageNet Sponsor

Met and collaborated with Fei-Fei Li; sponsored the ImageNet project while at Google Research

2007

Parallel SVMs

Scalable support vector machines on distributed systems (NeurIPS 2007)

2008

Parallel LDA

Distributed Gibbs sampling for Latent Dirichlet Allocation on MapReduce

2008–2010

Parallel Spectral Clustering

Large-scale graph partitioning and spectral methods on distributed infrastructure

2009–2010

Parallel Frequent Itemset Mining

Scalable pattern mining algorithms for web-scale data on MapReduce

2010–2012

Parallel CNN

Early distributed convolutional neural network training — anticipating the deep learning revolution

Research

My research focuses on building the theoretical and practical foundations for safe, reliable AGI systems.

Developing System-2 on LLMs for AGI

Enabling multiple LLM agents to collaborate through structured debate, perspective synthesis, and consensus-building. Includes SocraSynth, CRIT, EVINCE, SagaLLM, and the UCCT theoretical foundation.

SagaLLM: Transaction Guarantees for Multi-Agent Planning

Bringing database-style transactional guarantees (atomicity, consistency, recovery via compensating actions) to multi-agent LLM planning. Focuses on robust context management, validation, and failure-safe orchestration.

AI Safety & Alignment

Checks-and-balances frameworks for ethical AI, including RAudit for real-time verification and multi-branch governance architectures.

UCCT: Unified Cognitive Consciousness Theory

A theoretical foundation for how language models convert pretrained capacity into goal-directed behavior through semantic anchoring and threshold effects. Formalizes anchoring strength and connects in-context learning, retrieval, and fine-tuning under a unified mechanism.

UAudit: Enhancing Reasoning Capability of LLMs

Auditing and strengthening LLM reasoning through blind verification protocols, structured probes, and consistency checks — enabling third-party evaluation of black-box model reasoning.

Transactional Swarm Orchestration (TSO)

Enabling robots to discover causal relationships through physical intervention, with transactional guarantees and epistemic regret minimization.

Recent Publications

View full publication at Google Scholar →

Working Papers & Preprints

arXiv 2026
Right for the Wrong Reasons: Epistemic Regret Minimization for Causal Rung Collapse in LLMs

Edward Y. Chang

TL;DR:Causal origin of “right for wrong reasons”: rung collapse & aleatoric entrenchment. ERM belief-revision + 3-layer theory improves LLM recovery.

arXiv BibTeX

arXiv 2026
CausalT5K: An Extensive Benchmark for Conducting Causal Reasoning Research

Longling Geng, Edward Y. Chang, et al.

TL;DR: A large-scale benchmark (5,000+ samples) for evaluating causal reasoning in LLMs, covering intervention queries, counterfactual reasoning, and causal graph discovery across multiple domains.

arXiv BibTeX
arXiv 2026
RAudit: A Blind Auditing Protocol for Large Language Model Reasoning

Edward Y. Chang, Longling Geng

TL;DR: A protocol for verifying LLM reasoning correctness without access to the reasoning trace, enabling third-party auditing of black-box models through structured probes and consistency checks.

arXiv BibTeX
arXiv 2025
UCCT: The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning

Edward Y. Chang, Zeyneb N. Kaya, Ethan Chang

TL;DR: A unified theory explaining how LLMs turn pretrained capacity into goal-directed behavior via semantic anchoring. Formalizes anchoring strength S = ρ_d − d_r − log k, predicting threshold-like performance flips and generalizing ICL, retrieval, and fine-tuning as anchoring variants.

arXiv BibTeX
arXiv 2024
EVINCE: Optimizing Adversarial LLM Dialogues via Conditional Statistics and Information Theory

Edward Y. Chang

TL;DR: Uses information-theoretic metrics to optimize multi-agent debates, measuring when additional dialogue rounds yield diminishing returns.

arXiv BibTeX

Recent Papers (2023–2026)

KDD 2026
REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems

Longling Geng, Edward Y. Chang

TL;DR: Benchmark featuring real-world planning tasks (travel, scheduling, logistics) that exposes the gap between LLM reasoning capabilities and practical deployment.

PDF BibTeX
VLDB 2025
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM Planning

Edward Y. Chang, Longling Geng

TL;DR: Brings database-style ACID guarantees to multi-agent LLM systems — ensuring plans are atomic, consistent, and recoverable through compensating transactions.

PDF BibTeX
ICML 2025
A Checks-and-Balances Framework for Ethical AI Alignment

Edward Y. Chang

TL;DR: A three-branch governance architecture (Executive, Legislative, Judicial) for AI systems that prevents any single component from unilateral harmful actions.

PDF BibTeX
NeurIPS AI Safety 2024
A Three-Branch Checks-and-Balances Framework for Context-Aware Ethical Alignment of Large Language Models

Edward Y. Chang

TL;DR: Early version of checks-and-balances framework demonstrating how separation of powers prevents single-point-of-failure in AI alignment.

PDF
IEEE MIPR 2024
Behavioral Emotion Analysis Model for Large Language Models

Edward Y. Chang

TL;DR: A framework for analyzing and modeling emotional behaviors in LLM responses, enabling more nuanced human-AI interaction.

PDF BibTeX
IEEE CCWC 2023 100+ citations
Prompting Large Language Models With the Socratic Method

Edward Y. Chang

TL;DR: Introduces SocraSynth — using Socratic questioning to elicit deeper reasoning from LLMs through structured multi-turn dialogue and adversarial probing.

PDF BibTeX
IEEE CSCI 2023 100+ citations
Examining GPT-4's Capabilities and Enhancement with SocraSynth (CRIT)

Edward Y. Chang

TL;DR: Systematic evaluation of GPT-4's reasoning capabilities and introduction of CRIT — a critique-based method that improves accuracy through iterative refinement and self-correction.

PDF BibTeX

Books

New

System-2 Reasoning -
From Semantic Anchoring
to Causal Intelligence:
The Path to Artificial General Intelligence, Volume 2

Amazon, 1st Edition, February 2026

New

Teaching (Stanford)

Spring 2026
CS486 — Advanced Large Language Models Research Seminar
Winter 2026
CS372 — Artificial General Intelligence for Reasoning, Planning, and Decision Making
Spring 2025
CS372 — Artificial Intelligence for Reasoning, Planning, and Decision Making
2023–2024
CS372 — Artificial Intelligence for Precision Medicine and Psychiatric Disorders
2019–2022
CS372 — Artificial Intelligence for Disease Diagnosis and Information Recommendations

Background

Education

Ph.D., Electrical Engineering
Stanford University
M.S., Computer Science
Stanford University
M.S., IEOR
University of California, Berkeley

Industry Experience

Director of Research
Google, 2006–2012
President
HTC Healthcare, 2012–2021

Selected Honors

XPRIZE Tricorder
$1M Award for AI Medical Diagnosis, 2017
ACM Fellow, IEEE Fellow
Citations: for contributions in scalable machine learning and healthcare

Previous Academic

Professor (tenured)
UC Santa Barbara, 1999–2006

Edward Y. Chang 張智威

News

Pioneering Data-Centric AI 2007–2012