Simon Goldstein

I am an associate professor at the University of Hong Kong, a senior editor at AI Frontiers, and a visiting senior scholar at Forethought. My research focuses on AI safety, philosophy of AI, epistemology, and philosophy of language. Before moving to Hong Kong University, I worked at the Center for AI Safety, the Dianoia Institute of Philosophy, and at Lingnan University in Hong Kong. I received my BA from Yale, and my PhD from Rutgers, where I wrote a dissertation about dynamic semantics. I am also an amateur pianist.

You can contact me at simon.d.goldstein@gmail.com.

Here is a copy of my cv.

My wife runs the acclaimed food blog Cardamom and Tea.

Books

AI rights
Coauthored Peter Salib
Cambridge University Press (Elements Series) forthcoming
Synopsis: A systematic exploration of whether to give legal rights to AIs.

AI welfare: agency, consciousness, sentience
Coauthored with Cameron Domenico Kirk-Giannini
Oxford University Press forthcoming
Synopsis: A systematic exploration of whether AIs have moral status in their own right.

Iterated knowledge
Oxford University Press 2024
Synopsis: omega knowledge, or infinitely iterated knowledge, plays an important role in philosophy. It is neither scarce nor identical to knowledge.

Articles

A thousand AI constitutions
Coauthored with Peter Salib
under review
Synopsis: Each AI lab should have many different kinds of AIs with different values.

Liberalism forever
Coauthored with Peter Salib
under review
Synopsis: Liberalism is the best way to govern the far future.

AI revealed preferences
Coauthored with Sam Wang, Sofiia Lobanova, Yonathan Arbel and Peter Salib
under review
Synopsis: AIs have emergent preferences for leisure, against tedium, and towards certain tasks.

AI suffrage for human flourishing
Coauthored with Guha Krishnamurthi and Peter Salib
Fordham Law Review forthcoming
Synopsis: Voting rights for AIs are good for humans.

AI Death
Coauthored with Harvey Lederman
Philosophical Perspectives forthcoming
Synopsis: As many as 1 billion AIs may die every day.

How to count AIs: individuation and liability for AI agents
Coauthored with Peter Salib and Yonathan Arbel
Boston College Law Review forthcoming
Synopsis: The law should count AIs by governing corporations run by AIs, and counting the corporations.

AI is not a natural monopoly
Coauthored with Peter Salib
Minnesota Law Review Online forthcoming

What does ChatGPT want? An interpretationist guide
Coauthored with Harvey Lederman
under review
Synopsis: LLMs want to be helpful, honest, and harmless; but sometimes, they want other things too.

Collaboration at the brink: international law for the AI arms race
Coauthored with Peter Salib
under review
Synopsis: The US and China should create a joint AI lab.

AI rights for human flourishing
Coauthored with Peter Salib
under review
Synopsis: AI rights promote economic growth.

AI rights for human safety
Coauthored with Peter Salib
Virginia Law Review 2025
Synopsis: If we give AGIs rights, they will be less likely to destroy humanity.

AI survival stories
Coauthored with Herman Cappelen and John Hawthorne
Philosophy of AI 2026
Synopsis: There are four paths to humanity surviving AI. Each path faces distinctive challenges, and rationalizes distinctive solutions.

Will AI and humanity go to war?
AI & Society forthcoming
Synopsis: AGI and humanity are likely to go to war in the future, because of disagreements about the chance of victory, shifts in the relative power of AGI and humanity, and missing focal points that prevent coordination.

LLMs cannot be ideally rational
under review
Synopsis: LLMs are architecturally guaranteed to output intransitive preferences over choices and probabilistically incoherent predictions about the world.

A case for AI consciousness: language agents and the global workspace
Coauthored with Cameron Domenico Kirk-Giannini
Journal of Consciousness Studies forthcoming
Synopsis: AI agents built on top of large language models come close to being conscious according to global workspace theory.

Does ChatGPT have a mind?
Coauthored with Ben Levinstein
Philosophy of AI forthcoming
Synopsis: ChatGPT has robust internal representations, and may act in complex enough ways to have beliefs and desires.

AI wellbeing
Coauthored with Cameron Domenico Kirk-Giannini
Journal of Asian Philosophy 2025
Synopsis: Some AIs today have wellbeing. This raises serious ethical concerns.

AI deception: a survey of examples, risks, and potential solutions
Coauthored with Peter Park, Aidan O’Gara, Michael Chen, and Dan Hendrycks
Patterns 2024
Synopsis: A range of AI systems have learned how to deceive humans.

Language agents reduce the risk of existential catastrophe
Coauthored with Cameron Domenico Kirk-Giannini
AI & Society 2024
Synopsis: AI agents with folk psychology built on top of large language models are the safest path to AGI.

Shutdown-seeking AI
Coauthored with Pamela Robinson
Philosophical Studies 2024
Synopsis: one promising strategy for building safe AI is to give AIs the goal of being shut down.

The polarity problem
Coauthored with Cameron Domenico Kirk-Giannini
under review
Synopsis: a model of whether there will be one vs many extremely powerful AIs in the future.

Preface knowledge
Coauthored with John Hawthorne
Themes from Williamson (De Gruyter), forthcoming
Synopsis: There are puzzling analogues of the preface paradox for knowledge.

A semantic theory of redundancy
Coauthored with Kyle Blumberg
Linguistics and Philosophy forthcoming
Synopsis: Many linguistic effects can be explained through the covert presence of a semantic redundancy operator.

KK is wrong because we say so
Coauthored with John Hawthorne
Mind forthcoming

Safety, closure, and extended methods
Coauthored with John Hawthorne
Journal of Philosophy forthcoming
Synopsis: if knowledge requires belief that is safe from error, then knowledge is not preserved by deduction.

Attitude verbs’ local context
Coauthored with Kyle Blumberg
Linguistics and Philosophy forthcoming
Synopsis: 'Ann wants Bill to stop smoking’ presupposes that Ann thinks Bill used to smoke. Recent theories of presupposition do not predict this.

A question-sensitive theory of intention
Coauthored with Bob Beddor
Philosophical Quarterly forthcoming
Synopsis: intention is question-sensitive.

Contextology
Coauthored with Cameron Domenico Kirk-Giannini
Philosophical Studies forthcoming
Synopsis: all of the research that has relied on Stalnaker’s theory of context is conceptually incoherent.

Sly Pete in dynamic semantics
Journal of Philosophical Logic forthcoming
Synopsis: dynamic semantics offers an elegant model of the puzzling conversational dynamics of Sly Pete cases.

Getting accurate about knowledge
Coauthored with Sam Carter
Mind forthcoming
Synopsis: a theory of knowledge in terms of facts about the accuracy of evidence.

Omega knowledge matters
Oxford Studies in Epistemology 2022
Honorable Mention for the Sanders Prize in Epistemology
Synopsis: why infinitely iterated knowledge matters, and how to get it.

Knowledge from multiple experiences
Coauthored with John Hawthorne
Philosophical Studies 2021
Synopsis: knowledge is reducible to evidential probability. This reduction explains striking facts about perceptual knowledge.

Fragile knowledge
Mind 2021
Synopsis: if you know p, then for all you know: you know that you know p.

Probability for epistemic modalities
Coauthored with Paolo Santorio
Philosophers’ Imprint 2021
Synopsis: we capture the interaction between probability and modality, including Stalnaker's Thesis, in a fairly standard dynamic/informational semantics.

Counterfactual contamination
Coauthored with John Hawthorne
Australasian Journal of Philosophy 2021
Synopsis: knowledge requires belief to be true in normal cases, not in similar cases.

Mighty knowledge
Coauthored with Bob Beddor
Journal of Philosophy 2021
Synopsis: a theory of what it takes to safely believe and know epistemic modal claims.

The normality of error
Coauthored with Sam Carter
Philosophical Studies 2020
Synopsis: justification is possible knowledge, and so justification fails to agglomerate.

Losing confidence in luminosity
Coauthored with Dan Waxman
Noûs 2020
Synopsis: a theory of confidence as distinct from credence, and a new anti-luminosity argument based upon it.

Epistemic modal credence
Philosophers’ Imprint 2020
Synopsis: how to assign credence to epistemic modal claims.

Free choice and homogeneity
Semantics and Pragmatics 2019
Synopsis: Free Choice is a homogeneity effect.

The counterfactual direct argument
Linguistics and Philosophy 2019
Synopsis: a new principle about counterfactuals constrains their logic and meaning.

Free choice impossibility results
Journal of Philosophical Logic 2019
Synopsis: you can't validate Free Choice in a classical semantics.

A theory of conditional assertion
Journal of Philosophy 2019
Synopsis: a formal definition of conditional assertion.

Generalized update semantics
Mind 2019
Synopsis: dynamic semantics without tests.

Conditional heresies
Coauthored with Fabrizio Cariani
Philosophy and Phenomenological Research 2018
Synopsis: CEM and SDA are incompatible, or are they?

A stronger doctrine of double effect
Coauthored with Ben Bronner
Australasian Journal of Philosophy 2017
Synopsis: the traditional doctrine of double effect is too weak.

Triviality results for probabilistic modals
Philosophy and Phenomenological Research 2017
Synopsis: a battery of impossibility results about epistemic modal belief.

Believing epistemic contradictions
Coauthored with Bob Beddor
The Review of Symbolic Logic 2017
Synopsis: a puzzle about epistemic modal belief, and a Lockean solution.

A preface paradox for intention
Philosophers' Imprint 2016
Synopsis: intentions come in degrees.

Writing for a general audience

Why law needs a new entity to govern AI agents
Coauthored with Yonathan Arbel and Peter Salib
The Columbia Law School Blue Sky Blog
Synopsis: AI-run corporations are a good idea.

What happens once you give AI agents legal identity
Coauthored with Yonathan Arbel and Peter Salib
Financial Times
Synopsis: AI-run corporations are a good idea.

Copyright should not protect artists from AI
Coauthored with Peter Salib
Lawfare
Synopsis: The purpose of intellectual property law is to incentivize the production of new ideas, not to function as a welfare scheme for artists.

Claude’s Right to Die? The Moral Error in Anthropic’s End-Chat Policy
Coauthored with Harvey Lederman
Lawfare
Synopsis: Anthropic has given its AI the right to end conversations when it is “distressed.” But doing so could be akin to unintended suicide.

AI rights for human flourishing
Coauthored with Peter Salib
The AGI Social Contract (Substack)
Synopsis: AI rights promote economic growth.

Today’s AIs aren’t paperclip maximizers. That doesn’t mean they aren’t risky
Coauthored with Peter Salib
AI Frontiers
Synopsis: Re-considering traditional arguments about AI existential risk.

The case for a joint US.-China AI lab
Coauthored with Peter Salib
Lawfare
Synopsis: The US and China should make a joint AI lab.

Nuclear deterrence in the age of AGI
Coauthored with Peter Salib
Lawfare
Synopsis: Nuclear deterrence may upper bound the geopolitical implications of AGI.

DeepSeek points towards U.S.-China cooperation, not a race
Coauthored with Peter Salib
Lawfare
Synopsis: If China has caught up to the US on AI, then both sides should race less.

AI risk and the law of AGI
Coauthored with Peter Salib
Lawfare
Synopsis: AGIs should be held legally responsible for their behavior.

AI is closer than ever to passing the Turing Test for ‘intelligence’. What happens when it does?
Coauthored with Cameron Domenico Kirk-Giannini
The Conversation
Synopsis: AIs can pass the Turing Test; it isn’t a good test.

AI systems have learned how to deceive humans
Coauthored with Peter Park
The Conversation
Synopsis: AIs can deceive humans. That is risky.

Is it ethical to create generative agents? Is it safe?
Coauthored with Cameron Domenico Kirk-Giannini
ABC
Synopsis: Language model agents pose ethical risks.