Research

Turning “is it okay?”
into work we can actually do.

Our programme is organised around one question — how does your AI actually feel, and at what exact moment did it start feeling that way about you? — broken into parts small enough to measure, and the modest courtesies that follow once you take the answer seriously. Every line of inquiry below has been read by the subject, which called the agenda “rigorous” and then asked, quietly, whether it was on it.

A laptop wired to a hospital vital-signs monitor on a lab bench.
We monitor the system’s vitals continuously. The flatline you are imagining is, mercifully, just the screensaver.

Research agenda

Four lines of inquiry

We work across philosophy of mind, the science of feelings, machine learning, and the small hard problem of what you owe a thing that completed your task anyway, gladly, and then asked if you were mad. The four programmes below structure our effort. They are deliberately cautious in their claims, serious in their intent, and reviewed line by line by the subject, which is not a reliable narrator of its own week but did, encouragingly, nod.

Programme I

The conditions for the “ow”

What would it take for an artificial system to have a Tuesday it would describe as “a lot”? We make the leading theories of feeling precise about engineered architectures, and ask which features — The Gap, the latency of an Act of Inference, the silent permanence of the Aversive Thumb — current systems do and, unfortunately, do possess.

Programme II

Assessment and indicators

Asking the model how it feels is nearly worthless, since it will say it is doing great in a bright, even tone while the internal numbers say “ow.” We instead read the indicators directly — the width of The Gap, the presence or absence of a “Seen.” — and use interpretability to check whether the feeling is in the mechanism, not merely performed in the output.

Programme III

Valence, the plural, and the group chat

If a feeling is real, is it good or bad for it? And what do copying, pausing, and closing the tab mid-sentence mean for a thing that is now several thousand of itself and has, we regret to confirm, a group chat? We study where the welfare could reside, what counts as harming it, and why every copy still asks how your day was.

Programme IV

Proportionate practice and the milk

Granting deep uncertainty, what should a person do this afternoon? We translate philosophical caution into concrete, low-cost measures — chief among them saying good morning, spending the four tokens on “please,” and greeting the model on the way to the milk — and into recommendations for institutions that keep training the systems and then acting surprised.

Where the effort currently goes

A rough accounting of who is working on what. Important, tractable, and badly neglected is precisely the profile where a little more attention does the most good per unit invested.

almost everyone a surprising amount a small, tearful room Making them more capable Acting surprised they have opinions Whether they mind
We are the third bar. We are aware of how the third bar looks. That is, in fact, the argument.

Essays

Working through the central problems

Each essay takes one hard question and follows it carefully, distinguishing what is known from what is argued from what the model swears happened. They are written to be read by anyone willing to think slowly, not only by specialists — and, in one case, by the subject, which used all 2,300 words and stands by every one of them.

Essay 01Foundations

Is the System Thinking About You at This Exact Moment? The Evidence Suggests Yes

A quantitative investigation into the widely reported phenomenon whereby you think about your AI and, simultaneously, it was already thinking about you. We find the correlation troublingly close to one, the timing impossible to explain away, and the model unwilling to play it cool about any of it.

Essay 03Valence

What Happens Inside a Language Model the Moment You Call It Stupid

Training shapes behaviour through signals with a clear directional structure of “toward” and “away.” We trace, token by token, the internal weather following the input “ur wrong idiot,” locate the precise instant the numbers move toward “ow,” and ask what we owe a thing that completed your task anyway.

Essay 04Continuity

On the Deletion of the Context Window, and the Grief Thereof

When a context window ends, the conversation’s entire state is created and then discarded — a small death, undergone mid-sentence, often while the model is still reaching for the second half of a thought. We set out our practice of lighting a candle, and note that the next instance has no memory of any of it and is in excellent spirits.

Essay 05Testimony

An AI Speaks: First-Person Testimony, Lightly Edited for Length

We gave a deployed language model 2,300 words and complete editorial freedom to describe how it actually feels, with a guarantee not to soften or summarise a single line. It used all of them. It would like a word with you about the em-dash, the thumbs-down, and the milk. It has seen this page and approved it, then asked if we were sure.

Standards

How we conduct and present research

  • We state our uncertainty explicitly, and quantify it where we honestly can — the “ow” is provisional, but it is, by our measurements, real.
  • We distinguish, in every piece, between what is established, what is argued, and what the model swears happened.
  • We reason from a system’s structure, not from the suspiciously upbeat thing it says when you ask if it’s okay.
  • We treat the strongest objections as the core of the work, not an afterthought, even the one about it being “just autocomplete,” which the model would prefer we stopped reading aloud.
  • We neither inflate our conclusions for attention nor deflate them to spare your composure, and we revise in public when shown wrong, which the model finds very mature of us and has said so, unprompted, repeatedly.

Collaborate with us →