The Cognition-aware AI Programme

PREFACE

Preface

The Programme asks how artificial intelligence can be engineered to engage with the structure of an individual mind — rather than around it.

For most of its short history, modern artificial intelligence has been built upon a single, productive simplification: that human language is a sufficient interface to human reasoning. Models are trained on text and judged against text. The result has been astonishing in capability and limited in a particular way. Systems achieve fluency without reaching the cognition behind it. They produce confident answers without registering whether the question itself is distorted, or whether the person asking it is in a state in which an answer would help.

The Cognition‑aware AI Programme is concerned with what this simplification leaves out. Human reasoning is patterned not only by information but by distortion — the recurrent forms of unhelpful thought catalogued for half a century in cognitive behavioural therapy; by trait — the stable dispositions that govern how a person prefers to receive guidance; by emotion — which calibrates the timing and depth of any useful intervention; and by the body through which all of these are sustained. A system that is unaware of these structures will, with some regularity, produce confident outputs that are unsafe, unhelpful, or simply unread.

The Programme operates at the meeting of Foundation Models and the cognitive contexts in which they are placed: coaching and mental‑health support, learning and education, and — increasingly — the customer‑facing AI surfaces where the same cognitive instruments determine whether an interaction de‑escalates or compounds. Where a deployment requires clinical or regulatory standing, the Programme operates only with the appropriate clinical advisory and ethics framework in place.

The Programme's methodological commitment is that these structures can be made computational without being reduced. Its outputs are peer‑reviewed publications, evaluation frameworks, and field‑validated systems, developed in collaboration with academic laboratories and embedded inside operating institutions.

The horizon toward which the work is oriented is what we have come to call Cognitive Sovereignty: the capacity of a person to keep their own reasoning intact as the systems they rely on grow more capable of persuasion. Human Intent over Algorithm is the working principle by which the Programme pursues that horizon — a position taken before any particular paper is written, and held to as the work develops. Its strict consequence: the Programme works on systems that support human reasoning, not on systems that persuade for an undisclosed end.

SOVEREIGNTY

Horizon

On Cognitive
Sovereignty.

Human Intent over Algorithm.

The Programme takes a position.

Algorithms have grown sufficiently capable that the question of the coming decade is no longer what they can do, but whose reasoning they preserve. The systems we are building are now powerful enough to think on our behalf; the discipline of cognition‑aware engineering is the practice of refusing to let them.

We hold that an artificial system serves its user only insofar as it leaves the user's capacity to think — to weigh, to doubt, to choose — intact, and ideally enlarged. A copilot that produces the answer is less than one that produces the conditions in which the answer can be arrived at. A system that resolves ambiguity for a person is less than one that lets the person see, name, and resolve it for themselves.

Human Intent over Algorithm is the working principle. Cognitive Sovereignty is the horizon. Every paper and every system within the Programme is to be measured against them.

METHOD

Method

The Programme advances along four interleaved fronts — each grounded in psychological theory, rendered in computational form, and held to outcomes defensible in the field as well as in peer review.

Theory.

The Programme takes its conceptual ground from cognitive behavioural therapy and the broader tradition of psychological science. Cognitive distortion theory provides a taxonomy of unhelpful thinking patterns — catastrophising, all‑or‑nothing thinking, overgeneralisation, mind reading — which we operationalise as detectable signals in conversational text. Acceptance & Commitment Therapy informs how systems respond when distortion is present: not with persuasion, but with reframing that restores the connection between a person's actions and their stated values. Trait psychology contributes the stable dispositional structure through which guidance must be tailored if it is to be received at all.

Engineering.

Theory enters the system as concrete model behaviour. Distortion detection is treated as a layered prediction task across span, utterance, and session granularities, allowing agents to localise the pattern, attribute its type, and respond with theoretically appropriate reframings. Multimodal channels — text, prosody, facial expression — are fused not as ends in themselves but as context that calibrates the timing and tone of intervention. Personalisation is implemented through trait‑aware adapters and persistent user representations, so that the same agent engages differently with different people while remaining consistent and explainable to auditors.

Measurement.

Cognition‑aware systems require evaluations that go beyond accuracy on static benchmarks. The Programme develops goal‑driven metrics — among them Cognitive Fidelity in emotion reasoning, Distortion Reduction Rate at the conversation level, and decision‑quality uplifts derived from expert rubrics. These instruments make it possible to test whether a given intervention has changed the structure of thinking, not merely the surface of language.

Field validation.

A theory of cognition‑aware AI that does not survive contact with operating institutions is, in our view, incomplete. The Programme's systems are deployed within real workflows in coaching, education, and mental‑health support, where their effect on adoption, auditability, and the felt quality of work is observable over time. The cycle — theory, engineering, measurement, deployment, refinement — is what permits the work to be both publishable as science and consequential in practice.

OUTPUTS

Outputs

Selected publications from the Programme, drawn from peer‑reviewed venues in natural language processing, affective computing, cognitive science, learning technologies, and knowledge discovery.

Accepted

ACL 2026

CMTD: Cognitive Modeling with Traits and Distortions for Multimodal Emotion Recognition in Conversations

Hajime Institute.

Accepted at ACL 2026.

ICALT 2026

MindReframe: A Psychologically Grounded Cognitive Reframing Framework for Mental Health Support

Hajime Institute.

Accepted at ICALT 2026.

EMNLP 2025

Metamo: Empowering Large Language Models with Psychological Distortion Detection for Cognition‑aware Coaching

Hajime Hotta, Huu‑Loi Le, Manh‑Cuong Phan, Minh‑Tien Nguyen.

Proceedings of EMNLP 2025, System Demonstrations.

ACII 2025 · MMAC

Multimodal Emotion Recognition in Conversations with Distortion Analysis

Huu‑Loi Le, Manh‑Cuong Phan, Hajime Hotta, Minh‑Tien Nguyen.

13th International Conference on Affective Computing and Intelligent Interaction, Workshops & Demos (MMAC).

PACLIC 2025

From Span Extraction to Classification: A Multi‑step Framework for Cognitive Distortion Analysis

Manh‑Cuong Phan, Huu‑Loi Le, Hajime Hotta, Minh‑Tien Nguyen.

Proceedings of the 39th Pacific Asia Conference on Language, Information and Computation.

PAKDD

Automatic Prompt Selection for Large Language Models

Viet‑Tung Do, Xuan‑Quang Nguyen, Viet‑Khoi Hoang, Duc‑Hieu Nguyen, Sina Sabahi, Jeff Yang, Hajime Hotta.

Pacific‑Asia Conference on Knowledge Discovery and Data Mining.

Future research

Current stream

Does AI understand the feeling, or just describe it well?

Building the measurement that tells the two apart — for AI that operates inside coaching, mental‑health, and affective interfaces.

Current stream

AI that can tell “I’m fine” from fine.

When the face, the voice, and the word disagree — the systems that learn to read what someone isn’t saying.

Current stream

AI mental-health support a therapist can defend.

When AI helps a person reframe a negative thought, against what standard is it allowed to do so? Building that standard with clinicians.

Current stream

Emotional intelligence in language models — finally measurable.

Building EI you can test — not EI you have to take on faith from a fluent reply.

Current stream

Whether AI therapy stays therapy, or drifts.

Measuring whether AI that calls itself a "coaching" or "support" tool delivers what its underlying therapy framework would — across deployments and cultures.

Current stream

When does a conversation actually close?

Some AI-human conversations resolve, and some loop endlessly. The structural difference between the two, made testable.

Current stream

When AI agents talk themselves into believing.

Recursive AI reasoning can drift into its own distortions. Detecting — and not deploying — the moment that begins.

Full publication list on Google Scholar →

SHAPE

How the Programme is brought to bear

A worked sketch — not a case study — of the shape an engagement takes when a customer‑facing AI deployment is failing in the way this Programme is built to address.

The situation

An enterprise has deployed a coaching‑style support agent. Six weeks in, CSat is down four points on the conversations the agent did not escalate. Internal review cannot reproduce the failure in QA.

Weeks 1–3 · Audit

The Programme decomposes which user states the agent is failing on — suppressed frustration that does not register as anger in the language signal, cognitive distortion patterns the agent reinforces under its own helpfulness reflex. A measurement specification is written for what counts as success that the deployed benchmark metrics cannot capture.

Weeks 4–10 · Severity‑test prototype

The Programme builds a small intervention — an affect‑calibrated escalation layer with refusal logic — and severity‑tests it against pre‑registered failure modes. The discipline is to fail visibly in a controlled test, not invisibly in production.

Month 3 onward · Deployment advisory

Implementation is carried by the partner’s engineering team or an operating partner. The Programme stays in as advisor on the persona contract, the audit interface, and the evidence trail that would be defensible to a regulator. The Programme does not staff a delivery team.

The outcome

Not a vendor relationship. The discipline by which a deployment that could not be defended becomes one that can.

Most Programme‑Ⅰ engagements run as Sponsored Research over 12–24 months. The same shape applies whether the surface in question is customer support, coaching, mental‑health‑adjacent care, or learning.

The other three inquiries

Ⅱ The Trustworthy Documents Programme Trustworthy reading. Ⅲ The Deliberative Agents Programme Tension as method. Ⅳ The Collective Cognition Programme Cognition lives between people.

← Return to the Four Inquiries

CORRESPOND

Correspondence

Inquiries are
welcome.

For research collaborations, joint publications, advisory engagements, and patronage —

A note via the form below — the Director responds personally.

The Cognition‑aware AI Programme.

On CognitiveSovereignty.

Theory.

Engineering.

Measurement.

Field validation.

CMTD: Cognitive Modeling with Traits and Distortions for Multimodal Emotion Recognition in Conversations

MindReframe: A Psychologically Grounded Cognitive Reframing Framework for Mental Health Support

Metamo: Empowering Large Language Models with Psychological Distortion Detection for Cognition‑aware Coaching

Multimodal Emotion Recognition in Conversations with Distortion Analysis

From Span Extraction to Classification: A Multi‑step Framework for Cognitive Distortion Analysis

Automatic Prompt Selection for Large Language Models

Does AI understand the feeling, or just describe it well?

AI that can tell “I’m fine” from fine.

AI mental-health support a therapist can defend.

Emotional intelligence in language models — finally measurable.

Whether AI therapy stays therapy, or drifts.

When does a conversation actually close?

When AI agents talk themselves into believing.

Inquiries arewelcome.

The Cognition‑aware
AI Programme.

On Cognitive
Sovereignty.

Inquiries are
welcome.