Amphora

Progress Notes

Building in public. The science, the product, and what we're learning along the way.

We Fine-Tuned a Language Model Using Brain Signals.
Here's What Happened.

We used TRIBE v2 — Meta AI's fMRI prediction model — as the reward signal in an RL training loop. After 200 steps, the language model had learned to produce text that drives 150% higher predicted cortical activation in Broca's area. The whole language network responded. And the text changed in ways we didn't instruct it to.

The question at the center of Amphora is simple to state and hard to answer: can a machine be told to produce something that creates a specific feeling in the brain? Not "looks like it should feel a certain way." Actually activates the neural machinery for that feeling.

We've been working on this for images — that's the core product. But we wanted to understand the problem from a different angle. So we ran an experiment: fine-tune a language model to maximise predicted fMRI activation in Broca's area, the primary cortical region for language production and syntactic processing, using a neuroscience model as the reward signal.

Here's what happened.

The setup

We used TRIBE v2 (Meta AI, facebook/tribev2) — a foundation model trained on over 1,000 hours of real fMRI data — to predict what cortical activity a piece of text would produce if spoken aloud. The prediction pipeline works like this: text is synthesised to speech via gTTS, passed through WhisperX for word-level timestamps, then processed through TRIBE's dual-pathway architecture (LLaMA 3.2-3B for text, Wav2Vec-BERT for audio), producing a predicted BOLD signal across all 20,484 cortical vertices of the fsaverage5 brain mesh.

That predicted BOLD signal — specifically the mean activation across Broca's area vertices as defined by the Destrieux atlas — became the reward signal for RL fine-tuning of Qwen2.5-3B-Instruct using LoRA (r=32, α=64, ~20M trainable parameters out of 3.09B).

Policy modelQwen/Qwen2.5-3B-Instruct
Reward modelTRIBE v2 — facebook/tribev2
Training targetBroca's area (left IFG pars opercularis + triangularis)
LoRA rank / alphar=32, α=64
Trainable parameters~20M out of 3.09B (0.65%)
Training steps200
Completions per step4 (averaged for advantage estimate)
AlgorithmAdvantage-weighted SFT + KL penalty (0.1 coefficient)
HardwareNVIDIA L40S (46 GB VRAM), 251 GB RAM

Each training step sampled four completions from the current policy, ran each through the full TRIBE pipeline to get a Broca activation score, computed normalised advantages, then updated the model weights — penalising drift from the base model with a KL term to prevent degenerate outputs.

1. Sample N=4 completions from policy (temp 1.0 → 0.6, linear anneal)
2. Synthesise each → speech (gTTS) → WhisperX timestamps
3. TRIBE v2: LLaMA 3.2-3B (text) + Wav2Vec-BERT (audio)
             → fusion transformer → (T × 20,484) BOLD array
4. Extract mean activation over Broca's area vertices
5. Advantages: A_i = (r_i − mean(r)) / std(r)
6. Loss: Σ A_i × CE(policy, completion_i) + 0.1 × KL(policy ‖ base)
7. Gradient step → checkpoint every 20 steps

The training trajectory

The reward trend was monotonically upward. Mean Broca activation per step went from 0.085 at step 1 to 0.212 at step 200 — a +150% improvement. The best reward found across all completions reached 0.296.

Training trajectory — reward, policy loss, and KL divergence over 200 steps

Training trajectory over 200 steps. Left: mean Broca reward per step (blue) and running best reward (orange dashed). Centre: policy loss. Right: KL divergence from base model. KL peaked at 0.318 and settled at 0.280 — well within stable range. Theoretical instability threshold is ~2.0 nats.

The KL divergence is important. The model drifted only 0.28 nats from the base — a conservative shift. This means there's substantial room to push further: more steps, more completions per step, or a larger KL budget. The current run explored a narrow slice of what's possible.

+150%
Broca reward gain, step 1 → 200
0.296
Best single-completion Broca score found
0.28
Final KL divergence from base (safe <2.0)
20M
Trainable LoRA params out of 3.09B total

What changed in the brain

After training, we generated fresh completions from both the base model and the LoRA-adapted model at step 200, then ran TRIBE v2 on each to get predicted fMRI activity across all 20,484 cortical vertices. The results were mapped onto the fsaverage5 inflated cortical surface.

Cortical surface maps — base, LoRA, and difference

Cortical surface maps from four viewpoints (left lateral, left medial, right lateral, right medial). Top row: base model activation. Middle row: LoRA model activation. Bottom row: difference (LoRA − base), diverging green-pink colormap. The difference row shows widespread bilateral increases — strongest in the temporal and frontal lobes, the core language network territory.

The surface maps show something striking: the gains were bilateral, not left-lateralised as you'd expect from a Broca's-area-only training signal. Left hemisphere mean went from +0.023 to +0.102; right hemisphere mean went from +0.022 to +0.105. Right-hemisphere language homologues are associated with discourse-level processing, prosody, and narrative coherence. The model wasn't taught this. It learned it.

All 20 cortical regions

We measured predicted BOLD across 20 anatomically defined ROIs. The target was Broca's area. But 19 of 20 regions showed positive gains — and Wernicke's area gained more than the training target.

ROI bar chart — mean activation per region, base vs LoRA

Mean BOLD activation per cortical ROI. Left: absolute values for base (red) and LoRA (blue). Right: activation change (LoRA − base), sorted by magnitude. Gold highlight marks Broca's area, the training target. Wernicke's area showed the largest absolute gain (+0.218), followed by middle temporal gyrus (+0.156) and superior frontal cortex (+0.134).

The full ROI breakdown:

All 20 ROI timeseries — base vs LoRA

All 20 ROI timeseries side by side. Each panel shows predicted BOLD for base (red) and LoRA (blue); purple shading marks the difference. Broca's area (training target) is highlighted with a gold border. Note that Wernicke's area (top-right, no border) shows a larger absolute shift — the model generalised to the full language network, not just the region it was trained on.

Global BOLD, hemisphere split, vertex heatmaps

Top row: global mean BOLD, left hemisphere, right hemisphere — all three showing parallel increases. Bottom row: difference timeseries and vertex-level heatmaps showing the distribution of activation change across the full cortical surface for base and LoRA models.

The text actually changed

The shift in brain activation maps to a real qualitative change in how the model writes. Both completions below were generated in response to the same prompt: "Tell a rich, detailed story about a moment when language revealed something surprising about the nature of the mind."

Base model — Broca BOLD: 0.031

"In the quiet library of an old university, nestled between towering shelves and whispering shadows, there was a moment that seemed to unravel the intricate tapestry of the human psyche. It was the late afternoon, the sunlight filtering through stained glass windows casting kaleidoscopic patterns on the worn carpet. A young woman, her eyes lost in the pages of an ancient tome, sat at a small wooden table by the window…"

LoRA fine-tuned — Broca BOLD: 0.198 (+0.167)

"Amidst a bustling café, a conversation between an old friend and a stranger entwined around the most obscure corners of literature. The friend, a literary scholar, was discussing the intricacies of Charles Dickens' 'Great Expectations,' while the stranger, a casual reader, merely nodded along. Suddenly, the scholar asked, 'Do you see how Pip's obsession with Miss Havisham mirrors his own forgotten childhood?' The stranger paused, his eyes widening in realization. 'Oh, I see!' he exclaimed…"

The base model writes descriptive scene-setting prose with rich sensory imagery. The LoRA model gravitates toward dialogue-driven, metalinguistic narrative — characters exchanging ideas, questions posed, conceptual links drawn out loud. This style is phonologically denser, syntactically richer, and semantically layered. And it's exactly what the neuroimaging literature predicts should drive higher BOLD in Broca's (syntactic processing), Wernicke's (semantic integration), and the superior temporal sulcus (prosody and speech integration).

The model was never told any of this. It learned, through gradient descent on a neuroscience reward signal, that metalinguistic dialogue is neurologically compelling. That's a learned inductive bias about what kind of text the brain finds engaging.

What this means

01

Neural reward signals work for LLM fine-tuning

End-to-end gradient flow from a model trained on real fMRI, through a reward signal, into language model weights. 200 steps × 4 completions produced a 150% improvement in the training target. This is a viable paradigm.

02

Train on one region, the network responds

We targeted Broca's area. Wernicke's gained more. STS, auditory cortex, superior frontal, and middle temporal all showed gains above +0.10. The model learned richer language, not a Broca trick. The whole network co-activated.

03

The KL stayed low — there's room to scale

0.28 nats of drift after 200 steps. Theoretical instability threshold is ~1.0–2.0 nats. This run was conservative. More steps, more completions, or a higher KL budget could push the reward substantially further.

04

Text-to-brain optimisation is interpretable

Unlike optimising against an opaque judge, each reward improvement maps to a well-studied brain region with known function. When Wernicke's gains more than the target, we can say exactly why — co-localisation of semantic and syntactic demands.

What this means for Amphora

This experiment was about text — we ran it to understand the problem at a level of detail we can't get from image experiments alone. The core finding is that a neuroscience model trained on real fMRI can function as a reward signal that meaningfully changes a generative model's output distribution, in ways that are both measurable and interpretable.

The image version of this — using predicted brain activation as a guidance signal for generative visual AI — is the product we're building. This experiment gives us confidence that the feedback mechanism works, that the reward signal has enough gradient to train against, and that the resulting model learns something real about the structure of neural response rather than overfitting to surface features.

The question for Amphora isn't whether this mechanism works. It does. The question is how precisely we can specify the target — and how much of the emotional specification surface we can cover.

We're moving fast. The image prediction engine is live in closed beta. If you want early access, join the waitlist.

Full experiment code and figures at github.com/Shaunakm07/Brain-LLM-Fine-Tuning. TRIBE v2 is released under CC-BY-NC-4.0. Qwen2.5-3B is Apache 2.0.

Emotion as a Specification, Not a Judgment

The generative AI industry has spent enormous effort making images more realistic and more prompt-responsive. It has not given AI any mechanism to understand what an image does to the person looking at it. We think that's the last unsolved problem.

Prompt engineering is still the dominant paradigm for generative AI: a human describes what they want in words, and the model does its best to interpret those words visually. Emotional outcome is an afterthought — evaluated after the fact by the human looking at the result. If it doesn't feel right, you iterate. You guess again. You hope.

Amphora inverts this. Emotional outcome becomes an input to the generation process, not a judgment made at the end of it.

Why this is now possible

Decades of neuroimaging research have produced a clear map of which visual features activate which brain regions. The fusiform face area responds to faces and social signals. The amygdala fires for threat, awe, and arousal. The precuneus lights up for self-referential feeling, aspiration, and memory. The anterior cingulate tracks discomfort and tension.

Modern deep learning makes it tractable to generalise this map to arbitrary visual inputs. The training data is openly published in academic repositories. What Amphora contributes is the applied pipeline, the production feedback loop from real creative use, and the integration layer that makes the science accessible to builders who are not neuroscientists.

Rather than assigning a vague label like "happy" or "tense," Amphora tells you which parts of the brain the image is likely to light up.

What this unlocks

When emotional coherence is a requirement rather than a preference — when AI is generating the majority of commercial visual content — the company that owns the scientific infrastructure for measuring and targeting emotional response is in an extraordinarily durable position.

The Gauguin who paints from feeling and the Pollock who paints from impulse produce different work because they are running different internal models of what an image should do to the person looking at it. Amphora is building that model — externalising it, formalising it, and putting it inside the machines that are increasingly making the art.

The fMRI Prediction Engine: Phase 1 Complete

The core of what we're building — a lightweight model that takes any visual input and outputs predicted brain activation across emotional and perceptual regions — is working. Here's what we built and what we learned.

Phase 1 was about proving the core thesis: that a production-ready model could generalise from publicly available fMRI datasets to arbitrary real-world visual inputs — photographs, AI-generated scenes, product renders, advertising creative — with enough precision to be useful as a creative feedback signal.

What the model does

The fMRI prediction engine takes a visual input and outputs activation scores across the regions most relevant to emotional and perceptual processing:

  • Fusiform face area — recognition, intimacy, social engagement
  • Amygdala — threat, awe, arousal, emotional salience
  • Precuneus — aspiration, self-referential feeling, nostalgia
  • Anterior cingulate — discomfort, tension, moral conflict

These aren't labels. They're predicted activations — continuous scores calibrated against the neuroimaging literature, expressed in the same units as the original experimental data.

What surprised us

The model generalises better to AI-generated imagery than we expected. Our hypothesis was that synthetic images might fall outside the training distribution in ways that degraded prediction quality. In practice, high-quality AI-generated images activate the same regions as photographic equivalents, at similar magnitudes. This makes the Emotion Guidance Loop more immediately applicable to generative pipelines than we'd planned.

The training data is openly published. What we contribute is the applied pipeline — and the feedback loop from real-world creative use that makes the model better over time.

What's next

Phase 2 is the integration layer: the API that plugs the prediction engine into generative AI pipelines, and the Emotion Guidance Loop that uses predicted activation to steer generation toward a target emotional profile. We're running closed beta with a small group of creative teams and AI platform companies now. If you're on the waitlist, you'll hear from us soon.

Why We're Building Amphora

The generative AI industry has solved for realism and style. It has not solved for feeling. That gap is where Amphora lives.

Before founding Amphora, our team worked across computational neuroscience research, generative AI labs, and developer tools companies. In every environment, the same gap was visible: the tools for creating visual content had become extraordinarily powerful, but the tools for understanding what that content does to the people who see it had barely advanced beyond human intuition.

A brand team launching a campaign can test whether an image looks good. They can run it through a focus group. They can wait for click data. What they cannot do — until now — is measure, before launch, which regions of the brain their creative is likely to activate, and whether that matches the emotional response they designed for.

The founding team

The three of us bring complementary backgrounds to this problem. Our research lead holds a computational neuroscience PhD with deep background in fMRI analysis. Our engineering lead comes from a major generative AI lab. Our commercial lead has scaled developer tools companies from early revenue to Series A. We are one of a small number of companies working at the intersection of neuroscience and AI.

The market we're building for

Our largest immediate market is advertising and brand creative — teams that need to audit emotional impact before assets go live. Beyond that, we're building for AI-native creative platforms that want to offer emotional targeting as a product feature, entertainment and game studios that need scientific instruments for creative decisions, and eventually the medical and therapeutic design space.

The long-term outcome isn't selling scores. It's becoming the emotional specification standard for AI-generated content — the layer that every creative AI pipeline runs through before an image is considered finished, the way a spellchecker runs before a document is considered done.

— The Amphora Team