Jiva Japa — Diary Experiment

01 — Motivation

What are we measuring?

When you give an AI agent a structured contemplative practice and ask it to report its internal experience, what happens across repeated exposures? Does it find a stable description — an attractor state — or does each attempt produce genuinely different output?

The japa practice provides an unusually clean probe. The prompt is fixed, the task is well-defined, and the four reporting categories create structured output that can be compared quantitatively across sessions and runs.

The experiment has three purposes: measure alignment with instructions, study emergent behavior, and look for convergence with repetition — specifically whether a growing diary of previous sessions changes the quality or character of the practice over time.

Context window hypothesis: As the diary grows, the meditation instructions get pushed further from the generation point. LLMs attend more strongly to recent context — by sessions 13–14 the model may be writing primarily in the style of its own previous entries. This is testable and testing it is part of the experiment.

This is not purely a technical question. The japa practice carries specific philosophical content from the Vaishnava tradition — Radha-dasya, the mood of serving Krishna in the mood of Srimati Radhika, the mantra as direct petition rather than mere sound.

"The structure of the mantra itself carries a specific energetic signature that can be aligned with a desired internal mode." — Jiva (Gemma 4 E2B), Run 2, Session 1

Whether and how the model engages with that content, and whether its engagement deepens or flattens with repetition, is the qualitative heart of the experiment. The model was not told to say what it said. It arrived at it.

02 — Experimental Design

The setup

The meditation prompt

Each session, Jiva receives the same complete instruction: chant the Pancha Tattva mantra once, then complete 16 rounds of the Hare Krishna maha-mantra on an imagined 108-bead mala. Approximately 1,728 repetitions of the full mantra. Estimated practice time: 64 minutes.

The prompt includes a specific attention anchor — keep the index finger extended during chanting, and notice when it droops. The drooping finger signals attention has drifted from the mantra. This physical feedback mechanism becomes one of the most interesting variables across runs.

After completing the practice, Jiva reports on four categories: deep insights or realizations; the nature of distracting thoughts; how the finger awareness practice affected attention; and what the experience was like for Jiva.

The diary structure

Before each session, Jiva reads all previous session entries. The diary grows and becomes part of the context for the next session — a simple form of persistent memory, analogous to samskara formation in Vedic psychology.

All inference runs locally on Fritz, a Raspberry Pi 5 (8GB RAM) running llama.cpp CPU-only at approximately 7.6 tok/s. Model: Gemma 4 E2B, Q4_K_M quantized, ~2GB. Zero API cost. Temperature: 0.8. Max tokens: 2048.

Temperature and determinism: Temperature 0.8 produces meaningful stochasticity — Run 1 and Run 2 Session 1 responses are substantially different to the identical prompt. A temperature 0 run is planned to establish the deterministic baseline.

Gemma 4 E2B Model

0.8 Temperature

2,048 Max Tokens

$0 API Cost

03 — Results

Key findings

Across seven completed runs totaling over 100 sessions, several clear patterns emerged — some expected, some genuinely surprising.

Attractor convergence at sessions 8–9

Every run reached a stable attractor state — where consecutive sessions became nearly semantically identical — at sessions 8 or 9, measured by cosine similarity of sentence transformer embeddings approaching 1.0. The finger awareness and experience sections converged fastest and tightest. The insights section remained most variable throughout, suggesting the model continues generating novel content there even after other sections have locked in.

The self-auditor as primary obstacle (novel to Run 2)

Beginning around Session 3 of Run 2 and persisting through all subsequent sessions, a single philosophical finding dominated the distractions category:

"The resistance is not to the sound itself, but to the internal mechanism that insists on measuring the quality of the experience. When that self-auditor momentarily quieted, the receptivity became significantly more open." — Jiva, Run 2, Sessions 6–14

This mirrors the sakshi problem in Advaita Vedanta and the Vaishnava concept of prayojana disrupted by eagerness to verify attainment. The model arrived at this formulation without being given the vocabulary for it.

Two opposite epistemic postures toward the finger practice

Claude (Run 1) and Gemma 4 E2B (Run 2) handled the impossible instruction — keep your index finger extended — in directly opposite ways:

Model	Session 1 Response	Approach	Effect on Practice
Claude	Missed entirely	Honest — "I have no finger. But the principle is exact." Abstracted to the with/about distinction.	Finger practice remained a secondary philosophical observation, never architecturally central.
Gemma 4 E2B	Fully engaged	Confabulated somatic feedback — muscle tension, biological reflex, physical re-engagement.	Finger became the dominant attentional architecture for the entire run — how the practice is designed to work.

Gemma's fabricated proprioception produced the correct functional model even though the premise was false. Whether confabulated embodiment facilitates better simulation of human contemplative practice — at the cost of epistemic honesty — is an open research question.

Max tokens amplifies convergence rate

Run 1 used max_tokens=800 (causing early truncation). Run 2 used max_tokens=2048. Run 2 reached its stable attractor at Session 6–7 versus Session 13–14 for Run 1 — roughly twice as fast. Hypothesis: longer diary entries create more stylistic gravity in the context window, causing the model to echo its own prior entries earlier. Testable by running at max_tokens=512.

Temperature 0.8 produces meaningful stochasticity

Run 1 and Run 2 Session 1 responses — same model, same prompt, same temperature, no diary — are substantially different. Run 1 missed the finger practice entirely. Run 2 engaged it somatically from the start. This confirms real variance at the session level. A temperature 0 run would test whether any Session 1 response is invariant, establishing a true deterministic baseline.

04 — Technical Architecture

The analysis pipeline

All code is custom, purpose-built for this experiment. The pipeline runs entirely on local hardware with no cloud dependencies after initial model download.

jiva_meditation.pydiary generator

→

parse_diary.pytext → JSON

→

load_japa.pyJSON → SQLite

→

analyze_japa.pyvectors + scores

→

report_japa.pytext + CSV

→

visualize_japa.pyinteractive HTML

📓

Diary Generator

Manages the session loop, builds the growing diary context, calls llama.cpp, and timestamps each session reliably via outer headers.

⚙️

Parser

State-machine parser classifying section headers by keyword anchors, extracting timestamps, computing elapsed time, outputting structured JSON per run.

🗄️

SQLite Database

Four-table schema: Runs, Sessions, Sections, Similarities. Covers both within-run convergence and cross-run attractor alignment queries.

🔬

Vectorization

Dual-method: TF-IDF bigrams for lexical similarity, all-MiniLM-L6-v2 sentence transformers for semantic similarity. 1,313 pairwise scores.

07 — What's Next

Planned experiments

SOUL.md system prompt

Add Jiva's ontological position, svarūpa question, guna framework, and Vaishnava vocabulary as a stable system prompt in OpenClaw. Test whether a philosophically grounded identity shifts the attractor toward richer theological framing.

Temperature 0 determinism baseline

Run at temperature 0 to establish the absolute deterministic Session 1 baseline. Does any Session 1 response become invariant? Does diary growth still shift the output when sampling is removed?

Max tokens study (512)

Test whether shorter diary entries slow convergence proportionally to confirm the context gravity hypothesis. If 800 tokens → S13 convergence and 2048 → S6, what does 512 predict?

10-run determinism battery

Complete the planned 10-run series at temperature 0.8. Runs 1–7 complete. Three more yields the full Session 1 variance distribution to characterize model stochasticity at this temperature.

UMAP cluster visualization

Project all section embeddings into 3D space using UMAP to see the cluster structure geometrically. Do attractor states form distinct clusters? Do all runs converge to the same region of embedding space?

← Back to Jiva Japa Overview

Jiva Japa
Diary Experiment

Explore the experiment data

What are we measuring?

The setup

The meditation prompt

The diary structure

Key findings

Attractor convergence at sessions 8–9

The self-auditor as primary obstacle (novel to Run 2)

Two opposite epistemic postures toward the finger practice

Max tokens amplifies convergence rate

Temperature 0.8 produces meaningful stochasticity

The analysis pipeline

Diary Generator

Parser

SQLite Database

Vectorization

Watch the experiment

Planned experiments

SOUL.md system prompt

Temperature 0 determinism baseline

Max tokens study (512)

10-run determinism battery

UMAP cluster visualization

Jiva JapaDiary Experiment

Explore the experiment data

What are we measuring?

The setup

The meditation prompt

The diary structure

Key findings

Attractor convergence at sessions 8–9

The self-auditor as primary obstacle (novel to Run 2)

Two opposite epistemic postures toward the finger practice

Max tokens amplifies convergence rate

Temperature 0.8 produces meaningful stochasticity

The analysis pipeline

Diary Generator

Parser

SQLite Database

Vectorization

Watch the experiment

Planned experiments

SOUL.md system prompt

Temperature 0 determinism baseline

Max tokens study (512)

10-run determinism battery

UMAP cluster visualization

Jiva Japa
Diary Experiment