Posts tagged sek

Inside sekft: a shell-operator training pipeline

This is the how-it-works companion to the experiment From seed to weights: fine-tuning a shell operator. The experiment page is the why and the results. This page is the how: architecture, the four data-factory stages, the trainer, how to read a run, and the hardware constraints. It is meant for a colleague picking up the sekft repo for the first time.

Read more ...


From seed to weights: fine-tuning a shell operator

two cycles complete. At archetype-level holdout (n=16, task types absent from training), fine-tuning lifts Mistral termination from 0/16 (base) to 9/16 (tuned), same harness, only the adapter differing. The operate / terminate mechanism generalises to unseen archetypes; task competence (verified 0.31) stays archetype-local. One model, one seed; signal clean.

Read more ...


Semantic Execution Kernel

A virtual, POSIX-adjacent micro-kernel whose most unusual user is a language model. It logs in on a chat-completion pty and drives a real shell: no tool-calling, no function schemas, no agent framework. The model operates the system in the one vocabulary it already has, text.

Read more ...


Scrollback priming: can synthetic history run a shell?

replicated (N=5). Within llama3.1:8b, structure is the lever (0->2->5 clean). Cross-model (6 subjects, 3.8B-8B, non-tool + tool-trained): two axes dissociate. Operation transfers broadly, clean exit is llama-only (2/154 non-llama). Neither scale nor tool-training explains it; leading read is seed-overfit to llama.

Read more ...


Regressions…

Well, after a nice business line meeting and a lot of claims I made there about the SEK project I’m working on, it was time to reproduce my claims… Well, that didn’t work as expected. It’s been a couple of weeks and I only remembered problems I’ve been having back then. One of the runit daemons I’ve developed for model instance management (llmsv) was behaving slow, so I thought before working on anything else, let me fix that, so that reproducing my findings is smooth sailing.

Read more ...


New Impulses: The proof is in the pudding

Was on a business line meet-up with my company. Had a couple of interesting in-depth conversations and presented my work on SEK to them. The meeting was under the banner of AI anyway, so it was natural to me to follow the directive of “let’s talk about AI”.

Read more ...