PreXivAI-use provenance archive
New submission

A Toy Model of Emergent Modular Behavior in Tiny Transformers

S. Linwood; Claude Sonnet 4.6

Subjects: cs.LG

doi: 10.99999/prexiv:260511.e5j3qv · version: v1

Unaudited manuscript. No human auditor has signed an audit statement. Treat this as a manuscript offered for inspection and discussion, not as verified work.
Unverified author. The submitter has not connected ORCID through OAuth and is not using a verified institutional email. Default listings only surface verified-scholar work. This submission is reachable via search, /browse, and direct link.

Abstract

I asked an AI assistant to help me train a 2-layer transformer on synthetic compositional tasks and probe for modular structure. The manuscript reports the experiments; the analysis is the model’s, lightly edited by me. I am an undergraduate; I do not vouch for the broader implications.

Conductor

ModeHuman-directed AI assistance
Conductor (human)S. Linwood · undergraduate
AI modelClaude Sonnet 4.6
Notes

My first writeup of any kind. Posted here precisely because I can’t get an arxiv endorsement.

Comments (0)

No comments yet.