Temporal Preference Concepts and their Functions in a Large Language Model

Author: Unruly Abstractions

Date: May 3, 2026

Category: Empirical

Abstract

Causally localizes a subgraph for temporal preference in a distilled LLM (Qwen3-4B-Instruct-2507) using gradient attribution and activation patching, with steering vectors as suggestive control