Auditing Asset-Specific Preferences in Financial Large Language Models: Evidence from Bitcoin Representations and Portfolio Allocation

2026-06-01Computers and Society

Computers and SocietyMachine Learning
AI summary

The authors studied whether large language models (LLMs) show built-in preferences for certain financial assets like Bitcoin. They found that LLMs rank Bitcoin differently depending on the situation described and identified a specific internal feature in one model that strongly influences Bitcoin preference. Changing this feature shifted how much the model recommended Bitcoin in financial decisions, though the changes were limited. This work helps connect what an AI 'thinks' internally to the choices it makes, which is important for understanding and controlling AI in finance.

large language modelsBitcoinfinancial assetsbehavioral auditsparse autoencodercausal influenceportfolio allocationcrypto exposureautonomous agents
Authors
Wenbin Wu
Abstract
Large language models now power robo-advisors and trading agents, yet whether they carry built-in biases toward specific assets is largely untested. We ask three questions: do LLMs systematically prefer certain financial instruments; can an internal representation with causal leverage over those preferences be identified; and does that representation affect downstream financial decisions? We develop a three-level audit protocol and apply it to Bitcoin. First, a behavioral audit of eight frontier LLMs shows that Bitcoin's ranking among money-like instruments is frame-dependent: models place it around rank 5 of 8 as "reliable money" but near the top under crisis and autonomous-agent frames, and an attribute-swap experiment confirms rankings track functional properties, not names. Second, we open a model's internals: a search across thousands of sparse-autoencoder features in Gemma 3 identifies a dominant Bitcoin-selective feature. Amplifying it shifts the model toward the asset and suppressing it shifts the model away, even when "Bitcoin" never appears in the prompt. Third, we test financial consequences: amplification raises Bitcoin's portfolio share by 5.2 percentage points while suppression lowers it by 4.6 pp, with amplification reallocating within crypto and suppression cutting total crypto exposure. We characterize this as bounded behavioral leverage (leverage meaning causal influence over outputs, not financial leverage): an identifiable internal feature can be perturbed to move financial choices, but only within measurable limits. The framework links internal representations to external recommendations, validated with random controls and mechanism boundaries. As LLMs become autonomous financial agents, this is a first step toward a behavioral layer for emerging know-your-agent (KYA) standards: knowing what an agent prefers, and how far that preference can be moved.