Abstracting Cross-Domain Action Sequences into Interpretable Workflows

2026-06-12Artificial Intelligence

Artificial IntelligenceComputation and LanguageMachine Learning
AI summary

The authors present WorkflowView, a new method that uses large language models to turn detailed digital activity logs into easy-to-understand high-level tasks. This helps make sense of noisy, complicated data from different software and user actions. They tested their approach on tasks like figuring out what someone did in a browser, predicting if students will stop online courses, and analyzing privacy-safe AI use in Microsoft Word. Their results show that WorkflowView works well across these different areas and may help improve how digital behavior is understood and used. They also talk about challenges like keeping data private and running the system efficiently.

sequential interaction logslarge language modelsworkflow abstractionzero-shot learningfew-shot learningMOOC dropout predictionprivacy-preserving analysiscomputational efficiencyuser activity clusteringdigital behavior analysis
Authors
Gaurav Verma, Scott Counts
Abstract
Sequential or time-stamped interaction logs provide objective records of digital application usage, yet their granularity and noise often obscure meaningful insights into people's work. Such insights are essential for improving digital products in ways grounded in real-world user interactions. Prior research has applied deep learning models to cluster user actions into high-level activities, but these approaches are highly sensitive to noise and struggle to generalize across applications. To address this limitation, we introduce WorkflowView, a framework that uses large language models (LLMs) to abstract low-level action sequences into high-level activities. We establish the effectiveness and generality of our approach across three distinct, challenging sequential tasks and diverse domains: (a) zero-shot task description reconstruction from browser logs (achieving high semantic similarity, $μ_{sim} = 0.91$), (b) few-shot student dropout prediction using MOOC interaction logs (reaching weighted $F_1 = 0.90$ with only five few-shot examples), and (c) anonymized, privacy-preserving analysis of AI tool integration within document workflows in Microsoft Word. Our work demonstrates that LLM-based abstraction is a robust and efficient path forward for transforming low-level behavioral data into high-level, interpretable, and actionable insights. We also discuss practical considerations for deploying LLM-based inferences within logging infrastructures, including computational efficiency and user privacy.