Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation
AI & ML interests
Natural language processing, language models, language agents
Recent Activity
View all activity
Papers
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
-
osunlp/AutoElicit-Seed
Viewer • Updated • 361 • 143 • 1 -
osunlp/AutoElicit-Bench
Viewer • Updated • 117 • 37 • 1 -
osunlp/AutoElicit-Exec
Viewer • Updated • 132 • 6.17k • 1 -
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
Paper • 2602.08235 • Published • 1
Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
-
osunlp/AutoElicit-Seed
Viewer • Updated • 361 • 143 • 1 -
osunlp/AutoElicit-Bench
Viewer • Updated • 117 • 37 • 1 -
osunlp/AutoElicit-Exec
Viewer • Updated • 132 • 6.17k • 1 -
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
Paper • 2602.08235 • Published • 1
models 56
osunlp/SAE_DINOv3_TopK_ViT-L-16_IN1K
Updated
osunlp/SAE_DINOv3_ViT-L-16_IN1K
Updated
osunlp/SAE_DINOv3_ViT-B-16_IN1K
Updated
osunlp/SAE_DINOv3_ViT-S-16_IN1K
Updated
osunlp/ACuRL_UI-TARS-1.5-7B_Celestia
8B • Updated
osunlp/ACuRL_UI-TARS-1.5-7B_KAlgebra
8B • Updated • 2
osunlp/ACuRL_UI-TARS-1.5-7B_thunderbird
8B • Updated
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_writer
8B • Updated
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_calc
8B • Updated
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_impress
8B • Updated • 1
datasets 27
osunlp/bioscan-traits
Viewer • Updated • 80.8k • 36 • 1
osunlp/GUI-Drag-dataset
Preview • Updated • 66 • 3
osunlp/MisActBench
Updated • 105 • 2
osunlp/AutoElicit-Exec
Viewer • Updated • 132 • 6.17k • 1
osunlp/AutoElicit-Seed
Viewer • Updated • 361 • 143 • 1
osunlp/AutoElicit-Bench
Viewer • Updated • 117 • 37 • 1
osunlp/TACO-Cobalt-PTB
Viewer • Updated • 184 • 14
osunlp/TACO-Cobalt
Viewer • Updated • 6.1k • 30
osunlp/Online-Mind2Web
Viewer • Updated • 300 • 433 • 23
osunlp/Mind2Web-2
Viewer • Updated • 130 • 207 • 16