NiceWebRL: a Python library for human subject experiments with reinforcement learning environments

Agentic AI
Published: arXiv: 2508.15693v1
Authors

Wilka Carvalho Vikram Goddla Ishaan Sinha Hoon Shin Kunal Jha

Abstract

We present NiceWebRL, a research tool that enables researchers to use machine reinforcement learning (RL) environments for online human subject experiments. NiceWebRL is a Python library that allows any Jax-based environment to be transformed into an online interface, supporting both single-agent and multi-agent environments. As such, NiceWebRL enables AI researchers to compare their algorithms to human performance, cognitive scientists to test ML algorithms as theories for human cognition, and multi-agent researchers to develop algorithms for human-AI collaboration. We showcase NiceWebRL with 3 case studies that demonstrate its potential to help develop Human-like AI, Human-compatible AI, and Human-assistive AI. In the first case study (Human-like AI), NiceWebRL enables the development of a novel RL model of cognition. Here, NiceWebRL facilitates testing this model against human participants in both a grid world and Craftax, a 2D Minecraft domain. In our second case study (Human-compatible AI), NiceWebRL enables the development of a novel multi-agent RL algorithm that can generalize to human partners in the Overcooked domain. Finally, in our third case study (Human-assistive AI), we show how NiceWebRL can allow researchers to study how an LLM can assist humans on complex tasks in XLand-Minigrid, an environment with millions of hierarchical tasks. The library is available at https://github.com/KempnerInstitute/nicewebrl.

Paper Summary

Problem
The main problem addressed by this paper is the need for a research tool that enables researchers to compare artificial intelligence (AI) agents with human performance in various environments. This is particularly important for developing AI systems that are human-like, compatible with humans, and assistive to humans.
Key Innovation
The innovation presented in this paper is NiceWebRL, a Python library that transforms Jax-based environments into online interfaces for human subject experiments. This library allows researchers to use machine reinforcement learning (RL) environments for online human subject experiments, supporting both single-agent and multi-agent environments.
Practical Impact
NiceWebRL has the potential to impact various fields, including AI research, cognitive science, and multi-agent research. It enables researchers to: * Compare AI algorithms with human performance * Test ML algorithms as theories for human cognition * Develop algorithms for human-AI collaboration * Study how LLMs can assist humans on complex tasks The library is available on GitHub, and the authors provide several functional example folders using NiceWebRL across three scenarios: Human-like AI, Human-compatible AI, and Human-assistive AI.
Analogy / Intuitive Explanation
Imagine a virtual playground where humans and AI agents can interact and learn from each other. NiceWebRL is like a meta-environment that enables the creation of this playground, allowing researchers to design and test AI systems that can work collaboratively with humans. Just as children learn and develop skills in a playground, AI agents can learn and improve their performance through interactions with humans in this virtual environment.
Paper Information
Categories:
cs.AI
Published Date:

arXiv ID:

2508.15693v1

Quick Actions