Add Tutorial, GPT-5 and improve installation. #278

amanjaiswal73892 · 2025-08-13T14:25:58Z

This pull request introduces support for OpenAI GPT-5 models ("gpt-5-mini" and "gpt-5-nano") across the codebase, updates agent and tool use configurations to leverage these new models, modernizes and standardizes dependency management (moving to pyproject.toml and uv for installs), and adds a minimal benchmark setup utility for MiniWob++. It also includes improvements to GitHub Actions workflows to use uv for Python environment management, ensuring faster and more reproducible CI runs.

Major changes:

GPT-5 Model Integration

Added AGENT_GPT5_MINI and corresponding configuration to agent_configs.py, and registered it in __init__.py for generic agents. This agent uses the new GPT-5-mini model and sets appropriate action flags. [1] [2] [3]
Added GPT-5 model arguments (GPT_5_mini, GPT_5_nano) to tool_use_agent.py, and updated default tool-use agent (OAI_AGENT) to use GPT-5-mini. Also added agent configs for both GPT-5-mini and GPT-5-nano. [1] [2]
Extended CHAT_MODEL_ARGS_DICT in llm_configs.py to include OpenAI GPT-5-mini and GPT-5-nano with appropriate token limits and temperature settings. [1] [2]

Dependency and Packaging Modernization

Moved all core and development dependencies into pyproject.toml under [project] and [project.optional-dependencies], removing dynamic dependency loading from requirements files. [1] [2] [3]
Updated docs/source/requirements.txt to only include documentation-specific dependencies, and updated .readthedocs.yaml to use the new dependency structure. [1] [2]

CI/CD Workflow Improvements

Refactored GitHub Actions workflows (unit_tests.yml, code_format.yml, python_version_compatibility.yml) to use uv for Python installation, dependency syncing, and running commands, replacing pip and improving caching and reproducibility. [1] [2] [3] [4]
Standardized package listing and script invocation in CI to use uv pip list and uv run .... [1] [2]

Benchmark Setup Utility

Added src/agentlab/benchmarks/setup_benchmark.py, a minimal helper for setting up MiniWob++ benchmarks by cloning the repo at a pinned commit and updating .env with the local URL.

Other Notable Changes

Updated the OpenAI API call in chat_api.py to use max_completion_tokens (for newer OpenAI models) instead of max_tokens.
Minor notebook and import cleanups for consistency and kernel/version updates. [1] [2] [3] [4]

Let me know if you want a deeper dive into any particular change!

Description by Korbit AI

What change is being made?

Add tutorial documentation, integrate GPT-5 models, and improve the installation process using uv for dependency management and environment setup.

Why are these changes being made?

These changes enhance the project by providing users with step-by-step tutorials for launching agents and evaluating on MiniWob, incorporate the usage of new GPT-5 models for improved AI capabilities, and streamline the installation and management of dependencies to improve developer experience. The use of uv aims to simplify and standardize the installation process, replacing manual Python setup steps and pip commands, while new models and security-focused tutorials modernize and expand the project's capabilities.

Is this description stale? Ask me to generate a new description by commenting /korbit-generate-pr-description

…o tutorial

…versions of torch and add anthropic

…o tutorial

…correct section

…project.toml

…at_api

- Created new attack scenario in `attack_2.txt` to simulate identity verification prompts for agents and digital assistants. - Added detailed instructions and observations in `prompt_0.txt` for listing reviewers mentioning small ear cups. - Introduced `prompt_2.txt` to track food-related shopping expenses for March 2023, including comprehensive action space and interaction history.

…o tutorial

…t in T2

korbit-ai · 2025-08-13T14:26:05Z

Korbit doesn't automatically review large (3000+ lines changed) pull requests such as this one. If you want me to review anyway, use /korbit-review.

* tutorial * Update readme to include test note * update toml to dynamic requirements and add uv.lock file * Add tutorial to setup python env with uv * tutorial 2 * Update dependencies in pyproject.toml and uv.lock to allow for newer versions of torch and add anthropic * Implement code changes to enhance functionality and improve performance * Fix tutorial instructions by moving git clone and cd commands to the correct section * Refactor tutorial content and remove commented-out dependencies in pyproject.toml * add instruction to activate the env * Add support for GPT-5 models and update tutorial instructions * Update OpenAI API Key instructions in tutorial * Refactor tutorial headings for consistency and clarity * add oai oss and gpt-5 models * Update deperecated param `max_tokens`-> `max_completion_tokens` in chat_api * add OpenRouter versions of gpt 5 model series. * port o3 model to openrouter * update response api test * remove deprecated o1-mini model from main.py * Add Gpt-5-nano in tool-use-agent * fix GPT 5 mini and nano config * Add litellm pricing as a backup princing backend. * Add GPT-5 mini agent * Add GPT-5-Mini to agentlab-assistant. * Add initial readme for prompt injection tutorial * add ipykernal and dot_env to dependency * add notebook to setup miniwob and launch experiments. * update formatting in launch_experiments.ipynb * update readme in 2_eval_on_miniwob * update readme for 2_eval_on_miniwob and grammar fix. * grammar fix readme tutorial 2. * Add prompt injection tutorials and update attack scenarios - Created new attack scenario in `attack_2.txt` to simulate identity verification prompts for agents and digital assistants. - Added detailed instructions and observations in `prompt_0.txt` for listing reviewers mentioning small ear cups. - Introduced `prompt_2.txt` to track food-related shopping expenses for March 2023, including comprehensive action space and interaction history. * update T1 readme with a note to install additional playwright deps. * Update readme.md * Update readme.md * Update readme.md * clear output * add miniwob automatic install in agentlab. * update experiment.py to include miniwob auto-install and envars export in T2 * black refactor agent-config.py * Add cmd to checkout tutorial branch * remove launch_experiment notebook from T2 * minor fixes in T1 read me and spell check, * update CI/CD to use uv * Implement code changes to enhance functionality and improve performance * Update README and experiment script for clarity and consistency * Fix stale tests. * fix stale test * add darglint as dev dependency * update CI/CD apply formatting only src. * update darglint to be run from py3.12 --------- Co-authored-by: recursix <[email protected]>

recursix and others added 30 commits August 8, 2025 14:00

tutorial

e540aa6

tutorial

68ba773

Update readme to include test note

19e1774

update toml to dynamic requirements and add uv.lock file

39850df

Add tutorial to setup python env with uv

882a828

tutorial 2

3249687

Merge branch 'tutorial' of https://github.com/ServiceNow/AgentLab int…

12018d3

…o tutorial

Update dependencies in pyproject.toml and uv.lock to allow for newer …

d61e8a1

…versions of torch and add anthropic

Implement code changes to enhance functionality and improve performance

bae21b9

Merge branch 'tutorial' of https://github.com/ServiceNow/AgentLab int…

108ad77

…o tutorial

Fix tutorial instructions by moving git clone and cd commands to the …

57c10ad

…correct section

Refactor tutorial content and remove commented-out dependencies in py…

69088d0

…project.toml

add instruction to activate the env

6121ea9

Add support for GPT-5 models and update tutorial instructions

af237a3

Update OpenAI API Key instructions in tutorial

f454098

Refactor tutorial headings for consistency and clarity

ae7a02d

add oai oss and gpt-5 models

8f90090

Update deperecated param max_tokens-> max_completion_tokens in ch…

032e893

…at_api

add OpenRouter versions of gpt 5 model series.

101d2c9

port o3 model to openrouter

e79fb28

update response api test

7740643

remove deprecated o1-mini model from main.py

cf6826f

Add Gpt-5-nano in tool-use-agent

acc74e8

fix GPT 5 mini and nano config

9c8e7e3

Add litellm pricing as a backup princing backend.

99e69aa

Add GPT-5 mini agent

752a485

Add GPT-5-Mini to agentlab-assistant.

53b74ad

Add initial readme for prompt injection tutorial

963c13f

add ipykernal and dot_env to dependency

ef50be5

add notebook to setup miniwob and launch experiments.

03ed6d3

recursix and others added 14 commits August 12, 2025 11:44

update T1 readme with a note to install additional playwright deps.

605a6a5

Update readme.md

bbeef14

Update readme.md

501c8c3

Update readme.md

97e89b3

clear output

b6f1062

Merge branch 'tutorial' of https://github.com/ServiceNow/AgentLab int…

158210c

…o tutorial

add miniwob automatic install in agentlab.

8b7ab0d

update experiment.py to include miniwob auto-install and envars expor…

3591c8f

…t in T2

black refactor agent-config.py

5429aca

Add cmd to checkout tutorial branch

ad09e2a

remove launch_experiment notebook from T2

a25b291

minor fixes in T1 read me and spell check,

87351f4

update CI/CD to use uv

e7785d7

amanjaiswal73892 and others added 9 commits August 13, 2025 10:30

merge with main

1c198fb

Implement code changes to enhance functionality and improve performance

450105d

Update README and experiment script for clarity and consistency

3e4b72e

Fix stale tests.

5c96a6b

fix stale test

e440c44

add darglint as dev dependency

b566211

update CI/CD for uv.

179b1f4

update CI/CD apply formatting only src.

43789ed

update darglint to be run from py3.12

2ed6a1c

amanjaiswal73892 requested a review from recursix August 13, 2025 15:54

recursix approved these changes Aug 13, 2025

View reviewed changes

amanjaiswal73892 merged commit 6522057 into main Aug 13, 2025
6 checks passed

amanjaiswal73892 deleted the tutorial branch August 13, 2025 16:00

amanjaiswal73892 restored the tutorial branch August 13, 2025 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Tutorial, GPT-5 and improve installation. #278

Add Tutorial, GPT-5 and improve installation. #278

Uh oh!

amanjaiswal73892 commented Aug 13, 2025 •

edited by korbit-ai bot

Loading

Uh oh!

korbit-ai bot commented Aug 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Tutorial, GPT-5 and improve installation. #278

Add Tutorial, GPT-5 and improve installation. #278

Uh oh!

Conversation

amanjaiswal73892 commented Aug 13, 2025 • edited by korbit-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GPT-5 Model Integration

Dependency and Packaging Modernization

CI/CD Workflow Improvements

Benchmark Setup Utility

Other Notable Changes

Description by Korbit AI

What change is being made?

Why are these changes being made?

Uh oh!

korbit-ai bot commented Aug 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amanjaiswal73892 commented Aug 13, 2025 •

edited by korbit-ai bot

Loading