Skip to content

How can I use Agent Lightning for fine-tuning an agent’s system prompt using OpenAI models (gpt-4o)? Can VERL be used for this? #351

@rutvik-jaiswal-deeplearning

Description

I am trying to fine-tune an existing Agent using Lightning agent — specifically its system prompt (agent behavior).
My requirements:

  • I must use OpenAI models only, such as gpt-4o base.
  • I want to use VERL to optimize or update the agent’s prompt/behavior.
  • I prefer a minimal, validated, single-file example (or as simple as possible).

My Questions:

  1. Does Agent Lightning support fine-tuning or behavioral optimization using VERL when the underlying LLM is an OpenAI model?
  2. Is VERL compatible with Agent Lightning for updating prompts or performing reward-based optimization on an OpenAI-powered agent?
  3. Can you we have used verl with prompt optimization for the llm and get some a validated minimal example demonstrating how to integrate VERL + Agent Lightning + OpenAI (gpt-4o) for fine-tuning an agent system prompt?

Metadata

Metadata

Assignees

No one assigned

    Labels

    apoquestionQuestion about a feature or some usage

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions