Skip to content

Various enhancements to GEPA on BFCL experiment's functions, logging, and metrics#41

Open
parthkotwal wants to merge 34 commits into
mainfrom
parth-branch
Open

Various enhancements to GEPA on BFCL experiment's functions, logging, and metrics#41
parthkotwal wants to merge 34 commits into
mainfrom
parth-branch

Conversation

@parthkotwal

Copy link
Copy Markdown
Collaborator

No description provided.

parthkotwal and others added 30 commits December 29, 2025 13:08
- Introduced new scripts for plotting generalization gap, GEPA vs baseline performance, and prompt comparison.
- Enhanced candidate snapshot loading to include evaluation index and improved data handling.
- Updated run_all.py to ensure proper execution order and validation of output files.
- Removed obsolete prompt_timeline.py script.
Move GEPA and feedback ablation work into experiments/parth/{gepa,feedback_ablation}/

with updated path references. Delete annotations/ (moving to Google Drive),

csv_to_intermediate.py, and one-off shell/run scripts. Extract baseline stability

labels into standalone YAML file.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant