Deploy here: https://get.runpod.io/diffusion-pipe-template
Click the Terminal button to open a command prompt.
Type the following command and press Enter:
bash interactive_start_training.shThe script will guide you through:
- Model Selection - Choose from Flux, SDXL, or Wan models
- API Keys - Enter required tokens (Hugging Face for Flux, Gemini for video captioning)
- Dataset Options - Select image captioning, video captioning, or both
- Configuration Review - Review training parameters before starting
The script will automatically:
- Download the selected model
- Generate captions for your media (if selected)
- Start LoRA training with optimized settings
Once training is complete, your trained LoRA files will be saved in:
training_outputs
Before running the script, place your training data in:
- Images:
image_dataset_here/folder - Videos:
video_dataset_here/folder
- First Run: Allow extra time for model downloads (can be several GB)
- API Keys: Have your Hugging Face token ready for Flux, Gemini API key for video captioning
- Monitor Progress: The script shows progress indicators for downloads and captioning
- Review Captions: You'll be prompted to manually review generated captions before training starts
The interactive script provides clear instructions and error messages to guide you through each step. Simply follow the on-screen prompts!