Unified CLI (`opentryon`)

Once OpenTryOn is installed (pip install -e . or pip install opentryon), every adapter in the repo is available through a single opentryon command with three levels of control: service → model → parameters.

opentryon <service> --model <model> [params...]

service: what kind of task -- vton, generate, edit, understand, video-generate, bg-remove
model: which adapter/provider to use for that service, e.g. --model flux-vto
parameters: model-specific flags (image inputs, prompts, sampling knobs, etc.)

Services and Models

Service	What it does	Models
`vton`	Virtual try-on: compose a garment onto a person image	`flux-vto`, `nova-canvas`, `kling-ai`, `segmind`
`generate`	Text-to-image generation	`nano-banana`, `nano-banana-pro`, `nano-banana-2`, `flux2-pro`, `flux2-flex`, `flux2-turbo` (local), `gpt-image`, `luma-image`
`edit`	Image editing (image + instruction → image)	`nano-banana`, `nano-banana-pro`, `nano-banana-2`, `flux2-pro`, `flux2-flex`, `flux2-turbo` (local), `gpt-image`
`understand`	Image/video understanding	`kimi-k2.6`, `kimi-k2.7-code`, `kimi-vl` (local), `llava-next` (local)
`video-generate`	Text/image-to-video generation	`veo`, `sora`, `luma-video`
`bg-remove`	Background removal	`ben2` (local)

Models marked "local" run on your own GPU and require pip install opentryon[local]; everything else calls a cloud API and needs the corresponding API key set in your environment (see Configuration).

Discovering Flags

Every level of the CLI is self-documenting:

opentryon --help                              # list services
opentryon understand --help                   # list models for a service
opentryon understand --model kimi-k2.6 --help # list that model's parameters

Examples

# Virtual try-on
opentryon vton --model flux-vto \
  --person-image model.png --garment-image garment.png

# Text-to-image
opentryon generate --model nano-banana-pro \
  --prompt "A fashion model wearing elegant evening wear" --resolution 4K

# Image editing
opentryon edit --model gpt-image \
  --images person.jpg --prompt "Change the jacket to black leather"

# Image/video understanding (Kimi K2.6, general-purpose -- not fashion-only)
opentryon understand --model kimi-k2.6 \
  --image garment.jpg --prompt "Describe this outfit."
opentryon understand --model kimi-k2.6 \
  --video runway_clip.mp4 --prompt "Summarize the styling shown."

# Coding-focused multimodal understanding
opentryon understand --model kimi-k2.7-code \
  --image ui_mockup.png --prompt "Write the HTML/CSS for this design."

# Open-weight local understanding (no API key, needs a GPU)
opentryon understand --model kimi-vl --image garment.jpg

# Text-to-video
opentryon video-generate --model veo \
  --prompt "A model walking a runway in slow motion" --duration 6

# Background removal
opentryon bg-remove --model ben2 --image product.jpg --refine

Every command accepts -o/--output-dir (default: outputs/) and --dry-run (print the resolved adapter call without invoking the API/GPU):

opentryon vton --model flux-vto \
  --person-image model.png --garment-image garment.png --dry-run

Local (GPU-only) Models

Local models (flux2-turbo, kimi-vl, llava-next, ben2) need the local extra:

pip install opentryon[local]

Running a local model without it prints an install hint instead of a raw stack trace:

✗ 'Kimi-VL (open-weight, local)' requires local ML dependencies that aren't installed.
  Install them with: pip install opentryon[local]

Services and Models​

Discovering Flags​

Examples​

Local (GPU-only) Models​

See Also​

Services and Models

Discovering Flags

Examples

Local (GPU-only) Models

See Also