Roadmap
Last Updated: November 18, 2025 | Timeline: November 2025 - April 2026
Roadmap Overview
┌─────────────────────────────────────────────────────────────────────────┐
│ COMPLETED (Q4 2025 - November 2025) │
├─────────────────────────────────────────────────────────────────────────┤
│ ✓ Repository rebranded: TryOnDiffusion → OpenTryOn │
│ ✓ Datasets module (Fashion-MNIST, VITON-HD) with class-based adapters │
│ ✓ Virtual Try-On adapters: Amazon Nova Canvas, Kling AI, Segmind │
│ ✓ Enhanced documentation │
└─────────────────────────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────────────────────────┐
│ IN PROGRESS (November 17-30, 2025) │
├─────────────────────────────────────────────────────────────────────────┤
│ • Dataset module testing and refinement │
│ • API adapter pattern standardization │
│ • Documentation improvements │
│ • Image Generation model integrations │
│ - Closed-source: Nano Banana, GPT-Image-1, FLUX Kontext Pro │
│ - Open-source: FLUX.1-dev and other open-source models │
└─────────────────────────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────────────────────────┐
│ PLANNED (December 2025 - January 2026) │
├─────────────────────────────────────────────────────────────────────────┤
│ Core API Infrastructure │
│ • Base provider classes/interfaces (BaseProvider, BaseVTONAdapter) │
│ • Unified API response types (GenerationResponse, VTONResponse) │
│ • Configuration management, error handling, retry logic │
│ • VirtualTryOn unified interface │
│ • Standardize existing adapters (Nova Canvas, Kling AI, Segmind) │
│ │
│ Image Generation & Model Integration (December 2025) │
│ • ImageGenerator unified interface │
│ • Model registry system │
│ • Additional open-source image generation models │
│ │
│ Video Generation Capabilities │
│ • VideoGenerator unified interface │
│ • Closed-source providers: SORA2, Kling AI, LUMA Ray │
│ • Open-source models from GitHub, HuggingFace │
│ │
│ Open-Source Virtual Try-On Models │
│ • Image VTON: Leffa, DreamO, Voost, IDM-VTON, CatVTON │
│ • Video VTON: Video VTON models │
│ • 3D VTON: Open-source 3D virtual try-on models │
│ • Integration with HuggingFace and GitHub models │
│ │
│ Agentic AI System for PDP Optimization │
│ • Multi-agent system for product detail page evaluation │
│ • LLM and VLM integration for analysis and optimization │
│ • Automated PDP content analysis and recommendations │
│ │
│ Testing & Quality │
│ • Unit/integration tests, >85% coverage │
└─────────────────────────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────────────────────────┐
│ EXPLORING (February - April 2026) │
├─────────────────────────────────────────────────────────────────────────┤
│ • Additional providers: Fashn AI, TryOnLabs, Qwen-Image │
│ • HuggingFace model loading utilities │
│ • CLI improvements (`ot generate`, `ot tryon`, `ot video`, `ot agent`) │
│ • Additional datasets (DeepFashion, FashionGen) │
│ • Enhanced agentic AI capabilities │
│ • Advanced features: async/await, caching, batch processing (if time) │
└─────────────────────────────────────────────────────────────────────────┘
Detailed Breakdown
Completed (Q4 2025 - November 2025)
- Repository rebranding, documentation updates
- Datasets Module: Fashion-MNIST, VITON-HD with class-based adapter pattern
- Virtual Try-On Adapters: Amazon Nova Canvas, Kling AI, Segmind
In Progress (November 17-30, 2025)
- Dataset module testing and refinement
- API adapter pattern standardization
- Documentation improvements
- Image Generation Model Integrations:
- Nano Banana image generation adapter (closed-source)
- GPT-Image-1 adapter (closed-source)
- FLUX Kontext Pro adapter (closed-source)
- FLUX.1-dev adapter (open-source)
- Additional open-source image generation models from HuggingFace/GitHub
Planned (December 2025 - January 2026)
December 2025: Core API Infrastructure
- Base provider classes/interfaces (
BaseProvider,BaseVTONAdapter) - Unified API response types (
GenerationResponse,VTONResponse) - Configuration management, error handling, retry logic
- VirtualTryOn unified interface
- Standardize existing adapters (Nova Canvas, Kling AI, Segmind)
December 2025: Image Generation Unified Interface
- ImageGenerator unified interface
- Model registry system for image generation models
- Standardize image generation adapters
- Unit/integration tests for image generation, >90% coverage
January - February 2026: Video Generation & Agentic AI
- VideoGenerator unified interface
- Video providers: SORA2, Kling AI, LUMA Ray (closed-source)
- Open-source video models from GitHub, HuggingFace
- Agentic AI system for PDP (Product Detail Page) optimization
- LLM and VLM integration for content analysis
- Automated PDP evaluation and optimization pipeline
January - February 2026: Open-Source Virtual Try-On Models
- Image VTON models: Leffa, DreamO, Voost, IDM-VTON, CatVTON
- Video VTON models integration
- 3D VTON models integration
- HuggingFace model loading utilities for VTON models
- GitHub model integration framework
Exploring (February - April 2026)
- Additional providers: Fashn AI, TryOnLabs, Qwen-Image
- Additional open-source VTON models from community
- CLI improvements (
ot generate,ot tryon,ot edit,ot list-models,ot video,ot agent) - Additional datasets (DeepFashion, FashionGen)
- Advanced features: async/await, caching, batch processing (if time permits)
- Enhanced agentic AI capabilities and integrations
Success Metrics
November 2025: ✓ Datasets module (2 datasets), ✓ 3 VTON adapters (Nova Canvas, Kling AI, Segmind), Image generation adapters (Nano Banana, GPT-Image-1, FLUX Kontext Pro, FLUX.1-dev), >80% test coverage
December 2025: Base architecture, Unified VirtualTryOn interface, ImageGenerator unified interface, Model registry system, >85% test coverage
January - February 2026: VideoGenerator interface, Video providers (SORA2, Kling AI, LUMA Ray), Open-source VTON models (Leffa, DreamO, Voost, IDM-VTON, CatVTON, Video VTON, 3D VTON), Agentic AI system for PDP optimization, >90% coverage
February - April 2026: Additional providers, CLI improvements, Enhanced DX, Advanced agentic AI features
Contributor Roadmap
For Contributors: This section highlights areas where we need your help! Pick a task that matches your skills and interests.
Good First Issues (Great for New Contributors)
These are perfect for getting started with the project:
Documentation & Examples
- Documentation: Improve README examples and add code snippets
- Tutorials: Create Jupyter notebook tutorials for datasets module
- Examples: Add usage examples for each API adapter
- Type Hints: Add type hints to existing functions and classes
- Labels:
good first issue,documentation,help wanted
Testing & Quality
- Unit Tests: Write tests for dataset loaders (Fashion-MNIST, VITON-HD)
- Integration Tests: Add tests for API adapters (Nova Canvas, Kling AI, Segmind)
- Test Coverage: Improve test coverage for existing modules
- Labels:
good first issue,testing,help wanted
Code Quality
- Linting: Fix linting errors and improve code style
- Error Handling: Improve error messages and exception handling
- Code Comments: Add docstrings and inline comments
- Labels:
good first issue,code quality,help wanted
Intermediate Contributions
For contributors with some experience:
API Adapters & Integrations
- Base Classes: Implement
BaseProviderandBaseVTONAdapterabstract classes - Response Types: Create unified response types (
GenerationResponse,VTONResponse) - Error Handling: Implement retry logic with exponential backoff
- Configuration: Build configuration management system
- Labels:
intermediate,api,help wanted
Image Generation Providers (November 2025 - High Priority)
- Nano Banana: Implement Nano Banana image generation adapter (closed-source)
- GPT-Image-1: Integrate GPT-Image-1 API (closed-source)
- FLUX Kontext Pro: Add FLUX Kontext Pro adapter (closed-source)
- FLUX.1-dev: Create adapter for open-source FLUX.1-dev model
- Additional Open-Source Models: Integrate other open-source image generation models from HuggingFace/GitHub
- Labels:
intermediate,image-generation,help wanted,high-priority
Dataset Adapters
- DeepFashion: Create DeepFashion dataset adapter
- FashionGen: Implement FashionGen dataset loader
- Dataset Utils: Add preprocessing utilities (masks, segmentation, pose detection)
- Labels:
intermediate,datasets,help wanted
Open-Source Virtual Try-On Models
- Leffa: Integrate Leffa image virtual try-on model
- DreamO: Integrate DreamO image virtual try-on model
- Voost: Integrate Voost image virtual try-on model
- IDM-VTON: Integrate IDM-VTON image virtual try-on model
- CatVTON: Integrate CatVTON image virtual try-on model
- Video VTON: Integrate video virtual try-on models
- 3D VTON: Integrate 3D virtual try-on models
- Labels:
intermediate,virtual-try-on,open-source,help wanted
Advanced Contributions
For experienced contributors:
Video Generation
- VideoGenerator Interface: Design and implement unified VideoGenerator interface
- SORA2 Integration: Integrate SORA2 video generation API
- LUMA Ray Integration: Add LUMA Ray video generation adapter
- Open-Source Video Models: Integrate video models from HuggingFace/GitHub
- Labels:
advanced,video-generation,help wanted
Agentic AI System
- Multi-Agent Framework: Design multi-agent system architecture
- LLM Integration: Integrate LLMs for text analysis
- VLM Integration: Add VLM support for image analysis
- PDP Optimizer: Build automated PDP evaluation and optimization pipeline
- Labels:
advanced,agentic-ai,help wanted
Core Infrastructure
- Model Registry: Implement model registry system for easy model discovery
- HuggingFace Integration: Create utilities for loading models from HuggingFace Hub
- CLI Tool: Enhance CLI with
ot generate,ot tryon,ot video,ot agentcommands - Async Support: Add async/await support for API calls
- Labels:
advanced,infrastructure,help wanted
Contribution Areas by Priority
High Priority (November 2025 - December 2025)
- Image Generation Providers - Nano Banana, GPT-Image-1, FLUX Kontext Pro (closed-source), FLUX.1-dev (open-source)
- Core API Infrastructure - Base classes, interfaces, unified responses
- ImageGenerator Unified Interface - Standardize image generation adapters
- Testing - Unit tests, integration tests, >90% coverage
- Documentation - API docs, examples, tutorials
Medium Priority (January - February 2026)
- Open-Source VTON Models - Leffa, DreamO, Voost, IDM-VTON, CatVTON, Video VTON, 3D VTON integrations
- Video Generation - VideoGenerator interface, SORA2, LUMA Ray integrations
- Agentic AI System - Multi-agent framework, LLM/VLM integration
- Model Registry - Model discovery and management system
- CLI Improvements - Enhanced command-line interface
Nice to Have (February - April 2026)
- Additional Providers - Fashn AI, TryOnLabs, Qwen-Image
- Additional Datasets - DeepFashion, FashionGen
- Advanced Features - Async support, caching, batch processing
- Developer Tools - VS Code extension, Jupyter integration
How to Contribute
- Find an Issue: Check our GitHub Issues for tasks labeled
good first issue,help wanted, orintermediate - Claim a Task: Comment on the issue to let us know you're working on it
- Fork & Branch: Fork the repo and create a feature branch
- Code: Write clean, tested code following our Contributing Guidelines
- Test: Ensure all tests pass and add new tests for your changes
- Document: Update documentation and add docstrings
- Submit PR: Open a pull request with a clear description
Contribution Guidelines
- Code Style: Follow PEP 8, use type hints, add docstrings
- Testing: Write tests for new features, maintain >85% coverage
- Documentation: Update README, add examples, document APIs
- Commits: Write clear commit messages following conventional commits
- PRs: Keep PRs focused, include tests and documentation
Issue Labels
We use these labels to help you find tasks:
good first issue- Perfect for new contributorshelp wanted- We need help with thisintermediate- Requires some experienceadvanced- Complex tasks for experienced contributorsdocumentation- Documentation improvementstesting- Test-related tasksapi- API adapter workimage-generation- Image generation featuresvideo-generation- Video generation featuresvirtual-try-on- Virtual try-on model integrationsopen-source- Open-source model integrationsagentic-ai- Agentic AI system workdatasets- Dataset adapter workinfrastructure- Core infrastructure improvements
Get Help
- Discord: Join our Discord community
- GitHub Discussions: Ask questions in Discussions
- Issues: Open an issue for bugs or feature requests
Current Focus Areas
This Month (November 2025):
- Dataset module testing and refinement
- API adapter pattern standardization
- Image Generation Model Integrations:
- Closed-source: Nano Banana, GPT-Image-1, FLUX Kontext Pro
- Open-source: FLUX.1-dev and other open-source models
- Documentation improvements
Next Month (December 2025):
- ImageGenerator unified interface
- Model registry system
- Core API infrastructure (base classes, interfaces)
- Unified VirtualTryOn interface
- Configuration management system
Coming Soon (January 2026):
- Video Generation capabilities
- Open-source VTON models (Leffa, DreamO, Voost, IDM-VTON, CatVTON, Video VTON, 3D VTON)
- Agentic AI system foundation
Ready to contribute? Check out our Contributing Guide and start with a good first issue!