// POSTED: Apr 14, 2026

AI Developer Needed to Build Custom Video Generation Model (Text-to-Video / Image-to-Video)

Job Description: We are seeking an experienced AI/ML Developer to build a custom video generation AI system capable of generating high-quality videos from text prompts and/or images. The ideal candidate should have strong experience in Generative AI, diffusion models, and video synthesis pipelines. This project involves building, fine-tuning, or integrating an AI model capable of producing short-form videos for marketing, social media, and commercial use. Project Scope: Develop or fine-tune a Text-to-Video or Image-to-Video AI model Work with models such as Stable Video Diffusion or other open-source video diffusion frameworks Implement prompt optimization and output quality enhancement Add support for background music integration Add subtitle generation (optional) Add voiceover syncing (optional) Optimize model performance for efficient rendering Deploy the solution via web application or API Handle cloud deployment (AWS, GCP, or Azure preferred) Required Skills: Strong Python programming skills Experience with PyTorch or TensorFlow Hands-on experience with diffusion models and generative AI Knowledge of video processing tools such as FFmpeg or OpenCV Experience with Hugging Face and Transformers GPU training and CUDA optimization API development (FastAPI preferred) Cloud infrastructure and deployment experience Deliverables: Fully functional video generation pipeline Clean, well-documented source code Deployment documentation Performance optimization report Post-delivery support for bug fixes and improvements Project Timeline: Estimated 4–8 weeks depending on scope and complexity.

APPLY NOW

AI Developer Needed to Build Custom Video Generation Model (Text-to-Video / Image-to-Video)

More Remote Jobs