Nvidia Driver Bug & PC Reboots
Nvidia driver 576.02 may cause incorrect GPU cooling. A clean reinstall or updating to hotfix 576.15 can fix this. Hard PC reboots during generation might also relate to recent drivers; consider rolling back to 566.36. Check your Power Supply Unit (PSU) too.
Links:
- https://nvidia.custhelp.com/app/answers/detail/a_id/5650/~/geforce-hotfix-display-driver-version-576.15
- https://www.reddit.com/r/nvidia/comments/1k0iop7/game_ready_studio_driver_57602_faqdiscussion/
- https://github.com/lllyasviel/stable-diffusion-webui-forge
Model Quirks: HiDream & Beards
Models like HiDream can show biases, similar to Flux, often generating bearded men even when not prompted. Using negative prompts like "beard" might help, note that this may only work on the full HiDream model, not development versions. Experimentation is key.
Video Generation Updates & Issues
SkyReels V2 workflows for ComfyUI are available, requiring specific supporting models (CLIP Vision, Text Encoders, VAE). Some users experience black or blurry outputs with WAN video models in ComfyUI; check sampler settings (Euler, UNIPC) and ensure the correct VAE is used.
Links:
- https://openart.ai/workflows/alswa80/skyreelsv2-comfyui/3bu3Uuysa5IdUolqVtLM
- https://huggingface.co/Skywork/SkyReels-V2-I2V-1.3B-540P/tree/main
- https://github.com/kijai/ComfyUI-WanVideoWrapper/
Character Consistency & Detail Loss
Maintaining character consistency across different images/videos remains a challenge, especially in DALL-E 3 or GPT-4o generators. In Sora, prompting closer camera angles can help retain facial details in wide shots. For image-to-video, face details often get lost; try starting with a close-up and zooming out.
Advanced Techniques & Troubleshooting
For ultra-detailed scenes (like 'Where's Waldo'), explore progressive outpainting or tiled diffusion methods. Automate prompt variations using dynamic prompt extensions. Ensure ControlNet models match your base model (e.g., use Illustrious-specific ControlNets with Illustrious models). When using pre-processed ControlNet inputs (like pose skeletons), disable the preprocessor.
Links: