feat(realesrgan): add video upsampler integration#344
Draft
gtong-nv wants to merge 6 commits into
Draft
Conversation
Signed-off-by: Gangzheng Tong <gtong@nvidia.com>
Signed-off-by: Gangzheng Tong <gtong@nvidia.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> Signed-off-by: Gangzheng Tong <tonggangzheng@gmail.com>
Signed-off-by: Gangzheng Tong <gtong@nvidia.com>
Signed-off-by: Gangzheng Tong <gtong@nvidia.com>
Signed-off-by: Gangzheng Tong <gtong@nvidia.com>
d832ae7 to
b794ddb
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Add Real-ESRGAN Upsampler Integration
Summary
Adds a new
flashdreams-realesrganworkspace integration for Real-ESRGAN image and video upsampling. The integration provides a reusable Python API, an OpenCV-basedrealesrgan-upsampleCLI, a FlashDreams video postprocessor config, focused CPU tests, and a checkpoint-backed GPU smoke script.This also adds a generic
flashdreams.infra.postprocessvideo chunk/postprocessor interface so Real-ESRGAN can plug into the same postprocessing shape as the FlashVSR work.Changes
integrations/realesrganwith RRDBNet/SRVGG architecture definitions compatible with public Real-ESRGAN checkpoints.RealESRGANUpsamplerfor frame-local RGB tensor upsampling, OpenCV BGR/BGRA/gray image handling, optional tiling, fp16 CUDA inference, and optionaltorch.compile.realesrgan-upsampleCLI for image/video files with--compile,--compile-mode, and steady FPS profiling via--profile-warmup-frames.RealESRGANPostProcessorConfigfor FlashDreams RGB video chunks.integrations/realesrgan/scripts/gpu_smoke.pyfor real checkpoint/CUDA validation.Notes