TY - GEN
T1 - RVS-CUDA
T2 - 2025 Picture Coding Symposium, PCS 2025
AU - Maria, Enzo Di
AU - Pejman, Hossein
AU - Vazquez, Carlos
AU - Coulombe, Steephane
AU - Coti, Camille
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Intermediate view synthesis is a fundamental operation in applications that aim to create compelling, interactive visual experiences from captured scenes, such as virtually visiting museums or touristic sites. To create such experiences, the synthesis must be performed in real-time at a high frame rate. Reference View Synthesizer (RVS) is an open-source solution enabling the synthesis of any view located between cameras positioned in an environment through interpolation techniques. However, the speed of RVS tools in the current version 4.0 does not meet the real-time requirement necessary for practical applications. Even the OpenGL-optimized version of RVS, exploiting the processing power of GPUs, is too slow. Additionally, it only supports raw YUV input files, whereas in real-world scenarios, input views are typically provided as compressed video streams. To address these limitations, we introduce a CUDA-based implementation of RVS, referred to as RVS-CUDA, which is optimized for real-world applications by offering significantly improved speed through GPU acceleration and support for compressed input views. Experimental results show that, using four 4K input views (cameras), the proposed RVS-CUDA achieves real-time performance exceeding 30 frames per second (fps) for both raw YUV and compressed input formats (about 1000 × and 14 × faster than the original RVS and OpenGL RVS versions, respectively). Furthermore, the output quality of RVS-CUDA is comparable to that of the initial RVS implementation. The source code of the proposed method is publicly available at: https://github.com/EnzoN7/RVS-CUDA.
AB - Intermediate view synthesis is a fundamental operation in applications that aim to create compelling, interactive visual experiences from captured scenes, such as virtually visiting museums or touristic sites. To create such experiences, the synthesis must be performed in real-time at a high frame rate. Reference View Synthesizer (RVS) is an open-source solution enabling the synthesis of any view located between cameras positioned in an environment through interpolation techniques. However, the speed of RVS tools in the current version 4.0 does not meet the real-time requirement necessary for practical applications. Even the OpenGL-optimized version of RVS, exploiting the processing power of GPUs, is too slow. Additionally, it only supports raw YUV input files, whereas in real-world scenarios, input views are typically provided as compressed video streams. To address these limitations, we introduce a CUDA-based implementation of RVS, referred to as RVS-CUDA, which is optimized for real-world applications by offering significantly improved speed through GPU acceleration and support for compressed input views. Experimental results show that, using four 4K input views (cameras), the proposed RVS-CUDA achieves real-time performance exceeding 30 frames per second (fps) for both raw YUV and compressed input formats (about 1000 × and 14 × faster than the original RVS and OpenGL RVS versions, respectively). Furthermore, the output quality of RVS-CUDA is comparable to that of the initial RVS implementation. The source code of the proposed method is publicly available at: https://github.com/EnzoN7/RVS-CUDA.
KW - CUDA
KW - DIBR
KW - Immersive Video
KW - RVS
KW - Real Time
KW - View Synthesis
UR - https://www.scopus.com/pages/publications/105035835894
U2 - 10.1109/PCS65673.2025.11417654
DO - 10.1109/PCS65673.2025.11417654
M3 - Contribution to conference proceedings
AN - SCOPUS:105035835894
T3 - 2025 Picture Coding Symposium, PCS 2025
BT - 2025 Picture Coding Symposium, PCS 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 8 December 2025 through 11 December 2025
ER -