Ordinality of Visible-Thermal Image Intensities for Intrinsic Image Decomposition

Computer Vision & MultiModal AI
Published: arXiv: 2509.10388v1
Authors

Zeqing Leo Yuan Mani Ramanagopal Aswin C. Sankaranarayanan Srinivasa G. Narasimhan

Abstract

Decomposing an image into its intrinsic photometric factors--shading and reflectance--is a long-standing challenge due to the lack of extensive ground-truth data for real-world scenes. Recent methods rely on synthetic data or sparse annotations for limited indoor and even fewer outdoor scenes. We introduce a novel training-free approach for intrinsic image decomposition using only a pair of visible and thermal images. We leverage the principle that light not reflected from an opaque surface is absorbed and detected as heat by a thermal camera. This allows us to relate the ordinalities between visible and thermal image intensities to the ordinalities of shading and reflectance, which can densely self-supervise an optimizing neural network to recover shading and reflectance. We perform quantitative evaluations with known reflectance and shading under natural and artificial lighting, and qualitative experiments across diverse outdoor scenes. The results demonstrate superior performance over recent learning-based models and point toward a scalable path to curating real-world ordinal supervision, previously infeasible via manual labeling.

Paper Summary

Problem
Intrinsic image decomposition (IID) is a long-standing problem in computer graphics and computer vision. It aims to separate the diffuse albedo and shading from a photograph, which is useful for various applications such as recoloring, relighting, and compositing. However, acquiring ground truth data for real-world scenes remains a major bottleneck, often requiring specialized procedures and equipment.
Key Innovation
This research introduces a novel training-free approach for intrinsic image decomposition using only a pair of visible and thermal images. The authors leverage the principle that light not reflected from an opaque surface is absorbed and detected as heat by a thermal camera, allowing them to relate the ordinalities between visible and thermal image intensities to the ordinalities of shading and reflectance.
Practical Impact
This research has significant practical implications. By using a single thermal image, the authors can regularize the albedo-shading ambiguity, which is a major challenge in IID. This approach can be applied to a wide range of scenarios, including outdoor scenes with strong shading variations or rich albedo textures. The results demonstrate superior performance over recent learning-based models and point toward a scalable path to curating real-world ordinal supervision, previously infeasible via manual labeling.
Analogy / Intuitive Explanation
Think of it like this: Imagine you're in a room with a bright light shining on a black object. The object will appear dark in a visible image, but it will appear bright in a thermal image because the light is absorbed and converted into heat. This is the fundamental principle behind the authors' approach. By leveraging the ordinality of visible and thermal image intensities, they can recover the shading and reflectance components of an image without any training.
Paper Information
Categories:
cs.CV
Published Date:

arXiv ID:

2509.10388v1

Quick Actions