Literature DB >> 35385560

Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution.

Hiroki Mizuochi¹, Koki Iwao¹, Satoru Yamamoto¹.

Abstract

Thermal remote sensing is an important tool for monitoring regional climate and environment, including urban heat islands. However, it suffers from a relatively lower spatial resolution compared to optical remote sensing. To improve the spatial resolution, various "data-driven" image processing techniques (pan-sharpening, kernel-driven methods, and machine learning) have been developed in the previous decades. Such empirical super-resolution methods create visually appealing thermal images; however, they may sacrifice radiometric consistency because they are not necessarily sensitive to specific sensor features. In this paper, we evaluated a "sensor-driven" super-resolution approach that explicitly considers the sensor blurring process, to ensure radiometric consistency with the original thermal image during high-resolution thermal image retrieval. The sensor-driven algorithm was applied to a cloud-free Moderate Resolution Imaging Spectroradiometer (MODIS) scene of heterogeneous urban and suburban landscape that included built-up areas, low mountains with a forest, a lake, croplands, and river channels. Validation against the reference high-resolution thermal image obtained by the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) shows that the sensor-driven algorithm can downscale the MODIS image to 250-m resolution, while maintaining a high statistical consistency with the original MODIS and ASTER images. Part of our algorithm, such as radiometric offset correction based on the Mahalanobis distance, may be integrated with other existing approaches in the future.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35385560 PMCID： PMC8986004 DOI： 10.1371/journal.pone.0266541

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Measurement of terrestrial thermal emissions allows us to estimate the land surface temperature and the emissivity of surface materials. Thermal remote sensing takes advantage of such features to effectively monitor volcanic disasters [1], wildfires [2], crop fields [3], mineral composition [4], regional climate [5] and urban heat islands [6, 7]. In comparison to observation using in situ photographs [8] or unmanned aerial vehicles [9], satellite-based observation has advantages in spatial coverage, frequency, and regularity. One of the major issues in thermal remote sensing is the coarse spatial resolution of the thermal images [4]. In comparison to optical sensors that observe solar reflection from the Earth’s surface, thermal sensors that observe thermal emissions from the surface collect electromagnetic waves with lower signal strength, resulting in lower spatial resolution. For example, the spatial resolution of the optical data provided by the Moderate Resolution Imaging Spectroradiometer (MODIS) is 250 m or 500 m, whereas that of thermal data is 1 km. A similar situation arises for other moderate resolution sensors: the resolution of optical data provided by the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) is 15 m or 30 m, but the thermal data resolution is 90 m. The other aspect of degraded spatial resolution is the general trade-off between spatial and temporal resolution, and the spatial and spectral resolution in a single sensor. Due to technical limitations (especially in data downlinks), frequent and/or spectrally fine-resolution observation sacrifices spatial details, and vice versa. Missing spatial detail in thermal images is particularly critical when monitoring heterogeneous landscapes, such as urban and suburban areas. To enhance the spatial resolution of thermal images, a wide variety of techniques, called “disaggregation,” “downscaling,” and “super-resolution,” have been developed in recent decades [10]. These can be roughly divided into multi-sensor-based and single-sensor-based approaches. The multi-sensor-based approach, also called spatiotemporal image fusion [11], mainly focuses on mitigating the trade-off between spatial and temporal resolution. In this approach, thermal images with high spatial (but low temporal) resolution are estimated from simultaneously (or quasi-simultaneously) acquired thermal images with low spatial (but high temporal) resolution, based on an empirical relationship between them. Various algorithms, such as the spatial and temporal adaptive reflectance fusion model [12] and similar or improved models (e.g., [13-16]), are used to describe the relationship. These algorithms are powerful tools for environmental monitoring with high spatiotemporal resolution, and are widely applied with match-up pairs between MODIS and ASTER [17], MODIS and Landsat [15], and polar orbiting satellites and geostationary satellites [14]. However, given the nature of spatiotemporal image fusion, the success or failure of this approach depends on the selection of the matched pairs used to describe the relationship. In contrast, the single-sensor-based approach relies on a relationship between the thermal image and images in other spectral domains (usually optical) acquired by the same instrument, to enhance the spatial resolution of thermal images. This approach can be applied to a single sensor that observes thermal and another spectral domain simultaneously from the same platform, even in the absence of a counterpart satellite platform that offers a sufficient chance of simultaneous overpasses of the region of interest, which is rarely realized for satellites with irregular orbits, and for deep space exploration. Pan-sharpening methods via intensity-hue-saturation transformation or principal component analysis have been used traditionally [18, 19], and kernel-driven methods [20-22] and machine-learning approaches (e.g., [23-25]) have become popular recently. These efforts have created visually appealing thermal images that have higher spatial resolution than the original ones. However, such “data-driven” approaches do not necessarily take physical processes into account, including sensor-specific features, and radiometric consistency. In contrast, there are a few “sensor-driven” approaches that explicitly consider sensor features, and target radiometric consistency in the super-resolution results. Hughes and Ramsey [4] introduced a sensor-driven super-resolution approach originally developed by Tonooka [26], which creates both quantitatively accurate and qualitatively acceptable results for their exploration of Mars using the Thermal Emission Imaging System (THEMIS) onboard the Mars Odyssey [27]. This simple approach uses the Mahalanobis distance to estimate each high-resolution pixel value from neighboring, spectrally similar low-resolution pixels. Beneficial characteristics of this approach include consideration of the point spread function (PSF) for the sensor of interest, and radiometric correction weighted by the Mahalanobis distance after the tentative super-resolution retrieval. Such an approach that gives attention to sensor physics also seems to be in line with the recent trend of physically informed machine learning [28], and worth revisiting to achieve single-sensor-based super-resolution rather than using the empirical, data-driven approaches [18-25]. However, sufficient validation and evaluation of the super-resolution results obtained using the sensor-driven algorithm over a heterogeneous Earth surface have not been conducted. In addition, as the original algorithm was proposed more than 10 years ago [26], there seems to be room for refinement. Although it was implemented with ASTER data over urban and suburban areas, quantitative accuracy assessments using independent validation data have not been provided yet. Therefore, this work aims to investigate the potential applicability of the sensor-driven approach over a heterogeneous landscape, and to improve its primitive algorithm. A complex terrain including urban, suburban, forest, lake, and river areas was selected as the study site for this purpose. Similar to previous thermal super-resolution research (e.g., [17, 22]), Terra/MODIS was used as the sensor of interest. The relatively wide swath of MODIS is suited to covering large areas and capturing various land cover types in comparison with other moderate-resolution instruments (e.g., Landsat) that are also often used for super-resolution algorithm development. The other advantage of Terra/MODIS is the existence of a counterpart higher-resolution sensor (ASTER), which can be used for validation data. Because they are onboard the same satellite platform and make simultaneous observations, comparison between them can minimize differences in atmospheric and/or surface conditions [29]. Because both MODIS and ASTER data are freely available, readers can easily reproduce our results. The radiometric calibration uncertainty (sensor requirement) for MODIS thermal bands for surface temperature measurement (i.e., bands 31 and 32) is ± 0.5% in radiance [30]. That for ASTER is ± 1 K or better in brightness temperature, for the range of 270–340 K (i.e., ~ ± 0.3%) [31]. In-flight validation of the thermal bands of MODIS and ASTER has also been reported by Hook et al. [32]. This work provides the first quantitative accuracy assessment of sensor-driven super-resolution with MODIS, using independent validation data (ASTER).

Materials and methods

We first describe the original algorithm developed by Tonooka in 2005 [26] in the “Original algorithm” section, and then describe our proposed refinement in “Proposed refinement” section. Descriptions of the study site and data are provided in the “Study site and data processing” section.

Original algorithm

The original algorithm for the sensor-driven approach was proposed by Tonooka [26]. It is a single-sensor-based approach, and thus makes full use of high-resolution optical information to achieve super-resolution with the low-resolution thermal pixels. It relies on “the empirical fact that, if two nearby surfaces are covered by a similar material under a similar situation, their radiance spectra will be similar in the wide wavelength region” [26]. Therefore, application of the algorithm is not limited to correlation of thermal and optical images. As long as the abovementioned assumption is reasonable, the algorithm is applicable (and was actually applied), even between visible and near infrared bands and shortwave infrared bands. For a general description, let us denote a pixel value of a higher-resolution image in band k (= 1, 2, …, n) as fhigh,k, and that of the counterpart lower-resolution image in band k’ (k’ = 1, 2, …, m) as glow,k’. By an appropriate inter-band coregistration and reasonable sensor design, we assume that one lower-resolution pixel corresponds to an integer number of higher-resolution pixels. The overall super-resolution procedure is as follows: Step 1) Search homogeneous pixels within each lower-resolution scale. Step 2) Degrade fhigh,k images to the same resolution of glow,k’ images considering the PSF (denoted as flow,k hereafter). Step 3) Make a typical spectral pattern (i.e., correspondence between flow,k and glow,k’) by clustering the homogeneous pixels within the entire study region. Step 4) Calculate the Mahalanobis distance (dnei) between fhigh,k at the pixel of interest and flow,k at neighboring homogeneous pixels within a moving window. Step 5) Calculate the Mahalanobis distance (dlib) between fhigh,k at the pixel of interest and the typical spectral pattern extracted in step 3. Step 6) Compare all Mahalanobis distances calculated in steps 4 and 5, and assign glow,k’ at the minimum dnei or dlib as the super-resolved pixel value (ghigh,k’). Step 7) Repeat steps 4–6 for all high-resolution pixels. Step 8) Add an offset so that degraded ghigh,k’ can be consistent with the original glow,k’ for each low-resolution pixel (i.e., perform radiometric correction). The offset value is determined for each high-resolution pixel from the Mahalanobis distance and PSF. Fig 1 summarizes the super-resolution steps in the form of a flowchart. The image pairs for (A) high-resolution bands and (B) low-resolution bands are input into the process. The high-resolution images are degraded to the same resolution as the low-resolution images in step 2. For each high-resolution pixel location, the B data is positioned by referring to the relationship between the A and B spectral information, either at a neighboring homogeneous pixel (step 4) or in a typical spectral pattern (step 5). By repeating this process (step 6) for all high-resolution pixel locations (step 7), images having B-band information but having A-band spatial resolution are created (i.e., super-resolution). The final result is output after post-processing (step 8). A more detailed explanation of each step is provided in the following section.

Fig 1

Flowchart for super-resolution process.

Flowchart for super-resolution process.

The star symbols are where our refinement from the original algorithm [26] was implemented. As an example, the super-resolution process for converting MODIS 500-m resolution images (band 3–7) to 250-m resolution images is shown. Theoretically, this process can be applied to any two datasets that have different spatial resolutions, as long as they have some statistical relationship. In the case of MODIS, there are terrestrial bands with three different spatial resolutions (i.e., 250 m for bands 1 and 2, 500 m for bands 3–7, and 1 km for thermal bands), leading to arbitrariness in combining these bands to obtain super-resolution. In the case of the original algorithm [26], band 3–7 (500-m resolution) were first super-resolved to a resolution of 250 m by referring to the highest-resolution bands (bands 1 and 2), and then the thermal bands (1-km resolution) were super-resolved to a resolution of 250 m by referring to bands 1 and 2, and previously obtained super-resolution bands (3–7). The input-output process for this “two-times super-resolution” method is shown in Fig 2. In the flowchart (Fig 1), original bands 1 and 2 (250-m resolution) correspond to fhigh,k, degraded bands 1 and 2 (500-m resolution) correspond to flow,k, which are indicated by the two red arrows input to the first super-resolution step in Fig 2. The original bands 3–7 (500-m resolution) correspond to glow,k’, which is shown as the blue arrow input to the first super-resolution step. The super-resolved bands 3–7 (250-m resolution) are further input to the second super-resolution step with the original bands 1–2 (fhigh,k in the second super-resolution step), as well as both degraded bands (1-km resolution; flow,k) and the original thermal bands (glow,k’). The final output is the thermal images (bands 31, 32) with a resolution of 250 m.

Fig 2

Original super-resolution algorithm proposed by Tonooka in 2005 [26].

Note that band 5 of Terra/MODIS suffers from stripe noise [33], and we decided not to use it for further processing. For the second super-resolution process, the Mahalanobis distance from the highest-resolution bands and the previously super-resolved bands (bands 3–7 in our case) were calculated separately. The total Mahalanobis distance is evaluated by where d1 is the Mahalanobis distance (either dnei or dlib) for bands 1 and 2 in our case, d2 is that for bands 3–7, n1 (= 2) and n2 (= 4) are the corresponding number of bands, and w is a weighting factor, which was assumed to be 0.7 [26].

Proposed refinement

To make the algorithm more straightforward and to create better radiometrically corrected results, in this paper, we propose two modifications regarding (1) the order of multiple super-resolution retrievals and (2) regularization of the offset adjustment. For each super-resolution process, refinement (1) concerns input-output correspondence and degraded image input, whereas refinement (2) concerns post-processing (both are indicated by a star symbol in the flowchart in Fig 1). For the first modification, the second-highest resolution images are first super-resolved to the highest resolution, which are used in the second super-resolution process in the original algorithm. In this case, the first super-resolution process relies only on the two highest-resolution bands (bands 1 and 2), which is likely to cause substantial uncertainty in the first super-resolution retrieval. The uncertainty probably propagates to the second super-resolution retrieval, making it difficult to perform reliable analysis with the super-resolution results. In addition, regarding this procedure, the original algorithm evaluates the Mahalanobis distance from the highest-resolution bands and super-resolves the second-highest resolution bands separately (Eq 1). This seems to make the algorithm complex and requires the somewhat arbitrary hyperparameter w. To avoid this complexity, we applied the procedure in the inverse direction: first, thermal bands were super-resolved to 500 m with the help of bands 1–7, the result of which was further super-resolved to 250 m with the help of bands 1 and 2 (Fig 3). The MODIS bands 1 and 2 were degraded to 500 m and 1 km, and bands 3–7 were degraded to 1 km in the first super-resolution retrieval. In other words, bands 1–7 (500-m resolution) were fhigh,k, bands 1–7 (1-km resolution) were flow,k, and bands 31, 32 were glow,k’, which were all input to the first super-resolution step. These were used together for calculation of the Mahalanobis distance, and thus Eq 1 and the arbitrary parameter w were no longer needed. The procedure enables the first super-resolution retrieval to make full use of all the optical bands, which may also improve the second super-resolution retrieval and yield a more reliable final result.

Fig 3

Proposed super-resolution algorithm.

For this modification, a detailed description of each step of the algorithm is provided below. Step 1) Within each low-resolution pixel, the standard deviation of fhigh,k is calculated. Homogeneous pixels are flagged when the standard deviation within a low-resolution pixel exceeds the standard deviation over the entire study area for all bands k. In the first super-resolution process, k ranges from band 1 to 7 with 500-m resolution (i.e., n = 7), whereas in the second super-resolution process, k ranges from band 1 to 2 with 250-m resolution (i.e., n = 2). Step 2) Within each low-resolution pixel, fhigh,k is degraded using the PSF as a weighting function to describe signal blurring in low-resolution sensor observation: where i and j denote high-resolution pixel locations within a low-resolution pixel. The mathematical expression for the PSF is provided in the “Study site and data processing” section. Step 3) For all homogeneous pixels, K-means++ clustering is conducted first with flow,k, and then with glow,k’. The number of clusters is set to nine based on the land cover types of the study site (see “study site and data processing” section). The clusters for flow,k and glow,k’ compose hierarchical trees. For each node of the tree, samples of flow,k and glow,k’ are averaged and stored as a database, creating typical spectral patterns over the entire study region. Step 4) Homogeneous pixels are picked up within ±10 low-resolution pixels (i.e., a moving window) from the high-resolution pixel of interest. The Mahalanobis distance is calculated by where fhigh = (fhigh,1, fhigh,2, …, fhigh,n) is a vector with pixel values at the pixel of interest, flow = (flow,1, flow,2, …, flow,n) is a vector of homogeneous pixels, and Vf is a variance-covariance matrix of flow for all the homogeneous pixels of the study site. The homogeneous pixel with minimum dnei (i.e., the spectrum most similar to the pixel of interest) is a candidate for ghigh,k’. Step 5) Similarly, the Mahalanobis distance for the typical spectral pattern is calculated by where flib is a column vector with the average pixel values (band k = 1, 2, …, n) from each cluster. The minimum dlib is also a candidate to estimate ghigh,k’. Step 6) The minimum dnei and the minimum dlib are compared. When dnei ≤ dlib, glow,k’ at the homogeneous pixel with the minimum dnei is placed into the high-resolution pixel as ghigh,k’. When dnei > dlib, the algorithm searches for the spectrum in the g domain at the node where dlib was a minimum: where glow = (glow,1, glow,2, …, glow,m) is a vector with pixel values at the pixel of interest, glib is a column vector with average pixel values from each g cluster at the node with minimum dlib, and Vg is a variance-covariance matrix of glow for all the homogeneous pixels in the study site. The average glow,k’ at the node of minimum dlib,g is placed as ghigh,k’. Step 7) Steps 4–6 are repeated for all high-resolution pixels to create a high-resolution g image with ghigh,k’. At the same time, the Mahalanobis distance corresponding to the adopted ghigh,k’ is stored for each pixel as a “distance map.” Step 8) The retrieved ghigh,k’ should be radiometrically consistent with glow,k’ when degraded again within a low-resolution pixel. To this end, an offset value is added to ghigh,k’. Instead of adding an offset uniformly over the low-resolution pixels, full use is made of the Mahalanobis distance, to allow additional offset corrections to be made for less reliable pixels of ghigh,k’ (i.e., pixels with less spectral similarity). The offset to meet this concept is where d is the Mahalanobis distance from the distance map, g’high,k’ is the corrected result, and αk′ is a modification scale defined by Our second modification of the original algorithm regards the offset correction. The abovementioned offset correction with consideration of the Mahalanobis distance as a weighting function is theoretically reasonable; however, a very large Mahalanobis distance among a few pixels may result in overcorrection and implausible pixel values. To mitigate overcorrection while still employing the concept of weighting by the Mahalanobis distance, we introduced a regularization term into the distance map: where dnorm is a normalized distance that makes the summation over the entire study region equal to 1, dreg is the regularized distance, and λ is a tunable positive real number applied over the entire study region. A large λ makes the correction uniform within a low-resolution pixel, whereas a small λ makes it diverse (λ→0 is equivalent to the original algorithm). We compared the results from (1) the original algorithm, (2) the inverse-direction super-resolution algorithm without distance regularization (i.e., the first modification), and (3) the inverse-direction super-resolution algorithm with distance regularization (i.e., the first and the second modification). For simplicity, hereafter we call them Algorithm 1, Algorithm 2, and Algorithm 3, respectively. This comparison will clarify how our algorithm refinement improves the super-resolution results. To summarize, the benefit of the sensor-driven algorithm over other existing approaches is explicit consideration of the PSF, and radiometric correction weighted by the Mahalanobis distance. The sensor-driven algorithm (with our improvement) may be useful for thermal super-resolution research in the context of physical consistency.

Study site and data processing

The study site is centered around Tsukuba City, Ibaraki, Japan (36.049N-36.459N, 139.856E-140.353E; Fig 4). The region includes urban and suburban areas of Tsukuba and several neighboring cities; Mount Tsukuba, which is covered by a mixed needleleaf and broadleaf forest; and a part of Lake Kasumigaura, the second-largest inland waterbody in Japan. Rice paddy fields and croplands are distributed along several narrow river channels. According to the land cover map provided by the Japan Aerospace Exploration Agency (JAXA) [34], there are also a few grassland areas. The spatial resolution of the land cover map is 250 m. The overall accuracy and kappa coefficient have been reported as 78.0% and 0.745, respectively [34].

Fig 4

Reference satellite data and land cover map for the study site.

Reference satellite data and land cover map for the study site.

(Left) False color image taken by ASTER (2001/09/24), (center) that taken by MODIS, and (right) JAXA land cover map degraded to 250-m resolution. All images have a UTM 54 projection with a WGS84 ellipsoid. Land cover category abbreviations: DBF, deciduous broadleaf forest; DNF, deciduous needleleaf forest; EBF, evergreen broadleaf forest; ENF, evergreen needleleaf forest. We searched for a cloud-free scene acquired by MODIS and ASTER simultaneously, and the scene on 24 September 2001 was selected for use. Level 1B calibrated radiances (MOD02QKM for bands 1 and 2, MOD02HKM for bands 3–7, and MOD021KM for bands 31 and 32) were downloaded via the Level-1 and Atmosphere Archive and Distribution System from the Land Processes Distributed Active Archive Center website [35]. To treat images with equally spaced meter scales, all images were projected onto the UTM 54 projection with a WGS84 ellipsoid by nearest neighbor resampling. For simplicity, super-resolution processing was conducted with images in the radiance scale (W/m2/str/μm), including thermal bands. If necessary, thermal radiance can be translated into brightness temperature Tb (K) by Planck’s law: where h = 6.626 × 10−34 J s is the Planck constant, c = 2.988 × 108 m/s is the speed of light, k = 1.380×10−23 J/K is the Boltzmann constant, l is the wavelength (m), and B is the radiance (W/m2/str/m) at the wavelength. For reference, ASTER Level-3A radiance data on the same day were downloaded via the METI AIST satellite Data Archive System website [36]. The data were also projected on the UTM 54 projection with a WGS84 ellipsoid. The band correspondence between ASTER and MODIS is summarized in Table 1.

Table 1

Characteristics of MODIS bands and correspondence with reference ASTER bands.

MODIS band	Description	MODIS wavelength (nm)	MODIS original spatial resolution (m)	ASTER band	ASTER wavelength (nm)	ASTER original spatial resolution (m)
1	Red	620–670	250	2	630–690	15
2	Near infrared	841–876	250	3	760–860	15
3	Blue	459–479	500	1	520–600	15
4	Green	545–565	500	1	520–600	15
6	Short-wave infrared	1628–1652	500	4	1600–1700	30
7	Short-wave Infrared	2105–2155	500	5	2145–2185	30
31	Thermal	10,780–11,280	1000	14	10,950–11,650	90
32	Thermal	11,770–12,270	1000	14	10,950–11,650	90

In the research performed by Tonooka [26], image coregistration between bands was not implemented because the author assumed that the accuracy of the inter-telescope registration of the data used (ASTER) was sufficient for the algorithm. However, data-driven coregistration is desirable before integrating multiple images (e.g., [12]). Therefore, we implemented image coregistration using the phase-only correlation (POC) approach [37]. Specifically, reference images (i.e., ASTER) were coregistered via POC between bands first. Then each MODIS band was coregistered by comparing it with the corresponding ASTER band (Table 1) via POC. This ensured the elimination of uncertainty caused by inconsistent MODIS inter-band registration during the super-resolution process, and inter-sensor registration between MODIS and ASTER during validation. The MODIS PSF to simulate spatial degradation of the higher-resolution signal can be modeled by the convolution of a triangular function along the across-track direction and a Gaussian function [38, 39]. The former represents the detector response [40], and the latter represents optical blurring [38]. The PSF was considered to be a weighting function of spatial degradation within a square low-resolution pixel, which includes ν × ν high-resolution pixels (ν is the number of pixels along a column or row). The triangular function can be expressed as follows, by considering geometric transformation of the coordinates within a low-resolution pixel: where i and j are the high-resolution pixel coordinates in a low-resolution pixel; u(i,j) and v(i,j) are those for the cross-track and along-track directions, taking the center of the image as the origin; and a is the inclination of the along-track direction measured on the i-j coordinates, which was set to 5.357 by checking geolocation information in the MODIS data (MOD03 [35]). The Gaussian function was where σ determines the standard deviation of the Gaussian function, which was set to 0.2 by referring to [38, 39]. Then, the total PSF was Examples of each PSF are shown in Fig 5.

Fig 5

Simulated Point Spread Function (PSF) for 100×100 pixels (i.e., ν = 100).

Simulated Point Spread Function (PSF) for 100×100 pixels (i.e., ν = 100).

(left) Triangular weighting function for sensor PSF, (center) Gaussian weighting function for optical PSF, and (right) combined PSF for a Moderate Resolution Imaging Spectroradiometer (MODIS) observation. Via the super-resolution algorithm, 250-m MODIS thermal images (bands 31 and 32) were retrieved, which were validated by the corresponding ASTER band 14. To this end, the ASTER image was degraded to 250-m resolution using the MODIS PSF. The correlation coefficient (CC) and root mean squared error (RMSE) between the MODIS and ASTER images were calculated for the three types of algorithms (the original algorithm, and the proposed algorithm with and without distance regularization) to investigate the effect of our refinement. The Relative Dimensionless Global Error (ERGAS) index and peak signal-to-noise ratio (PSNR) [41] were also checked to analyze the accuracy of spectral and spatial reconstruction, respectively. Since the quantization of the thermal reference data (ASTER) is a 12-bit process [42], the maximum value is 4095 (equivalent to a radiance of 21.39 W/m2/str/m), which was used for calculating the PSNR. Spatial patterns (i.e., images) and the basic statistics for the radiance were also checked among the reference, retrieved, and original data.

Results

The regularization parameter λ was determined as 0.002 based on tuning repeated twice, to give the first super-resolution process the best performance (i.e., the least RMSE and the best CC; Fig 6). On this basis, the validation with ASTER images for Algorithms 1, 2, and 3 is shown in Table 2. Because the relative spectral responses are different even between the corresponding bands in MODIS and ASTER images, it is natural that there is a systematic bias in radiance. Apart from the inevitable bias, the best performance is achieved with our proposed Algorithm 3, which produced the highest CC and PSNR, and the lowest RMSE and ERGAS.

Fig 6

Tuning of the regularization parameter λ in Algorithm 3.

(left column) Root mean squared error (RMSE) and correlation coefficient (CC) for a wide range (from 0 to 104) and (right column) RMSE and CC for a narrow range (from 10−4 to 2.0×10−2). Both tunings were performed with the first super-resolution image (i.e., retrieval of 500-m thermal images), and the common λ was used for the second super-resolution process. Each x-axis is log-scale, whereas each y-axis is scaled by the RMSE or CC for λ = 0. The dashed line marks λ = 0.002.

Table 2

Correlation Coefficient (CC), Root Mean Squared Error (RMSE), and Peak Signal-to-Noise Ratio (PSNR) for each band [31, 32], and Relative Dimensionless Global Error (ERGAS) between the results from the three types of super-resolution algorithms and ASTER radiance.

CC and PSNR: larger is better; RMSE and ERGAS: smaller is better.

Accuracy criteria	Algorithm 1	Algorithm 2	Algorithm 3
CC for band 31	0.366	0.533	0.644
RMSE / relative RMSE (%) for band 31	1.545 / 16.79%	1.468 / 15.95%	1.447 / 15.72%
PSNR for band 31 (dB)	22.77	23.28	23.40
CC for band 32	0.351	0.514	0.626
RMSE / relative RMSE (%) for band 32	0.882 / 9.584%	0.772 / 8.384%	0.739 / 8.030%
PSNR for band 32 (dB)	27.57	28.86	29.24
ERGAS	3.418	3.186	3.121

Tuning of the regularization parameter λ in Algorithm 3.

Correlation Coefficient (CC), Root Mean Squared Error (RMSE), and Peak Signal-to-Noise Ratio (PSNR) for each band [31, 32], and Relative Dimensionless Global Error (ERGAS) between the results from the three types of super-resolution algorithms and ASTER radiance.

CC and PSNR: larger is better; RMSE and ERGAS: smaller is better. More importantly, our proposed Algorithm 3 shows the best statistical consistency with the original MODIS thermal data, and as a result, also with the ASTER data (as clearly seen in Table 3). The original Algorithm 1 creates both physically impossible negative radiance and implausibly high radiance. Only the average values were acceptable because of the offset correction. Our proposed algorithm without regularization (Algorithm 2) shows better results than Algorithm 1, without any physically impossible values. However, with the appropriate regularization (Algorithm 3), the statistical consistency with the original MODIS and ASTER images increased further, not only for the average values, but also for the minimum and maximum values. Interestingly, the standard deviation of the retrieved result with Algorithm 3 is more consistent with that of the reference data (ASTER) than that of the original MODIS data.

Table 3

Basic statistics over the entire study region between the results from the three super-resolution algorithms, original MODIS image (1-km resolution), and ASTER image.

Radiance statistics (W/m²/str/μm)	Algorithm 1	Algorithm 2	Algorithm 3	Original MODIS	ASTER (band 14)
Band 31 minimum	-29.83	4.281	7.959	9.691	8.084
Band 32 minimum	-27.98	4.685	7.694	9.100	8.084
Band 31 maximum	27.44	15.04	11.88	11.23	11.14
Band 32 maximum	24.83	14.06	10.98	10.41	11.14
Band 31 average	10.61	10.62	10.62	10.62	9.202
Band 32 average	9.876	9.881	9.881	9.881	9.202
Band 31 standard deviation	0.674	0.450	0.371	0.290	0.352
Band 32 standard deviation	0.588	0.391	0.320	0.248	0.352

Fig 7 shows the stepwise enhancement of the spatial resolution in MODIS thermal images by Algorithm 3. Although the image contains some noisy patterns that are probably errors from our algorithm, textual details are certainly retrieved, especially around the boundaries of major land features such as Lake Kasumigaura and Mount Tsukuba. Compared with the land cover types (Fig 4), urban and built-up regions tend to show a brightness temperature higher than that of neighboring areas. Low brightness temperatures over water surfaces and forest areas are reasonable given the abundant evapotranspiration and aerodynamic features. Focusing on the forest area, the northeast part is hotter than the other area, which is probably due to the difference in altitude.

Fig 7

Improvement in spatial resolution of MODIS thermal infrared bands by the proposed super-resolution Algorithm 3.

Improvement in spatial resolution of MODIS thermal infrared bands by the proposed super-resolution Algorithm 3.

(left column) Original 1000-m resolution images, (center column) result from the first super-resolution process, and (right column) result from the second super-resolution process for (top row) MODIS band 31 and (bottom row) MODIS band 32. The radiance value was converted into brightness temperature by Eq 10. An almost similar spatial pattern can even be retrieved using Algorithms 1 and 2 (Fig 8). However, careful comparison shows that Algorithm 2 tends to generate slightly more noisy patterns and lower contrast than Algorithm 3, and that Algorithm 1 generates a few pixels having a negative brightness temperature (shown by small red points), which also confirms the results in Table 3.

Fig 8

Comparison of maps obtained by the three algorithms.

The upper panel containing 8 images displays MODIS band 31 results, and the lower one displays MODIS band 32 results. For each panel, from the far left column: reference ASTER images, MODIS retrieved by Algorithm 2, Algorithm 3, and Algorithm 1, respectively. The top row shows the 500-m resolution retrieval (first super-resolution images for Algorithms 2 and 3), and the bottom row shows the 250-m resolution retrieval. Algorithm 1 (original algorithm) cannot retrieve a 500-m resolution map. The red pixels (encircled by red circles for visibility) in the maps retrieved by Algorithm 1 indicate negative brightness temperatures.

Comparison of maps obtained by the three algorithms.

Discussion

We revisited the sensor-driven approach for thermal image super-resolution and investigated its applicability to a complex landscape with urban and suburban regions. The sensor-driven algorithm [26] with our modification refined the statistical consistency of the retrieved MODIS images (250-m resolution) with the original MODIS images and with the reference ASTER images (Tables 2 and 3). Refinement of the algorithm structure (from Algorithm 1 to 2) improved the accuracy of the super-resolution process (Table 2): in Algorithm 1, the first super-resolution process relies only on bands 1 and 2, which is likely to cause substantial uncertainty in the first super-resolution retrieval. The uncertainty probably propagates to the second super-resolution retrieval, resulting in enhanced uncertainty in the whole super-resolution process. In addition, the somewhat arbitrary hyperparameter w is likely to make the original algorithm too complex to obtain best-tuned results. Algorithm 2 was likely to address such issues, and was further improved by the introduction of a regularization term for the Mahalanobis distance (i.e., Algorithm 3). The standard deviation in the retrieved MODIS images using our algorithm (Algorithm 3) is more consistent with the ASTER images than with the original MODIS image with 1-km resolution. This suggests that contrasting features (i.e., spatial details) are captured by the super-resolution process, which are missed in the original MODIS image having a coarser resolution. Retrieved thermal images well captured specific features of different land covers. In particular, urban areas (Fig 4) tend to show high brightness temperatures, suggesting an urban heat island phenomenon [43]. Statistics for each type of land cover showed that the mean thermal radiance (or brightness temperature) in urban regions is the highest among the categories (Table 4), quantitatively confirming the urban heat island phenomenon. The thermal radiance and super-resolution accuracy were similar between paddy and crop categories distributed around suburban regions. This is understandable because water in rice paddy fields should be drained in preparation for harvest in this season (September), creating similar thermal properties to crop fields before and after harvest. For further discussion of the thermal structure in an urban and suburban area, the land surface temperature [44] rather than the radiance or brightness temperature may be more suitable, although it requires additional work to estimate the thermal emissivity precisely over heterogeneous artificial materials [45, 46]. The highest accuracy (PSNR and ERGAS) was observed for the urban category, suggesting that this algorithm is suitable for obtaining super-resolution in urban landscapes. Poor accuracy was obtained in the water and forest categories. For the water category, the weak correspondence between the thermal properties and optical spectra probably makes it difficult to conduct an accurate super-resolution process. The reason for the poor accuracy for the forest categories can be largely attributed to the temperature differences at different altitudes around Mount Tsukuba. Adding altitude data (i.e., digital elevation model) when calculating the Mahalanobis distance and clustering homogeneous pixels may improve the results for such mountainous regions.

Table 4

Statistics for super-resolved thermal data and accuracy indices for Algorithm 3 for each land-cover type.

Categories	B31 radiance / T_b	B32 radiance / T_b	B31 PSNR	B32 PSNR	ERGAS	N
All	10.62±0.37 / 306.9±2.5	9.881±0.320 / 307.1±2.5	23.40	29.23	3.121	32400
Water	10.16±0.41 / 303.8±2.8	9.453±0.373 / 303.7±3.0	23.03	28.28	3.501	552
Urban	10.91±0.26 / 308.9±1.8	10.13±0.23 / 309.0±1.8	24.04	31.01	2.712	2285
Paddy	10.74±0.29 / 307.7±1.9	9.985±0.253 / 307.9±2.0	23.70	30.11	2.921	9027
Crop	10.73±0.26 / 307.7±1.7	9.978±0.227 / 307.9±1.8	23.55	29.83	2.990	9854
Grassland	10.58±0.34 / 306.6±2.3	9.847±0.298 / 306.9±2.3	23.98	30.13	2.872	589
Forest	10.35±0.38 / 305.1±2.6	9.662±0.324 / 305.4±2.6	22.86	27.88	3.527	9914
Barren	10.51±0.44 / 306.2±3.1	9.803±0.381 / 306.5±3.0	23.58	28.84	3.105	179

Statistics for super-resolved thermal data and accuracy indices for Algorithm 3 for each land-cover type.

The land-cover type was determined using previously reported data [34], while unclassified and snow/ice categories were excluded. Note that the four forest categories (see Fig 4) were integrated into the one category. Radiance (W/m2/str/μm) with standard deviation, Tb (brightness temperature; K) with standard deviation, PSNR, ERGAS, and N (number of sample pixels) are listed. Improvement in the accuracy indices by our algorithm refinement was statistically confirmed for all land cover categories (Table 5). The Wilcoxon signed-rank test over the categories (n = 7) indicated that Algorithm 3 showed a smaller ERGAS and a greater PSNR than Algorithm 2 with statistical significance (p < 0.01), and Algorithm 2 showed a smaller ERGAS and greater PSNR than Algorithm 1 with statistical significance (p < 0.01). Therefore, for all categories, Algorithm 2 was better than Algorithm 1, and Algorithm 3 was better than both Algorithms 1 and 2.

Table 5

Comparison of accuracy indices (ERGAS: Smaller is better; PSNR: Greater is better) for different algorithms and land-cover types.

Accuracy indices	ERGAS			PSNR31			PSNR32
Algorithm	1	2	3	1	2	3	1	2	3
Water	3.831	3.571	3.501	22.51	22.93	23.03	26.71	27.88	28.28
Urban	2.930	2.788	2.712	23.56	23.87	24.04	29.47	30.44	31.01
Paddy	3.137	2.986	2.921	23.27	23.57	23.70	28.74	29.66	30.11
Crop	3.094	3.040	2.990	23.34	23.45	23.55	29.18	29.52	29.83
Grassland	3.195	3.042	2.872	23.29	23.62	23.98	28.34	29.10	30.13
Forest	4.055	3.599	3.527	21.94	22.73	22.86	25.86	27.56	27.88
Barren	4.632	3.186	3.105	20.80	23.49	23.58	23.61	28.19	28.84

Artificial coloring and metal composition of the surface would also influence the estimation of radiance or brightness temperature using our super-resolution algorithm, especially when creating the typical spectral pattern, which relies on the spectral link between the optical and thermal domains. In practice, the impact of the uncertainty in the typical spectral pattern on the super-resolution is limited because most of the thermal radiance ghigh,k’ is retrieved from neighboring homogeneous pixels, rather than the typical spectral pattern extracted by clustering (Fig 9G and 9H). Therefore, as long as a spectrally similar target with 500-m spatial homogeneity in the second super-resolution image can be obtained around the pixel of interest, the resulting image is likely to be reliable. This condition may sometimes be too strict for heterogeneous landscapes including urban and suburban areas, and thus retrieval has uncertainty in such regions. In fact, the Mahalanobis distance for the retrieval is small (i.e., high reliability in the retrieval) over the relatively homogeneous forest region and water body apart from the lake shore (Fig 9D and 9F), but not in the urban and suburban areas, and the boundary the land covers (e.g., the shore of Kasumigaura Lake) with less homogeneity (Fig 9A–9C). However, even in such cases, the offset correction with regularization at least ensures statistical consistency in the final retrieved value g’high,k’. Super-resolution with higher-resolution data (e.g., ASTER) may further mitigate uncertainty arising from such spatial heterogeneity.

Fig 9

Maps describing characteristics of super-resolution retrieval.

Maps describing characteristics of super-resolution retrieval.

From the top row, pixel homogeneity, Mahalanobis distance, and data sources (the typical spectral pattern or neighboring pixel used in the retrieval process) are displayed. The left and center columns show the maps for the first and the second super-resolution retrievals by our proposed Algorithm 3, respectively. The right column shows retrieval by the original Algorithm 1. There is still room to improve our algorithm regarding the visualizability of the retrieved image. Textural details, such as narrow river channels and mixed landscapes of crop, forest, urban, and suburban areas (Fig 4) are hidden behind the noisy patterns generated by the algorithm (Fig 7). Due to the inherent feature of the twofold super-resolution retrieval, the noise generated in the first super-resolution image should inevitably affect the second super-resolution image. In fact, a distributed spatial pattern of the large Mahalanobis distance (i.e., less reliable retrieval) is observed in the distance map in the second super-resolution image (Fig 9E), which can be attributed to the noise generated by the first super-resolution retrieval. Data-driven approaches, including traditional pan-sharpening [18, 19], kernel-driven methods [20-22], and machine learning [23-25], may have an advantage from the viewpoint of visualizability. Therefore, comparison of the sensor-driven approach with such data-driven approaches and/or their integrated use will be important future work. Especially, the offset correction, which is an important part of our algorithm, may be added to other data-driven approaches to improve the statistical consistency, while keeping each algorithm straightforward.

Conclusion

To improve the spatial resolution of thermal satellite images, we revisited a sensor-driven super-resolution algorithm and investigated its applicability to a complex landscape with urban and suburban regions. The algorithm explicitly considers the sensor blurring effect using a point spread function, and ensures radiometric consistency with the original thermal image during high-resolution thermal image retrieval, both of which are not generally taken into consideration in existing approaches such as machine learning and kernel-driven methods. We also introduced modification to the original sensor-driven algorithm to enhance the statistical consistency of the super-resolution results, including making the algorithm structure more straightforward, and introducing regularization term when calculating the Mahalanobis distance. The original sensor-driven algorithm (Algorithm 1) and two refined versions (Algorithms 2 and 3) were applied to a cloud-free MODIS scene to enhance the thermal (1 km) resolution to the optical (250 m) resolution, and were validated against the corresponding high-resolution thermal image (ASTER). The validation result showed that the refined sensor-driven algorithm can downscale the MODIS image to 250-m resolution, while maintaining a high statistical consistency with the original MODIS and ASTER images. Part of our algorithm, such as radiometric offset correction based on the Mahalanobis distance, may be integrated with other existing approaches in the future. 26 Jan 2022 PONE-D-21-32571 Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution PLOS ONE Dear Hiroki, Thank you for submitting your manuscript titled, ‘Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution (Ref. No.: PONE-D-21-32571) to PLOS ONE. The manuscript has been reviewed. After careful consideration, we find that the manuscript has some merit with some interesting results, but it needs a major revision to address reviewers’ comments and fully meet PLOS ONE publication criteria. You can find reviewers’ comments at the bottom of this letter. We invite you to submit a revised version of the manuscript. The changes required in the manuscript are very significant and require you to respond fully. We will send your revised manuscript for further external review. Therefore, we strongly recommend addressing concerns raised in full. ============================== Be sure to address: 1. Restructure your introduction, particularly organizing the problem statement 2. Quality and extent of results. 3. Revise the Discussion section for clarity. ============================== Please submit your revised manuscript by Feb 25, 2022. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Dr. Kunwar K. Singh Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: I Don't Know ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Dear authors: I have found your research article quite interesting. I have carefully read your proposal. I have some questions and explanations I would like you to clarify for me. Your research tries to recover and deepen a method to improve the resolution of thermal images from a sensor point of view (sensor-driven), as opposed to other approaches (data-driven). The preceding methodology and your proposals for improvement are well defined. Except in the figures shown: The order of the defined steps is not clear. In figure 1, there is a red arrow linking both steps of superresolution. Please clarify its meaning. In figure 2, MODIS B1,2(1Km) and (500 m) refer to one super-resolution step when they are images with different resolutions. The reader may be confused about the use of both images with different resolutions at the same super-resolution step. In line 103, (step 3) it is not clear what the meaning of "instant spectral library" is. I would thank the authors for clarifying this concept. In line 145, the authors refer to a super-resolution step using different bands simultaneously. The authors do not make clear how these many bands are handled. Clarification about how the bands fuse with the thermal image will be useful. My major concern in this work, in the Results section, is about the statistics the authors use to determine the quality of the super-resolution transformation. RMSE is not a good index of image quality. It has been proved that very different images can have the same RMSE value. The same is true for the correlation coefficient (CC). There are other more appropriate image quality indices such as the ERGAS or the spectral mapping angle (SAM) for spectral reconstruction. If you want to analyze the spatial reconstruction, Peak Signal to Noise Ratio (PSNR) will measure the quality of this process. Please refer to those indices or justify the use of RMSE and CC properly. I will be grateful if the authors could clarify these comments. I encourage them because I consider their work an important contribution to thermal image resolution enhancement literature. Reviewer #2: The article presents a review of a methodology to obtain surface temperature values based on spatially resampled MODIS sensor data. In its presentation, the main objective of the article is to demonstrate the application of the method for urban and suburban areas, however the results and discussions presented do not respond to this expectation, which is the main point of the improvements that the article needs. Below are some excerpts that indicate adjustments and additions to improve the understanding of the article. In line 44, I suggest improving the presentation of the characteristics of the thermal images, perhaps presenting a table or in the text, detailing the resolutions of each sensor In line 75, the authors seek to show the potential application of the algorithm, however the work does not show other results obtained with the use of the first algorithm that justify changes in it. I suggest that they present other works that analyzed the application of the 1 algorithm or made comparisons with other sensors In line 87, In the final paragraph of the Introduction, I suggest adding the objectives and results expected by the work. In line 103, in which he presents the routine for image treatment, I suggest improving the flowchart presented, using a number with stage indications, as this way the reader understands which steps the objective of the work changes and facilitates the replication of the work In line 135, I suggest rescuing the reason for the proposed algorithm modification In line 144, Here again rescue the routine flowchart of the algorithm indicating where changes occurred and to present a new proposal In line 327, I recommend that the authors seek to contextualize the discussion of the results obtained by showing the temperature results according to the land uses present in the analyzed section. It is important and necessary to confirm statistically whether there were differences or changes in temperatures by observing the behavior of surface temperature in vegetated environments and urban spaces ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 16 Feb 2022 Reviewer #1: Dear authors: I have found your research article quite interesting. I have carefully read your proposal. I have some questions and explanations I would like you to clarify for me. Your research tries to recover and deepen a method to improve the resolution of thermal images from a sensor point of view (sensor-driven), as opposed to other approaches (data-driven). The preceding methodology and your proposals for improvement are well defined. Except in the figures shown: The order of the defined steps is not clear. In figure 1, there is a red arrow linking both steps of superresolution. Please clarify its meaning. In figure 2, MODIS B1,2(1Km) and (500 m) refer to one super-resolution step when they are images with different resolutions. The reader may be confused about the use of both images with different resolutions at the same super-resolution step. Thank you for your encouraging and constructive comments. Based on this and the comment from reviewer 2, we have added a new figure (Fig. 1) and explanation (Lines 124-130) to clarify the flowchart and corresponding steps. We also added a step-by-step explanation in Fig. 2 (Lines 131-149) and Fig. 3 (Line 178). For each super-resolution process, high-resolution images (fhigh,k), degraded high-resolution images (flow,k), and low-resolution images (glow,k’) are needed, and then super-resolution images (ghigh,k’) are output. For example, MODIS B1,2 (1 km) is flow,k, and B1,2 (500 m) is fhigh,k in Fig. 3 (original Fig. 2). In line 103, (step 3) it is not clear what the meaning of "instant spectral library" is. I would thank the authors for clarifying this concept. We decided not to use this ambiguous term, but explicitly explain the concept (Line 112) as follows: “Step 3) Make a typical spectral pattern (i.e., correspondence between flow,k and glow,k’) by clustering the homogeneous pixels within the entire study region.” In line 145, the authors refer to a super-resolution step using different bands simultaneously. The authors do not make clear how these many bands are handled. Clarification about how the bands fuse with the thermal image will be useful. Please see the abovementioned comments regarding the original Figs. 1 and 2, and the new Fig. 1. The super-resolution algorithm searches for the correspondence between flow,k and glow,k’ from a neighboring pixel or the typical spectral pattern, and simultaneously positions ghigh,k’ for all bands k’. My major concern in this work, in the Results section, is about the statistics the authors use to determine the quality of the super-resolution transformation. RMSE is not a good index of image quality. It has been proved that very different images can have the same RMSE value. The same is true for the correlation coefficient (CC). There are other more appropriate image quality indices such as the ERGAS or the spectral mapping angle (SAM) for spectral reconstruction. If you want to analyze the spatial reconstruction, Peak Signal to Noise Ratio (PSNR) will measure the quality of this process. Please refer to those indices or justify the use of RMSE and CC properly. Based on this suggestion, we have added ERGAS and PSNR for accuracy assessment (Line 304; Table 2) and added new references [41, 42]. These indices also indicate a performance improvement with our algorithm, as was the case for the RMSE and CC. We did not use the SAM because we used the same ASTER reference data (band 14) for both MODIS bands (bands 31 and 32), and calculation of spectral similarity is not applicable in this case. Reviewer #2: The article presents a review of a methodology to obtain surface temperature values based on spatially resampled MODIS sensor data. In its presentation, the main objective of the article is to demonstrate the application of the method for urban and suburban areas, however the results and discussions presented do not respond to this expectation, which is the main point of the improvements that the article needs. Thank you for reviewing our manuscript. To demonstrate the results from the viewpoint of urban and suburban landscapes, we have added a new table to show the statistics related to the results (super-resolution thermal values and accuracy indices) for each land-use and land-cover type, including urban and six other categories (Table 4). This analysis, for example, quantitatively confirmed the heat island phenomenon in the urban region (Line 378), and revealed that the best super-resolution performance was observed in the urban region (Line 385). We have also added further findings of this analysis in the Discussion section (Line 378-397). In line 44, I suggest improving the presentation of the characteristics of the thermal images, perhaps presenting a table or in the text, detailing the resolutions of each sensor We have added a description of the degradation of the spatial resolution of thermal data with examples from MODIS and ASTER, in the Introduction (Line 36-40). In line 75, the authors seek to show the potential application of the algorithm, however the work does not show other results obtained with the use of the first algorithm that justify changes in it. I suggest that they present other works that analyzed the application of the 1 algorithm or made comparisons with other sensors Indeed, the applications on Mars [4] using THEMIS [27] (Line 69) and in urban and suburban areas [26] using ASTER (Line 77) were only the existing reports that we knew of, and neither provided sufficient quantitative accuracy assessment by comparison with independent validation data (Line 77). Therefore, this paper represents the first quantitative report of the performance of algorithm 1. We consider that this is a point of novelty of our research, and added an appropriate statement in the Introduction (Line 92). In line 87, In the final paragraph of the Introduction, I suggest adding the objectives and results expected by the work. We had already stated the research objective in the first sentence of the final paragraph (Line 80): “this work aims to investigate the potential applicability of the sensor-driven approach over a heterogeneous landscape, and to improve its primitive algorithm”. The expected result has been newly added (Line 92) based on this comment, relating to the abovementioned response to the original line 75. In line 103, in which he presents the routine for image treatment, I suggest improving the flowchart presented, using a number with stage indications, as this way the reader understands which steps the objective of the work changes and facilitates the replication of the work Based on this and the comment from reviewer 1, we have added a new figure to clarify the super-resolution process (Fig. 1), indicating the improved points (star symbols). A brief explanation of the overall process has also been added (Lines 124-130). In line 135, I suggest rescuing the reason for the proposed algorithm modification We have briefly stated the purpose of the refinement (Line 164): “To make the algorithm more straightforward and to create better radiometrically corrected results,” In line 144, Here again rescue the routine flowchart of the algorithm indicating where changes occurred and to present a new proposal We have added a description of how the refinements corresponded to each step (Line 166) by revisiting the flowchart as follows: “For each super-resolution process, refinement (1) concerns input-output correspondence and degraded image input, whereas refinement (2) concerns post-processing (both are indicated by a star symbol in the flowchart in Fig. 1).” In line 327, I recommend that the authors seek to contextualize the discussion of the results obtained by showing the temperature results according to the land uses present in the analyzed section. It is important and necessary to confirm statistically whether there were differences or changes in temperatures by observing the behavior of surface temperature in vegetated environments and urban spaces Please see the response to the general comment. We have added a statistical analysis regarding each land cover category in addition to a discussion (Lines 378-397). Submitted filename: R2103554-ResponseToReviewers-4_fix.docx Click here for additional data file. 4 Mar 2022

PONE-D-21-32571R1

Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution

PLOS ONE Dear Hiroki, Thank you for submitting a revised manuscript titled, ‘Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution’ to PLOS ONE. Reviewers’ have provided their feedback. The quality of manuscript has improved substantially, but it will require a minor revision to address reviewers’ comments and fully meet PLOS ONE publication criteria. You can find reviewers’ comments at the bottom of this letter. We invite you to submit a revised version of the manuscript. Be sure to address: 1. Improve the statistical exploration of the results, including apply non-parametric Wilcox text 2. Emphasize gains from the modification of the algorithm. We would appreciate receiving your revised manuscript by Apr 18 2022 11:59PM. When you are ready to submit your revision, log on to https://pone.editorialmanager.com/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Kunwar K. Singh Academic Editor PLOS ONE Journal Requirements: Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: I would like to thank the authors for their work in clarify my comments and suggestions about their original work. Now, in the new version, I have found this research paper clearer and the quality of the results are well established with the new statistical indices. I am glad to propose your work for publishing. Reviewer #2: The authors met the requests made, presenting more details about how the workflow occurs for the first algorithm and presenting clarifications regarding the modifications and improvements proposed by the work that occurred in it. It is still necessary to improve the statistical exploration of the results obtained, demonstrating the real improvement that the modification of the algorithm brings. I suggest application of the non-parametric Wilcox test was applied, with 95% confidence, to determine if there was a significant difference between the radiance and brightness temperature occurring in different land covers and land uses for Algorithm1, 2 and 3. This further analysis will enrich the results presented and effectively show the gains with the modification of the algorithm ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Javier Raimundo Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

16 Mar 2022 Reviewer #1: I would like to thank the authors for their work in clarify my comments and suggestions about their original work. Now, in the new version, I have found this research paper clearer and the quality of the results are well established with the new statistical indices. I am glad to propose your work for publishing. We are glad to hear your very positive comment, and thank you again for reviewing our paper. Reviewer #2: The authors met the requests made, presenting more details about how the workflow occurs for the first algorithm and presenting clarifications regarding the modifications and improvements proposed by the work that occurred in it. It is still necessary to improve the statistical exploration of the results obtained, demonstrating the real improvement that the modification of the algorithm brings. I suggest application of the non-parametric Wilcox test was applied, with 95% confidence, to determine if there was a significant difference between the radiance and brightness temperature occurring in different land covers and land uses for Algorithm1, 2 and 3. This further analysis will enrich the results presented and effectively show the gains with the modification of the algorithm Thank you for your constructive comments. We understand that this suggestion refers to the statistical significance regarding improvement of the accuracy indices (PSNR and EAGAS) among the three algorithms, and we agree that the recommended analysis would be helpful for clarifying the gains due to modification of the algorithm. Therefore, we compared the accuracy indices for each category for the different algorithms (Table 5), and performed a Wilcoxon signed-rank test using classified samples (i.e., n = 7) as suggested. The results show that Algorithm 3 had a smaller ERGAS and a greater PSNR than Algorithm 2 with statistical significance (p < 0.01), and that Algorithm 2 had a smaller ERGAS and a greater PSNR than Algorithm 1 with statistical significance (p < 0.01). Therefore, for all categories, Algorithm 2 was better than Algorithm 1, and Algorithm 3 was better than both Algorithms 1 and 2 (Line 392). We added this description to the Discussion section. Submitted filename: R2103554-ResponseToReviewers-5.docx Click here for additional data file. 23 Mar 2022 Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution PONE-D-21-32571R2 Dear Hiroki, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Kunwar K. Singh Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 28 Mar 2022 PONE-D-21-32571R2 Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution Dear Dr. Mizuochi: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Kunwar K. Singh Academic Editor PLOS ONE

7 in total

1. Morphology and composition of the surface of Mars: Mars Odyssey THEMIS results.

Authors: Philip R Christensen; Joshua L Bandfield; James F Bell; Noel Gorelick; Victoria E Hamilton; Anton Ivanov; Bruce M Jakosky; Hugh H Kieffer; Melissa D Lane; Michael C Malin; Timothy McConnochie; Alfred S McEwen; Harry Y McSween; Greg L Mehall; Jeffery E Moersch; Kenneth H Nealson; James W Rice; Mark I Richardson; Steven W Ruff; Michael D Smith; Timothy N Titus; Michael B Wyatt
Journal: Science Date: 2003-06-05 Impact factor: 47.728

7. Super Resolution Infrared Thermal Imaging Using Pansharpening Algorithms: Quantitative Assessment and Application to UAV Thermal Imaging.

Authors: Javier Raimundo; Serafin Lopez-Cuervo Medina; Juan F Prieto; Julian Aguirre de Mata
Journal: Sensors (Basel) Date: 2021-02-10 Impact factor: 3.576

7 in total

Thermal remote sensing over heterogeneous urban and suburban landscapes using sensor-driven super-resolution.

Introduction

Materials and methods

Original algorithm

Flowchart for super-resolution process.

Proposed refinement

Study site and data processing

Reference satellite data and land cover map for the study site.

Simulated Point Spread Function (PSF) for 100×100 pixels (i.e., ν = 100).

Results

Tuning of the regularization parameter λ in Algorithm 3.

Correlation Coefficient (CC), Root Mean Squared Error (RMSE), and Peak Signal-to-Noise Ratio (PSNR) for each band [31, 32], and Relative Dimensionless Global Error (ERGAS) between the results from the three types of super-resolution algorithms and ASTER radiance.

Improvement in spatial resolution of MODIS thermal infrared bands by the proposed super-resolution Algorithm 3.

Comparison of maps obtained by the three algorithms.

Discussion

Statistics for super-resolved thermal data and accuracy indices for Algorithm 3 for each land-cover type.

Maps describing characteristics of super-resolution retrieval.

Conclusion

1. Morphology and composition of the surface of Mars: Mars Odyssey THEMIS results.

2. Fusion of MODIS and landsat-8 surface temperature images: a new approach.

3. Quantification of Local Warming Trend: A Remote Sensing-Based Approach.

4. Physically informed artificial neural networks for atomistic modeling of materials.

5. Examining seasonal effect of urban heat island in a coastal city.

6. Spatio-temporal evolution of urban thermal environment and its driving factors: Case study of Nanjing, China.

7. Super Resolution Infrared Thermal Imaging Using Pansharpening Algorithms: Quantitative Assessment and Application to UAV Thermal Imaging.