Literature DB >> 32670429

Assessment of NO₂ observations during DISCOVER-AQ and KORUS-AQ field campaigns.

Sungyeon Choi^1,2, Lok N Lamsal^1,3, Melanie Follette-Cook^1,4, Joanna Joiner¹, Nickolay A Krotkov¹, William H Swartz⁵, Kenneth E Pickering^1,6, Christopher P Loughner⁷, Wyat Appel⁸, Gabriele Pfister⁹, Pablo E Saide¹⁰, Ronald C Cohen¹¹, Andrew J Weinheimer⁹, Jay R Herman^1,12.

Abstract

NASA's Deriving Information on Surface Conditions from Column and Vertically Resolved Observations Relevant to Air Quality (DISCOVER-AQ, conducted in 2011-2014) campaign in the United States and the joint NASA and National Institute of Environmental Research (NIER) Korea-United States Air Quality Study (KORUS-AQ, conducted in 2016) in South Korea were two field study programs that provided comprehensive, integrated datasets of airborne and surface observations of atmospheric constituents, including nitrogen dioxide (NO2), with the goal of improving the interpretation of spaceborne remote sensing data. Various types of NO2 measurements were made, including in situ concentrations and column amounts of NO2 using ground- and aircraft-based instruments, while NO2 column amounts were being derived from the Ozone Monitoring Instrument (OMI) on the Aura satellite. This study takes advantage of these unique datasets by first evaluating in situ data taken from two different instruments on the same aircraft platform, comparing coincidently sampled profile-integrated columns from aircraft spirals with remotely sensed column observations from ground-based Pandora spectrometers, intercomparing column observations from the ground (Pandora), aircraft (in situ vertical spirals), and space (OMI), and evaluating NO2 simulations from coarse Global Modeling Initiative (GMI) and high-resolution regional models. We then use these data to interpret observed discrepancies due to differences in sampling and deficiencies in the data reduction process. Finally, we assess satellite retrieval sensitivity to observed and modeled a priori NO2 profiles. Contemporaneous measurements from two aircraft instruments that likely sample similar air masses generally agree very well but are also found to differ in integrated columns by up to 31.9 %. These show even larger differences with Pandora, reaching up to 53.9 %, potentially due to a combination of strong gradients in NO2 fields that could be missed by aircraft spirals and errors in the Pandora retrievals. OMI NO2 values are about a factor of 2 lower in these highly polluted environments due in part to inaccurate retrieval assumptions (e.g., a priori profiles) but mostly to OMI's large footprint (> 312 km2).

Entities: Chemical Disease Gene Species

Year: 2020 PMID： 32670429 PMCID： PMC7362396 DOI： 10.5194/amt-13-2523-2020

Source DB: PubMed Journal: Atmos Meas Tech ISSN： 1867-1381 Impact factor: 4.176

Introduction

Nitrogen dioxide (NO2) plays an important role in the troposphere by altering ozone production and OH radical concentration (Murray et al., 2012, 2014). It is one of the six United States Environmental Protection Agency (EPA) criteria pollutants because of its adverse health effects on humans (WHO, 2013). Major sources of nitrogen oxides (NO=NO + NO2) in the troposphere include combustion, soil, and lightning. As a trace gas with a relatively short lifetime, NO2 is usually confined to a local scale with respect to its source and therefore exhibits strong spatial and temporal variations, leading to difficulties in comparing NO2 observations by methods with different atmospheric sampling. Due to its distinct absorption features at ultraviolet–visible (UV–Vis) wavelengths, atmospheric NO2 is observable from ground- and space-based remote sensing instruments. In particular, space-based measurements of tropospheric column NO2 have been widely used to study spatial and temporal patterns (e.g., Beirle et al., 2003; Richter et al., 2005; Boersma et al., 2008; Lu and Streets, 2012; Wang et al., 2012; Hilboll et al., 2013; Russell et al., 2010, 2012; Duncan et al., 2013; Lin et al., 2015) as well as long-term trends (e.g., van der A et al., 2008; Lamsal et al., 2015; Krotkov et al., 2016), and to infer NO sources (e.g., Jaeglé et al., 2005; van der A et al., 2008; Bucsela et al., 2010; de Wildt et al., 2012; Lin, 2012; Ghude et al., 2010; Ghude et al., 2013a; Mebust and Cohen, 2013; Pickering et al., 2016) and top-down NO emissions (e.g., Martin et al., 2003; Konovalov et al., 2006; Zhao and Wang, 2009; Lin et al., 2010; Lamsal et al., 2011; Ghude et al., 2013b; Vinken et al., 2014; Schreier et al., 2015; Cooper et al., 2017; Miyazaki et al., 2017; Liu et al., 2018). These observations have also been often used to assess chemical mechanisms (e.g., Martin et al., 2002; van Noije et al., 2006; Lamsal et al., 2008; Kim et al., 2009; Herron-Thorpe et al., 2010; Huijnen et al., 2010) and to infer the lifetime of NO (e.g., Schaub et al., 2007; Lamsal et al., 2010; Beirle et al., 2011) in chemical transport models (CTMs). Surface NO2 concentrations (Lamsal et al., 2008, 2014; Novotny et al., 2011; Bechle et al., 2013) and NO deposition flux (Nowlan et al., 2014; Geddes and Martin, 2017) can also be estimated using satellite NO2 observations. As the accuracy of any application of satellite data largely depends on the data quality, validation of satellite NO2 observations is necessary. A number of validation studies of space-based tropospheric NO2 columns have been conducted using independent NO2 observations from airborne in situ mixing ratio measurements (e.g., Boersma et al., 2008; Bucsela et al., 2008; Hains et al., 2010; Lamsal et al., 2014), ground-based total column (e.g., Pandora instrument; Herman et al., 2009) and tropospheric (MAX-DOAS instrument; e.g., Vlemmix et al., 2010; Irie et al., 2012) column measurements, and airborne high-resolution differential optical absorption spectroscopy (DOAS) measurements (Lamsal et al., 2017; Nowlan et al., 2018). Most validation studies utilizing in situ and ground-based observations have reported that satellite measurements tend to underestimate tropospheric NO2 columns, especially over highly polluted areas (e.g., Hains et al., 2010). Intrinsic limits of space-based measurements, however, pose a challenge in comparisons between satellite, in situ, and ground-based measurements due to differences in representativeness. As stated above, NO2 usually exhibits very sharp spatial gradients (tens of meters to kilometers). In contrast, the spatial resolution of satellite measurements is too coarse (tens of kilometers) to capture the fine spatial features of tropospheric NO2 abundance. Therefore, it is important to recognize and account for the spatial variability while comparing satellite data with ground-based and in situ observations. While the intrinsic resolution of satellite observations cannot be altered, there are ways to improve the derived satellite data products. The fidelity of the retrieved NO2 product is dependent on the assumptions (e.g., NO2 vertical profile shape, surface reflectivity) made in the retrieval algorithm. Some of the input parameters are available at much coarser resolution than the spatial resolution of OMI, introducing spatially (e.g., rural-to-urban) varying retrieval biases. Several studies show that the use of high-resolution NO2 profiles results in significant improvements in retrievals (e.g., Russell et al., 2012; Lin et al., 2014; Lamsal et al., 2014; McLinden et al., 2014; Laughner et al., 2016, 2019; Goldberg et al., 2017). Deficiencies in model distributions of NO2 may be identified and improved through rigorous evaluation with independent data, such as the suite of data collected during the Deriving Information on Surface Conditions from Column and Vertically Resolved Observations Relevant to Air Quality (DISCOVER-AQ) campaign deployments. In this paper, we use comprehensive, integrated datasets of NO2 gathered from surface, aircraft, and space instruments during NASA DISCOVER-AQ and the NASA and National Institute of Environmental Research (NIER) Korea–United States Air Quality Study (KORUS-AQ) together with NO2 model simulations to address questions regarding retrieval accuracy. We describe the datasets in Sect. 2.1 and the models in Sect. 2.2. As an example, we focus on the NASA Standard NO2 Product from OMI onboard the Aura satellite and conduct retrieval studies using the algorithm as discussed in Sect. 2.3, but the approaches discussed here could be applied to similar products as well. Results are presented in Sect. 3.

Observations and chemical transport models

NO2 observations during DISCOVER-AQ and KORUS-AQ field campaigns

DISCOVER-AQ (https://www-air.larc.nasa.gov/missions/discover-aq/, last access: 5 September 2019) and KORUS-AQ (https://www-air.larc.nasa.gov/missions/korus-aq/, last access: 5 September 2019) were field study programs that provided comprehensive, integrated datasets of airborne and surface observations relevant to the diagnosis of surface air quality conditions from space. DISCOVER-AQ was a part of the NASA Earth Venture program and conducted four field deployments in Maryland (MD), California (CA), Texas (TX), and Colorado (CO) that covered different seasons and pollution regimes. KORUS-AQ was an international cooperation field study program conducted in the Republic of Korea (South Korea), sponsored by NASA and the South Korean government through the NIER. Table 1 summarizes the campaign locations and periods for the two field campaigns.

Table 1.

Campaign locations and time periods.

Campaign	Location	Time period	Flight days
DISCOVER-AQ	Baltimore, Maryland	June-July 2011	14
DISCOVER-AQ	San Joaquin Valley, California	January-February 2013	11
DISCOVER-AQ	Houston, Texas	September 2013	10
DISCOVER-AQ	Denver-Ft. Collins, Colorado	July-August 2014	15
KORUS-AQ	Republic of Korea (South Korea)	May-June 2016	22

The primary objectives of DISCOVER-AQ and KORUS-AQ included (1) exploring the relationship between air quality at the surface and the tropospheric columns that can be derived from satellite orbit, (2) examining the diurnal variation of these relationships, and (3) characterizing the scales of variability relevant to the model simulation and remote observation of air quality. To accomplish these objectives, an observing strategy was designed to carry out systematic and concurrent in situ and remote sensing observations from a network of ground sites and research aircraft. The payloads on research aircraft consisted of several in situ instruments that differed minimally between campaigns. Ground-based trace gas observations included in situ surface and remote sensing Pandora measurements (Herman et al., 2009). Figure 1 illustrates a conceptual view of the instruments and their sampling methods with their areal coverage for NO2 observations. While the aircraft (P-3B for DISCOVER-AQ and DC-8 for KORUS-AQ) make spirals (P-3B) or ascents and descents (DC-8) over the site, the onboard National Center for Atmospheric Research (NCAR) and thermal dissociation laser-induced florescence (TD-LIF) instruments measure in situ NO2 profiles. The aircraft usually visit each site two to four times a day to observe the diurnal variations of the NO2 profiles. The P-3B aircraft made spirals of ~ 4 km diameter, whereas the DC-8 ascents and descents covered 10–20 km. Consequently, the distance between the ground and aircraft locations was 0–5 km during the DISCOVER-AQ and 10–20 km during the KORUS-AQ campaign. Pandora and NO2 ground monitor instruments are typically located at ground stations close to the aircraft profiles. Throughout the day, Pandora reports the total column NO2 from direct-sun measurements, and the ground monitor reports the in situ surface NO2 mixing ratio. Finally, OMI retrievals report a tropospheric column NO2 once a day in the afternoon; the OMI pixel has a much larger ground footprint compared with the in situ and Pandora measurements. Table 2 lists the sites with ground-based NO2 monitors used in this analysis, along with the type of instrument employed at each site and the numbers of aircraft profiles and Pandora measurements available from each site near the time of OMI overpass. Detailed data descriptions follow in this section.

Figure 1.

Conceptual illustration of NO2 observations during the DISCOVER-AQ and KORUS-AQ field campaigns. The instruments used include ground-based monitors measuring in situ NO2 volume mixing ratios, Pandora making direct-sun measurements to retrieve the total column NO2, airborne instruments measuring in situ NO2 profiles, and the Ozone Monitoring Instrument (OMI) aboard the Aura spacecraft reporting total column and tropospheric column NO2.

Table 2.

Summary of ground supersites during DISCOVER-AQ and KORUS-AQ campaigns with ground-based NO2 measurements. The symbol N represents the sample size for aircraft and Pandora (in parentheses if different from that of aircraft profiles) measurements that are collocated with OMI observations. Surface NO2 monitors include NO analyzers with molybdenum converters (MCs), NO analyzers with photolytic converters (PCs), cavity-attenuated phase shift (CAPS) spectrometers, and cavity ring-down spectrometers (CRDSs).

Campaign	Site	Latitude, longitude	Elevation (m)	N	Ground monitor type
MD	Padonia	39.46° N, 76.63° W	120	6(4)	PC
	Fairhill	39.7° N, 76.86° W	109	3	PC
	Edgewood	39.4° N, 76.3° W	9	6(5)	PC
	Essex	39.31° N, 76.47° W	13	3(2)	MC
	Chesapeake[1]	39.16°N, 76.34° W[2]	-	3(0)	-
CA	Bakersfield	35.33° N, 119.0°W	117	5(3)	MC
	Porterville	36.03° N, 119.06° W	141	5	CAPS
	Hanford	36.32° N, 119.64° W	80	7(6)	MC
	Fresno	36.79° N, 119.77° W	97	8	MC
TX	Galveston	29.25° N, 94.86° W	0	7	PC
	Manvel Croix	29.52° N, 95.39° W	18	6	CRDS
	Deer Park	29.67° N, 95.13° W	6	4	MC
	Moody Tower	29.72° N, 95.34° W	64	4(2)	PC
	Channelview	29.80° N, 95.13° W	6	4	MC
	Conroe	30.35° N. 95.43° W	67	3	MC
CO	Fort Collins	40.59° N, 105.14° W	1577	3(2)	MC
	Platteville	40.18° N, 104.73° W	1522.5	5(4)	MC
	NREL-Golden	39.74° N, 105.18° W	1846	4(2)	CAPS
	Bao Tower	40.04° N, 105.01° W	1590	4	CRDS
	Denver La Casa	39.78° N, 105.01° W	1602	5(4)	MC
	Chatfield Park	39.53° N, 105.07° W	1675	5	MC
Korea	Olympic Park	37.52° N, 127.124°E	26	4(3)	PC
	Taehwa	37.31° N, 127.311°E	160	7(2)	CAPS

Only aircraft spirals were performed over this site.

The coordinate is approximate.

Vertical distribution of NO2 by aircraft

In situ NO2 volume mixing ratios (VMRs) were measured from the NASA P-3B (DISCOVER-AQ) and DC-8 (KORUS-AQ) aircraft. The number of flights varied between campaigns, ranging from 10 for Texas to 22 for Korea. Flights took place during a range of conditions, e.g., pollution episodes, clean days, weekdays, and weekends. Measurements usually commenced in the morning and continued throughout the day with multiple sorties on a given day. During each sortie, the aircraft made vertical spirals over surface sites, sampling NO2 between ~ 300 m and 5 km from the Earth’s surface. In Maryland, spirals were also made over the Chesapeake Bay area, which did not have any ground monitors. Airborne measurements were carried out using two different instruments and measurement techniques. The four-channel chemiluminescence instrument from the National Center for Atmospheric Research (NCAR) measured NO2 by the photolysis of NO2 and subsequent chemiluminescence detection of NO2 following the oxidation of the photolysis product NO with ozone (Ridley and Grahek, 1990). This instrument has an NO2 measurement uncertainty of 10 % and a 1 s, 2σ detection limit of 50 parts per trillion by volume (pptv). We hereafter refer to these NO2 measurements as “NCAR”. The thermal dissociation laser-induced florescence (TD-LIF) method used by the University of Berkeley detects NO2 directly and other nitrogen species (e.g., total peroxynitrates, alkyl nitrates, HNO3) following the thermal dissociation of all oxides of nitrogen (NO) to NO2 (Thornton et al., 2000). The laser-induced fluorescence method is highly sensitive for measuring NO2, with a detection limit of 30 pptv. The measurement uncertainty is 5 %. This instrument has a lower NO2 sampling frequency than the NCAR instrument due to its alternating measurement cycle for different species. We refer to these NO2 measurements as TD-LIF. Here we use 1 s merged data provided in the campaign data archives and focus on early afternoon measurements made within 1.5 h of the OMI overpass time (13:45 approximately). This time window of ±1.5 h is selected to maximize the number of samples while reducing effects from the diurnal variation of NO2. Figure 2 shows the mean NO2 profile for each of the DISCOVER-AQ and KORUS-AQ campaigns. Measurements show considerable spatiotemporal variation as well as some indication of a well-developed mixing layer, with the maximum mixing ratio near the ground. The mixing layer heights vary by region and season. For example, in the MD campaign conducted in summer, the mixing layer stretches up to 800 hPa (2 km). In contrast, the mean profiles from the CA campaign conducted in winter show a shallow mixing layer extending only up to 950 hPa (~ 700 m). Near-surface NO2 mixing ratios also vary by campaign location and possibly by season, with the highest near-surface NO2 in CA. In South Korea, the mean near-surface NO2 mixing ratio is not as high as in CA, but a very high (~ 5 ppbv) NO2 mixing ratio stretches up to 850 hPa, resulting in the greatest NO2 column. While the NCAR and TD-LIF mean profiles generally agree with each other in the MD, CA, and CO campaigns, they exhibit larger differences in TX and South Korea. Figure 2 also shows the nature of the variability in observed and simulated NO2 vertical profiles over the campaign domains. The observed differences between the model and observations arise primarily from a mismatch in both spatial and temporal sampling. The use of more restrictive collocation (spatial and temporal) applied for comparing different datasets in Sect. 3.1 and examining the air mass factor (AMF) effect in Sect. 2.3.2 would have resulted in different vertical distributions.

Figure 2.

Mean early afternoon NO2 profiles, both observed and modeled, for the DISCOVER-AQ and KORUS-AQ campaigns. Colored lines represent the average for airborne in situ profiles from NCAR (blue) and TD-LIF (green) instruments compared with simulated profiles from the GMI global model (orange) and the CMAQ (DISCOVER-AQ) or WRF-Chem (KORUS-AQ) regional models (red). The standard deviations of airborne profiles are indicated as shaded areas for NCAR (lavender) and TD-LIF (green) instruments. The blue–gray color represents the overlap of the two.

In situ surface NO2 measurements

To extend the altitude range of the vertical profiles discussed in Sect. 2.1.1, we merge in situ aircraft profile measurements with coincident in situ surface NO2 measurements sampled over the duration of spirals (~ 20 min) by linearly interpolating the NO2 mixing ratios between the surface and the lowest aircraft altitudes. These new merged profiles contain a greater portion of the tropospheric NO2 column. During both the DISCOVER-AQ and KORUS-AQ campaigns, in situ surface NO2 monitors were deployed at several ground sites (Table 2). Measurements were carried out using one of four different types of NO2 monitors, including a chemiluminescence NO monitor equipped with either a molybdenum or photolytic converter, a cavity-attenuated phase shift (CAPS) spectrometer, and a cavity ring-down spectrometer (CRDS). The molybdenum converter analyzer measures NO2 indirectly by the thermal conversion of NO2 to NO using molybdenum and the detection of NO by chemiluminescence that results from the reaction of NO with ozone. Since the reduction process could convert not only NO2 but also other reactive nitrogen species, this instrument could overestimate NO2 concentrations (Dunlea et al., 2007; Steinbacher et al., 2007; Lamsal et al., 2008; Dickerson et al., 2019). The magnitude of interference depends on the relative concentrations of NO2, nitric acid, alkyl nitrates, and peroxy-acetyl nitrate, which vary spatially, diurnally, and seasonally and are difficult to quantify. Considering their use in the sections below (Sects. 2.3.2 and 3), we conducted a sensitivity study examining how 0 %–50 % biases in molybdenum converter measurements could impact tropospheric columns derived from merged (aircraft + surface) profiles. We found that the errors are usually rather small at < 6 % for various sites. Therefore, no attempt is made here to correct for the interference in these measurements, although we identify those sites in Table 2 and Fig. 6.

Figure 6.

Comparison of NO2 tropospheric columns derived from NCAR, TD-LIF, and Pandora instruments. Different colors represent the campaign location, and the symbols represent the type of surface monitor (open circle: photolytic converter, plus: molybdenum converter, triangle: CAPS, square: CRDS).

The operating principle of a photolytic converter analyzer is also gas-phase chemiluminescence, but the use of a photolytic converter to reduce NO2 to NO makes it more specific to NO2. As a result, this instrument provides nearly interference-free NO2 measurements, with the exception of nitrous acid (HONO; Ryerson et al., 2000). Measurement uncertainties for 1 h averages are expected to be ~ 10 % (Fehsenfeld et al., 1990). The CAPS instrument detects NO2 by measuring absorption around 450 nm. Baseline measurements spanning minutes to hours with a source of NO2-free air are needed to determine NO2 amounts. In contrast to the chemiluminescence–molybdenum converter techniques, CAPS directly detects NO2. Its specificity for NO2 is affected by potential interference from species like glyoxal, water vapor, and ozone that absorb light within the band pass of the instrument. The detection limit is < 0.1 ppb for a 10 s measurement. NO2 measurements from CAPS and chemiluminescence NO monitors with a molybdenum converter are reported to agree to within 2 % (Kebabian et al., 2008). A CRDS is a sensitive and compact detector that measures multiple nitrogen species including NO2. It employs a laser diode at 405 nm for the direct detection of NO2. Interferences arising from absorption by other trace gases, such as ozone and water vapor, are expected to be small. The measurement precision is 20 ppt at a 1 s time resolution and the accuracy is better than 5 %, which is primarily limited by the NO2 absorption cross section used in the data reduction process. The total reactive nitrogen (NO) measured by the CRDS and chemiluminescence NO monitor with a molybdenum converter is found to agree to within 12 % (Wild et al., 2014).

Pandora total column NO2

In addition to in situ measurements, each campaign hosted ground-based networks of Pandora instruments. Pandora is a small, commercially available sun-viewing spectrometer optimized for the detection of trace gases, including NO2. It measures direct solar spectra in the 280–525 nm spectral range with 0.6 nm resolution. A detailed description of the instrument’s design, operation, and retrieval method can be found in Herman et al. (2009, 2018). The NO2 retrieval algorithm includes (1) a direct-sun spectral fitting method similar to traditional differential optical absorption spectroscopy (DOAS) (Platt, 1994) using one measurement (or an average of several measurements) as a reference spectrum to derive relative NO2 slant column densities (SCDs), (2) the application of the Modified Langley Extrapolation (MLE) to derive total NO2 SCDs, and (3) the conversion of total NO2 SCDs to vertical column densities (VCDs) using the direct-sun air mass factor (AMF) as follows: The spectral fitting is performed over the 400–440 nm window; it fits NO2 cross sections at 254.5 K (Vandaele et al., 1998), ozone (Brion et al., 1993), and a fourth-order smoothing polynomial, and it applies a wavelength shift and a constant offset. In clear-sky conditions, this instrument provides total NO2 VCD with a precision of 2.7×1014 and an absolute accuracy of 1.3×1015 molec cm−2 (Herman et al., 2018). Potential sources of error in NO2 retrievals include the calibration of raw data, the chosen reference spectrum, and the use of a fixed temperature for the NO2 cross section. Pandora NO2 data have been compared with data from direct-sun multifunction DOAS (MFDOAS) and Fourier transform ultraviolet spectrometry (UVFTS) (Herman et al., 2009) and have been found to agree within 12 %. These data are regularly used to validate satellite NO2 retrievals (e.g., Lamsal et al., 2014; Tzortziou et al., 2015, 2018; Ialongo et al., 2016). Here, we use clear-sky quality-controlled (root mean square (rms) < 0.05 and errors < 0.05 DU) 80 s total column NO2 data averaged over the duration of each aircraft spiral. We infer tropospheric column NO2 by subtracting the OMI stratospheric column from the Pandora total column to compare with tropospheric NO2 from in situ and OMI observations.

NO2 simulations

GMI simulation

The Global Modeling Initiative (GMI) three-dimensional chemical transport model (CTM) simulates the troposphere and stratosphere (Strahan et al., 2013) with a stratosphere–troposphere chemical mechanism (Duncan et al., 2007) updated with the latest chemical rate coefficients (Burkholder et al., 2015) and time-dependent natural and anthropogenic emissions (Strode et al., 2015). Aerosol fields are computed online with the Goddard Chemistry Aerosol Radiation and Transport (GOCART) model (Chin et al., 2014, and references therein). Tropospheric processes such as NO production by lightning, scavenging, and wet and dry deposition are also represented in the model. The GMI simulations used in this work were constrained with meteorology from the Modern-Era Retrospective Analysis for Research and Applications version 2 (MERRA-2) meteorolog ical fields (Gelaro et al., 2017) at 72 vertical levels from the surface to 0.01 hPa, with a resolution ranging from 150 m in the boundary layer to ~ 1 km in the upper troposphere and lower stratosphere, and at a horizontal spatial resolution of 1.25° longitude ×1.0° latitude. GMI simulations have been evaluated in the troposphere and stratosphere. Strode et al. (2015) showed good agreement with tropospheric O3 and NO trends in the US in a 1990–2013 hindcast simulation. Strahan et al. (2016) demonstrated realistic seasonal and interannual variability of Arctic composition using comparisons to Aura MLS O3 and N2O. The simulation of NO2 in both the troposphere (Lamsal et al., 2014) and stratosphere (Spinei et al., 2014; Marchenko et al., 2015) has been shown to be in good agreement with independent measurements. We sample the model profile at the times and locations of airborne measurements. Figure 2 compares GMI NO2 profiles with collocated aircraft measurements during the DISCOVER-AQ and KORUS-AQ field campaigns. The GMI simulation generally captures the vertical distribution of NO2 in the free troposphere, is somewhat lower in the middle and upper parts of the mixing layer, and exhibits sharper gradients between the boundary layer and the surface. Due to the coarse spatial resolution of the GMI model, the surface pressure of the GMI profiles differs from the measurements, especially over complex terrain in CA, CO, and Korea.

NO2 simulations using regional models

For each DISCOVER-AQ and KORUS-AQ deployment, a high-resolution model simulation was conducted. We use NO2 profiles from those simulations to examine their effect on retrievals in Sect. 2.3.2 and to downscale OMI NO2 retrievals in Sect. 2.3.3. Below we provide a brief description of each simulation. Information about model options for these simulations can be found in Table A1 in the Appendix. For most of the campaigns, the near-surface NO2 concentration and the model profile shapes agree in general with the NCAR and TD-LIF profiles. In TX, however, the CMAQ simulation shows lower mixing ratios than observations throughout the mixing layer (Fig. 2).

Table A1.

Model options for each simulation.

Note that all model options listed are for the domain used for the analysis.

	MD	CA	TX	CO	Korea
Dates	5/24/2011–8/1/2011	1/10/2013–2/28/2013	8/18/2013–10/1/2013	7/9/2014–08/20/2014	5/1/2016–5/31/2016
WRF model options
Version	3.3	3.8	3.6.1	3.8.1	3.6.1
Model top	lOOhPa	50hPa	50hPa	50hPa	50hPa
Spatial resolution	4 km	4 km	4 km	4 km	4 km
Vertical levels	34	35	45	37	52
Radiation	LW: RRTM	LW: RRTMG	LW: RRTM	LW: RRTMG	LW: RRTM
	SW: Goddard	SW: RRTMG	SW: Goddard	SW: RRTMG	SW: Goddard
Land surface model	Noah Land Surface Model	Pleim-Xiu	Pleim-Xiu	Unified Noah Land	Unified Noah Land
	(Tewari et al., 2004)	(Pleim and Xiu, 2003)	(Pleim and Xiu, 2003)	Surface Model	Surface Model
Boundary layer	YSU (Hong et al., 2006)	ACM2 (Pleim, 2007)	ACM2 (Pleim, 2007)	YSU (Hong et al., 2006)	MJY scheme
Meteor, init. and	12 km NAM	12 km NAM	12 km NAM	NCAR ECMWF	0.25 degree GFS
bound, cond.
CMAQ model options					WRF-Chem
Version	5.0	5.2	5.0.2	5.2 beta	3.6.1 (modified)
Coupled?	No	Yes	No	No	Yes
Chemical mechanism	Carbon Bond (CB05)	Carbon Bond (CB06, e51)	Carbon Bond (CB05)	Carbon Bond	Reduced hydrocarbon
	(Yarwood et al., 2005)		(Yarwood et al., 2005)	(CB06, r3)	(Pfisteretal.,2014)
Aerosol	AE5	AERO6	AE5	AERO6	MOSAIC 4 bin
Chem. init. and	12 km CMAQ v5.0	12kmCMAQv5.2	MOZART	RAQMS	24 km MACC for
bound, cond.	simulation	simulation	(outer domain)	(outer domain)	chemistry
Emissions	Described in	4 km 2013 emissions, emissions	2012 TCEQ anthropogenic	Described in report	Described in
	Loughner et al. (2014)	based on the 2011	emissions Biogenic Emission		Goldberg et al. (2019)[1]
		NEI with year-specific updates	Inventory System (BEIS)		and Saide et al. (2019)
		to EGU point sources	calculated within
		(CEMs data),	CMAQ
		fires and mobile
		(MOBILE6)

LW: longwave, SW: shortwave, RRTM: Rapid Radiative Transfer Model, RRTMG: Rapid Radiative Transfer Model for General Circulation Models, AE5: aerosols with aqueous extensions version 5, MOZART: Model for OZone and Related chemical Tracers, RAQMS: Real Time Air Quality Monitoring System, MACC: Monitoring Atmospheric Composition and Climate.

https://www.colorado.gov/airquality/tech_doc_repository.aspx?action=open&file=FRAPPE-NCAR_Final_Report_July2017.pdf (last access: 5 September 2019).

MD. The Weather Research and Forecasting (WRF) model was run (Loughner et al., 2014) from 24 May through 1 August 2011 at horizontal resolutions of 36, 12, 4, and 1.33 km with 45 vertical levels from the surface to 100 hPa with 16 levels within the lowest 2 km. Meteorological initial and boundary conditions were taken from the 12 km North American Mesoscale (NAM) model. Output from the 4 and 1.33 km WRF simulations were fed into the Community Multiscale Air Quality (CMAQ; Byun and Schere, 2005). Chemical initial and boundary conditions for the 4 km CMAQ run came from a 12 km CMAQ simulation covering the continental US, which was performed for the GEO-CAPE Regional Observing System Simulation Experiment (OSSE). The creation of the emissions used within the CMAQ simulation is described in Loughner et al. (2014) and Anderson et al. (2014). CMAQ was run with reduced mobile emissions by 50 % and an increase in the photolysis frequency of organic nitrate species based on Anderson et al. (2014). CA. The coupled WRF–CMAQ modeling system (Wong et al., 2012) was run from 1 January through 28 February 2013 (2013 DISCOVER-AQ California campaign period) at horizontal resolutions of 4 and 2 km, with 35 vertical levels from the surface to 50 hPa and an average height of the middle of the lowest layer of 20 m. WRF version 3.8 and CMAQ version 5.2.1 were used in a coupled format, allowing for frequent communication between the meteorological and chemical transport models and indirect effects from aerosol loading on the meteorological calculations in WRF. Meteorological initial and boundary conditions were taken from the 12 km NAM reanalysis product from NOAA statistical and mathematical symbols. Observation nudging above the planetary boundary layer (PBL) using four-dimensional data assimilation (FDDA) was applied in WRF. Chemical initial and boundary conditions for the 4 km CMAQ simulation came from a 12 km CMAQ simulation covering the continental US, while initial and boundary conditions for the 2 km simulation were obtained from the 4 km WRF–CMAQ simulation. Emissions are based on the 2011 US National Emissions Inventory (NEI) with year-specific updates to point and mobile sources, while biogenic emissions were calculated inline in CMAQ using the Biogenic Emissions Inventory System (BEIS). TX. To simulate the DISCOVER-AQ Texas campaign, a WRF model simulation was performed from 18 August through 1 October 2013, covering the entire field deployment in September 2013. The model was run at 36, 12, 4, and 1.33 km horizontal resolutions with 45 levels from the surface to 50 hPa. Meteorological initial and boundary conditions were taken from the 12 km North American Mesoscale (NAM) model. Output from the 4 and 1.33 km simulations were used to run the CMAQ model. Chemical and initial boundary conditions for the outer domain were taken from the Model for Ozone and Related chemical Tracers (MOZART) chemical transport model (CTM). Detailed information about these simulations and the emissions used can be found at http://aqrp.ceer.utexas.edu/projectinfoFY14_15/14-004/14-004FinalReport.pdf (last access: 5 September 2019). CO. For the Colorado deployment, WRF was run from 9 July through 20 August 2014 at spatial resolutions of 12 km (covering the western US) and 4 km (covering Colorado). The model top was set at 50 hPa, with 37 levels in the vertical. Analysis fields from the European Centre for Medium-Range Weather Forecasts (ECMWF) were used for meteorological initial and boundary conditions. Chemical initial and boundary conditions for the outer domain were taken from Real Time Air Quality Monitoring System (RAQMS) model output. Further information about this simulation can be found at https://www.colorado.gov/airquality/tech_doc_repository.aspx?action=open&file=FRAPPE-NCAR_Final_Report_July2017.pdf (last access: 5 September 2019). Korea. Air quality forecasts were performed using the Weather Research and Forecasting model (Skamarock et al., 2008) coupled to the Chemistry (WRF-Chem) (Grell et al., 2005) model to support KORUS-AQ flight planning and post-campaign analysis. The modeling domains consist of a regional domain of 20 km resolution covering major sources of transboundary pollutants affecting the Korean Peninsula: anthropogenic pollution from eastern China, dust from inner China and Mongolia, and wildfires from Siberia (Saide et al., 2014). A 4 km resolution domain was nested and covered the Korean Peninsula and surroundings, which encompassed the region where the DC-8 flights were planned and better resolved local sources. Anthropogenic emissions were developed by Konkuk University for KORUS-AQ forecasting and are described in Goldberg et al. (2019).

OMI NO2 observations

The Ozone Monitoring Instrument (OMI) aboard the NASA Aura satellite provides measurements of solar backscatter that are used to retrieve total, stratospheric, and tropospheric NO2 columns with a native ground resolution varying from 13km × 24km near nadir to 40km × 250km at swath edges (Levelt et al., 2006, 2018). The Aura satellite was launched on 15 July 2004 into a sun-synchronous polar orbit with a local Equator crossing time of 13:45 in the ascending node. OMI is one of the most stable UV–Vis satellite instruments providing a long-term high-resolution data record with low degradation (Dobber et al., 2008; DeLand and Marchenko, 2013; Schenkeveld et al., 2017). In the middle of 2007, an anomaly began to appear in OMI radiances in certain rows affecting all Level 2 products (Schenkeveld et al., 2017). This “row anomaly” can be easily identified, and the affected rows are discarded. We use OMI pixels with a cloud radiance fraction less than 50 % and quality flags indicating good data.

Standard OMI NO2 Product

Here we use the Standard OMI NO2 Product (OMNO2) version 3.1, with updates from version 3.0 (Krotkov et al., 2017). The NO2 retrieval algorithm uses the differential optical absorption spectroscopy (DOAS) technique. The retrieval method includes (1) the determination of NO2 slant column density (SCD) using a DOAS spectral fit of the NO2 cross section from measured reflectance spectra over the 402–465 nm range; (2) the calculation of an air mass factor (AMF) that is required to convert SCD into vertical column density (VCD); and (3) a scheme to separate stratospheric and tropospheric VCDs. The AMF calculation is performed by combining NO2 measurement sensitivity (scattering weights) from the TOMS RADiative transfer model (TOMRAD; Dave, 1964) with the a priori relative vertical distribution (profile shape) of NO2 taken from the GMI CTM. Computation of scattering weights requires information on viewing and solar geometries, terrain and cloud reflectivities, terrain and cloud pressures, and cloud cover (radiative cloud fraction). The version used here represents a significant advance over previous versions (Bucsela et al., 2006, 2013; Celarier et al., 2008; Lamsal et al., 2014). It includes an improved DOAS algorithm for retrieving slant column densities (SCDs) as discussed in Marchenko et al. (2015). The key features of the algorithm include more accurate wavelength registration between Earth radiance and solar irradiance spectra, iterative accounting of the rotational Raman scattering effect, and sequential SCD retrieval of NO2 and interfering species (water vapor and glyoxal). Solar irradiance reference spectra are monthly average data derived from OMI measurements instead of an OMI composite solar spectrum used in prior versions. Cloud pressure and cloud fraction are taken from an updated version of the OMCLDO2 cloud product that includes updated lookup tables and O2–O2 SCD retrieved with a temperature correction (Veefkind et al., 2016). A priori NO2 profiles are as discussed in Lamsal et al. (2015) and Krotkov et al. (2017) and use 1° latitude 1.25° longitude GMI model-based monthly a priori NO2 profiles with year-specific emissions. This retrieval version also uses more accurate information on terrain pressure that is calculated from high-resolution digital elevation model (DEM) data at 3 km resolution and GMI terrain pressure.

Recalculation of OMI NO2 AMF using alternative NO2 profiles

NO2 vertical profiles, especially in the troposphere, vary strongly in both space and time. The simulated NO2 profiles from a global CTM (GMI) employed in the operational NO2 retrieval, while offering a good option at a global scale, may not sufficiently capture the distribution of NO2 at OMI’s ground resolution. Using precalculated scattering weights (Sw) made available in the OMNO2 product and alternative information on vertical NO2 profile shape (Xa), the OMI NO2 AMF can be readily recalculated (Lamsal et al., 2014): where the integral from the surface to the tropopause yields the tropospheric AMF (AMFtrop). Scattering weights vary with viewing and solar geometry, cloud–aerosol conditions, and surface reflectivity, but they are assumed to be independent of the vertical distribution of NO2. The typical vertical distribution of scattering weights is characterized by lower values in the troposphere due to reduced sensitivity owing to Rayleigh scattering and higher values (corresponding to a nearly geometric AMF) in the stratosphere. The AMF is therefore highly sensitive to NO2 profile shape in the lower troposphere. Here, we investigate how a priori NO2 profiles affect OMI tropospheric AMF and consequently the retrieval of OMI tropospheric NO2 VCD. For this, we combine the measured profile (from the surface to ~ 5 km) with coincidently sampled simulated NO2 from GMI (5 km to the tropopause) to create a complete tropospheric NO2 profile. We choose the GMI simulation over the high-resolution model simulations because we found that the GMI generally better performed in the free troposphere compared to the regional models. We then interpolate the pressure-tagged NO2 observations (aircraft NCAR NO2 + surface) onto the pressure grid of the OMI NO2 scattering weight. The tropospheric AMFs obtained using individual measured profiles (AMFobs) are compared with the AMFs in the OMI Standard Product (AMFSP), which are calculated using the GMI yearly varying monthly climatology (Fig. 3a). AMFSP is generally higher than AMFobs by 34 % on average, with the largest difference (61.6 %) for TX and the smallest difference (16.6 %) for Korea; this means that the OMI SP VCDs, based on the AMFSP, are correspondingly smaller on average than the those based on measured profiles. The correlation ranges from fair (r = 0.41, N = 21) for MD and TX to excellent (r ≥ 0.92, N = 36) for CA and Korea, with the overall correlation coefficient of 0.53.

Figure 3.

Comparison of AMFs calculated using observed NO2 profiles (AMFobs) with tropospheric AMFs in the OMI Standard Product (AMFSP, a), and those calculated using NO2 profiles from high-resolution model simulations (AMFHR, b). Panel (c) compares tropospheric AMFs using daily versus campaign-average profiles (AMFobs-m). The symbols are color-coded by campaign location.

To explore how NO2 profiles from high-resolution model simulations could affect OMI NO2 retrievals, we calculate tropospheric AMFs using simulated monthly NO2 profiles (AMFHR). Since the OMI ground pixel size is much larger than the model grid boxes, we derive an average profile of all model grid boxes located within one OMI pixel and use it to calculate AMFHR. Figure 3b compares AMFobs with AMFHR; it suggests improved agreement compared to AMFSP (Fig. 3a), especially for CA, CO, and Korea, although with no significant improvement in the correlation. We also considered how using AMFs based on monthly mean profiles, such as the OMI SP, impacts retrieved NO2. To assess this, we calculated AMFs using both daily (AMFobs) and campaign-average measured NO2 profiles (AMFobs-m). Figure 3c shows that AMFobs and AMFobs-m agree to within 5.3 % and exhibit excellent correlation (r > 0.8). That is, the use of a mean profile does not make a significant difference compared to the individual daily profiles, implying that the average profile generally captures the local vertical distribution fairly well. Somewhat larger scatter in TX may be related to stronger land–sea breeze dynamics that could affect the vertical distribution of NO2 in both the boundary layer and free troposphere. Our results here differ from previous studies that reported improved agreement of OMI NO2 retrievals using simulated daily NO2 profiles with independent observations (Valin et al., 2013; Laughner et al., 2019), although Laughner et al. (2019) also suggested poorer performance with daily profiles in the southeast US than in other regions.

Downscaled OMI NO2 data

The NO2 value associated with an OMI ground pixel is averaged over a large area. This spatial smoothing leads to a loss of information on sub-pixel variation, which could be considerable for NO2, especially over urban source regions. Therefore, it is important to recognize and address this limitation while assessing, interpreting, and using satellite NO2 data. Here we use high-resolution NO2 model simulations for sub-pixel variation. We apply the method described by Kim et al. (2016, 2018) to downscale OMI NO2 retrievals, which are then compared with aircraft and Pandora data. This method applies high-resolution model-derived spatial-weighting kernels to individual OMI pixels and calculates sub-pixel variability within the pixel. The major assumption is that the model captures the spatial distribution of emission sources and NO2 transport patterns well. The method ensures that the quantity (total number of molecules) of the satellite data over the pixel is numerically preserved, while adding higher-resolution spatial information to the derived tropospheric NO2 columns. Figure 4 illustrates the downscaling of tropospheric NO2 for an OMI pixel using the high-resolution CMAQ simulation over Essex, Maryland. The tropospheric NO2 column observed by OMI (5.9 × 1015 molec cm−2) is 25.7 % higher than the average of the CMAQ NO2 columns over the pixel. The spatial-weighting kernels suggest more than an order of magnitude difference in NO2 within this single OMI pixel. Applying the kernels to the original OMI pixel value results in a range of sub-pixel NO2 column values from 1.9×1015 over a clean background to 3.2×1016 molec cm−2 over a pol luted hot spot.

Figure 4.

An illustration of downscaled OMI NO2 for an OMI pixel over Essex, MD, from orbit 37024 on 1 July 2011. Shown are the original OMI tropospheric NO2 VCD (a), coincidently sampled CMAQ NO2 VCD at a spatial resolution of 4×4 km2 (b), the spatial-weighting kernel (c), and downscaled OMI tropospheric NO2 VCD (d). These pixels coincide with an airborne in situ NO2 profile sampled during the DISCOVER-AQ Maryland campaign, and the flight route is marked with a black line. The location of the NO2 surface monitor and Pandora instrument is marked with a red dot.

Figure 5 demonstrates how the downscaled OMI NO2 data using high-resolution NO2 output from a CMAQ simulation compare with the original OMI NO2 data from the standard product. Both OMI SP and CMAQ show enhanced NO2 columns at major urban areas, but their magnitudes differ, with OMI showing lower values. As described above, OMI’s field of view covers a large area, sampling the NO2 field over the entire pixel, while the actual NO2 distribution (better resolved by the CMAQ simulation) is defined by local source strengths, chemistry, and wind patterns that can occur at much finer spatial scales. By employing the relative ratios inside an OMI pixel rather than the overall magnitude of simulated columns, the downscaling technique yields a more detailed structure, enhancing NO2 over sources and dampening it elsewhere by more than a factor of 2.

Figure 5.

Tropospheric NO2 VCD maps from (a) OMI SP, (b) CMAQ, and (c) downscaled OMI over Maryland on 29 July 2011. Panel (d) shows the difference between downscaled and standard tropospheric NO2 VCD data (c minus a). The gray areas represent pixels with an effective cloud fraction > 0.3.

Results and discussion

Comparison between in situ observations

Figure 6a and Table 3 summarize how the two airborne in situ NO2 tropospheric column measurements compare. We derive the column amount by first extending the NCAR and TD-LIF NO2 profiles to the same surface NO2 concentration measurements and then integrating the NO2 profiles. The only exception is at the Chesapeake Bay during the MD campaign, the only marine site used in this study; we extend a constant NO2 mixing ratio measured at the lowest aircraft altitudes to the surface. To compare with OMI and Pandora retrievals, NO2 amounts for the missing portion from the top of the aircraft altitude to the tropopause are added from the GMI simulation. This amount varied between 4.7×1014 and 1.2×1015 molec cm−2 and represented an average 5 % of the tropospheric NO2 columns but can reach up to 50.8 % for an individual profile. Overall, the two airborne in situ columns generally agree very well and exhibit excellent correlation (r = 0.87–0.99). The correlation and mean difference differ among the five campaigns, with TD-LIF higher than NCAR by 31.9 % in TX and 11.6 % in Korea but lower by ~ 10% in MD and CO. The observed difference in TX is much larger than the reported uncertainty of both NCAR and TD-LIF measurements. Analysis of individual profiles suggests that the data from TD-LIF are generally higher than NCAR at all altitudes, regardless of the NO2 pollution level (Fig. 7). The underlying cause of this difference is not clear, but it may be associated with the applied calibration standard or an interference issue for either or both of the two measurements. The small difference elsewhere could come from the lower measurement frequency of TD-LIF compared with the NCAR instrument.

Table 3.

Comparison between NCAR, TD-LIF, and Pandora NO2 observations.

Campaign	NCAR vs. TD-LIF		NCAR vs. Pandora		TD-LIF vs. Pandora
No. of profs.	Mean diff. (%)		Mean diff. (%)		Mean diff. (%)
(Pandora)	(TD-LIF - NCAR)	r	(Pandora - NCAR)	r	(Pandora - TD-LIF)	r
MD 21 (14)	−9.6	0.87	−24.5	0.42	−18.3	0.18
CA 25 (22)	7.2	0.93	11.1	0.65	4.8	0.58
TX 28 (26)	31.9	0.97	−39.1	0.94	−53.9	0.93
CO 26 (21)	−6.6	0.99	−2.8	0.81	4.2	0.78
Korea 11 (5)	11.6	0.99	20.3	0.95	7.5	0.94
All 111 (88)	8.3	0.99	−2.0	0.92	−9.8	0.90

Figure 7.

Vertical distribution of NO2 mixing ratios at different local solar time (LST) over Galveston (a, b, c) and Deer Park (d, e, f) in TX measured by the NCAR (light blue) and TD-LIF (orange) instruments. The circles in lighter colors represent 1 s measurements, and the solid lines show the mean values for NCAR (blue) and TD-LIF (red).

Comparison between Pandora and aircraft observations

Figure 6b–c and Table 3 show the comparison between Pandora and the two airborne tropospheric NO2 column measurements. We derive tropospheric columns from Pandora by subtracting collocated OMI stratospheric NO2 columns from the Pandora total column NO2 retrievals. The relationship between the aircraft and Pandora data is not as good as between the two aircraft measurements themselves. The use of OMI stratospheric NO2 columns to derive tropospheric columns from Pandora could impact the comparison between Pandora and aircraft observations; this approach is unlikely to be a significant factor over the polluted DISCOVER-AQ and KORUS-AQ campaign domains. The correlation ranges from fair (r = 0.42) to excellent (r = 0.95) for NCAR versus Pandora and poor (r = 0.18) to excellent (r = 0.94) for TD-LIF versus Pandora. The overall correlation coefficients between Pandora and the airborne NCAR and TD-LIF measurements are 0.94 and 0.91, respectively, with higher correlation in CO, TX, and Korea and lower correlation in MD and CA. Pandora data are about a factor of 2 lower than air craft measurements in TX. Elsewhere, Pandora data agree with aircraft measurements to within 20 % on average, although much larger differences are observed for individual sites. A larger discrepancy for Pandora data in TX is also reported by Nowlan et al. (2018), who used various NO2 measurements to evaluate GeoTASO NO2 retrievals. The reasons for such exceptionally large differences could include strong gradients in the NO2 field that are missed by aircraft spirals, errors in Pandora retrievals, or both.

Assessment of OMI NO2 retrievals

We compare OMI tropospheric NO2 columns with Pandora data and vertically integrated columns from aircraft spirals at 23 locations (Table 2) during the DISCOVER-AQ and KORUS-AQ field campaigns. We only analyze OMI pixels that overlap individual aircraft profiles. Spatially collocated aircraft and Pandora data are temporally matched to OMI by allowing only the measurements made within 1.5 h of the OMI overpass time. We infer tropospheric columns from Pandora by subtracting OMI-derived stratospheric NO2 from Pandora total columns. Figure 8a and b and Table A2 present tropospheric NO2 columns from the OMI Standard Product compared with integrated columns from the NCAR and TD-LIF instruments. Although the OMI and aircraft data are significantly correlated (r = 0.39–0.87), OMI NO2 retrievals are generally lower, with the largest difference in CO and the smallest difference in MD. OMI data are also lower than Pandora as shown in Fig. 8c. The magnitude of the difference and the degree of correlation with OMI vary for NCAR, TD-LIF, and Pandora measurements. This discrepancy between OMI, aircraft spiral columns, and Pandora local measurements is due to a combination of strong NO2 spatial variation, the size of OMI pixels, and the placement of the sites, but OMI retrieval errors arising from inaccurate information in the AMF calculation, such as a priori NO2 profiles, and potential errors in the validation sources themselves also contribute.

Figure 8.

Comparison of tropospheric NO2 columns from OMI with the data from NCAR (a, d, g), TD-LIF (b, e, h), and Pandora (c, f, i) instruments. OMI retrievals are performed using the default GMI (a–c) and observed NO2 profiles (d–i). In addition, OMI columns in (g)–(i) are downscaled with high-resolution (CMAQ and/or WRF-Chem) model simulations. Different colors represent the campaign locations.

Table A2.

Summary of NO2 comparison between the OMI Standard Product (OMISP) and NCAR, TD-LIF, and Pandora observations.

The mean difference is calculated as OMI minus observations.

Campaign	NCAR vs. OMI_SP		TD-LIF vs. OMI_Sp		Pandora vs. OMI_Sp
No. profs (Pandora)	Mean diff. (%)	r	Mean diff. (%)	r	Mean diff. (%)	r
MD 21 (14)	−40.7	0.39	−34.4	0.54	−21.8	0.21
CA 25 (22)	−53.8	0.77	−56.9	0.81	−58.5	0.24
TX 28 (26)	−54.9	0.65	−65.8	0.56	−26.9	0.65
CO 26 (21)	−67.5	0.73	−65.2	0.75	−68.2	0.72
Korea 11 (5)	−41.9	0.87	−47.9	0.87	−60.1	0.8
All 111 (88)	−51.9	0.82	−55.6	0.83	−54.6	0.84

Figure 8d–f and Table A3 show the comparison after partially accounting for OMI retrieval errors arising from a priori NO2 profiles taken from the GMI model. Replacing the model profiles with the NCAR and TD-LIF observed NO2 profiles in the AMF calculations addresses the issues related to model inaccuracies, although the measured profiles may not necessarily represent the true average NO2 over the entire OMI pixel (e.g., Fig. 4). Nevertheless, using observed profiles reduces OMI’s mean differences with NCAR by 8 %–29.2 %, TD-LIF by 8.7 %–24.4 %, and Pandora by 6.8 %–24.2 %. Changes are largest in TX and smallest in CA and Korea. Correlations are either improved or remain similar.

Table A3.

Same as A2, but for OMI using AMFobs (OMIobs).

Campaign	NCAR vs. OMI_SP		TD-LIF vs. OMI_Sp		Pandora vs. OMI_Sp
No. profs (Pandora)	Mean diff. (%)	r	Mean diff. (%)	r	Mean diff. (%)	r
MD21(14)	−23.7	0.61	−17.6	0.7	2.4	0.3
CA 25 (22)	−42.4	0.73	−45.8	0.75	−47.9	0.2
TX 28 (26)	−25.5	0.82	−41.3	0.76	21.6	0.81
CO 26 (21)	−54.2	0.7	−50.5	0.71	−55.2	0.69
Korea 11 (5)	−33.9	0.87	−39.2	0.86	−53.3	0.79
All 111 (88)	−37.5	0.82	−41.5	0.82	−39.2	0.84

Figure 8g–i and Table A4 show the comparison of OMI NO2 columns derived using observed profiles with NCAR, TD-LIF, and Pandora observations after accounting for spatial variation in the NO2 field as suggested by the CMAQ simulation. After downscaling, the agreement of OMI NO2 columns improves further with NCAR by 1.1 %–41.5 %, TD-LIF by 1.2 %–39.7 %, and Pandora by 1.2 %–33.2 %. The exceptions are MD for both aircraft and Pandora data and TX for Pandora data only. Changes are small in MD and Korea and large in CA and TX. The larger difference in TX is due to significant underestimation of NO2 by Pandora instruments. The correlation improves in MD and TX but is reduced in CA, CO, and Korea. These results suggest that downscaling helps explain some of the discrepancies between OMI, aircraft, and Pandora observations. Variations among campaign locations may also point to difficulty related to the fidelity of the CMAQ simulations.

Table A4.

Same as A2, but for OMIobs with downscaling (OMIDS).

Campaign	NCAR vs. OMI_SP		TD-LIF vs. OMI_Sp		Pandora vs. OMI_Sp
No. profs (Pandora)	Mean diff. (%)	r	Mean diff. (%)	r	Mean diff. (%)	r
MD21 (14)	−24.1	0.75	−18.0	0.85	0.8	0.31
CA 25 (22)	14.2	0.47	7.6	0.56	4.6	0.22
TX 28 (26)	9.5	0.94	−13.8	0.91	78.3	0.93
CO 26 (21)	−42.4	0.7	−37.7	0.71	−42.4	0.67
Korea 11 (5)	−32.8	0.73	−38.4	0.73	−52.1	0.48
All 111 (88)	−12.5	0.65	−18.0	0.68	−12.3	0.57

Figure 9 summarizes the comparison of OMI with aircraft and Pandora measurements. Here we present site mean columns observed from all measurements during the entire campaign periods. OMI captures the overall spatial variation in site means. In relatively cleaner places (NO2 VCD ≤ 5 × 1015 molec cm−2), OMI agrees well with NCAR and TD-LIF columns. OMI values are generally lower in polluted areas.

Figure 9.

Site mean tropospheric NO2 VCDs calculated from NCAR (blue), TD-LIF (orange), Pandora (green), and OMI (blue). The OMI data are derived using observed NO2 profiles and downscaled using high-resolution model simulations. The vertical bars represent the standard deviations.

Implications for satellite NO2 validations

NO2 measurements from a variety of instruments and techniques taken during the DISCOVER-AQ and KORUS-AQ field deployments provided a unique opportunity to assess correlative data and realize the strengths and limitations of the various measurements. Some of the techniques are still in a state of development and evaluation, and the data have not been fully validated. Additional complications arise when comparing measurements covering different areal extents. This is particularly true for a short-lived trace gas like NO2 that has a large spatial gradient, especially in the boundary layer. The NCAR and TD-LIF instruments onboard the same aircraft (P-3B during DISCOVER-AQ and DC-8 during KORUS-AQ) offer valuable insights on the vertical distribution of NO2, a critical piece of information needed for satellite retrievals. Despite their adjacent locations on the aircraft, they did not sample the same air mass throughout each profile due to their different NO2 measurement frequencies. Despite this, and even using independent measurement techniques with unique sources of uncertainties, NO2 measurements from the two instruments exhibit excellent correlation and very good agreement in most cases. However, varying discrepancies between the two instruments among campaigns with campaign-average differences reaching up to 31.9 % are unlikely to be related solely to the sampling issues; they are rather related to issues pertaining to measurement methods. It is crucial to reconcile these differences and improve the accuracy of these measurements for the meaningful validation and improved error characterization of satellite NO2 retrievals. In situ aircraft spirals miss significant portions of the tropospheric NO2 column, especially from the ground to the lowest level of the aircraft altitude, typically 200–300 m above ground level. In this analysis, we account for the missing portion above the aircraft profile by using coincidently sampled simulated NO2 profiles. For the portion below the aircraft profile we extrapolate to surface monitor data. The latter step can be a significant error source, given that it assumes spatial homogeneity over the spiral domain. Additional errors could come from the use of different types of monitors that were deployed during the DISCOVER-AQ and KORUS-AQ campaigns (see Sect. 2.1.2). In particular, NO2 data from molybdenum converter analyzers are biased high by variable amounts that are difficult to quantify and correct (e.g., Lamsal et al., 2008). The use of more accurate NO2 monitors, such as photolytic converter analyzers, together with balloon-borne NO2 sondes (Sluis et al., 2010) of similar accuracy would complement in situ aircraft profiles. While total column NO2 retrievals from the ground-based remote sensing Pandora instrument are useful to track temporal changes, their use for satellite validation or for comparing with aircraft spiral data can be onerous, particularly over locations with large NO2 spatial gradients, such as cities. Pandora’s field of view is so narrow that it serves as a point measurement. Additionally, Pandora data are subject to retrieval errors arising predominantly from the use of an incorrect reference spectrum as well as fixed temperature for the NO2 cross section in the spectral fitting procedure. Failure to apply a reference spectrum derived using weeks of measurements from the same site often yields systematic biases in the retrieved NO2 columns. Improved calibration and data processing are therefore needed to improve the Pandora data quality. Concurrent spatial NO2 observations from other ground-based (e.g., multi-axis differential optical absorption spectroscopy – MAX-DOAS; Vlemmix et al., 2010) or airborne (e.g., Geostationary Trace gas and Aerosol Sensor Optimization – GeoTASO; Nowlan et al., 2016; Judd et al., 2019) platforms would facilitate intercomparison among measurements of different spatial scales. The validation of NO2 observations from any satellite instrument, including OMI, is complicated by a variety of factors, principally the ground area covered by the instrument’s field of view. As discussed in Sect. 3.3, disagreement between partially (spatially and temporally) matched OMI NO2 and validation measurements made near sources may be reasonably anticipated and ought to be expected. Therefore, it may be necessary to use a proper validation strategy, such as downscaling of satellite data using either observed or modeled NO2 as presented in Fig. 8g–i and Table A4. It also underscores the need for comprehensive high-quality long-term observations for validation. Enhanced agreement with OMI retrievals revised using observed NO2 profiles is indicative of retrieval errors from model-based a priori vertical NO2 profile shapes (Fig. 8d–f, Table A3) and highlights the need for approaches to address the issue. Moreover, improved accuracy in other retrieval parameters, both surface and atmospheric, helps enhance the quality of satellite NO2 retrievals (Laughner et al., 2019; Vasilkov et al., 2017, 2018; Lorente et al., 2018; Lin et al., 2014, 2015; Liu et al., 2019; Noguchi et al., 2014; Zhou et al., 2011)

Conclusions

We conducted a comprehensive intercomparison among various NO2 measurements made during the five field deployments of DISCOVER-AQ and KORUS-AQ. The field campaigns were conducted in four US states (Maryland, California, Texas, and Colorado) and South Korea. The analyzed datasets were obtained from surface monitors, the NCAR and TD-LIF airborne instruments, ground-based Pandora instruments, and the space-based OMI. We investigated the data from 23 sites among the five campaigns when measurements from all these instruments were available. We focused on an analysis of tropospheric NO2 column amounts. NO2 mixing ratio measurements from the surface monitors and airborne instruments were merged and integrated to yield tropospheric columns, while the Pandora tropospheric columns were obtained by subtracting the OMI stratospheric column from Pandora total column observations. In order to compare OMI NO2 tropospheric columns with the available validation measurements, we used a combination of observed and simulated NO2 vertical profiles to recalculate tropospheric NO2 columns using the OMI Standard Product (OMNO2) version 3.1. To overcome the challenge of comparing OMI NO2 with its relatively large pixel size to the airborne and ground-based measurements with small spatial scales, we additionally applied a downscaling technique, whereby OMI tropospheric NO2 columns for each ground pixel are downscaled using high-resolution CMAQ (DISCOVER-AQ) or WRF-Chem (KORUS-AQ) model simulations. Therefore, the comparisons here include three kinds of OMI NO2 tropospheric columns: (1) OMI Standard Product, (2) OMI data recalculated using observed NO2 profiles, and (3) downscaled OMI NO2 data. The tropospheric columns from the NCAR and TD-LIF airborne instruments generally show good agreement, with a mean difference of 8.4 % and correlation coefficients in the 0.87–0.99 range. The Pandora columns also agree variably with the two airborne instruments, with the campaign-average difference in the range of 3 % to 54 %, but the correlation is not as good (r = 0.18–0.95) as between the two airborne instruments themselves. There are differences among the campaigns. In particular, all three instruments show the largest discrepancies in the TX campaign; TD-LIF is higher than NCAR by ~ 31.9 %, and Pandora data are lower by ~ 39 % and ~ 54 % compared to NCAR and TD-LIF measurements, respectively. All three OMI NO2 columns (Standard Product, based on observed NO2 profiles, and downscaled) exhibit good correlation with the airborne and ground-based measurements. In terms of quantitative agreement, the OMI SP column is smaller than airborne and ground-based measurements. Retrievals using observed NO2 profiles bring the OMI column closer to validation measurements. Applying downscaling to OMI data provides further improvement in agreement but little or insignificant change in correlation, perhaps due to the use of model simulations for downscaling. As discussed in Sect. 3.3, disagreement between the comparatively large OMI pixel and smaller-scale ground and aircraft measurements is to be expected due to the large spatial variability of NO2. Techniques such as the downscaling method shown here can reduce this discrepancy. However, the robust evaluation of NO2 tropospheric column retrievals is further confounded by the current lack of agreement among ground-based and in-situ measurements. Future validation strategies for satellite observations of tropospheric column NO2 will need to address these differences.

13 in total

1. Atmospheric NO2: in situ laser-induced fluorescence detection at parts per trillion mixing ratios

Authors:
Journal: Anal Chem Date: 2000-02-01 Impact factor: 6.986

2. Megacity emissions and lifetimes of nitrogen oxides probed from space.

Authors: Steffen Beirle; K Folkert Boersma; Ulrich Platt; Mark G Lawrence; Thomas Wagner
Journal: Science Date: 2011-09-23 Impact factor: 47.728

3. Increase in NOx emissions from Indian thermal power plants during 1996-2010: unit-based inventories and multisatellite observations.

Authors: Zifeng Lu; David G Streets
Journal: Environ Sci Technol Date: 2012-07-06 Impact factor: 9.028

4. Space-based constraints on spatial and temporal patterns of NO(x) emissions in California, 2005-2008.

Authors: Ashley R Russell; Lukas C Valin; Eric J Bucsela; Mark O Wenig; Ronald C Cohen
Journal: Environ Sci Technol Date: 2010-05-01 Impact factor: 9.028

5. Global partitioning of NOx sources using satellite observations: relative roles of fossil fuel combustion, biomass burning and soil emissions.

Authors: Lyatt Jaeglé; Linda Steinberger; Randall V Martin; Kelly Chance
Journal: Faraday Discuss Date: 2005 Impact factor: 4.008

6. Increase in tropospheric nitrogen dioxide over China observed from space.

Authors: Andreas Richter; John P Burrows; Hendrik Nüss; Claire Granier; Ulrike Niemeier
Journal: Nature Date: 2005-09-01 Impact factor: 49.962

7. The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2).

Authors: Ronald Gelaro; Will McCarty; Max J Suárez; Ricardo Todling; Andrea Molod; Lawrence Takacs; Cynthia Randles; Anton Darmenov; Michael G Bosilovich; Rolf Reichle; Krzysztof Wargan; Lawrence Coy; Richard Cullather; Clara Draper; Santha Akella; Virginie Buchard; Austin Conaty; Arlindo da Silva; Wei Gu; Gi-Kong Kim; Randal Koster; Robert Lucchesi; Dagmar Merkova; Jon Eric Nielsen; Gary Partyka; Steven Pawson; William Putman; Michele Rienecker; Siegfried D Schubert; Meta Sienkiewicz; Bin Zhao
Journal: J Clim Date: 2017-06-20 Impact factor: 5.148

8. National satellite-based land-use regression: NO2 in the United States.

Authors: Eric V Novotny; Matthew J Bechle; Dylan B Millet; Julian D Marshall
Journal: Environ Sci Technol Date: 2011-04-26 Impact factor: 9.028

9. A measurement of total reactive nitrogen, NOy, together with NO₂, NO, and O₃ via cavity ring-down spectroscopy.

Authors: Robert J Wild; Peter M Edwards; William P Dubé; Karsten Baumann; Eric S Edgerton; Patricia K Quinn; James M Roberts; Andrew W Rollins; Patrick R Veres; Carsten Warneke; Eric J Williams; Bin Yuan; Steven S Brown
Journal: Environ Sci Technol Date: 2014-07-24 Impact factor: 9.028

10. Revising the slant column density retrieval of nitrogen dioxide observed by the Ozone Monitoring Instrument.

Authors: S Marchenko; N A Krotkov; L N Lamsal; E A Celarier; W H Swartz; E J Bucsela
Journal: J Geophys Res Atmos Date: 2015-06-04 Impact factor: 4.261

3 in total

1. Changes in the ozone chemical regime over the contiguous United States inferred by the inversion of NO_x and VOC emissions using satellite observation.

Authors: Jia Jung; Yunsoo Choi; Seyedali Mousavinezhad; Daiwen Kang; Jincheol Park; Arman Pouyaei; Masoud Ghahremanloo; Mahmoudreza Momeni; Hyuncheol Kim
Journal: Atmos Res Date: 2022-06-01 Impact factor: 5.369

2. Comprehensive evaluations of diurnal NO₂ measurements during DISCOVER-AQ 2011: effects of resolution-dependent representation of NO _x emissions.

Authors: Jianfeng Li; Yuhang Wang; Ruixiong Zhang; Charles Smeltzer; Andrew Weinheimer; Jay Herman; K Folkert Boersma; Edward A Celarier; Russell W Long; James J Szykman; Ruben Delgado; Anne M Thompson; Travis N Knepp; Lok N Lamsal; Scott J Janz; Matthew G Kowalewski; Xiong Liu; Caroline R Nowlan
Journal: Atmos Chem Phys Date: 2021-07-23 Impact factor: 7.197

3. Disentangling the Impact of the COVID-19 Lockdowns on Urban NO₂ From Natural Variability.

Authors: Daniel L Goldberg; Susan C Anenberg; Debora Griffin; Chris A McLinden; Zifeng Lu; David G Streets
Journal: Geophys Res Lett Date: 2020-09-05 Impact factor: 5.576

3 in total