How we measure background CO2 levels on Mauna Loa.

Pieter Tans and Kirk Thoning, NOAA Earth System Research Laboratory,
Boulder, Colorado
September, 2008


We have confidence that the CO2 measurements made at the Mauna Loa Observatory reflect truth about our global atmosphere. The main reasons for that confidence are:

  1. The Observatory near the summit of Mauna Loa, at an altitude of 3400 m, is well situated to measure air masses that are representative of very large areas.
  2. All of the measurements are rigorously and very frequently calibrated.
  3. Ongoing comparisons of independent measurements at the same site allow an estimate of the accuracy, which is generally better than 0.2 ppm.

Infrared absorption.

How does the CO2 analyzer work? Air is slowly pumped through a small cylindrical cell with flat windows on both ends. Infrared light is transmitted through one window, through the cell, through the second window, and is measured by a detector that is sensitive to infrared radiation. In the atmosphere carbon dioxide absorbs infrared radiation, contributing to warming of the earth surface. Also in the cell CO2 absorbs infrared light. More CO2 in the cell causes more absorption, leaving less light to hit the detector. We turn the detector signal, which is registered in volts, into a measure of the amount of CO2 in the cell through extensive and automated (always ongoing) calibration procedures.

Mole fraction in dry air

What do we need to measure? Most people assume that we measure the concentration of CO2 in air, and in communicating with the general public we frequently use that word because it is familiar. The quantity we actually determine is accurately described by the chemical term “mole fraction”, defined as the number of carbon dioxide molecules in a given number of molecules of air, after removal of water vapor. For example, 372 parts per million of CO2 (abbreviated as ppm) means that in every million molecules of (dry) air there are on average 372 CO2 molecules. The table below gives an example for 372 ppm CO2 in dry air. All species have been expressed as ppm, turning 78.09% nitrogen into 780,900 ppm. The rightmost column shows the composition of the same air after 3% water vapor has been added:

dry air3% wet air
Nitrogen780,900 757,473 ppm
Oxygen209,400 203,118
Water vapor 0 30,000
Argon 9,300 9,021
Carbon Dioxide 372 360.8
Neon 18 17.5
Helium 5 4.9
Methane 2 2
Krypton 1 1
trace species (each less than 1) 1 1
Total1,000,0001,000,000 ppm

Why do we express the abundance of CO2 as a mole fraction in dry air? The concentration of a gas is defined formally as the number of molecules per cubic meter. The goal of our measurements is to quantify how much CO2 has been added to, or removed from, the atmosphere. The concentration does not give us that information because it primarily depends on the pressure and temperature, and secondarily on how much the relative abundance of each gas has been diluted by water vapor, which is extremely variable. Only the dry mole fraction reflects the addition and removal of a gas species because its mole fraction in dry air does not change when the air expands upon heating or upon ascending to higher altitude where the pressure is lower. Nor does it change when water evaporates, or condenses into droplets. Why is this so important? Here is an example: The amount of CO2 is higher in the Northern than in the Southern Hemisphere as a result of the combustion of coal, oil, and natural gas. The measurement of this difference gives us crucial quantitative information about the emissions and removals of CO2. The concentration change produced by the addition of water vapor can be greater than the CO2 difference between the two hemispheres. In contrast, the difference in dry mole fraction does reflect the differences in emissions and removals between the hemispheres.

Calibration of the instrument.

The most important aspect of the measurements is the ongoing calibrations. Air flows continually through the instrument, after having first been dried in a cold trap where the water vapor freezes out as ice on the walls. Unfortunately, the absorption that we measure in the cell does not depend on the CO2 mole fraction, but on the total amount of CO2 in the cell. Therefore, we either have to extremely accurately control the temperature and pressure in the cell, as well as the flow rate, or we can control them less accurately while using frequent calibrations of the instrument with reference gas mixtures of CO2-in-dry-air spanning the expected range of the measurements. The reference gas mixtures are stored in high pressure aluminum cylinders. At Mauna Loa the calibration is done every hour by interrupting the flow of outside air through the cell, and replacing it with flows of three reference gas mixtures in succession, 5 minutes each. An example is shown in Figure 1. Each hour is divided up as follows:

Figure 1. Voltage output from the Mauna Loa CO2analyzer for November 17, 2006. The analyzer measures an hourly cycle of ambient air and three standard reference gases with known amounts of CO2.
  • minute 0 through 24: air intake line 1 (last 20 minutes is used)
  • minute 25-44: air intake line 2 (20 minutes used)
  • minute 45-49: high reference gas, which we call W3 (last 2 minutes is used)
  • minute 50-54: middle reference gas (W2) (last 2 minutes is used)
  • minute 55-59: low reference gas (W1) (last 2 minutes is used)

We know the mole fractions of CO2 in the reference gases very accurately. In this case they were 370.50 ppm (W1), 379.93 (W2), and 389.72 (W3). For each of these three we register a voltage, which establishes a relationship between voltage and CO2 mole fraction. The relationship is not linear because the sensitivity of the detector decreases at higher CO2 abundance, although the effect is not strong in the small range measured at Mauna Loa. We fit a quadratic function through the three points to provide a CO2 mole fraction number for every voltage in the range shown in Figure 1, from ~1.4 to ~2.3 volt. After each gas switch the first few minutes are ignored to eliminate switching transients. For example, the flow rate is 100 cc per minute, and it takes time for the previous air to be fully replaced with new air, and for the walls of the tubes, valves, and cell to equilibrate with the new mole fraction. During minutes 0 through 44 we use two separate air intake lines in succession, both from the top of a 38 m tall tower next to the observatory, to facilitate detection of a potential leak between in the intake lines bewteen the tower and the instrument. The tower was built to eliminate any influence on the measurements of human activities at the observatory.

Target gas.

The calibration strategy outlined above can be applied, with appropriate variations, to any separation/detection method, whether it is infrared absorption, gas chromatography, mass spectrometry, etc. Therefore the choice of measurement method is mostly one of convenience. The measurement results do depend very strongly on the control of the reference gas mixtures. If one of them has the wrong value, all results will be off. We provide for a check on the assigned values of the calibration mixtures by running what we call a “target gas”, treated as having an unknown mole fraction, every 25 hours through the measurement system. In reality the mole fraction of the target gas is very accurately known.

Figure 2. Voltage output from the Mauna Loa CO2 analyzer for November 17, 2006. The 'target gas' is being measured during hour 21. The different colors represent data that has been marked by the data selection criteria.

If one of the calibration gases has a wrongly assigned value the measurement of the target gas will produce the wrong result. An example of a daily target measurement is shown in Figure 2 where, during hour 21, the analyzer alternates between the three calibration gases and the target gas. The target gas has low mole fraction in this case, registering at ~1.1 volt. The repeat cycle of every 25 hours causes the target measurement to slowly move through the diurnal cycle because every day it is measured one hour later than on the previous day.

Data selection for background air.

In 1957 Dave Keeling, who was the first to make accurate measurements of CO2 in the atmosphere, chose the site high up on the slopes of the Mauna Loa volcano because he wanted to measure CO2 in air masses that would be representative of much of the Northern Hemisphere, and, hopefully, the globe. That goal has not changed. We still want to eliminate the influence of CO2 absorbed or emitted locally by plants and soils, or emitted locally by human activities. Dave Keeling also introduced the principle of a rigorous calibration strategy that we still employ today.

The observatory is surrounded by many miles of bare lava, without any vegetation or soil. This provides an opportunity to measure “background” air, also called “baseline” air, which we define as having a CO2 mole fraction representative of an upwind fetch of hundreds of km. Nearby emission or removal of CO2 typically produces sharp fluctuations, in space and time, in mole fraction. These fluctuations get smoothed out with time and distance through turbulent mixing and wind shear. A distinguishing characteristic of background air is that CO2 changes only very gradually because the air has been mixed for days, without any significant additions or removals of CO2. Another common word for emissions is “sources”, and for removals, “sinks”. Figure 2 shows an example of the data selection procedures we use to eliminate air that has likely been influenced significantly by nearby sources/sinks. First the time axis needs to be explained. The title of Figure 2 reflects that we keep date and time at the Mauna Loa Observatory in Universal Coordinated Time, abbreviated as UTC. UTC is the same across the entire world. It succeeded Greenwich Mean Time in 1961, and is widely used and unambiguous. Local time in Hawaii lags UTC by 10 hours, so that 10 am UTC corresponds to midnight locally. Figure 2 shows 17 November 2006 from midnight to midnight in Hawaii. In Figure 2 all hours are consecutively counted starting from 0 on 17 November 2006 UTC hour 0. Note that the last 10 hours (24-33), the afternoon and evening in Hawaii, are already part of the next day in UTC.

At Mauna Loa we use the following data selection criteria:

  1. The standard deviation of minute averages should be less than 0.30 ppm within a given hour. A standard deviation larger than 0.30 ppm is indicated by a “V” flag in the hourly data file, and by the red color in Figure 2.
  2. The hourly average should differ from the preceding hour by less than 0.25 ppm. A larger hour-to-hour change is indicated by a “D” flag in the hourly data file, and by the green color in Figure 2.
  3. There is often a diurnal wind flow pattern on Mauna Loa driven by warming of the surface during the day and cooling during the night. During the day warm air flows up the slope, typically reaching the observatory at 9 am local time (19 UTC) or later. The upslope air may have CO2 that has been lowered by plants removing CO2 through photosynthesis at lower elevations on the island, although the CO2 decrease arrives later than the change in wind direction, because the observatory is surrounded by miles of bare lava. In Figure 2 the downslope wind changed to upslope during hour 18. Upslope winds can persist through ~7 pm local time (5 UTC, next day, or hour 29 in Figure 2). Hours that are likely affected by local photosynthesis are indicated by a “U” flag in the hourly data file, and by the blue color in Figure 2. The selection to minimize this potential non-background bias takes place as part of step 4. At night the flow is often downslope, bringing background air. However, that air is sometimes contaminated by CO2 emissions from the crater of Mauna Loa. As the air meanders down the slope that situation is characterized by high variability of the CO2 mole fraction. In Figure 2, downslope winds resumed in hour 28. Hour 33 in Figure 2 is the first of an episode of high variability lasting 7 hours.
  4. In keeping with the requirement that CO2 in background air should be steady, we apply a general “outlier rejection” step, in which we fit a curve to the preliminary daily means for each day calculated from the hours surviving step 1 and 2, and not including times with upslope winds. All hourly averages that are further than two standard deviations, calculated for every day, away from the fitted curve ("outliers") are rejected. This step is iterated until no more rejections occur. These hours are indicated by an “A” flag in the hourly data file, and by the purple color in Figure 2, also indicated as “spline” in the legend. Spline is a curve fitting technique. Rejected hours occurring during times with upslope winds are given a “U” character in the data file.

Hours for which we do not have valid air measurement are flagged with an “I” character in the hourly data file, and by the orange color in Figure 2. The hours during which we do a target gas measurement are an example, but there are also other causes, such as zero flow in the analyzer.

Figure 3. The distribution of hours with background air after data selection.

No data are thrown away. Hourly means are calculated wherever possible, and how we use that data is indicated by the selection flags. Raw data are the voltages recorded for the air measurements as well as for the reference gas mixtures used for calibration and for the target gas.

On average over the entire record there are 13.6 retained hours per day with background CO2 mole fractions. The distribution is wide, as shown in Figure 3. Only those days with more than one remaining background hour have been plotted. There were zero days with 24 hours of background data after we introduced the target gas strategy, reducing the maximum number of background hours per day for almost all days to 23. Before we used the target gas we used 2 sets of reference gas mixtures, called working standards and station standards, as described in the references below. The number of days with 0 or 1 background hour comprises 6.5% of the total.

Auxiliary measured variables

System status variables measured and continuously recorded are: analyzer temperature, cold trap temperature, room temperature, sample flow rate, and both pressure and flow rate through both air intake lines from the tower.

How we calibrate the reference gases.

Since 1995 our laboratory has been the CO2 Central Calibration Laboratory of the World Meteorological Organization (WMO). Before 1995 that role was filled by the Scripps Institution of Oceanography of the University of California in San Diego. We maintain the WMO Mole Fraction Scale for CO2-in-air. The WMO Scale is based on 15 so-called Primary Standards that are calibrated in terms of fundamental quantities at intervals of ~1.5 year by manometric measurements. They have CO2 mole fractions that span the range of ambient air. The primaries are stored in high pressure aluminum cylinders. The Primary Standards themselves are only sparingly used, so that we can build up a calibration history for each of them over several decades. During such a calibration episode, lasting 2-3 months, each Primary is measured three times. Each time a sample of air is taken from the cylinder and its pressure and temperature inside a closed volume are measured very accurately, after temperature and pressure have stabilized in a temperature controlled environment. The volume of approximately 6 liters is accurately known. Once the pressure, temperature, and volume of a gas are known, the amount of gas can be calculated, taking the temperature dependent compressibility of the gases into account. Then the air, with the CO2 that is to be determined still in it, is slowly and completely flowed at low pressure over a cold trap cooled with liquid nitrogen. The condensate in the cold trap consists of carbon dioxide, nitrous oxide, and residual water vapor. The latter is low, corresponding to 1 ppm or so in the dry high pressure cylinder. After separation of the water vapor component, the carbon dioxide and nitrous oxide are transferred to a small volume, about 10 cc, which has also been very accurately calibrated. After stabilization, the pressure and temperature are once again recorded, and total amount of the two remaining gases calculated. At that point we have determined the combined mole fraction of carbon dioxide and nitrous oxide in the cylinder air. The latter typically comprises less than one thousandth of the total, and we correct for its contribution after measuring nitrous oxide separately on a gas chromatograph.

The accuracy of the WMO Scale has been estimated, based on the accuracy of the temperature, volume and pressure measurement, and the gas handling procedures, at 0.07 ppm. It has been compared several times to calibration scales established independently by other laboratories that were based on weighing a small amount of pure carbon dioxide that was then mixed into a large, also accurately weighed, amount of carbon dioxide-free air. We have also compared to the previous WMO Scale maintained by Scripps. These comparisons are compatible with our internal accuracy estimate.

The manometric calibrations are a very time consuming process, while we need to calibrate thousands of cylinders, many of which are used by other laboratories around the world. We transfer, or propagate, the WMO scale to other high pressure cylinders of CO2-in-air in a much more efficient way, by comparing them to each other on an infrared gas analyzer system. This is very similar to how we measure outside air by comparing the voltage response of unknown air to the response of air with known mole fractions. The repeatability of these transfer calibrations, done one week or more apart, is typically 0.01-0.02 ppm in the range of ambient air. The sequence is similar to that of target gas calibrations at the field observatories where all cylinders are measured repeatedly, one after the other, with the known gases interspersed between the unknown gases. Because we want to use the Primary Standards sparingly, we transfer the scale twice per year through these comparisons to Secondary Standards, which in turn get used to calibrate all other reference gases. The Secondaries typically last for 3-5 years.

An important aspect of the calibration strategy is that reference gas mixtures are not only calibrated before use, but also after their use when the pressure is low, but not close to zero. Reference gas mixtures are recalled for recalibration when ~20% of the air is still in the cylinder because experience has shown that the CO2 mole fraction tends to drift more when the pressure becomes low. Because the raw data consists of voltages, it is possible to re-calculate corrected mole fractions for the measured atmospheric air after it has become clear through recalibration that a particular reference gas has drifted. Fortunately this problem greatly diminished when we made the switch from using steel cylinders to aluminum cylinders.

Replication of the field measurements

We know from experience that careful calibration is necessary, but not sufficient, for making accurate measurements. There are many potential biases related to gas handling, drying, that are not covered through the calibration. The ultimate test of how accurate these measurements are likely to be is to compare them with other measurements, done independently, with different methods, and/or by different laboratories. At Mauna Loa we have both. In Figure 4 we show a comparison with two sets of grab samples of air in glass flasks, done weekly in pairs. One set, called “S”, are samples taken through the intake line used by the continuous analyzer, and diverted just before entering the analyzer. The second set, called “P”, are completely independent from the continuous analyzer system. They are taken with a battery operated pump, and extendable air intake mast, at a spot on the roof of one of the observatory buildings. The flask samples are sent to our laboratory in Boulder, Colorado, where they are analyzed together with thousands of other air samples from all over the world. There is a fair amount of “scatter” in the comparisons, partly because we are comparing a spot sample, a ~20 s average, to the hourly average of the continuous analyzer when the sample was taken. On average, the “P” flasks differ from the continuous analyzer by 0.11 ppm, and the “S” flasks by 0.05 ppm. Important are systematic biases that persist over a good part of a year and more, portrayed by the solid smooth lines fitted to the respective data sets. One can see that most of the time the biases are less than 0.2 ppm.

Figure 4. The figure shows differences between grab samples of air in glass flasks, and the corresponding hourly mean CO2 value from the Mauna Loa in-situ analyzer. There are two types of flask samples, the 'P' type collected using an independent flask sampling package (green pluses), and the 'S' type, collected using the in-situ analyzer air intake lines (black diamonds).

We have a similar comparison with the CO2 measurements performed by the Scripps Institution of Oceanography, the program started by David Keeling. They have always maintained their own manometric calibration scale, they use a different analyzer system, and their data selection techniques for selecting background air are different from, and independent of, our methods. They also compare on a regular basis with their own grab samples. A comparison of the Scripps monthly mean CO2 data (obtainable from shows that the average difference during 1974-2004 between the Scripps and our monthly means is 0.04 ppm, and that the standard deviation of the annual mean differences is 0.12 ppm.

The above comparisons give us confidence that the CO2 measurements are generally accurate to better than 0.2 ppm.

Observed variations of CO2 in the atmosphere.

We present two typical examples of measured CO2 mole fractions over the continent, to demonstrate what an excellent choice Mauna Loa is for making background measurements. Figure 5a shows one week in July 1999 of CO2 measurements made on a tall TV antenna in a forested area in Northern Wisconsin. Again, the time is recorded as UTC.

Figure 5a. Measurements of CO2 by a continuous in-situ analyzer at a tall tower during the summer in northern Wisconsin. The top plot show the CO2 values at six different heights on the tower.

In Wisconsin the local time is lagging UTC by 6 hours. The measurements are made at six height levels on the tower, from 11 m to 396 m above the ground. Near the ground there is a very large diurnal cycle which decreases with altitude. At sunset, when photosynthesis shuts down, the CO2 mole fraction shoots up, especially at the lowest levels, because plants and soils keep respiring during day and night, releasing CO2. Sunset occurs at around 1 am UTC, which equals 7 pm local time of the previous day at the tower. During the day photosynthesis is stronger than respiration, causing net removal of CO2 from the atmosphere, and the CO2 gradient along the height of the tower reverses. Now the 11 m level near the ground has the lowest CO2 mole fraction. The mole fraction differences between the levels are very small during the day because the air is vigorously mixed, typically up to altitudes of 1-2 km. During the night the ground cools and the atmosphere becomes stable, with little vertical mixing because the coldest (densest) air is near the ground. The respired CO2 is trapped in the stable boundary layer near the ground, which may have a thickness of only tens of meters. The buildup of respiratory CO2 near the ground is more strongly dependent on the atmospheric stability, driven by the weather, than on the rate of respiration. It is more difficult to quantify emissions/removals of CO2 over the continent than at background sites like Mauna Loa that average over very large areas. Figure 5a also shows slow variations of CO2 on the time scale of weather systems, several days to a week, with the wind bringing air masses from different directions.

Figure 5b presents one week in January which portrays a typical situation in the winter. There are no strong sources/sinks of CO2 in the vicinity of the tower, because we see fairly uniform mole fractions along the height of the tower. The variations on the time scale of weather systems remain, due to sources/sinks far away from the tower.

Figure 5b. Measurements of CO2 by a continuous in-situ analyzer at a tall tower during the winter in northern Wisconsin.

Further reading: