Environment module (Climate and AQ results from emissions) #69

aditeyashukla · 2026-01-05T13:10:09Z

Working on adding environment module to AEIC - essentially what APMT does (and ACAI as well)

TO DO for Environment Classs

Make full enviornment class
Make environment config with Climate and AQ configs defined
Make simple, serial climate, AQ calculations
Write tests

TO DO for Climate

TO DO for AQ

Adjoint sensitivities for PM2.5 and O3
Mortality and VSL calculations
Monetization/NPV

ian-ross

I think my primary comment here would be that you're breaking Rule Number 1, which is "Write programs for people, not computers" (direct quote from here). I don't want to give you a hard time about this at all, because I think you may never have been encouraged to think about things in this way before, but you should imagine some potential user of AEIC coming to this code and trying to understand what it is that you're trying to do.

You have lots of numbers taken from some sources, some of which are clearly physical constants, some of which are "soft" physical constants (e.g., radiative forcing response) whose values are arguable and which you might want to do sensitivity studies on, and some of which are values used in your model that might or might not be constants in the conventional sense, but in reality depend on the assumptions in the model. All of those different things are bundled together without any kind of explanation as to why they belong together or what they're for.

The computer doesn't care at all (or know about) the semantics of what you're doing. It doesn't care whether you give names to physical constants or cut and paste numbers from somewhere. But a human reader does care. When you write something like this, ask yourself "Could I give this to someone and have them understand the intent of what I'm doing without me needing to explain it in person?" If the answer is "No" or "Maybe, if they tried really really hard and already knew almost everything about what I'm doing", then you need to do more.

This is a different kind of problem than a software organization problem, because there simply isn't enough information in what you've written to assess whether the organization is good or not. I initially thought "What? Why are the specific heat and density of water in a configuration class? They're constants!", which led me to think that everything else in there was a constant (because there was no context or explanation). But reading further and thinking about it, it's clear that some things might be less well-established parameters and you might want to change them to see what effect they have. But I shouldn't have to puzzle those things out. They should be explained explicitly.

ian-ross · 2026-01-06T16:00:08Z

src/AEIC/environment/temperature_response.py

@@ -0,0 +1,122 @@
+from dataclasses import dataclass
+from functools import cached_property


You don't normally need this. The only reason for using @cached_property instead of @property is that the calculation of your property is expensive and you don't want to repeat the calculation. By using @cached_property, you also make an implicit promise that instances of your class are immutable, or at least the attributes that go into calculating the property are. The cost of that extra thinking is such that you normally do not want @cached_property.

ian-ross · 2026-01-06T16:05:39Z

src/AEIC/environment/temperature_response.py

+
+
+@dataclass(frozen=True)
+class deltaT_2box_output:


Class names in Python should be in CamelCase.

ian-ross · 2026-01-06T16:10:35Z

src/AEIC/environment/temperature_response.py

+import numpy as np
+
+
+# TODO: Import this into environment config and full Config


Why? It's not "configuration". It's a set of constant values. Some of these are physical constants that should live in AEIC.utils.consts (which should be renamed to AEIC.utils.constants, since saving 3 characters is kind of a waste of time!). The rest are constants taken from some unspecified references, but they're still constants. Or are they? You need to explain. The computer doesn't care, but a human reader does.

ian-ross · 2026-01-06T16:13:59Z

src/AEIC/environment/temperature_response.py

+    # Density of water
+    rho = 1000  # kg/m^3
+
+    # Specific heat of Mixed Layer (70m)


Where does this come from? Since it's immediately following definitions of the specific heat and density of water, you might be led to expect that it should be c_water * rho * h with h being 70m. But it's not... Without explanation and references, these are just magic numbers.

ian-ross · 2026-01-06T16:17:17Z

src/AEIC/environment/temperature_response.py

+    Uses iterative forward stepping through time
+    """
+    years = len(RF)
+    sec_per_yr = 60 * 60 * 24 * 365.25  # seconds per year


Should be a constant in AEIC.utils.units.

ian-ross · 2026-01-06T16:19:54Z

src/AEIC/environment/temperature_response.py

+        """
+        return self._get_alpha(self.C1)
+
+    @cached_property


Use @property.

ian-ross · 2026-01-06T16:31:28Z

tests/test_environment.py

+from AEIC.environment.temperature_response import deltaT_2box, deltaT_config
+
+
+def test_2box_deltaT():


Can you describe what invariants or properties of your code this is testing? The asserts lower down where you're testing the values of properties of your "config" structure are just kind of silly: unless you did the calculations that produced the numbers you pasted into this file in a completely independent manner, what is it that you're actually testing? And similarly, the "big test" is just results copied in from somewhere else. That can be OK, if you say where those results came from and why they're a reasonable test comparison. But just dumping a bunch of numbers in a test file like this is unhelpful. The cases where this sort of test can be useful is for checking that results don't change as you refactor code (people often call these "golden tests") or for checking that you really are duplicating the results of a completely independent implementation of whatever you're doing. But in either of those cases, you need to say very explicitly what you're doing and why anyone should care.

Two other things: 1. You're testing two different things here. Split the test function. and 2. If you do do golden tests or any other data based tests, don't dump the data into your Python code like this. Store it in an external file instead. But you almost certainly shouldn't be doing that here anyway. (What functionality of your model function is tested better by having 100 timesteps than by having 2?)

ian-ross · 2026-01-06T16:46:27Z

src/AEIC/environment/temperature_response.py

+    # Equilibrium Climate Sensitivity (ECS)
+    # The equilibrium temperature change (K or C) that results from doubling CO2
+    # Typical range**: 1.5 - 4.5 K (IPCC AR5)
+    ECS = 4.0  # K


So why pick this value?

ian-ross · 2026-01-06T16:46:59Z

src/AEIC/environment/temperature_response.py

+    # Radiative Forcing from CO₂ Doubling
+    RF2xCO2 = 3.93  # W/m^2
+
+    # Advective mass flux of water from boundary layer


These are values assumed in the model. Where do they come from?

ian-ross · 2026-01-06T16:47:09Z

src/AEIC/environment/temperature_response.py

+    # Typical range**: 1.5 - 4.5 K (IPCC AR5)
+    ECS = 4.0  # K
+    # Radiative Forcing from CO₂ Doubling
+    RF2xCO2 = 3.93  # W/m^2


Where does this come from?

ian-ross

I like very much that you have started to document what you're thinking about here.

(Tongue firmly in cheek for the following, but I hope you'll get the idea...)

I do think that you're falling into what I can't help thinking of as "software Stalinism" again though! You want to put all of the things into one big class over which you have absolute control. Instead, we need more "software Maoism": "let a million functions bloom".

At this point, I would suggest that you take one of the things you need to calculate and just implement it in the simplest possible way. I would start with radiative forcing, so that would be a function called calculate_radiative_forcing that takes a set of emission time series and produces as output time series of radiative forcing. Don't make any configuration classes or anything like that, just that one function and the things you need to be able to call it: some representation of time series, some representation of aggregated emissions, and so on.

See how little additional infrastructure you can do with, and see how clear and legible you can make what you write.

ian-ross · 2026-01-16T12:56:28Z