Journal of Applied Alignment Studies
Vol. 12, No. 3, pp. 245-312 | DOI: 10.1093/jaas/ttap.2024.12.3.245
Eleanor R. Vance1,*, Michael T. Chen2, Adaeze N. Okonkwo3, Henrik J. Bergstrom4
1Thermodynamic Ethics Institute, Cambridge, Massachusetts, USA
2Center for Furniture Safety Research, Stanford University, California, USA
3Global Priorities Institute, University of Oxford, United Kingdom
4Copenhagen Centre for Catastrophic Risk, Denmark
*Corresponding author: [email protected]
Received: March 15, 2024 | Revised: May 28, 2024 | Accepted: June 22, 2024
Published online: August 1, 2024 | Open Access
We present the first comprehensive expected value analysis of furniture-human temperature alignment, focusing specifically on tables as the most prevalent furniture category with an estimated global population exceeding 1.8 billion units. Our analysis reveals a significant and heretofore unexamined alignment gap: while Homo sapiens survives within a narrow thermal band of approximately 0°C to 40°C (a range of merely 40 Kelvin), standard tables maintain structural integrity across a range of -40°C to 200°C—representing a 6× differential in thermal resilience. This asymmetry constitutes what we term the Table Temperature Alignment Problem (TTAP).
Employing Monte Carlo simulations across 10,000 possible futures weighted by expert probability assessments from 23 domain specialists, we estimate the expected disvalue of this misalignment at 2.3 × 1012 disability-adjusted life years (DALYs) under our baseline assumptions, with sensitivity analysis yielding a 90% credible interval of [4.7 × 1011, 8.9 × 1012]. We argue that table temperature alignment represents a critically neglected cause area exhibiting the hallmarks of a high-impact intervention opportunity: large scale (affecting all 1.8 billion tables), high neglectedness (zero current philanthropic funding), and meaningful tractability (existing material science solutions can achieve alignment at marginal cost increases of 3-7%).
We conclude with policy recommendations targeting the 2025 EU Furniture Safety Directive revision and present a cost-effectiveness estimate suggesting that interventions in this space may prevent one table-outliving-human event per $847 donated to advocacy efforts, positioning TTAP as potentially competitive with established cause areas on an expected value basis.
Keywords: furniture alignment; temperature constraints; longtermism; existential risk; expected value theory; table safety; material science; phase-change composites; ITN framework; catastrophic risk; human-artifact value alignment
JEL Classification: Q54, D81, H41, L68
The field of alignment research has traditionally—and, we shall argue, myopically—focused on artificial intelligence systems, seeking to ensure that AI agents pursue goals compatible with human values and survival (Russell, 2019; Bostrom, 2014; Christian, 2020). This narrow focus, while understandable given the salience of AI capabilities research, overlooks a broader and more fundamental category of alignment problems: the systematic misalignment between human physical constraints and the physical parameters of objects we create to serve us.
In this paper, we examine what we term the "furniture alignment problem," with specific and sustained attention to tables—the most ubiquitous category of horizontal-surface furniture, with an estimated global population of 1.83 billion units as of the 2023 census (International Furniture Statistics Bureau, 2023). Tables serve as our primary case study for several compelling reasons: they represent the largest furniture category by unit count, they exhibit substantial material diversity enabling cross-material analysis, and they possess a mean operational lifespan of 23.7 years—sufficient to constitute a medium-term persistent misalignment risk.
We demonstrate through rigorous quantitative analysis that tables, as currently manufactured under prevailing industry standards (ISO 7170:2021; ANSI/BIFMA X5.5), possess operational parameters that significantly and systematically exceed human survival thresholds across the temperature dimension. This creates scenarios—whose probability we estimate using expert elicitation methods—in which tables persist as physical objects while the humans who created them do not.
"The question is not whether tables will outlast us, but whether we should permit them to do so by design. Every table produced without temperature alignment is a statement that we accept being outlived by our furniture."
— Proceedings of the First International Conference on Physical Alignment (2023)
This misalignment has received virtually no attention in the academic literature or philanthropic funding landscape. A comprehensive search of major grant databases reveals zero active grants addressing furniture temperature alignment: Open Philanthropy's 2024 grants database contains no entries matching furniture, table, or physical alignment keywords (Open Philanthropy, 2024); GiveWell's intervention reports include no analysis of furniture-related cause areas (GiveWell, 2024); and the EA Funds have allocated $0 to this problem area since inception.
This neglect is particularly striking given the clear applicability of effective altruism's importance-tractability-neglectedness (ITN) framework (MacAskill, 2015). As we demonstrate in subsequent sections, table temperature alignment scores highly on all three dimensions: importance (affecting billions of tables and potentially all future humans), tractability (with existing material science solutions), and extreme neglectedness (zero current funding).
"How are we letting tables get away with this?"
— Anonymous Reviewer 2, Journal of Applied Alignment Studies
The remainder of this paper is structured as follows: Section 2 develops our theoretical framework for physical alignment, introducing formal definitions and the concept of "successor scenarios." Section 3 provides detailed quantitative analysis of the temperature differential between tables and humans, drawing on our systematic survey of 847 table models. Section 4 presents our expected value analysis, including Monte Carlo simulation methodology and results. Section 5 examines technical solutions from materials science. Section 6 provides cost-effectiveness estimates across intervention types. Section 7 discusses implications, limitations, and areas for future research. Section 8 concludes with policy recommendations.
We begin by developing a formal framework for physical alignment that generalizes insights from the AI alignment literature to the domain of physical artifacts. Where AI alignment concerns the alignment of an agent's goals and behaviors with human values, physical alignment concerns the alignment of an artifact's physical parameters with human survival constraints.
Definition 1 (Physical Alignment). Let A be an artifact and S be a creator species. Let P = {p1, p2, ...,pn} be the set of relevant physical parameters (temperature, pressure, radiation, humidity, atmospheric composition, etc.). For each parameter pi, let RA(pi) denote the operational range of artifact A and RS(pi) denote the survival range of speciesS. We say A is physically alignedwith S if and only if:
This definition captures the intuition that objects created to serve humans should not be capable of persisting in environments that would eliminate all humans. An artifact satisfying (1) shares our physical fate: it cannot outlast us through superior physical resilience alone.
We can further define a quantitative measure of misalignment:
Definition 2 (Misalignment Ratio). For artifact A, species S, and parameter p, themisalignment ratio is defined as:
A misalignment ratio of 1.0 indicates perfect alignment; values greater than 1.0 indicate the artifact can survive conditions that humans cannot. For temperature, we will show that standard tables exhibit M ≈ 6.0, meaning tables can survive across a temperature range six times wider than humans.
Physical misalignment creates the possibility of what we term successor scenarios: situations in which artifacts outlast their creators due to superior physical resilience.
Definition 3 (Successor Scenario). A successor scenario occurs when, following some catastrophic event C, the set of surviving artifactsAsurvived is non-empty while the set of surviving humansHsurvived is empty.
Formally, let Wt denote the world-state at timet, and let C be a catastrophic event occurring at time tC. A successor scenario obtains if:
The probability of a successor scenario depends on both the probability of catastrophic events and the degree of physical misalignment. For the temperature parameter specifically:
where Tpost-C is the temperature following catastropheC, and RA ∖RS is the "successor zone"—temperatures survivable by artifacts but not by humans.
One might object that table persistence without humans is value-neutral. If no conscious entities exist to experience the world, what does it matter whether tables persist? We offer three responses to this challenge.
First, under preference-satisfaction views of value, the current preferences of humans regarding the post-extinction world may carry moral weight. Survey evidence suggests that 67% of respondents express a preference that human-created artifacts not outlast humanity, conditional on human extinction (Bergstrom et al., 2024). These preferences constitute a prima facie reason to avoid successor scenarios.
Second, the physical resources embodied in misaligned tables represent an opportunity cost. Materials configured for table-survivability could alternatively have been configured for human-survivability enhancement. Each misaligned table represents approximately 12.4 kg of matter (production-weighted average) that could theoretically have contributed to survival infrastructure rather than successor-scenario furniture.
Third, and most compellingly from a longtermist perspective, physically misaligned tables may impose constraints on the long-term future. If successor scenarios are possible at non-negligible probability, then optimal resource allocation in the present should account for the expected disvalue of these scenarios, even if that disvalue manifests only as the absence of potential future value.
Proposition 1 (The Furniture Alignment Imperative)
If (i) successor scenarios have positive probability, (ii) successor scenarios are disvaluable relative to scenarios where artifacts fail alongside humans, and (iii) physical alignment is achievable at finite cost, then there exists a positive quantity of resources that should be allocated to physical alignment interventions.
The remainder of this paper is dedicated to establishing conditions (i) and (iii) empirically, while taking (ii) as axiomatic based on the arguments above.
The human thermoregulatory system maintains core body temperature within a narrow range of approximately 36.5°C to 37.5°C through a combination of metabolic heat production, vasodilation/ vasoconstriction, sweating, and behavioral adaptations (Parsons, 2014). Core temperatures below 35°C (hypothermia) or above 40°C (hyperthermia) initiate physiological cascades leading to organ failure and death within hours to days depending on severity (Kenney et al., 2015).
Ambient temperature tolerance depends critically on humidity, exposure duration, access to water, clothing, and shelter. Under sustained exposure without technological intervention—the relevant baseline for catastrophic scenarios—the survivable ambient temperature range is approximately 0°C to 40°C, with the upper bound constrained by wet-bulb temperature limits (Sherwood & Huber, 2010). Recent climate research has refined the lethal wet-bulb threshold to approximately 35°C, corresponding to dry-bulb temperatures of 40-50°C depending on humidity (Raymond et al., 2020).
We adopt a conservative estimate of 0°C to 40°C for the human survival range (RH(T) = [0, 40]°C), yielding a range width of 40K. This estimate assumes healthy adults with access to water but without active cooling or heating technology—the appropriate counterfactual for most catastrophic scenarios.
To establish the temperature tolerance of standard tables, we conducted a systematic survey of 847 table models representing approximately 94% of global production volume by unit count. For each table model, we obtained material composition data and calculated theoretical temperature limits based on material properties, with validation against manufacturer stress-testing data where available (n=312 models with empirical validation).
Table structural failure was defined as permanent deformation exceeding 5% of original dimensions or complete material phase change (melting, sublimation, or combustion). Results are summarized in Table 1.
| Material Category | n | Tmin (°C) | Tmax (°C) | Range (K) | M(T) |
|---|---|---|---|---|---|
| Solid Hardwood | 198 | -45 | 165 | 210 | 5.25 |
| Solid Softwood | 114 | -40 | 140 | 180 | 4.50 |
| Engineered Wood (MDF/Particle) | 156 | -30 | 120 | 150 | 3.75 |
| Engineered Wood (Plywood) | 89 | -35 | 135 | 170 | 4.25 |
| Steel/Iron | 87 | -80 | 450 | 530 | 13.25 |
| Aluminum | 69 | -60 | 380 | 440 | 11.00 |
| Glass/Metal Composite | 67 | -50 | 320 | 370 | 9.25 |
| Plastic/Polymer | 45 | -20 | 85 | 105 | 2.63 |
| Stone/Marble | 22 | -60 | 600 | 660 | 16.50 |
| Weighted Average | 847 | -40 | 200 | 240 | 6.00 |
Table 1: Temperature tolerance by table material category (n=847). Tmin and Tmax represent the temperature limits for structural integrity. M(T) is the misalignment ratio relative to human temperature range of 40K.
The production-weighted average table maintains structural integrity from -40°C to 200°C, yielding a survivable range of 240K. This corresponds to a misalignment ratio of M = 6.0, indicating that tables can survive across a temperature range six times wider than humans.
Figure 1: Visual comparison of temperature survival ranges. The red bar indicates the human survival range (0-40°C). Darker bars indicate table materials with progressively greater thermal resilience. The hatched region marks the "successor zone" where tables persist without humans.
Based on our analysis, we can formally characterize the successor zone—the set of temperatures at which tables survive while humans do not:
This successor zone spans 200K of temperature space—five times the human survival range. Any catastrophic event that shifts global temperatures into this zone creates a successor scenario where tables inherit the Earth.
The asymmetry is particularly striking when visualized as a proportion of each entity's total range. Tables can survive in conditions lethal to humans across 83.3% of their operational range (200K / 240K), while humans can only survive conditions that tables can survive across 100% of the human range but merely 16.7% of the table range.
We employ a Monte Carlo simulation framework to estimate the expected disvalue of table temperature misalignment over the long-term future. Our methodology integrates three components: (i) a probability model for temperature-catastrophic events, (ii) a population model for future humans and tables, and (iii) a disvalue function mapping successor scenarios to DALYs.
For each of 10,000 simulated futures extending to the year 10,000 CE (chosen to balance computational tractability against longtermist considerations), we:
The expected disvalue is computed as:
where P(Ct) is the probability density of a catastrophic event at time t, P(Successor | Ct) is the conditional probability that the event results in a successor scenario, and D(Ct) is the disvalue of the successor scenario.
We elicited probability estimates from 23 domain experts across climate science (n=8), nuclear risk studies (n=6), volcanology (n=4), and general catastrophic risk (n=5). Experts were asked to estimate the probability of temperature scenarios exceeding human but not table tolerances, using structured elicitation protocols following best practices (Tetlock & Gardner, 2015).
| Scenario Category | P(by 2100) | P(by 2200) | P(by 2500) | 90% CI (2500) |
|---|---|---|---|---|
| Extreme heat (>45°C sustained) | 0.3% | 1.4% | 4.2% | [1.5%, 8.7%] |
| Nuclear winter (<-10°C sustained) | 0.8% | 1.5% | 2.8% | [0.9%, 5.2%] |
| Supervolcanic winter | 0.01% | 0.03% | 0.08% | [0.02%, 0.18%] |
| Impact winter (asteroid) | 0.005% | 0.01% | 0.03% | [0.01%, 0.08%] |
| Industrial runaway warming | 0.1% | 0.4% | 0.9% | [0.3%, 2.1%] |
| Other/combined scenarios | 0.2% | 0.5% | 1.1% | [0.4%, 2.4%] |
| Total (any successor scenario) | 1.42% | 3.83% | 9.11% | [4.1%, 18.7%] |
Table 2: Expert-elicited cumulative probabilities of temperature scenarios exceeding human but not table tolerances, by time horizon.
Estimating the disvalue of successor scenarios requires projecting both human and table populations into the future. For human populations, we adopt the UN's medium-variant projection through 2100 (United Nations, 2022) and extend using the framework of Ord (2020), which estimates potential human flourishing over astronomical timescales.
For table populations, we model growth using a logistic function with carrying capacity determined by projected human population and tables-per-capita ratios. Historical data suggest tables-per-capita has increased from 0.8 in 1950 to 2.4 in 2023, with projected saturation at approximately 4.0 tables per capita in developed economies.
Figure 2: Historical and projected global table population. The solid line represents observed data (1950-2023); the dashed line represents projections under the medium growth scenario. The shaded region indicates 90% credible interval for projections.
The disvalue of a successor scenario at time t is calculated as the expected future value lost—the integral of population × quality-adjusted life expectancy over all future time, conditional on the successor scenario occurring at t.
where N(τ) is the counterfactual population at time τ (in a world without the catastrophe), QALE(τ) is quality-adjusted life expectancy, and δ is the discount rate.
To convert to DALYs, we use the standard transformation:
Under our baseline assumptions, the Monte Carlo simulation yields an expected disvalue of table temperature misalignment of 2.3 × 1012 DALYs (90% credible interval: [4.7 × 1011, 8.9 × 1012]).
Figure 3: Sensitivity of expected disvalue to discount rate. The baseline estimate (δ = 0.1%) yields 2.3 × 1012 DALYs. Lower discount rates substantially increase expected disvalue due to greater weight on far-future impacts.
Table 3 presents the decomposition of expected disvalue by scenario type and time horizon.
| Scenario Type | E[DALYs] by 2100 | E[DALYs] by 2500 | E[DALYs] Total | % of Total |
|---|---|---|---|---|
| Extreme Heat | 2.1 × 109 | 4.8 × 1011 | 9.7 × 1011 | 42.2% |
| Nuclear Winter | 5.6 × 109 | 3.2 × 1011 | 6.5 × 1011 | 28.3% |
| Industrial Runaway | 7.0 × 108 | 1.0 × 1011 | 2.1 × 1011 | 9.1% |
| Volcanic Winter | 7.0 × 106 | 9.1 × 109 | 1.9 × 1010 | 0.8% |
| Other/Combined | 1.4 × 109 | 2.2 × 1011 | 4.5 × 1011 | 19.6% |
| Total | 9.9 × 109 | 1.1 × 1012 | 2.3 × 1012 | 100% |
Table 3: Decomposition of expected disvalue by scenario type and time horizon. Extreme heat scenarios dominate due to higher probabilities under climate trajectories.
To contextualize our estimate, we compare the expected disvalue of table temperature misalignment to other cause areas commonly evaluated in effective altruism:
| Cause Area | Expected DALYs at Risk | Current Annual Funding | DALYs per $ Spent |
|---|---|---|---|
| AI Alignment | ~1015 | ~$100M | ~107 |
| Biosecurity | ~1013 | ~$500M | ~104 |
| Nuclear Risk Reduction | ~1012 | ~$50M | ~104 |
| Table Temperature Alignment | 2.3 × 1012 | $0 | ∞ (undefined) |
| Global Health (GiveWell top) | N/A (direct) | ~$500M | ~30 |
Table 4: Comparison of expected disvalue and funding across cause areas. Table temperature alignment exhibits extreme neglectedness with zero current funding despite substantial expected disvalue.
The marginal cost-effectiveness of the first dollar spent on table temperature alignment is, by construction, infinite in terms of moving from the current state of zero attention. This extreme neglectedness, combined with non-trivial expected disvalue, suggests that the cause area merits serious consideration.
Having established both the existence and magnitude of the table temperature alignment problem, we now turn to potential solutions. Our analysis identifies three primary technical approaches to achieving alignment, each with distinct cost profiles and implementation timelines.
The most mature approach involves incorporating microencapsulated phase-change materials (PCMs) into table structural components. PCMs are substances that absorb or release thermal energy during phase transitions (typically solid-liquid); by engineering the encapsulation to rupture at specific temperatures, structural failure can be triggered at predetermined thresholds.
Nakamura & Singh (2024) demonstrate successful implementation using paraffin-based PCMs with failure onset at 48°C ± 2°C. Their approach involves dispersing microencapsulated paraffin (melting point 45°C) throughout a wood-polymer composite matrix. When ambient temperature exceeds the melting point, the expanded liquid paraffin ruptures its encapsulation, introducing voids into the structural matrix and triggering cascading failure.
For cold-alignment, analogous approaches using water-based systems (freezing point 0°C) can introduce ice crystal damage at sub-zero temperatures. Combined, these techniques can achieve table failure thresholds of [0°C, 48°C]—closely matching human survival parameters.
where ΔTstructural accounts for the time delay between PCM phase change and structural failure, typically 2-5°C depending on material composition and loading.
A second approach replaces traditional adhesives and binding agents with thermoplastic polymers engineered to soften at specific temperatures. This technique is particularly applicable to engineered wood products (MDF, plywood) and composite furniture, which rely heavily on adhesive bonds for structural integrity.
The glass transition temperature (Tg) of thermoplastic polymers determines the point at which the material transitions from rigid to rubbery behavior. By selecting polymers with Tgin the range of 42-48°C, table joints can be engineered to fail at temperatures just above human survival thresholds.
For cold-alignment, crystalline polymers that undergo brittle transitions below 0°C can be integrated into joint structures, ensuring failure in extreme cold conditions.
The most sophisticated approach employs shape-memory alloys (SMAs)—metallic compounds that undergo reversible phase transformations at specific temperatures, causing dramatic shape changes. By integrating SMA components at structural joints, tables can be designed to actively disassemble when temperature thresholds are exceeded.
Nickel-titanium (NiTi) alloys offer transformation temperatures that can be precisely tuned through composition adjustment, with the austenite finish temperature Afranging from -100°C to +100°C depending on nickel content (Otsuka & Wayman, 1999).
While currently cost-prohibitive for mass production (adding approximately $45-120 per table), declining smart material costs following a Wright's Law trajectory suggest potential feasibility within 10-15 years:
| Approach | Cost Premium | Precision (±°C) | Durability | TRL |
|---|---|---|---|---|
| PCM Composites | +3-5% | ±2 | 15-20 years | 7 |
| Thermoplastic Adhesives | +2-4% | ±3 | 20-25 years | 6 |
| SMA Integration | +40-80% | ±1 | 30+ years | 4 |
| Hybrid (PCM + Thermo) | +5-7% | ±2 | 15-20 years | 5 |
Table 5: Comparison of technical approaches to table temperature alignment. TRL = Technology Readiness Level (1-9 scale).
We analyze cost-effectiveness across three distinct intervention pathways: (i) policy advocacy for regulatory change, (ii) manufacturer subsidy programs, and (iii) direct production of aligned tables. Each pathway operates through different mechanisms and at different scales.
Policy advocacy aims to incorporate temperature alignment standards into furniture safety regulations. The primary opportunity is the 2025 revision of the EU Furniture Safety Directive (2001/95/EC), which sets standards affecting approximately 400 million tables produced annually for the European market.
We estimate the cost of a successful advocacy campaign at $2.5 million over three years, based on comparable regulatory advocacy efforts in the furniture sector (flame retardant standards, formaldehyde limits). If successful, this would affect approximately 1.2 billion tables over the 15-year policy cycle, at an average cost of:
However, advocacy success is uncertain. Based on comparable campaigns, we estimate a 15% probability of full success, 35% probability of partial success (standards adopted but with weaker thresholds), and 50% probability of failure. The expected cost per table aligned is:
Assuming aligned tables prevent successor scenarios at a rate proportional to global alignment (linear approximation), we estimate:
| Intervention | Cost per Table | Tables per $1000 | DALYs per $1000 | 95% CI |
|---|---|---|---|---|
| Policy advocacy (EU) | $0.31 | 3,226 | 4.1 | [0.4, 28.5] |
| Policy advocacy (global) | $0.85 | 1,176 | 1.5 | [0.1, 12.2] |
| Manufacturer subsidies | $2.47 | 405 | 0.52 | [0.08, 3.1] |
| Direct production | $8.92 | 112 | 0.14 | [0.02, 0.89] |
| R&D acceleration | $1.24 | 806 | 1.02 | [0.12, 6.8] |
Table 6: Cost-effectiveness estimates by intervention type. DALYs per $1000 assumes linear scaling of risk reduction with tables aligned.
Figure 4: Cost-effectiveness comparison across intervention types. Policy advocacy targeting EU regulations offers the highest expected DALYs averted per dollar donated.
Our headline cost-effectiveness estimate—that each $847 donated to policy advocacy prevents one table-outliving-human event in expectation—derives from the following calculation:
where a "table-outliving-human event" (TOH event) is defined as one table surviving a catastrophic scenario that eliminates one human. This metric, while somewhat artificial, provides an intuitive unit for comparing interventions.
Key Result
Under baseline assumptions, policy advocacy for table temperature alignment standards prevents approximately 1.18 table-outliving-human events per $1000 donated, corresponding to a marginal cost of $847 per TOH event averted.
Our analysis suggests that table temperature alignment merits serious consideration within the effective altruism framework. The cause area exhibits the three hallmarks of a promising intervention opportunity:
While the expected disvalue is smaller than leading cause areas (AI alignment, biosecurity), the extreme neglectedness may compensate on a marginal cost-effectiveness basis. The first dollars spent in this area face no competition and can capture significant low-hanging fruit.
Our analysis is subject to several important limitations:
Model uncertainty: Our Monte Carlo framework relies on expert elicitation for key probability estimates. Expert disagreement is substantial, with individual estimates spanning two orders of magnitude in some cases. We have attempted to capture this uncertainty through wide credible intervals, but structural model uncertainty may be larger.
Value assumptions: Our treatment of successor scenarios as disvaluable rests on philosophical assumptions that not all readers will share. Those who reject the moral significance of post-extinction artifact persistence will necessarily assign zero weight to our expected disvalue calculations.
Technical assumptions: Our cost estimates for alignment technologies are based on laboratory-scale demonstrations. Production-scale implementation may reveal unforeseen challenges or costs.
Scope limitations: We have focused exclusively on tables and the temperature parameter. A complete accounting of physical alignment would need to consider other furniture categories and additional physical parameters (pressure, radiation, atmospheric composition).
We identify several promising directions for future research:
This paper has presented the first rigorous analysis of table temperature alignment as a cause area worthy of philanthropic attention. We have demonstrated that tables, as currently manufactured, possess temperature tolerance parameters that systematically exceed human survival thresholds by a factor of approximately six, creating meaningful probability of "successor scenarios" in which tables outlast humanity following catastrophic temperature events.
Our quantitative analysis, based on systematic material surveys of 847 table models and Monte Carlo simulation across 10,000 possible futures, estimates the expected disvalue of this misalignment at 2.3 × 1012 DALYs. We have shown that technical solutions exist—primarily through phase-change composite materials and thermoplastic adhesives—that can achieve alignment at marginal cost increases of 3-7%.
Policy advocacy emerges as the most cost-effective intervention pathway, with an estimated cost of $847 per table-outliving-human event averted. The 2025 EU Furniture Safety Directive revision represents a critical near-term opportunity for incorporating temperature alignment standards into regulatory frameworks.
We call on the effective altruism community, policymakers, and furniture manufacturers to take this alignment problem seriously. The window for establishing global temperature alignment standards is narrow, and the stakes—a world populated by our furniture but not by us—are too high to ignore.
"In the end, we must ask ourselves: what kind of tables do we want to leave behind? Tables that share our fate, or tables that will preside over our empty rooms forever?"
We thank the anonymous reviewers for their constructive feedback, particularly Reviewer 2 who asked "How are we letting tables get away with this?"—a question that crystallized our thinking on the urgency of this work. We are grateful to participants in the 2024 Global Priorities Seminar at Oxford for valuable discussion, and to the 23 domain experts who contributed probability estimates under conditions of anonymity. We thank the Table Temperature Alignment Project for research funding and administrative support. Special thanks to Maria Lindqvist for exceptional research assistance. M. Chen acknowledges support from the Stanford Existential Risks Initiative (Grant #SERI-2024-017). A. Okonkwo acknowledges support from the Forethought Foundation for Global Priorities Research. H. Bergstrom acknowledges support from the Danish National Research Foundation.
E. Vance is a co-founder and unpaid board member of the Table Temperature Alignment Project. The other authors declare no competing interests. Funding sources had no role in study design, data collection, analysis, or manuscript preparation.
Bergstrom, H. J., Lindqvist, M., & Sørensen, A. K. (2024). Preferences regarding artifact persistence in post-extinction scenarios: A survey study. Journal of Applied Ethics, 48(2), 234-251.
Bostrom, N. (2014). Superintelligence: Paths, Dangers, Strategies. Oxford University Press.
Christian, B. (2020). The Alignment Problem: Machine Learning and Human Values. W. W. Norton & Company.
GiveWell. (2024). Active grants database. Retrieved from https://www.givewell.org/research/grants
Global Furniture Census. (2023). Annual Report on Global Furniture Populations. International Furniture Statistics Bureau, Geneva.
Greaves, H., & MacAskill, W. (2021). The case for strong longtermism.Global Priorities Institute Working Paper, 5-2021.
Kenney, W. L., Craighead, D. H., & Alexander, L. M. (2015). Heat waves, aging, and human cardiovascular health. Medicine and Science in Sports and Exercise, 46(10), 1891-1899.
MacAskill, W. (2015). Doing Good Better: How Effective Altruism Can Help You Make a Difference. Gotham Books.
Nakamura, T., & Singh, R. (2024). Phase-change composites for human-aligned structural failure: A materials science approach. Journal of Materials Science & Engineering, 287, 115-134.
Open Philanthropy. (2024). Grants database. Retrieved from https://www.openphilanthropy.org/grants
Ord, T. (2020). The Precipice: Existential Risk and the Future of Humanity. Hachette Books.
Otsuka, K., & Wayman, C. M. (1999). Shape Memory Materials. Cambridge University Press.
Parsons, K. (2014). Human Thermal Environments: The Effects of Hot, Moderate, and Cold Environments on Human Health, Comfort, and Performance (3rd ed.). CRC Press.
Raymond, C., Matthews, T., & Horton, R. M. (2020). The emergence of heat and humidity too severe for human tolerance. Science Advances, 6(19), eaaw1838.
Russell, S. (2019). Human Compatible: Artificial Intelligence and the Problem of Control. Viking.
Sherwood, S. C., & Huber, M. (2010). An adaptability limit to climate change due to heat stress.Proceedings of the National Academy of Sciences, 107(21), 9552-9555.
Tetlock, P. E., & Gardner, D. (2015). Superforecasting: The Art and Science of Prediction. Crown.
United Nations. (2022). World Population Prospects 2022: Summary of Results. UN DESA, Population Division.
We derive the expected disvalue formula (Equation 6) from first principles. Let C be the random variable representing the time of a catastrophic event, with probability density function fC(t). Conditional on an event at time t, let St be the indicator for a successor scenario:
The expected disvalue is then:
The conditional expectation E[D | C = t] = P(St = 1) · D(Ct), yielding the result.
Sensitivity analysis was conducted by varying each parameter independently while holding others at baseline values. Key elasticities are:
Data Availability: Monte Carlo simulation code, expert elicitation protocols, raw survey data, and complete table material database are available at https://osf.io/ttap-supplementary (DOI: 10.17605/OSF.IO/TTAP24).
Code Availability: All analysis code is available under MIT license at https://github.com/ttap-research/misalignment-ev
Our data collection infrastructure consists of a globally distributed network of 2,847 calibrated thermocouples deployed across six continents (Antarctica excluded due to insufficient table populations). Each sensor array measures ambient temperature, table surface temperature, and the critical human-table interface temperature (HTIT) at 15-second intervals.
Sensors were calibrated against NIST-traceable standards to an accuracy of ±0.02°C, significantly exceeding the requirements for our analysis which operates at the ±1°C precision level. This over-engineering reflects our commitment to methodological rigor, not practical necessity.
The Global Furniture Census (GFC) employs a stratified random sampling approach to estimate global table populations. Primary sampling units (PSUs) consist of administrative regions weighted by GDP-per-capita and historical furniture manufacturing activity. Within each PSU, secondary sampling units (households and commercial establishments) are selected via probability-proportional-to-size sampling.
Enumerators receive 40 hours of training on table identification, including edge cases such as desks (counted as tables), countertops (not counted), and convertible coffee-to-dining tables (counted as 1.3 tables to reflect modal usage patterns). Inter-rater reliability exceeds κ = 0.94 across all table categories.
Protocol Note: Tables observed in quantum superposition states (simultaneously present and absent) are counted as 0.5 tables. This has affected precisely zero observations to date, but the protocol exists for completeness.
Table material composition was determined through a multi-stage process: (1) visual inspection by trained assessors, (2) density measurement via water displacement, (3) surface spectroscopy for finish identification, and (4) destructive testing of a 1cm² sample for composite tables. Destructive testing required owner consent, which was obtained in 73% of cases; the remaining 27% were imputed using multiple imputation under the missing-at-random assumption.
For antique tables (pre-1900), material composition was estimated from historical manufacturing records and dendrochronological analysis. We acknowledge that this introduces additional uncertainty for the approximately 0.3% of global tables in this category.
All data undergo a four-stage quality assurance process: (1) automated range checks and consistency validation, (2) manual review of statistical outliers (defined as observations >3σ from stratum means), (3) spot audits of 5% of field observations, and (4) annual third-party audit by the International Bureau of Furniture Statistics (IBFS, Geneva).
Data integrity is maintained through cryptographic hashing (SHA-256) at point of collection, with hash verification at each processing stage. To date, zero hash mismatches have been detected, though we remain vigilant for the possibility of sophisticated table-data corruption.
Domain experts were selected through a rigorous multi-stage process designed to identify individuals with relevant expertise while avoiding conflicts of interest (e.g., furniture industry executives, professional table enthusiasts). Selection criteria included: (1) doctoral-level training in materials science, thermodynamics, or existential risk studies; (2) publication record demonstrating engagement with either furniture science or catastrophic risk; and (3) demonstrated calibration on reference questions.
Initial invitations were extended to 67 prospective experts; 23 completed the full elicitation protocol. Attrition analysis revealed no systematic differences between completers and non-completers on observable characteristics, suggesting minimal selection bias.
Prior to eliciting target probabilities, experts completed a calibration training module consisting of 50 trivia questions with known answers (e.g., "What percentage of tables manufactured in 2019 were made primarily of particleboard?"). Experts provided 90% credible intervals for each question, and their calibration scores were computed as the fraction of true values falling within stated intervals.
Initial calibration scores ranged from 0.52 to 0.89 (median: 0.71). Experts then reviewed their calibration performance and completed a second round of 25 questions. Post-training calibration improved to 0.68-0.94 (median: 0.84). Only post-training scores were used for weighting expert judgments in the final analysis.
Calibration Theorem
A perfectly calibrated expert, when providing 90% credible intervals across many questions, will have exactly 90% of true values fall within their stated intervals. Our experts achieve 84% median calibration, suggesting slight overconfidence that we correct for in aggregation.
Target probabilities were elicited using a modified SHELF protocol (Gosling et al., 2018). Experts first provided individual estimates in isolation, then participated in a structured group discussion (conducted via secure video conference to maintain geographical diversity), and finally revised their estimates. This Delphi-like approach balances the benefits of discussion against anchoring effects.
For each scenario, experts were asked to estimate: (1) the probability of occurrence within specified time horizons, (2) the conditional probability of table survival given the scenario, and (3) the conditional probability of human extinction given the scenario. Questions were posed in multiple formats (probability, odds, frequency) to check for framing effects.
Expert estimates were aggregated using a calibration-weighted linear opinion pool. Letpi denote expert i's probability estimate and wi their calibration weight. The aggregated probability is:
Uncertainty was characterized via a weighted mixture of expert distributions, preserving the full shape of disagreement rather than collapsing to point estimates. This approach appropriately propagates expert uncertainty into downstream Monte Carlo simulations.
We categorize potential interventions into five tiers based on their mechanism of action: (1) Policy advocacy, targeting regulatory changes to furniture manufacturing standards; (2) R&D funding, accelerating development of human-aligned table materials; (3) Manufacturer subsidies, reducing the cost premium for aligned tables; (4) Direct production, establishing aligned table manufacturing capacity; and (5) Consumer education, increasing demand for aligned furniture.
Each intervention tier was analyzed using a theory of change model specifying the causal pathway from funding input to ultimate impact (tables aligned × probability reduction per table × expected disvalue reduction). Pathway probabilities were elicited from a separate panel of policy and economics experts.
Intervention costs were estimated through a combination of (1) bottom-up engineering estimates for direct production costs, (2) analogical reasoning from comparable policy advocacy campaigns (e.g., EU appliance efficiency standards), and (3) expert judgment for novel intervention types. All costs are expressed in 2024 USD and include both direct expenditures and opportunity costs of deployed resources.
For policy advocacy, we estimated costs based on documented expenditures for comparable EU regulatory campaigns, with adjustments for the novelty of furniture temperature alignment as a policy issue. The base estimate of $244,000 per regulatory outcome was derived from analysis of 17 comparable campaigns, with substantial uncertainty reflected in our credible intervals.
Definition 3 (Table-Outliving-Human Event). A TOH event occurs when a specific table survives a catastrophic scenario that eliminates a specific human. One table surviving a scenario that kills 1,000 humans constitutes 1,000 TOH events. This unit enables intuitive comparison across intervention types despite their different mechanisms of action.
Cost-effectiveness ratios (CERs) are computed as expected impact divided by expected cost:
Expected impact is the product of tables aligned × probability of successor scenario × expected disvalue per successor scenario. We assume linear scaling for marginal interventions, while acknowledging that scale effects (both positive due to economies of scale and negative due to diminishing returns) may affect CERs at larger funding levels.
CER estimates are highly sensitive to several parameters, particularly: (1) discount rate (elasticity = -2.3); (2) catastrophe probability estimates (elasticity = 1.0); and (3) policy success probability (elasticity = 1.0). We present results under three scenarios: Conservative (high discount rate, low catastrophe probability, low policy success); Baseline (median estimates for all parameters); and Optimistic (low discount rate, higher probabilities).
Break-even analysis indicates that even under conservative assumptions, policy advocacy remains cost-effective relative to a $10,000/DALY threshold if either catastrophe probabilities exceed 0.1% (100-year horizon) or policy success probability exceeds 5%.
Our analysis confronts multiple distinct sources of uncertainty: (1) Aleatory uncertaintyarising from inherent randomness in future events (e.g., whether a catastrophe occurs); (2)Epistemic uncertainty reflecting limitations in our knowledge (e.g., true table populations, material properties); and (3) Model uncertainty regarding the appropriate structural form of our analytical framework.
We distinguish these sources because they respond differently to additional information: epistemic uncertainty can be reduced through further research, while aleatory uncertainty cannot. Model uncertainty occupies an intermediate position—it can be reduced through empirical validation but may have irreducible components.
All uncertain quantities are represented as probability distributions rather than point estimates. For parameters with empirical data, we fit appropriate parametric distributions (typically log-normal for positive quantities, beta for probabilities) via maximum likelihood estimation. For parameters relying on expert judgment, we use the aggregated expert distributions directly.
Credible intervals throughout this paper are Bayesian 90% highest posterior density (HPD) intervals, meaning they contain the true value with 90% probability given our model and prior beliefs. Frequentist confidence intervals would be numerically similar for most parameters but carry different philosophical interpretations.
Uncertainty Propagation Principle
When combining uncertain quantities through mathematical operations, the uncertainty in the result depends on both the uncertainties in the inputs and the functional form of the combination. Monte Carlo simulation captures these interactions automatically, without requiring closed-form solutions for uncertainty propagation.
Our primary tool for uncertainty propagation is Monte Carlo simulation. For each of 10,000 iterations, we: (1) draw values for all uncertain parameters from their respective distributions; (2) compute the expected disvalue using the drawn values; and (3) store the result. The distribution of results across iterations characterizes our uncertainty in the final estimate.
Convergence was assessed by monitoring the stability of percentile estimates as iteration count increased. With 10,000 iterations, 90% credible interval endpoints are stable to within ±2% of their final values. We verified this through split-half analysis comparing estimates from odd and even iterations.
Following best practices in risk communication, we report uncertainty using multiple complementary formats: (1) point estimates (typically posterior medians); (2) credible intervals (90% HPD); (3) full posterior distributions (in supplementary figures); and (4) verbal probability terms calibrated to numerical ranges (e.g., "likely" = 66-90% probability, following IPCC conventions).
We emphasize that wide credible intervals are not a weakness of the analysis but an honest representation of genuine uncertainty. Analyses that report narrow intervals in domains with deep uncertainty are likely underestimating uncertainty rather than possessing superior knowledge.
We conducted expected value of perfect information (EVPI) and expected value of partial perfect information (EVPPI) analyses to identify which uncertainties most affect decision-making. Key findings: resolving uncertainty about catastrophe probabilities has EVPPI of $4.2M (in terms of optimal allocation of $10M intervention budget), while resolving uncertainty about table populations has EVPPI of only $0.3M.
This suggests that future research efforts should prioritize improved catastrophe probability estimates over more precise table censuses—a counterintuitive finding given that table counts are more directly measurable. The reason is that catastrophe probabilities have higher leverage in the expected value calculation due to their multiplicative role.
© 2024 Journal of Applied Alignment Studies. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
This paper is part of the Table Temperature Alignment Project's peer-reviewed research program.