Open Forest Watch
5 Events · 4,953 ha
Pipeline offline
HGE-OFW · METHODOLOGY v1.2

How Open Forest Watch Reasons About Forest Events

Open Forest Watch does not detect deforestation — Global Forest Watch, GLAD alerts, and Sentinel satellite feeds do that. What OFW adds is the reasoning layer above detection: structured hypothesis generation, multi-source evidence weighting, actor attribution, and regulatory flag assignment. This page documents how that reasoning works.

The Reasoning Gap

What detection layers provide
What OFW adds
Detects where forest is gone
Generates WHY it happened and WHO did it
Produces maps and alerts
Produces structured, auditable evidence bundles
Tells you an event occurred
Argues a causal chain with confidence scoring
Data for researchers and NGOs
Legal-grade reports for regulators and courts
Open detection layer
Open reasoning layer — the layer above detection

Pipeline Architecture

01
Detection Ingestion

OFW ingests alerts from Global Forest Watch GLAD-S2, RADD, and FIRMS active fire feeds via their public APIs. Each alert carries a bounding box, detection date, and confidence level. OFW does not re-derive deforestation — it consumes existing detection products.

GFW GLAD-S2FIRMS VIIRS/MODISRADD alerts
02
Multi-Spectral Evidence Collection

For each detected event, OFW queries Sentinel-2 (optical NDVI), Sentinel-1 (SAR backscatter), and PlanetScope (3m daily) composites for the event bounding box and evidence window. Evidence items are extracted as structured records with source, date, and supporting weight.

Sentinel Hub APIGoogle Earth EnginePlanet API
03
Cause Attribution

The HGE hypothesis engine analyses the spectral signatures, burn patterns, clearing geometry, and road network changes to generate a cause hypothesis with a confidence score. The cause taxonomy is: fire, logging, agriculture, mining, infrastructure. A secondary cause is generated when multiple mechanisms are present.

Spectral signature libraryHGE v3 classifier
04
Actor Hypothesis

Spatial proximity analysis against land concession registries, corporate databases, supply chain flow maps (Trase.earth), and OpenStreetMap road networks generates an actor hypothesis. Named entities are flagged only when a spatial overlap with a registered parcel or concession can be established. All actor attributions carry an explicit disclaimer and confidence score.

CAR/SICAR (Brazil)KPPU (Indonesia)World Bank concession dataTrase.earth
05
Regulatory Flag Assignment

Each event is automatically assessed against the EU Deforestation Regulation (EUDR), the Carbon Border Adjustment Mechanism (CBAM), REDD+ safeguards, CITES Appendix II forest species, ILO Convention 169 (indigenous rights), and applicable national forestry law. Flags are assigned based on biome, commodity linkage, and protected area status.

EUDR Annex I commodity listCITES Appendix IIWDPA protected areas
06
Confidence Scoring

The overall confidence score is a weighted composite of evidence item weights, cause attribution confidence, and data completeness. Uncertainty bounds are derived from cloud cover fraction over the evidence window, number of independent evidence sources, and concession boundary positional uncertainty. All uncertainty inputs are logged in the report.

HGE confidence model v1.2
07
Report Generation & Provenance Hash

The structured JSON report is generated, signed with a SHA-256 evidence hash over all evidence item content, and published to the public report library. The hash provides tamper-evident provenance: any modification to the evidence set changes the hash. The report is permanently citable by event_id.

HGE provenance graphSHA-256
THE HGE REASONING LAYER — CORE ALGORITHM

Bayesian Evidence Fusion (Log-Odds Form)

No tool in the market does this. GFW, RADD, and Planet tell you an alert occurred. OFW asks: given this alert and these independent evidence sources, what is the posterior probability that this specific cause explains the event? We use log-odds evidence fusion — the same mathematical framework used in medical diagnosis and intelligence analysis — to combine satellite alert confidence, spectral signatures, fire co-location, historical country priors, and supply chain linkage into a single calibrated probability estimate.

── Step 1: Satellite alert base probability ──────────────────────────────
  P_base = 0.88   if alert_confidence = "highest"   (multi-sensor agreement)
         = 0.72   if alert_confidence = "high"       (single sensor)
         = 0.55   otherwise

── Step 2: Log-odds initialisation ──────────────────────────────────────
  Λ₀ = log( P_base / (1 − P_base) )

── Step 3: Sequential evidence update (Dempster-Shafer inspired) ─────────
  for each evidence_item eᵢ (excluding primary alert):
    Λᵢ = Λᵢ₋₁ + α × log( wᵢ / (1 − wᵢ) )
    where wᵢ = evidence weight ∈ [0.01, 0.99]
          α  = 0.35  (partial update coefficient, guards against overconfidence)

── Step 4: Posterior conversion ─────────────────────────────────────────
  P_posterior = sigmoid(clamp(Λ_final, −4, +4))
              = 1 / (1 + exp(−Λ_final))

── Step 5: Contextual corrections ───────────────────────────────────────
  + 0.04  if area_ha > 2000 ha       (large events have fewer ambiguous causes)
  + 0.02  if area_ha > 500 ha
  − 0.04  if area_ha < 50 ha         (small events are ambiguous)
  + 0.05  if FIRMS fire co-located AND cause ∈ {fire, agriculture}

── Step 6: Uncertainty bounds ───────────────────────────────────────────
  half_width = max(4, 18 − 3 × N_independent_sources)
  CI_lower   = max(20, score − half_width)
  CI_upper   = min(96, score + half_width/2)
  (Asymmetric: upper bound is tighter — overconfidence is penalised more)
EVIDENCE SOURCES
5 types
GFW alert · NDVI · FIRMS · country prior · supply chain
UPDATE COEFFICIENT Α
0.35
Partial Bayesian update — guards against evidence double-counting
CLAMP RANGE
±4 log-odds
Prevents P→0 or P→1 from single overwhelming source

What Makes OFW Different

Capability
GFW / RADD
Planet Labs
OFW
Deforestation detection
Consumes GFW
Cause attribution (why)
✓ Bayesian
Actor hypothesis (who)
✓ Spatial inference
Regulatory flagging
Partial (EUDR only)
✓ EUDR/REDD+/CBAM/ILO
Evidence chain audit trail
✓ SHA-256 hash
Exportable structured report
Data download only
✓ JSON + cite
Public API (free tier)
EUDR compliance certificate
GFW Pro (paid)
✓ Free
Journalist citation format
✓ APA / BibTeX

Known Limitations

Actor attribution is probabilistic

OFW does not identify individuals. Actor hypotheses are spatial and supply-chain inferences, not legal findings. Named entities are flagged only when spatial overlap with a registered parcel can be established. All actor hypotheses carry an explicit disclaimer.

Cloud cover reduces evidence completeness

Tropical forest regions have persistent cloud cover. When optical satellite data is unavailable for more than 50% of the evidence window, the confidence score is automatically discounted and flagged in the report.

Concession boundary data quality varies by country

Brazil's CAR system is well-maintained. DRC, Cameroon, and Papua New Guinea have known boundary data gaps and positional uncertainties of up to 500 m. This is noted in relevant reports.

Regulatory flags are informational, not legal advice

Regulatory flag assignment identifies potential applicability of frameworks (EUDR, REDD+, etc.). It is not a legal opinion. Operators should seek independent legal counsel before acting on regulatory flags.

Open Science Commitment

The HGE reasoning methodology is published openly as a research paper and technical specification. The API is open for researchers to build on. This is the AlphaFold approach: open the science, close the infrastructure. Anyone can read how OFW reasons. Only Northflow runs the production engine at institutional scale. The methodology paper is available at doi:10.XXXX/ofw-methodology-v1.