Capstone Projects: From Raw Data to a Signed Decision

schedule15 min readfitness_center6 exercises

Every chapter so far taught one skill on clean data. The field does not hand you clean data, and it does not ask for a skill; it asks for a decision, signed, with your name on it: perforate here, book this many barrels, drill the next well or walk away. This chapter is where the book pays off. Four projects take messy, broken, real-shaped data and carry it all the way to a number an engineer would stake a development on.

The four are deliberately separate (a log interpretation, a reserves forecast, a drilling benchmark, a volumetric estimate), but each obeys the same discipline the rest of the book has drilled into you. (1) Quality-control the data before trusting it, because the first reading you believe is usually the broken one. (2) Report a range, not a point: P10/P50/P90, where by petroleum convention P90 is the conservative low estimate, P50 the middle, and P10 the optimistic high, because a single number off uncertain data is a lie with a decimal place. (3) Translate the answer into dollars and a decision, because barrels are not the deliverable. (4) Reconcile the statistical answer against the physics, because no reservoir engineer signs a model that disagrees with Archie or material balance. (5) Degrade honestly on a blind well, because a model that only works on the data it has seen has not been tested. That discipline, not any single algorithm, is what separates a portfolio from a pile of notebooks.

infoWhat You'll Learn

Build a QC-gated interpretation pipeline that detects, repairs, and quarantines bad log data before it poisons the answer
Carry uncertainty end-to-end: porosity → net pay → reserves → NPV, all as P10/P50/P90, never a single number
Translate every technical result into dollars and a go/no-go, with the price and discount sensitivities that flip the call
Reconcile machine-learning and statistical answers against the deterministic physics, and diagnose where they diverge
Benchmark and characterise at field scale: difficulty-normalised drilling performance and Monte-Carlo volumetrics with a value-of-information decision

lightbulbDatasets Used in This Chapter

Project 2 uses the field's real 24-month production export, embedded inline (faults and all). The other three projects generate synthetic-but-physical data with the verified generators from earlier chapters, so every cell runs offline. Each project is self-contained: run its cells in order.

Project 1: Automated Well Log Interpretation Pipeline

A new well, OD-007, has been logged. Before it can be added to the field model someone has to turn four curves into an answer: how many feet of pay, where, and how confident are you? Do it by hand and it takes a petrophysicist a day; do it wrong (miss a thin gas sand, or trust a washed-out density reading) and you perforate water or leave pay behind. We build the pipeline that does it in seconds, and refuses to trust data it should not.

The first cell builds the field: a forward rock-physics model that turns each rock type into the four wireline logs a tool would record, namely GR (gamma ray, high in shale), RHOB (bulk density), NPHI (neutron porosity), and RT (deep resistivity, high in hydrocarbons), giving us a known ground truth to invert against. Skim it top-down; you do not need to trace every term to use what it produces.

main.py

import numpy as np
import pandas as pd
from sklearn.ensemble import RandomForestClassifier, RandomForestRegressor, IsolationForest
from sklearn.metrics import accuracy_score, r2_score
from sklearn.preprocessing import StandardScaler

FACIES = {0: "Shale", 1: "Wet Sand", 2: "Pay Sand", 3: "Siltstone"}
ARCHIE = dict(a=0.81, m=2.0, n=2.0, Rw=0.04)   # Archie constants (Niger Delta sand)

def make_well(wid, seed, top=9000.0, n=420):
    """One depth-ordered well whose logs are consistent with facies, porosity AND
    Archie water saturation -- the single rock model the whole pipeline inverts."""
    rng = np.random.default_rng(seed)
    depth = top + 0.5 * np.arange(n)
    edges = np.sort(rng.integers(8, n - 8, rng.integers(8, 13) - 1))
    facies = np.zeros(n, int)
    for seg, f in zip(np.split(np.arange(n), edges),
                      rng.choice([0, 1, 2, 3], len(edges) + 1, p=[0.38, 0.22, 0.20, 0.20])):
        facies[seg] = f
    Vsh = np.clip(np.where(facies == 0, 0.82, np.where(facies == 3, 0.55, 0.15)) + rng.normal(0, 0.06, n), 0, 1)
    base = np.where(facies == 0, 0.06, np.where(facies == 3, 0.14, 0.24))
    bed_q = np.zeros(n)
    for seg in np.split(np.arange(n), edges):
        bed_q[seg] = rng.uniform(-0.10, 0.05)               # per-bed reservoir quality (some marginal)
    phi = np.clip((base + bed_q) * (1 - 0.4 * Vsh) + rng.normal(0, 0.018, n), 0.02, 0.34)
    Sw = np.where(facies == 2, rng.uniform(0.25, 0.68, n), np.where(facies == 1, 0.88, 0.95))
    Sw = np.clip(Sw + rng.normal(0, 0.05, n), 0.12, 1.0)
    gas = (facies == 2) & (Sw < 0.45)
    GR = 18 * (1 - Vsh) + 135 * Vsh + rng.normal(0, 7, n)
    RHOB = (2.65 + 0.03 * Vsh) * (1 - phi) + np.where(gas, 0.6, 1.0) * phi + rng.normal(0, 0.035, n)
    NPHI = phi + 0.30 * Vsh - np.where(gas, 0.08, 0.0) + rng.normal(0, 0.025, n)
    a, m, nn, Rw = ARCHIE.values()
    RT = np.clip(a * Rw / (np.clip(phi, 0.03, 1) ** m * Sw ** nn) * np.exp(rng.normal(0, 0.22, n)), 0.2, 2000)
    return pd.DataFrame({"WELL": wid, "DEPTH": depth, "GR": GR, "RHOB": RHOB, "NPHI": NPHI,
                         "RT": RT, "facies": facies, "PHI_true": phi, "SW_true": Sw})

def feats(df):
    X = df[["GR", "RHOB", "NPHI", "RT"]].copy()
    X["RT"] = np.log10(X["RT"])                 # resistivity on its log scale
    return X.values

# Train facies + porosity models on a 6-well field; score HONESTLY on a held-out well.
field = pd.concat([make_well(f"OD-{i:03d}", seed=i, top=9000 + i * 30) for i in range(1, 7)], ignore_index=True)
facies_clf = RandomForestClassifier(n_estimators=150, random_state=0).fit(feats(field), field.facies)
poro_reg   = RandomForestRegressor(n_estimators=200, random_state=0).fit(feats(field), field.PHI_true)

holdout = make_well("OD-006H", seed=60, top=9210.0)
print(f"trained on {field.WELL.nunique()} wells ({len(field)} samples)")
print(f"  held-out facies accuracy : {accuracy_score(holdout.facies, facies_clf.predict(feats(holdout))):.3f}")
print(f"  held-out porosity R2     : {r2_score(holdout.PHI_true, poro_reg.predict(feats(holdout))):.3f}")

Facies at 0.83 accuracy and porosity at R² 0.89, on a well the models never saw. Those are honest numbers. A pipeline that reports 0.99 here has almost certainly leaked adjacent half-foot samples across its train/test split (depth-correlated logs make a random split cheat); we scored on a separate well precisely to avoid that. Now the data fights back.

main.py

import matplotlib.pyplot as plt
from matplotlib.colors import ListedColormap

def archie_sw(phi, RT):
    """Water saturation from porosity and deep resistivity (Archie's equation)."""
    a, m, nn, Rw = ARCHIE.values()
    return np.clip((a * Rw / (np.clip(phi, 0.03, 1) ** m * RT)) ** (1 / nn), 0, 1)

# QC -- the new well OD-007 arrives broken. Inject four real tool faults so we
# know the ground truth, then detect and repair them before any interpretation.
new = make_well("OD-007", seed=42, top=9000.0)
truth_bad = np.zeros(len(new), bool)
washout = (new.DEPTH >= 9050) & (new.DEPTH < 9058)       # borehole caved -> density reads drilling mud
new.loc[washout, "RHOB"] = 1.6 + np.random.default_rng(1).normal(0, 0.05, washout.sum())
stuck = (new.DEPTH >= 9120) & (new.DEPTH < 9130)          # neutron tool stuck -> a dead flatline
new.loc[stuck, "NPHI"] = new.loc[stuck, "NPHI"].iloc[0]
new.loc[[30, 31, 180], "RT"] = [3500, 4200, 5000.0]      # electrical spikes on resistivity
truth_bad[np.where(washout)[0]] = True
truth_bad[np.where(stuck)[0]] = True
truth_bad[[30, 31, 180]] = True

# Detect: magnitude outliers (isolation forest) + a contextual flatline (rolling std).
Xq = StandardScaler().fit_transform(np.column_stack([new.GR, new.RHOB, new.NPHI, np.log10(new.RT)]))
mag = IsolationForest(contamination=0.08, random_state=0).fit(Xq).predict(Xq) == -1
flat = (new.NPHI.rolling(5, min_periods=5).std() < 0.004).fillna(False).values
qc_flag = mag | flat
recall = (qc_flag & truth_bad).sum() / truth_bad.sum()
print(f"QC: quarantined {qc_flag.sum()} of {len(new)} samples (caught {recall:.0%} of the injected faults)")

# Repair: impute each bad curve from its good neighbours, depth by depth -- so the
# interpretation runs on recovered logs, not on a conveniently regenerated well.
rep = new.copy()
d = rep.DEPTH.values
wash = rep.RHOB.values < 1.9                              # washout signature on the density log
rep.loc[wash, "RHOB"] = np.interp(d[wash], d[~wash], rep.RHOB.values[~wash])
fz = (rep.NPHI.rolling(5, min_periods=5).std() < 0.004).fillna(False).values
rep.loc[fz, "NPHI"] = np.interp(d[fz], d[~fz], rep.NPHI.values[~fz])
spk = rep.RT.values > 2500.0                             # legitimate RT tops out near 2000 ohm-m;
rep.loc[spk, "RT"] = np.interp(d[spk], d[~spk], rep.RT.values[~spk])      # production code: median + k*MAD
print(f"repaired by interpolation: {wash.sum()} washout, {fz.sum()} flatline, {spk.sum()} spike samples")

# Interpret the repaired well; carry porosity uncertainty through to net pay.
trees = np.stack([t.predict(feats(rep)) for t in poro_reg.estimators_])   # the forest's porosity spread
phi50, phi_sd = np.median(trees, 0), trees.std(0)
Vsh_gr = np.clip((rep.GR.values - 18) / (135 - 18), 0, 1)                 # shale fraction from gamma ray
rng = np.random.default_rng(7)
net = []
# Net-pay rule applied to the PREDICTED logs (the same cutoff drives the flag below):
# clean enough (Vsh < 0.40), porous enough (phi > 0.08), oil-bearing enough (Sw < 0.60).
# (The blind-well TRUTH later uses facies membership as the reservoir-quality proxy.)
for _ in range(300):                                     # Monte-Carlo net pay over the porosity uncertainty
    phi_r = np.clip(phi50 + rng.normal(0, 1, len(phi50)) * np.maximum(phi_sd, 0.005), 0.02, 0.34)
    pay_r = (Vsh_gr < 0.40) & (phi_r > 0.08) & (archie_sw(phi_r, rep.RT.values) < 0.60)
    net.append(pay_r.sum() * 0.5)
p90, p50, p10 = np.percentile(net, [10, 50, 90])         # P90 from the 10th percentile (low side)
print(f"net pay (Monte Carlo):  P90 {p90:.1f}  P50 {p50:.1f}  P10 {p10:.1f} ft")

# Reconcile ML porosity against the deterministic density-porosity transform.
dphi = (2.65 - rep.RHOB.values) / (2.65 - 1.0)
clean = make_well("OD-007", seed=42, top=9000.0)         # truth (clean curves) for the gas-zone diagnosis
gas = (clean.facies.values == 2) & (clean.SW_true.values < 0.45)
print(f"ML phi vs density-phi: {np.mean(phi50 - dphi):+.3f} overall, {np.mean((phi50 - dphi)[gas]):+.3f} in gas "
      "(density-porosity over-reads where gas lightens RHOB -- ML, trained on core, does not)")

# A fully blind well the pipeline never touched -- the only honest accuracy test.
blind = make_well("OD-099", seed=777, top=9000.0)
phib = poro_reg.predict(feats(blind))
Swb = archie_sw(phib, blind.RT.values)
Vb = np.clip((blind.GR.values - 18) / (135 - 18), 0, 1)
np_pred = ((Vb < 0.40) & (phib > 0.08) & (Swb < 0.60)).sum() * 0.5
np_true = (np.isin(blind.facies.values, [1, 2]) & (blind.PHI_true.values > 0.08) & (blind.SW_true.values < 0.60)).sum() * 0.5
print(f"blind well OD-099: predicted {np_pred:.1f} ft net pay vs true {np_true:.1f} ft (error {abs(np_pred - np_true):.1f} ft)")

# The deliverable -- a composite track a geologist can initial.
colors = ["#7f7f7f", "#9ecae1", "#CC4444", "#6B8E23"]
fp = facies_clf.predict(feats(rep))
fig, ax = plt.subplots(1, 4, figsize=(9, 8), sharey=True, gridspec_kw={"width_ratios": [1.2, 0.4, 1.2, 0.4]})
ax[0].plot(rep.GR, rep.DEPTH, "g", lw=0.7)
ax[0].set_xlabel("GR"); ax[0].set_ylabel("Depth (ft)"); ax[0].set_title("Log")
ax[1].imshow(fp.reshape(-1, 1), aspect="auto", cmap=ListedColormap(colors),
             vmin=0, vmax=3, extent=[0, 1, rep.DEPTH.iloc[-1], rep.DEPTH.iloc[0]])
ax[1].set_xticks([]); ax[1].set_title("Facies")
ax[2].fill_betweenx(rep.DEPTH, phi50 - 1.28 * phi_sd, phi50 + 1.28 * phi_sd, color="#ccc", label="P10-P90")
ax[2].plot(phi50, rep.DEPTH, "b", lw=0.8, label="ML")
ax[2].plot(clean.PHI_true, rep.DEPTH, "k--", lw=0.6, label="truth")
ax[2].set_xlabel("Porosity"); ax[2].legend(fontsize=7); ax[2].set_title("Porosity")
pay = (Vsh_gr < 0.40) & (phi50 > 0.08) & (archie_sw(phi50, rep.RT.values) < 0.60)
ax[3].imshow(pay.reshape(-1, 1).astype(int), aspect="auto", cmap=ListedColormap(["white", "#CC4444"]),
             vmin=0, vmax=1, extent=[0, 1, rep.DEPTH.iloc[-1], rep.DEPTH.iloc[0]])
ax[3].set_xticks([]); ax[3].set_title("Net pay")
ax[0].invert_yaxis(); fig.suptitle("OD-007 - ML Petrophysical Interpretation", y=0.99); fig.tight_layout()
plt.show()

Read the scorecard, then the band, then the reconciliation. The QC step catches the magnitude faults outright (the washout, where the borehole has caved and the density tool reads mud instead of rock, and the resistivity spikes, both flagged by the isolation forest) and catches most of the stuck-tool flatline with a contextual rolling-standard-deviation rule. That is about 74% of the injected bad samples overall; the rolling window inevitably misses a few at the dead zone's edges, which is exactly the precision/recall trade-off Exercise 19.1 makes you tune. It then repairs what it flags by interpolating across the good neighbours, so the interpretation runs on recovered curves, not on a conveniently regenerated well. Net pay, the cumulative feet that clear the reservoir cutoffs, comes back as 33.5 ft, bracketed 32–34.5: the porosity model's own uncertainty propagated through those cutoffs, so the asset team books a range, not a false-precision number. The reconciliation earns trust, and the direction is the whole point: gas is light, so it lowers the density log (RHOB), which makes the naive density-porosity transform read higher porosity than is really there. The ML porosity sits below that transform in the gas sand, and lower is the correct answer, because the model was trained on core, not fooled by the gas effect. And on a completely blind well the pipeline lands within 1.5 ft of truth, not the zero-foot error a leaky split would fake.

The signable output is the composite track above plus one line a geologist can initial: "OD-007: 33.5 ft net pay (32–34.5), gas-bearing sands flagged; recommend perforating the upper pay; blind-well net-pay error ~1.5 ft." That is the project, not the R².

Project 2: Production Forecasting and Reserves Estimation System

The reserves meeting is next week. Four producing wells, twenty-four months of history each, and the question on the table is worth nine figures: how many barrels do we book, and is the development still economic if oil drops to \$55? The data is the field's actual export, and like every real export, it is dirty. We build the system that QCs it, forecasts each well with honest uncertainty, and puts a P10/P50/P90 NPV on the table with the sensitivities that flip the decision.

main.py

import io
from scipy.optimize import curve_fit
import warnings; warnings.simplefilter("ignore")

def hyperbolic(t, qi, Di, b):
    return qi / np.power(1 + b * Di * t, 1.0 / b)

def modified_hyperbolic(t, qi, Di, b, Dmin):
    """Arps hyperbolic that switches to a terminal exponential decline at Dmin --
    without it, a b near 1 forecasts a near-infinite tail and over-books reserves."""
    t_sw = (Di / Dmin - 1) / (b * Di) if (b > 0 and Di > Dmin) else 1e9
    return np.where(t <= t_sw, hyperbolic(t, qi, Di, b),
                    hyperbolic(t_sw, qi, Di, b) * np.exp(-Dmin * (t - t_sw)))

# The field's real 24-month export (embedded). It arrives with faults, like every export does.
PROD = """well,date,oil_bopd
OD-001,2025-01,2347.9
OD-001,2025-02,2278.1
OD-001,2025-03,2276.2
OD-001,2025-04,2200.2
OD-001,2025-05,2151.0
OD-001,2025-06,2000.3
OD-001,2025-07,1827.7
OD-001,2025-08,1807.2
OD-001,2025-09,1657.7
OD-001,2025-10,1674.6
OD-001,2025-11,1570.1
OD-001,2025-12,1561.9
OD-001,2026-01,1532.1
OD-001,2026-02,1510.5
OD-001,2026-03,
OD-001,2026-04,1280.1
OD-001,2026-05,1333.5
OD-001,2026-06,1158.4
OD-001,2026-07,1224.0
OD-001,2026-08,1103.0
OD-001,2026-09,1138.6
OD-001,2026-10,1021.0
OD-001,2026-11,1095.7
OD-001,2026-12,965.1
OD-003,2025-01,3081.7
OD-003,2025-02,3007.0
OD-003,2025-03,2792.2
OD-003,2025-04,-200.0
OD-003,2025-05,2431.6
OD-003,2025-06,2247.2
OD-003,2025-07,2180.3
OD-003,2025-08,2171.0
OD-003,2025-09,1979.1
OD-003,2025-10,1788.0
OD-003,2025-11,1694.2
OD-003,2025-12,1576.2
OD-003,2026-01,1568.3
OD-003,2026-02,1428.9
OD-003,2026-03,1270.9
OD-003,2026-04,1315.6
OD-003,2026-05,1207.4
OD-003,2026-06,1177.8
OD-003,2026-07,1074.7
OD-003,2026-08,960.8
OD-003,2026-09,968.1
OD-003,2026-10,899.4
OD-003,2026-11,806.9
OD-003,2026-12,770.1
OD-005,2025-01,1807.2
OD-005,2025-02,1737.1
OD-005,2025-03,1720.5
OD-005,2025-04,1644.8
OD-005,2025-05,1593.0
OD-005,2025-06,1539.4
OD-005,2025-07,1535.7
OD-005,2025-08,15000.0
OD-005,2025-09,1450.4
OD-005,2025-10,1329.5
OD-005,2025-11,1324.6
OD-005,2025-12,1331.2
OD-005,2026-01,1266.3
OD-005,2026-02,1234.4
OD-005,2026-03,1129.9
OD-005,2026-04,1123.2
OD-005,2026-05,1135.1
OD-005,2026-06,1080.7
OD-005,2026-07,1084.6
OD-005,2026-08,1061.8
OD-005,2026-09,1038.6
OD-005,2026-10,1004.2
OD-005,2026-11,885.3
OD-005,2026-12,890.1
OD-007,2025-01,2871.6
OD-007,2025-02,2862.5
OD-007,2025-03,2779.9
OD-007,2025-04,2584.2
OD-007,2025-05,2406.8
OD-007,2025-06,2317.7
OD-007,2025-07,2126.1
OD-007,2025-08,2064.2
OD-007,2025-09,1929.2
OD-007,2025-10,1911.9
OD-007,2025-11,1777.9
OD-007,2025-12,1671.4
OD-007,2026-01,1627.4
OD-007,2026-02,1509.9
OD-007,2026-03,1353.1
OD-007,2026-04,1332.1
OD-007,2026-05,1272.4
OD-007,2026-06,1204.4
OD-007,2026-07,1175.7
OD-007,2026-08,1082.7
OD-007,2026-09,1110.0
OD-007,2026-10,1099.1
OD-007,2026-11,953.7
OD-007,2026-12,980.7
"""
prod = pd.read_csv(io.StringIO(PROD))

# QC: flag missing, negative, and outlier-spike oil rates, one well at a time.
def clean_mask(q):
    med = np.nanmedian(q)
    return np.isfinite(q) & (q > 0) & (q < 3 * med)     # drop NaN, negatives, and 10x spikes
total_bad = 0
ECON, DAYS, DMIN = 20.0, 30.4, 0.06 / 12                # economic limit, days/month, 6%/yr terminal decline

# Fit each well's decline curve, then bootstrap a P10/P50/P90 EUR (Estimated
# Ultimate Recovery -- the total barrels the well will make over its life).
def eur_from(qi, Di, b):
    t = np.arange(0, 360); q = modified_hyperbolic(t, qi, Di, b, DMIN); q = q[q >= ECON]
    return np.sum(q * DAYS)
fits = {}
for w in sorted(prod.well.unique()):
    q = prod[prod.well == w].oil_bopd.values.astype(float)
    ok = clean_mask(q); total_bad += (~ok).sum()
    t = np.arange(len(q)); tt, qq = t[ok], q[ok]
    # Arps bounds are physics, not tuning: b in (0, 1] for a real hyperbolic; b -> 1 is the over-booking edge.
    popt, _ = curve_fit(hyperbolic, tt, qq, p0=[qq[0], 0.05, 0.5], bounds=([0, 0, 0.01], [8000, 1, 0.99]), maxfev=5000)
    resid = qq - hyperbolic(tt, *popt); eurs = []
    rb = np.random.default_rng(0)
    for _ in range(300):
        qb = np.maximum(hyperbolic(tt, *popt) + rb.choice(resid, len(tt), replace=True), 1)
        try:
            pb, _ = curve_fit(hyperbolic, tt, qb, p0=popt, bounds=([0, 0, 0.01], [8000, 1, 0.99]), maxfev=3000)
            eurs.append(eur_from(*pb))
        except Exception:
            eurs.append(eur_from(*popt))
    p90, p50, p10 = np.percentile(eurs, [10, 50, 90])
    fits[w] = dict(qi=popt[0], Di=popt[1], b=popt[2], p90=p90, p50=eur_from(*popt), p10=p10)
    print(f"  {w}: qi={popt[0]:.0f} Di={popt[1]:.3f} b={popt[2]:.2f} | EUR {fits[w]['p50']/1e6:.2f} "
          f"(P90 {p90/1e6:.2f} / P10 {p10/1e6:.2f}) MMbbl")
print(f"\nQC flagged {total_bad} bad oil readings (a missing month, a -200 reading, and a 15,000 spike)")
print(f"FIELD EUR  P90 {sum(f['p90'] for f in fits.values())/1e6:.1f} / "
      f"P50 {sum(f['p50'] for f in fits.values())/1e6:.1f} / "
      f"P10 {sum(f['p10'] for f in fits.values())/1e6:.1f}  MMbbl")

The cell prints the three faults it caught (a missing month, a negative rate, and a 15,000-bopd spike) and removes all three before a single curve is fit. That last one matters most: a misplaced decimal, left in, drags the whole decline fit upward and over-books the well. The terminal-decline cap is the other safeguard, though on this field it is insurance that never has to pay out: all four wells decline steeply (fitted b ≈ 0.01–0.10), so they hit the economic limit long before the cap would ever fire, and it changes the booked EUR by nothing. It earns its place for the other kind of well: an unconstrained hyperbolic with b near 1 forecasts a tail that never dies and books reserves that will never be produced (the classic over-booking trap that Exercise 19.3 builds on purpose). Capping the decline at a 6%/yr floor keeps that trap shut. Now turn barrels into the only thing the meeting cares about.

main.py

PRICE, OPEX_VAR, OPEX_FIX, DISC = 75.0, 18.0, 45000.0, 0.10   # $/bbl, $/bbl, $/well/month, annual discount
def npv_from(qi, Di, b, price=PRICE, disc=DISC):
    t = np.arange(0, 360); q = modified_hyperbolic(t, qi, Di, b, DMIN); q = q[q >= ECON]
    cf = q * DAYS * (price - OPEX_VAR) - OPEX_FIX                # monthly cash flow, $
    return np.sum(cf / np.power(1 + disc, np.arange(len(q)) / 12.0))

field_npv = sum(npv_from(f['qi'], f['Di'], f['b']) for f in fits.values()) / 1e6
print(f"FIELD NPV (P50, $75/bbl, 10%): ${field_npv:.0f} MM")
print("tornado (what moves the decision):")
for label, kw in [("oil $55/bbl", dict(price=55)), ("oil $95/bbl", dict(price=95)),
                  ("discount 15%", dict(disc=0.15)), ("discount 5%", dict(disc=0.05))]:
    v = sum(npv_from(f['qi'], f['Di'], f['b'], **kw) for f in fits.values()) / 1e6
    print(f"  {label:14s}: ${v:.0f} MM   ({v - field_npv:+.0f})")
breakeven = min(p for p in range(20, 80) if sum(npv_from(f['qi'], f['Di'], f['b'], price=p) for f in fits.values()) > 0)
print(f"break-even oil price (field NPV = 0): ~${breakeven}/bbl")

fig, (axL, axR) = plt.subplots(1, 2, figsize=(11, 4.5))
ws = list(fits); p50s = [fits[w]['p50'] / 1e6 for w in ws]
axL.bar(ws, p50s, color="#2E8B57")
axL.errorbar(ws, p50s, yerr=[[ (fits[w]['p50']-fits[w]['p90'])/1e6 for w in ws], [(fits[w]['p10']-fits[w]['p50'])/1e6 for w in ws]],
             fmt="none", ecolor="k", capsize=4)
axL.set_ylabel("EUR (MMbbl)"); axL.set_title("Per-well EUR with P10-P90 band"); axL.grid(axis="y", alpha=0.2)
tor = [("oil price\n$55-$95", sum(npv_from(f['qi'],f['Di'],f['b'],price=55) for f in fits.values())/1e6,
        sum(npv_from(f['qi'],f['Di'],f['b'],price=95) for f in fits.values())/1e6),
       ("discount\n5-15%", sum(npv_from(f['qi'],f['Di'],f['b'],disc=0.15) for f in fits.values())/1e6,
        sum(npv_from(f['qi'],f['Di'],f['b'],disc=0.05) for f in fits.values())/1e6)]
for i, (lab, lo, hi) in enumerate(tor):
    axR.barh(i, hi - lo, left=lo, color="#4682B4"); axR.text(field_npv, i, " base", va="center", fontsize=8)
axR.axvline(field_npv, color="k", ls="--", lw=1); axR.set_yticks(range(len(tor))); axR.set_yticklabels([t[0] for t in tor])
axR.set_xlabel("Field NPV ($MM)"); axR.set_title("Tornado - what flips the decision"); axR.grid(axis="x", alpha=0.2)
fig.tight_layout(); plt.show()

The reserves-review packet writes itself from here: book P50 ≈ 7.5 MMbbl across the four wells (range 7.1–9.0), field NPV ≈ \ $341 MM at \$ 75/bbl. The tornado is the headline an asset manager reads first: oil price is the swing factor (±\ $125 MM), the discount rate barely matters, and the project stays positive down to roughly \$ 21/bbl, so yes, it survives the \$55 stress case comfortably. That sentence, not the EUR, is what gets signed.

Project 3: Drilling Performance Benchmarking Tool

Logs and production told us how much is down there and what it is worth; drilling asks a different question: how well we got to it. Six wells, six different drillers, and a capital meeting that wants to know which well to copy and which to never repeat. The trap is brutal and common: rank them by raw cost-per-foot and you will crucify the team that drilled the deepest, hardest hole and reward the one that drilled a shallow, soft one slowly. Real benchmarking normalises for difficulty first, then ranks what is left (the part the crew actually controls) and puts a dollar figure on the gap, with a confidence interval so one lucky well does not set the target.

main.py

# Six wells: TD (ft), formation hardness, drilling efficiency (0-1), and non-productive-time fraction.
SPEC = [("A", 9000, 1.0, 0.92, 0.32), ("B", 11000, 1.6, 0.78, 0.44), ("C", 9500, 1.1, 0.95, 0.30),
        ("D", 12500, 1.9, 0.97, 0.28), ("E", 8800, 0.9, 0.70, 0.52), ("F", 10200, 1.4, 0.85, 0.38)]
DAYRATE = 280000.0   # $/day full rig spread
rows = []
for wid, td, hard, eff, npt in SPEC:
    rop = 110 / hard * eff                               # on-bottom ROP depends on rock AND practices
    days = (td / rop / 24) / (1 - npt)                   # non-productive time inflates calendar days
    cost = days * DAYRATE
    rows.append(dict(well=wid, td=td, hard=hard, eff=eff, npt=npt, days=days,
                     cost=cost, cpf=cost / td, cpf_norm=(cost / td) / hard))   # divide difficulty OUT
df = pd.DataFrame(rows)
best_eu = (df.eff * (1 - df.npt)).max()                  # technical limit = best efficiency x uptime
vals = df.cpf_norm.values
print("well | TD     hard NPT  | raw $/ft | norm $/ft | pctile (80% CI)  | recoverable")
for _, r in df.sort_values("cpf_norm").iterrows():
    pct = (vals < r.cpf_norm).mean() * 100
    boots = [(np.random.default_rng(k).choice(vals, len(vals), replace=True) < r.cpf_norm).mean() * 100 for k in range(300)]
    lo, hi = np.percentile(boots, [10, 90])
    tl_cost = ((r.td / (110 / r.hard) / 24) / best_eu) * DAYRATE   # cost if drilled at the field's best efficiency
    rec = max(r.cost - tl_cost, 0) / 1e6
    tag = "TECHNICAL LIMIT" if (r.eff * (1 - r.npt)) == best_eu else f"${rec:.1f} MM"
    print(f" {r.well}   | {r.td:5.0f} {r.hard:4.1f} {r.npt:3.0%} | {r.cpf:7.0f}  | {r.cpf_norm:8.0f}  | {pct:3.0f}% ({lo:.0f}-{hi:.0f})    | {tag}")

worst = df.sort_values("cpf_norm").iloc[-1]
print(f"\nVERDICT: raw $/ft would flag well {df.sort_values('cpf').iloc[-1].well} (just the hardest hole). "
      f"Difficulty-normalised, the real underperformer is well {worst.well}: "
      f"{worst.eff:.0%} efficiency + {worst.npt:.0%} NPT, not depth.")

fig, ax = plt.subplots(figsize=(9, 4.5)); x = np.arange(len(df))
order = df.sort_values("cpf_norm")
ax.bar(x - 0.2, order.cpf, 0.4, color="#bbb", label="raw $/ft")
ax.bar(x + 0.2, order.cpf_norm, 0.4, color="#2E8B57", label="difficulty-normalised $/ft")
ax.set_xticks(x); ax.set_xticklabels("well " + order.well); ax.legend(); ax.grid(axis="y", alpha=0.2)
ax.set_ylabel("$/ft"); ax.set_title("Raw vs. Difficulty-Normalised Cost per Foot")
fig.tight_layout(); plt.show()

The two bars tell the whole story. By raw cost-per-foot, well B looks worst, but B drilled the second-deepest, second-hardest hole, and punishing it would teach the wrong lesson. Normalise difficulty out and the real laggard is well E: a shallow hole drilled at just 70% efficiency with 52% non-productive time, the least efficient hole in the field, ranked 83rd-percentile despite being the easiest. The technical limit is set, fittingly, by well D, the deepest, hardest hole, drilled best. Two distinct findings fall out: E is the crew to coach (worst efficiency), while B carries the largest absolute recoverable spend (≈\ $1.6 MM, because it is both deep and inefficiently drilled). What the capital meeting gets is the ranked table with a bootstrap CI on each rank and a next-well target: coach E's practices toward the technical limit, chase B's ≈\$ 1.6 MM of recoverable spend, and do not punish the depth that B and D earned.

Project 4: Reservoir Characterization and Volumetric Uncertainty

The first three projects each answered a question; the last one decides whether to go looking for a better answer. The final question is the biggest: how much oil is in this reservoir, and should we drill the next well? Stripped down, the volumetric equation is grade-school (area × thickness × porosity × oil saturation ÷ shrinkage) and its product is the STOIIP (Stock-Tank Oil Initially In Place: the total barrels in the ground before any is produced). The engineering is in admitting that every one of those inputs is uncertain, propagating that uncertainty with Monte Carlo, reconciling the answer against a deterministic check, and, the part that separates a reservoir engineer from a calculator, turning the uncertainty into a decision about acquiring more data.

main.py

# Field-scale inputs as DISTRIBUTIONS, not point estimates (the whole point).
rng = np.random.default_rng(7); K = 10000
area_acres = rng.triangular(700, 1150, 1900, K)            # mapped area; sparse well control -> wide, right-skewed
net_pay    = rng.normal(34, 3.5, K)                        # ft, field-wide reservoir thickness (cf. P1's per-well band)
porosity   = rng.normal(0.24, 0.025, K)
sw         = rng.normal(0.30, 0.05, K)                     # water saturation
bo         = rng.normal(1.25, 0.05, K)                     # formation volume factor

def stoiip_of(area, net, phi, sw, bo):
    return 7758 * area * np.clip(net, 0, None) * np.clip(phi, 0, 1) * \
           (1 - np.clip(sw, 0, 1)) / np.clip(bo, 1.05, None) / 1e6     # MMSTB (7758 = bbl per acre-ft)

inputs = dict(area=area_acres, net=net_pay, phi=porosity, sw=sw, bo=bo)
stoiip = stoiip_of(**inputs)
p90, p50, p10 = np.percentile(stoiip, [10, 50, 90])
deterministic = stoiip_of(area_acres.mean(), 34, 0.24, 0.30, 1.25)    # mean-input "best estimate"
print(f"Monte-Carlo STOIIP:  P90 {p90:.0f}  P50 {p50:.0f}  P10 {p10:.0f}  MMSTB")
print(f"deterministic (mean inputs): {deterministic:.0f} MMSTB  -- a single number that hides the {p10-p90:.0f}-MMSTB range")
print(f"P50 reserves @ 28% recovery: ~{p50*0.28:.0f} MMbbl")

# Value of information: freeze each input at its mean and see how much the P10-P90
# range collapses -- the input that shrinks it most is the one worth paying to learn.
base = p10 - p90
removed = {}
for name in inputs:
    lo, hi = np.percentile(stoiip_of(**{**inputs, name: inputs[name].mean()}), [10, 90])
    removed[name] = base - (hi - lo)                       # range removed by knowing this input exactly
print("what drives the spread (freeze each input at its mean, measure the range it removes):")
for name in sorted(removed, key=removed.get, reverse=True):
    print(f"  {name:4s}: removes {removed[name]:4.1f} of {base:.0f} MMSTB  ({removed[name]/base:.0%} of the spread)")
print("  -> area dominates: drill one appraisal well to tighten it before committing the development.")

fig, ax = plt.subplots(figsize=(8, 4.5))
ax.hist(stoiip, bins=50, color="#4682B4", alpha=0.8)
for p, lab, c in [(p90, "P90", "#888"), (p50, "P50", "k"), (p10, "P10", "#888")]:
    ax.axvline(p, color=c, ls="--", lw=1.5); ax.text(p, ax.get_ylim()[1]*0.92, f" {lab}={p:.0f}", fontsize=9)
ax.set_xlabel("STOIIP (MMSTB)"); ax.set_ylabel("frequency"); ax.set_title("Monte-Carlo STOIIP - the uncertainty is the answer")
fig.tight_layout(); plt.show()

A manager who asks "how much oil is there?" wants to hear "44 million barrels." The honest answer is "between 30 and 60, most likely 43", and the gap between those numbers, not the midpoint, is the decision. The freeze-one-input test makes the value-of-information argument concrete rather than asserted: knowing the area exactly removes far more of the P10–P90 range than any other input, because well control is sparse and the mapped extent is the real unknown. (These shares do not sum to 100% because STOIIP is a product of uncertain factors, so their ranges compound rather than add; it is the ranking, not the absolute percentages, that points to what is worth measuring.) That converts directly into a recommendation: the uncertainty is worth more to reduce than to ignore, so drill one appraisal well to pin the area before committing hundreds of millions to the full development. What ships to the drill-or-wait decision is the distribution above plus that value-of-information call. A coloured map would have looked more finished; this is more useful.

Exercises

These extend the four projects. Each one forces a decision to change, not just a number to recompute.

fitness_center

Exercise 19.1Practice

: Tune the QC Net

In Project 1, the isolation-forest contamination controls how aggressively the pipeline quarantines log samples. Sweep it from 0.02 to 0.15, and for e...

arrow_forward