From statsmodels¤

This page maps common statsmodels patterns to their glmax equivalents. The two libraries share most of the same statistical concepts — GLM families, link functions, offsets, robust standard errors — but differ in API shape.

The key structural difference: statsmodels bundles fitting, inference, and prediction into a single result object. glmax separates them into explicit verbs that return explicit nouns. There's no .summary() method; inference lives in glmax.infer.

Fitting a GLM¤

statsmodels exposes a mix of family-specific constructors (sm.OLS, sm.Logit, sm.Poisson) and a generic sm.GLM. glmax uses one entry point for everything — the family is an argument, not part of the class name.

statsmodels:

import statsmodels.api as sm
import numpy as np

X = sm.add_constant(X_raw)   # statsmodels adds the intercept for you

result = sm.OLS(y, X).fit()                                      # Gaussian
result = sm.Logit(y, X).fit()                                    # Binomial / logit
result = sm.Poisson(y, X).fit()                                  # Poisson
result = sm.GLM(y, X, family=sm.families.Gamma()).fit()          # Gamma
result = sm.NegativeBinomial(y, X).fit()                         # Negative Binomial

glmax:

import jax.numpy as jnp
import glmax

# glmax does not add an intercept automatically — include it in X
X = jnp.column_stack([jnp.ones(n), X_raw])

fitted = glmax.fit(glmax.Gaussian(), X, y)
fitted = glmax.fit(glmax.Binomial(), X, y)
fitted = glmax.fit(glmax.Poisson(), X, y)
fitted = glmax.fit(glmax.Gamma(), X, y)
fitted = glmax.fit(glmax.NegativeBinomial(), X, y)

Fitting returns a FittedGLM noun. Inference is a separate step:

result = glmax.infer(fitted)
result.params.beta   # coefficients  (cf. result.params)
result.se            # standard errors  (cf. result.bse)
result.stat          # test statistics  (cf. result.tvalues)
result.p             # p-values  (cf. result.pvalues)

fitted.mu            # fitted means  (cf. result.fittedvalues)
glmax.predict(fitted.family, fitted.params, X_new)   # out-of-sample  (cf. result.predict)

Fitted values are on the response scale

Like statsmodels fitted values, fitted.mu and glmax.predict(...) are response-scale means. For Binomial(n_trials=N), the inverse link gives a success probability \(p\), but fitted values are expected success counts \(Np\) when y is encoded as counts.

For Negative Binomial the overdispersion parameter lives in fitted.params.aux rather than appended to the coefficient vector.

Overriding the link function¤

statsmodels:

result = sm.GLM(
    y, X,
    family=sm.families.Binomial(sm.families.links.Probit())
).fit()

glmax:

fitted = glmax.fit(glmax.Binomial(glmax.ProbitLink()), X, y)

The link is passed directly to the family constructor. See Families & Links for all supported combinations.

Offsets and Exposure¤

Both libraries support offsets: fixed terms added to the linear predictor before converting \(\eta\) to response-scale fitted values.

statsmodels:

result = sm.GLM(
    y,
    X,
    family=sm.families.Poisson(),
    offset=np.log(exposure),
).fit()

glmax:

fitted = glmax.fit(glmax.Poisson(), X, y, offset=jnp.log(exposure))

For Poisson and Negative Binomial models with a log link, offset=log(exposure) models expected counts:

\[ \mu = \exp(X\beta + \log(\mathrm{exposure})) = \mathrm{exposure} \cdot \exp(X\beta). \]

glmax.predict(...) returns expected counts for the supplied exposure. Divide by exposure to get rates.

Do not pass raw exposure as offset

statsmodels has some model APIs that distinguish offset= from exposure=. glmax keeps one primitive: offset. For exposure workflows, pass jnp.log(exposure).

Robust (sandwich) standard errors¤

statsmodels:

result = sm.Poisson(y, X).fit(cov_type="HC3")

glmax:

result = glmax.infer(fitted, stderr=glmax.HuberError())

Residuals¤

statsmodels:

result.resid_pearson    # Pearson residuals
result.resid_deviance   # deviance residuals

glmax:

pearson  = glmax.check(fitted, diagnostic=glmax.PearsonResidual())
deviance = glmax.check(fitted, diagnostic=glmax.DevianceResidual())
quantile = glmax.check(fitted, diagnostic=glmax.QuantileResidual())

Quantile residuals are also available — they're randomised probability integral transform residuals and are especially useful for discrete families like Poisson and Binomial, where Pearson and deviance residuals don't follow a clean reference distribution.

Goodness-of-fit statistics¤

statsmodels:

result.deviance          # residual deviance
result.pearson_chi2      # Pearson χ²
result.aic               # AIC

glmax:

gof = glmax.check(fitted, diagnostic=glmax.GoodnessOfFit())
gof.deviance       # residual deviance
gof.pearson_chi2   # Pearson χ²
gof.aic            # AIC

What glmax doesn't have (yet)¤

A few things statsmodels provides that glmax doesn't yet support:

.summary() formatted tables — inference results are structured arrays, not formatted text. Use result.params.beta, result.se, etc. directly.
Dispersion inference — standard errors on the dispersion parameter for Gaussian and Gamma are on the roadmap.
Weights — observation weights are not part of the core grammar yet.

What glmax adds¤

Beyond the API differences, glmax is built on JAX, which gives you:

Batched fitting — fit the same model to hundreds of response vectors simultaneously with eqx.filter_vmap.
Gradients through fit — jax.grad and jax.jvp work through the fit itself via an Implicit Function Theorem-based custom derivative rule.
JIT compilation — all verbs compile on first call and run fast on CPU, GPU, or TPU thereafter.

See JAX Transformations for examples.