flatironinstitute · bob-carpenter · Jan 26, 2023 · Jan 27, 2023 · Jan 30, 2023 · Jan 30, 2023
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -0,0 +1,48 @@
+# Contributing to BridgeStan
+
+We welcome contributions to the project and we could really use your help to:
+
+* Investigate and fix reported bugs
+
+* Improve the workflow
+
+* Improve the documentation
+
+* Increase test coverage
+
+
+## Code format
+
+We are using [Black](https://black.readthedocs.io/en/stable/), the uncompromising code formatter.
+
+
+## Typing
+
+We are using the [mypy](https://mypy.readthedocs.io/en/stable/) static type checker
+
+## Unit testing
+
+We are using [PyTest](https://docs.pytest.org/en/stable/).
+
+## Git model
+
+Our development process involves the following steps to add code.
+
+1. create an issue on GitHub
+2. develop the issue on a branch from `main`
+    * code
+    * unit tests
+	* documentation
+3. create a pull request for the branch
+4. until accepted,
+    * get a code review
+    * fix pull request according to the reviewer's requests
+
+
+## Documentation strings
+
+BridgeStan uses [Sphinx](https://www.sphinx-doc.org/en/master/) to generate documentaiton.
+
+We are following the
+
+* [Google Stype Python Docstrings](https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html#example-google)
diff --git a/README.md b/README.md
@@ -4,6 +4,9 @@
 and posterior analysis with minimial dependencies for maximal
 flexiblity.
 
+This documentation is for end users; if you would like to contribute code, see
+[Contributing to BridgeStan](CONTRIBUTING.MD).
+
 ## Example
 
 The following example defines a model `StdNormal`, samples 1000 draws
@@ -71,24 +74,38 @@ draws from target log density, which may be used for Monte Carlo
 estimates of posterior expectations and quantiles for uncertainty
 quantification.
 
-#### Random-walk Metropolis sampler
+#### Metropolis sampler
+
+Metropolis is a diffusive sampler that requires a target log density
+function and a symmetric pseudorandom proposal generator.
+
+* **Random-Walk Metropolis (RWM)**: uses a zero-centered normal proposal,
+  resulting in Markov chains that are random walks
+
+#### Metropolis-Hastings sampler
 
-Random-walk Metropolis (RWM) is a diffusive sampler that requires a
-target log density function and a symmetric pseudorandom proposal
-generator.
+Metropolis Hastings (MH) is a sampler that requires a target log
+density function, a (not necessarily symmetric) proposal generator,
+and a way to evaluate a proposal's log density.
 
-#### Metropolis-adjusted Langevin sampler
+* **Metropolis-adjusted Langevin (MALA) **: uses a random proposal with a
+gradient adjustment.  MALA requires a target log density and gradient
+function.
 
-Metroplis-adjusted Langevin (MALA) is a diffusive sampler that adjusts
-proposals with gradient-based information.  MALA requires a target log
-density and gradient function.
+* **Affine-Invariant Walker**: uses an ensemble of parameter values
+and uses complementary values to condition an MH proposal.
+Affine-invariant walkers require only a target log density function.
 
 #### Hamiltonian Monte Carlo sampler
 
 Hamiltonian Monte Carlo (HMC) simulates Hamiltonian dynamics with a
 potential energy function equal to the negative log density. It
 requires a target log density, gradient function, and optionally a
-metric.
+metric.  Technically, HMC composes a Gibbs kernel that refreshes the
+quadratic (i.e., normally distributed) momentum with a deterministic
+Metropolis sampler, the proposals for which are generated by following
+the Hamiltonian dynamics.
+
 
 ### Sequential Monte Carlo samplers
 
@@ -108,6 +125,29 @@ p(theta | y)^t[n] * p(theta),
 where the temperature `t[n]` runs from 0 to 1 across iterations.
 
 
+
+### Posterior analysis
+
+#### R-hat
+
+R-hat is a statistic over multiple Markov chains that converges to 1
+if the chains have the same stationary distribution (under some
+mild assumptions).
+
+
+#### Effective sample size
+
+The effective sample size of a Markov chain for estimating a specific
+expectation is the number of independent draws that would lead to the
+same standard error.
+
+#### Standard error
+
+The standard error of an unbiased estimate of an expectation provides
+the scale of the distribution of normally distributed error.
+
+
+
 ## Dependencies
 
 `bayes-kit` only depends on a single external package,

diff --git a/bayes_kit/__init__.py b/bayes_kit/__init__.py
@@ -1,13 +1,14 @@
 from .hmc import HMCDiag
 from .mala import MALA
 from .metropolis import Metropolis, MetropolisHastings
-from .ensemble import Stretcher
+from .ensemble import AffineInvariantWalker
 from .smc import TemperedLikelihoodSMC
 
 __all__ = [
     "HMCDiag",
     "MALA",
     "Metropolis",
     "MetropolisHastings",
+    "AffineInvariantWalker",
     "TemperedLikelihoodSMC",
 ]
diff --git a/bayes_kit/ensemble.py b/bayes_kit/ensemble.py
@@ -1,68 +1,172 @@
-from typing import Callable, Optional, Tuple
+from typing import Any, Iterator, Optional, Union
 from numpy.typing import NDArray
 import numpy as np
 
 from .model_types import LogDensityModel
 
+Sample = NDArray[np.float64]
+
+
+class AffineInvariantWalker:
+    """The affine-invariant ensemble sampler with stretch updates.
+
+    References:
+        Goodman, J. and Weare, J., 2010. Ensemble samplers with affine invariance.
+        *Communications in Applied Mathematics and Computational Science*
+        5(1):65--80.
+
+    Attributes:
+        _model (LogDensityModel): The statistical model being sampled.
+        _dim (int): The number of model dimensions.
+        _a (np.float64): The upper bound of interpolation ratio sampling (must be > 1, default 2).
+        _sqrt_a (np.float64): The square root of `_a`.
+        _inv_sqrt_a (np.float64): The inverse square root of `_a`.
+        _num_walkers (np.int64): The number of ensemble members.
+        _half_num_walkers (np.int64): Half the number of walkers.
+        _drawshape (list(int)): The number of walks by number of dimensions.
+        _thetas (NDArray[np.float64]): The ensemble of draws (`_num_walkers` x `_dim`).
+        _lp_thetas (NDArray[np.float64]): The vector of log densities (`_num_walkers x 1`).
+        _first_range (NDArray[np.float64]): Range of indexes of first half of `_thetas`.
+        _second_range (NDArray[np.float64]): Range of indexes for second half of `_thetas`.
+        _rng (np.random.Generator): pseudo random number generator
 
-class Stretcher:
-    """
-    Goodman, J. and Weare, J., 2010. Ensemble samplers with affine invariance.
-    *Communications in Applied Mathematics and Computational Science*
-    5(1):65--80.
     """
 
-    # def __init__(
-    #         self,
-    #         model: LogDensityModel,
-    #         a: Optional[float] = None,
-    #         walkers: Optional[int] = None
-    #         init: Optional[NDarray[np.float64]] = None)
-    #     ):
-    #     self._model = model
-    #     self._dim = self._model.dims()
-    #     if a != None and a < 1:
-    #         raise ValueError(f"stretch bound must be greater than or equal to 1; found {a=}")
-    #     self._a = a
-    #     self._sqrt_a = np.sqrt(a)
-    #     self._inv_sqrt_a = 1 / self._sqrt_a
-    #     if walkers != NONE and (walkers <= 0 or walkers % 2 != 0) :
-    #         raise ValueError(f"walkers must be strictly positive, even integer; found {walkers=}")
-    #     self._walkers = walkers or 2 * self._dim
-    #     self._halfwalkers = a / 2
-    #     self._drawshape = (self._walkers, self._dim)
-    #     if init != None and init.shape != self._drawshape:
-    #         raise ValueError(f"init must be shape of draw {self._drawshape}; found {init.shape=}")
-    #     self._thetas = init or np.random.normal(size=self._drawshape)
-    #     self._firsthalf = range(halfwalkers)
-    #     self._secondhalf = range(halfwalkers, walkers)
-
-    # def __iter__(self):
-    #     return self
-
-    # def __next__(self):
-    #     return self.sample
-
-    # def draw_z(self):
-    #     """Return random draw z in (1/a, a) with p(z) propto 1 / sqrt(z)"""
-    #     return np.square(np.random.uniform(self._inv_sqrt_a, self._sqrt_a))
-
-    # def stretch_move(self, theta_k: NDarray[np.float64], theta_j: NDarray[np.float64]):
-    #     z = self.draw_z()
-    #     theta_star = theta_j + z * (theta_k - theta_j)  # (1 - z) * theta_j + z * theta_k
-    #     log_q = (self._dims - 1) * np.log(z) + self._model.log_density(theta_star) - self._model.log_density(theta_k)
-    #     if np.log(np.random.uniform()) < log_q:
-    #         return theta_star
-    #     return theta_k
-
-    # def sample(self) -> NDarray[np.float64]
-    #     js = np.random.choice(secondhalf, size=self._halfwalkers)
-    #     for k in firsthalf:
-    #         self._thetas[k] = stretch_move(self._thetas[k], self._thetas[js[k]])
-    #     js = np.random.choice(firsthalf, size=self._halfwalkers)
-    #     for k in secondhalf:
-    #         self_thetas[k] = stretch_move(self._thetas[k], self._thetas[js[k]])
-    #     return self._thetas
-
-
-# TODO(carpenter): cache log density rather than recomputing for self
+    def __init__(
+        self,
+        model: LogDensityModel,
+        a: Optional[float] = None,
+        num_walkers: Optional[int] = None,
+        init: Optional[NDArray[np.float64]] = None,
+        seed: Union[None, int, np.random.BitGenerator, np.random.Generator] = None,
+    ):
+        """Initialize the sampler with model, and optionally bounds, size, and initial values.
+
+        The class instance stores the model, bounds on the proposal on
+        the square root scale, and the number of walkers.  The
+        initialization is used for the value of the parameters *before*
+        the first draw; the initialization will *not* be returned as
+        one of the draws.
+
+        Arguments:
+            model (LogDensityModel): The class used to evaluate log densities.
+            a (Union[None, float]): The bounds on the interpolation ratio proposal (default 2).
+            num_walkers (Union[None, int]): An even number of walkers to use (default dimensionality of `model * 2`).
+            init (Union[None, NDArray[np.float64]]): An array of shape `walker` x `dimension` of initial values (default standard normal).
+            seed (Union[None, int, np.random.BitGenerator, np.random.Generator]): Pseudo-RNG seed or generator (default system generated).
+
+        Raises:
+            ValueError: If `a` is provided and is not greater than or equal to 1, `walker`s is provided and not strictly positive and even, or if the `init` is provided and is not an `NDArray` of shape `walker` x `dimension`
+        """
+        # if not isinstance(model, LogDensityModel):
+        #     raise TypeError("model must follow the protocol LogDensityModel")
+        if not (a is None or isinstance(a, float) or isinstance(a, int)):
+            raise TypeError(f"a must be None, float, or int, found {a=}")
+        if not (num_walkers is None or isinstance(num_walkers, int)):
+            raise TypeError(f"num_walkers must be int, found {type(num_walkers)=}")
+        if not (init is None or isinstance(init, np.ndarray)):
+            raise TypeError("init must be None or NDArray")
+        if not (
+            seed is None
+            or isinstance(seed, int)
+            or isinstance(seed, np.random.BitGenerator)
+            or isinstance(seed, np.random.Generator)
+        ):
+            raise TypeError(
+                "seed must be None, int, np.random.BitGenerator, or np.random.Generator; found {type(seed)=}"
+            )
+        self._model = model
+        self._dim = self._model.dims()
+        if a != None and np.float64(a) <= 1:
+            raise ValueError(
+                f"stretch bound must be greater than or equal to 1; found {a=}"
+            )
+        self._a = np.float64(a or 2.0)
+        self._sqrt_a = np.sqrt(self._a)
+        self._inv_sqrt_a = 1 / self._sqrt_a
+        if num_walkers is None:
+            self._num_walkers = 2 * self._dim
+        else:
+            if num_walkers < 2 or num_walkers % 2 != 0:
+                raise ValueError(
+                    f"number of walkers must be strictly positive, even integer; found {num_walkers=}"
+                )
+            self._num_walkers = num_walkers
+        self._half_num_walkers = self._num_walkers // 2
+        self._drawshape = (int(self._num_walkers), self._dim)
+        self._rng = np.random.default_rng(seed)
+        if init is None:
+            self._thetas = self._rng.normal(size=self._drawshape)
+        else:
+            if init.shape != self._drawshape:
+                raise ValueError(
+                    f"init must be shape of draw {self._drawshape}; found {init.shape=}"
+                )
+            self._thetas = init
+        self._lp_thetas = [self._model.log_density(theta) for theta in self._thetas]
+        self._first_range = range(0, int(self._half_num_walkers))
+        self._second_range = range(int(self._half_num_walkers), int(self._num_walkers))
+
+    def __iter__(self) -> Iterator[Sample]:
+        """Return an infinite iterator for ensemble sampling.
+
+        Returns:
+            An iterator generating samples.
+        """
+        return self
+
+    def __next__(self) -> Sample:
+        """Return the next ensemble sample (`_num_walkers` x `_dim`).
+
+        Returns:
+            The next sample.
+        """
+        return self.sample()
+
+    def _draw_z(self) -> Sample:
+        """Return a random draw of `z` in `(1/a, a)` with `p(z) propto 1 / sqrt(z)`.
+
+        Returns:
+            A random draw of `z`.
+        """
+        draw: NDArray[np.float64] = np.square(
+            self._rng.uniform(self._inv_sqrt_a, self._sqrt_a)
+        )
+        return draw
+
+    def _stretch_move(self, k: int, j: int) -> Any:
+        """Update the walkers with a single stretch move.
+
+        Arguments:
+            k (int): walker to update
+            j (int): complementary walker with which to interpolate/extrapolate
+        """
+        theta_k = self._thetas[k]
+        lp_theta_k = self._lp_thetas[k]
+        theta_j = self._thetas[j]
+        z = self._draw_z()
+        theta_star: NDArray[np.float64] = theta_j + z * (theta_k - theta_j)
+        lp_theta_star = self._model.log_density(theta_star)
+        log_q = (self._dim - 1) * np.log(z) + lp_theta_star - lp_theta_k
+        log_u = np.log(self._rng.uniform())
+        if log_u < log_q:
+            self._thetas[k] = theta_star
+            self._lp_thetas[k] = lp_theta_star
+
+    def sample(self) -> Sample:
+        """Return an ensemble draw (`_num_walkers` x `_dim`).
+
+        Returns:
+            An ensemble draw.
+        """
+        js = self._rng.choice(
+            self._second_range, size=self._half_num_walkers, replace=False
+        )
+        for k in self._first_range:
+            self._stretch_move(k, js[k])
+        js = self._rng.choice(
+            self._first_range, size=self._half_num_walkers, replace=False
+        )
+        for k in self._second_range:
+            self._stretch_move(k, js[k - self._half_num_walkers])
+        return self._thetas