Remove (duplicate) samplers being defined explicitly in Turing.jl

We're duplicating a lot of code and a lot of effort by having a bunch of sampler (or rather, `InferenceAlgorithm`) implementations in Turing.jl itself.

There are a few reasons for this is / was the case:
1. The old approach of doing Gibbs sampling took an approach that required hooking into the `assume` and `observe` statements for samplers _and_ to mutate the varinfo in a particular, even if the functionality of the sampler itself (when used outside of Gibbs) didn't require it.
2. The samplers in Turing.jl would often offer more convenient constructors while the sampler packages themselves, e.g. AdvancedHMC.jl, would offer a more flexible but also more complicated interfaces.
3. `InferenceAlgorithm` allows us to overload the `sample` call explicitly to do some "non-standard" things, e.g. use `chain_type=MCMCChains.Chains` as the default, instead of `chain_type=Vector` as is default in AbstractMCMC.jl.

Everything but (3) is "easily" addressable (i.e. only requires dev-time, not necessarily any discussion on _how_ to do it):
- [x] (1) is being addressed in #2328 (issue ref: #2318). This should therefore be addressed very soon.
- [ ] (2) should be addressed by simply moving any convenience constructors from Turing.jl itself into the respective package. There's no reason why we should keep convenient constructors in a different package (Turing.jl in this case) than the package implementing the samplers. Effort has been made towards this, e.g. https://github.com/TuringLang/AdvancedHMC.jl/pull/325, but we need to through all the samplers and check which have missing "convenience" constructors. Related issues that should be addressed in downstream packages: https://github.com/TuringLang/AdvancedMH.jl/issues/107 https://github.com/TuringLang/AdvancedMH.jl/issues/108
- [ ] (3) is somewhat tricky. There are a few aspects to this that we need to handle: a) how to default to `chain_type=Chains` for Turing.jl models (ref: https://github.com/TuringLang/AbstractMCMC.jl/pull/120, https://github.com/TuringLang/AbstractMCMC.jl/issues/118), b) how to allow extraction of other interesting information than _just_ the realizations for the variables from `sample` calls, and c) extraction of parameter names used to construct the chain. See the section below for more extensive discussion of this issue. Relevant issues: #2367 


## Removing the `InferenceAlgorithm` type (3)

### Problem

Currently, all the samplers in Turing.jl have most of their code living outside of Turing.jl + inside Turing.jl we define a "duplicate" which is _not_ an `AbstractMCMC.AbstractSampler` (as typically expected by `AbstractMCMC.sample`), but instead a subtype of `Turing.Infernece.InferenceAlgorithm`:

https://github.com/TuringLang/Turing.jl/blob/c0a4ee936570d46cfcddc333a1f12404da75be24/src/mcmc/Inference.jl#L91-L95

But exactly because these are _not_ `AbstractMCMC.AbstractSampler`, we can overload `sample` calls to do more than what `sample` does for a given `AbstractSampler`.

One of the things we do is to make `chain_type=Chains` rather than `chain_type=Vector` (as is the default in AbstractMCMC.jl):

https://github.com/TuringLang/Turing.jl/blob/c0a4ee936570d46cfcddc333a1f12404da75be24/src/mcmc/Inference.jl#L337-L359

Another is to perform some simple model checks to stop the user from doing things they shouldn't, e.g. accidentally using a model twice (this is done using [`DynamicPPL.check_model`](https://github.com/TuringLang/DynamicPPL.jl/blob/82842bc54cfbcca66db994c6f9c4527ed4950d88/src/debug_utils.jl#L616-L630)):

https://github.com/TuringLang/Turing.jl/blob/c0a4ee936570d46cfcddc333a1f12404da75be24/src/mcmc/Inference.jl#L296-L306

However, as mentioned before, having to repeat all these sampler constructors _just_ to go from working with a `AbstractSampler` to `InferenceAlgorithm` so we can do these things is a) very annoying to maintain, and b) makes it all very confusing for newcomers to contribute.

Now, the problem is that cannot simple start overloading `sample(model::DynamicPPL.Model, sampler::AbstractMCMC.AbstractSampler, ...)` calls since sampler packages might define something like `sample(model::AbstractMCMC.AbstractModel, sampler::MySampler, ...)` (we have `DynamicPPL.Model <: AbstractMCMC.AbstractModel` btw) which would give rise to a host of method ambiguities.

Someone might say "oh, but nobody is going to impelment `sample(model::AbstractMCMC.AbstractModel, sampler::MySampler, ...)`; they're always going to implement a sampler for a specific model type, e.g. `AbstractMCMC.LogDensityModel`", but this is not great for two reasons: a) "meta" samplers, i.e. samplers that use other samplers as components, might want to be agnostic to what the underlying model is as this "meta" sampler doesn't interact directly with the model itself, and b) if we do so, we're claiming that `DynamicPPL.Model` is, in some way, a special and more important model type than all other subtypes of `AbstractModel`, which is the exact opposite of what we wanted to do with AbstractMCMC.jl (we wanted it to be a "sampler package for all, not just Turing.jl").

`externalsampler` introduced in #2008 is a step towards this, but in the end we don't want to require `externalsampler` to wrap _every_ `sampler` passed to Turing.jl; we really only want this to have to wrap samplers which do _not_ support all the additional niceties that Turing.jl's current `sample` provides.

### Solution 1: rename or duplicate `sample`

The only true solution I see, which is very, very annoying, is to either
1. _Not_ export `AbstractMCMC.sample` from Turing.jl, and instead define and export a separate `Turing.sample` which is a fancy wrapper around `AbstractMCMC.sample`.
2. Define a new entry-point for `sample` from Turing.jl with a different name, e.g. `infer` or `mcmc` (or even use the internal `mcmcsample` from AbstractMCMC.jl naming but making it public).

None of these are ideal tbh. 

(1) sucks because so many of the packages are using `StatsBase.sample` (as we are in AbstractMCMC.jl) for this very reasonable interface, and so diverging from this is confusing + we'll easily end up with naming collisions in the namespace of the user, e.g. `using Turing, AbstractMCMC` would immediately cause two `sample` methods to be imported.

(2) is also a bit annoying as this would be a _highly_ breaking change. It's also a bit annoying because, well, `sample` is a much better name :shrug:

IMHO, I think (2) is best here though. If we define a method called `mcmc` or `mcmcsample` (ideally we'd do something with `AbstractMCMC.mcmcsample`) which is exported from Turing.jl, we could do away with all of `InferenceAlgorithm` and its implementations in favour of a single (or a few) overloads of this method.

	abstract type InferenceAlgorithm end
	abstract type ParticleInference <: InferenceAlgorithm end
	abstract type Hamiltonian <: InferenceAlgorithm end
	abstract type StaticHamiltonian <: Hamiltonian end
	abstract type AdaptiveHamiltonian <: Hamiltonian end

	function AbstractMCMC.sample(
	rng::AbstractRNG,
	model::AbstractModel,
	sampler::Sampler{<:InferenceAlgorithm},
	ensemble::AbstractMCMC.AbstractMCMCEnsemble,
	N::Integer,
	n_chains::Integer;
	chain_type=MCMCChains.Chains,
	progress=PROGRESS[],
	kwargs...,
	)
	return AbstractMCMC.mcmcsample(
	rng,
	model,
	sampler,
	ensemble,
	N,
	n_chains;
	chain_type=chain_type,
	progress=progress,
	kwargs...,
	)
	end

	function AbstractMCMC.sample(
	rng::AbstractRNG,
	model::AbstractModel,
	alg::InferenceAlgorithm,
	N::Integer;
	check_model::Bool=true,
	kwargs...,
	)
	check_model && _check_model(model, alg)
	return AbstractMCMC.sample(rng, model, Sampler(alg, model), N; kwargs...)
	end

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove (duplicate) samplers being defined explicitly in Turing.jl #2413

Removing the `InferenceAlgorithm` type (3)

Problem

Solution 1: rename or duplicate `sample`

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Remove (duplicate) samplers being defined explicitly in Turing.jl #2413

Description

Removing the InferenceAlgorithm type (3)

Problem

Solution 1: rename or duplicate sample

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Removing the `InferenceAlgorithm` type (3)

Solution 1: rename or duplicate `sample`