gpytorch.settings¶

class gpytorch.settings.cg_tolerance(value)[source]¶

Relative residual tolerance to use for terminating CG.

(Default: 1)

class gpytorch.settings.cholesky_jitter(float=None, double=None, half=None)[source]¶

The jitter value used by psd_safe_cholesky when using cholesky solves.

Default for float: 1e-6
Default for double: 1e-8

class gpytorch.settings.cholesky_max_tries(value)[source]¶

The max_tries value used by psd_safe_cholesky when using cholesky solves.

(Default: 3)

class gpytorch.settings.ciq_samples(state=True)[source]¶

Whether to draw samples using Contour Integral Quadrature or not. This may be slower than standard sampling methods for N < 5000. However, it should be faster with larger matrices.

As described in the paper:

Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization.

(Default: False)

class gpytorch.settings.debug(state=True)[source]¶

Whether or not to perform “safety” checks on the supplied data. (For example, that the correct training data is supplied in Exact GP training mode) Pros: fewer data checks, fewer warning messages Cons: possibility of supplying incorrect data, model accidentially in wrong mode

(Default: True)

class gpytorch.settings.detach_test_caches(state=True)[source]¶

Whether or not to detach caches computed for making predictions. In most cases, you will want this, as this will speed up derivative computations of the predictions with respect to test inputs. However, if you also need derivatives with respect to training inputs (e.g., because you have fantasy observations), then you must disable this.

(Default: True)

class gpytorch.settings.deterministic_probes(state=True)[source]¶

Whether or not to resample probe vectors every iteration of training. If True, we use the same set of probe vectors for computing log determinants each iteration. This introduces small amounts of bias in to the MLL, but allows us to compute a deterministic estimate of it which makes optimizers like L-BFGS more viable choices.

NOTE: Currently, probe vectors are cached in a global scope. Therefore, this setting cannot be used if multiple independent GP models are being trained in the same context (i.e., it works fine with a single GP model)

(Default: False)

class gpytorch.settings.eval_cg_tolerance(value)[source]¶

Relative residual tolerance to use for terminating CG when making predictions.

(Default: 1e-2)

class gpytorch.settings.fast_computations(covar_root_decomposition=True, log_prob=True, solves=True)[source]¶

This feature flag controls whether or not to use fast approximations to various mathematical functions used in GP inference. The functions that can be controlled are:

covar_root_decomposition
This feature flag controls how matrix root decompositions (\(K = L L^\top\)) are computed (e.g. for sampling, computing caches, etc.).
- If set to True,
  
  covariance matrices \(K\) are decomposed with low-rank approximations \(L L^\top\), (\(L \in \mathbb R^{n \times k}\)) using the Lanczos algorithm. This is faster for large matrices and exploits structure in the covariance matrix if applicable.
- If set to False,
  
  covariance matrices \(K\) are decomposed using the Cholesky decomposition.
log_prob
This feature flag controls how to compute the marginal log likelihood for exact GPs and log_prob for multivariate normal distributions
- If set to True,
  
  log_prob is computed using a modified conjugate gradients algorithm (as described in GPyTorch Blackbox Matrix-Matrix Gaussian Process Inference with GPU Acceleration. This is a stochastic computation, but it is much faster for large matrices and exploits structure in the covariance matrix if applicable.
- If set to False,
  
  log_prob is computed using the Cholesky decomposition.
fast_solves
This feature flag controls how to compute the solves of positive-definite matrices.
- If set to True,
  
  Solves are computed with preconditioned conjugate gradients.
- If set to False,
  
  Solves are computed using the Cholesky decomposition.

Warning

Setting this to False will compute a complete Cholesky decomposition of covariance matrices. This may be infeasible for GPs with structure covariance matrices.

By default, approximations are used for all of these functions (except for solves). Setting any of them to False will use exact computations instead.