CONTENTS
Calculus
dy
dx
Derivative of y with respect to x.
∂y
∂x
Partial derivative of y with respect to x
∇
x
y Gradient of y with respect to x
∇
X
y Matrix derivatives of y with respect to X
∇
X
y
Tensor containing derivatives of
y
with respect to
X
∂f
∂x
Jacobian matrix J ∈ R
m×n
of f : R
n
→ R
m
∇
2
x
f(x) or H(f)(x) The Hessian matrix of f at input point x
Z
f(x)dx Definite integral over the entire domain of x
Z
S
f(x)dx Definite integral with respect to x over the set S
Probability and Information Theory
a⊥b The random variables a and b are independent
a⊥b | c They are are conditionally independent given c
P (a)
A probability distribution over a discrete variable
p(a)
A probability distribution over a continuous vari-
able, or over a variable whose type has not been
specified
a ∼ P Random variable a has distribution P
E
x∼P
[f(x)] or Ef(x) Expectation of f(x) with respect to P (x)
Var(f(x)) Variance of f(x) under P (x)
Cov(f(x), g(x)) Covariance of f(x) and g(x) under P (x)
H(x) Shannon entropy of the random variable x
D
KL
(P kQ) Kullback-Leibler divergence of P and Q
N(x; µ, Σ)
Gaussian distribution over
x
with mean
µ
and
covariance Σ
xiii