In the mathematical theory of probability, a Doob martingale (named after Joseph L. Doob,[1] also known as a Levy martingale) is a stochastic process that approximates a given random variable and has the martingale property with respect to the given filtration. It may be thought of as the evolving sequence of best approximations to the random variable based on information accumulated up to a certain time. 
When analyzing sums, random walks, or other additive functions of independent random variables, one can often apply the central limit theorem, law of large numbers, Chernoff's inequality, Chebyshev's inequality or similar tools. When analyzing similar objects where the differences are not independent, the main tools are martingales and Azuma's inequality. 
  Definition
 Let  be any random variable with
 be any random variable with ![{\displaystyle \mathbb {E} [|Y|]<\infty }](./_assets_/4a58804990411da7658910c500286795687474ff.svg) . Suppose
. Suppose  is a filtration, i.e.
 is a filtration, i.e.  when
 when  . Define
. Define 
 ![{\displaystyle Z_{t}=\mathbb {E} [Y\mid {\mathcal {F}}_{t}],}](./_assets_/f74edc01fe1ddb6ff7e76c0c37ada4f444c3adbd.svg) 
then  is a martingale,[2] namely Doob martingale, with respect to filtration
 is a martingale,[2] namely Doob martingale, with respect to filtration  .
.  
To see this, note that 
 ![{\displaystyle \mathbb {E} [|Z_{t}|]=\mathbb {E} [|\mathbb {E} [Y\mid {\mathcal {F}}_{t}]|]\leq \mathbb {E} [\mathbb {E} [|Y|\mid {\mathcal {F}}_{t}]]=\mathbb {E} [|Y|]<\infty }](./_assets_/ac4e9b95baa128c4274c72d390c0326acf7cbb22.svg) ; ;
![{\displaystyle \mathbb {E} [Z_{t}\mid {\mathcal {F}}_{t-1}]=\mathbb {E} [\mathbb {E} [Y\mid {\mathcal {F}}_{t}]\mid {\mathcal {F}}_{t-1}]=\mathbb {E} [Y\mid {\mathcal {F}}_{t-1}]=Z_{t-1}}](./_assets_/bdcd9547b0344362e65e07be883f3ed7ddf85807.svg) as as . .
In particular, for any sequence of random variables  on probability space
 on probability space   and function
 and function  such that
 such that ![{\displaystyle \mathbb {E} [|f(X_{1},X_{2},\dots ,X_{n})|]<\infty }](./_assets_/5cc9276d89ff13f00efdb672d59521e5a9fc6e2d.svg) , one could choose
, one could choose  
  
and filtration  such that
 such that 
  
i.e.  -algebra generated by
-algebra generated by  . Then, by definition of Doob martingale, process
. Then, by definition of Doob martingale, process  where
 where 
 ![{\displaystyle {\begin{aligned}Z_{0}&:=\mathbb {E} [f(X_{1},X_{2},\dots ,X_{n})\mid {\mathcal {F}}_{0}]=\mathbb {E} [f(X_{1},X_{2},\dots ,X_{n})],\\Z_{t}&:=\mathbb {E} [f(X_{1},X_{2},\dots ,X_{n})\mid {\mathcal {F}}_{t}]=\mathbb {E} [f(X_{1},X_{2},\dots ,X_{n})\mid X_{1},X_{2},\dots ,X_{t}],\forall t\geq 1\end{aligned}}}](./_assets_/e26744a2522547f8affafbd91491b6bf0cd64c73.svg) 
forms a Doob martingale. Note that  . This martingale can be used to prove McDiarmid's inequality.
. This martingale can be used to prove McDiarmid's inequality. 
 McDiarmid's inequality
  The Doob martingale was introduced by Joseph L. Doob in 1940 to establish concentration inequalities such as McDiarmid's inequality, which applies to functions that satisfy a bounded differences property (defined below) when they are evaluated on random independent function arguments. 
A function  satisfies the bounded differences property if substituting the value of the
 satisfies the bounded differences property if substituting the value of the  th coordinate
th coordinate  changes the value of
 changes the value of  by at most
 by at most  . More formally, if there are constants
. More formally, if there are constants  such that for all
 such that for all ![{\displaystyle i\in [n]}](./_assets_/1b389a8f1ad8a43d2bcf5194acf34e934f806311.svg) , and all
, and all  ,
, 
  
 McDiarmid's Inequality[1]—Let  satisfy the bounded differences property with bounds
 satisfy the bounded differences property with bounds  .
. 
Consider independent random variables  where
 where  for all
 for all  . Then, for any
. Then, for any  ,
, 
 ![{\displaystyle {\text{P}}\left(f(X_{1},X_{2},\ldots ,X_{n})-\mathbb {E} [f(X_{1},X_{2},\ldots ,X_{n})]\geq \varepsilon \right)\leq \exp \left(-{\frac {2\varepsilon ^{2}}{\sum _{i=1}^{n}c_{i}^{2}}}\right),}](./_assets_/dcec32c891eaf7e6f8fb7c99d8749928ad1636e1.svg) 
![{\displaystyle {\text{P}}(f(X_{1},X_{2},\ldots ,X_{n})-\mathbb {E} [f(X_{1},X_{2},\ldots ,X_{n})]\leq -\varepsilon )\leq \exp \left(-{\frac {2\varepsilon ^{2}}{\sum _{i=1}^{n}c_{i}^{2}}}\right),}](./_assets_/7e8acf5ba6be8dd3cf27f282bc328b17bfd23a93.svg) 
and as an immediate consequence, 
 ![{\displaystyle {\text{P}}(|f(X_{1},X_{2},\ldots ,X_{n})-\mathbb {E} [f(X_{1},X_{2},\ldots ,X_{n})]|\geq \varepsilon )\leq 2\exp \left(-{\frac {2\varepsilon ^{2}}{\sum _{i=1}^{n}c_{i}^{2}}}\right).}](./_assets_/a60a6951869cfdf72d6bf5c6f1f2f2b38abef530.svg) 
  See also
  References