Add "Analyzing the Ito Differential Mapping"

victorliu5296 · Oct 16, 2024 · a9cb2ec · a9cb2ec
1 parent 5e0f122
commit a9cb2ec
Showing 1 changed file with 330 additions and 0 deletions.
diff --git a/source/content/posts/Analyzing the Ito Differential Mapping.md b/source/content/posts/Analyzing the Ito Differential Mapping.md
@@ -0,0 +1,330 @@
+---
+title: 'Itô''s lemma and the Itô Differential Mapping'
+date: 2024-10-15T22:08:14-04:00
+summary: "Deriving Itô's Lemma and analyzing the Itô 'd' operation as a formal linear mapping."
+math: katex
+categories:
+  - Stochastic Calculus
+tags:
+  - Stochastic Analysis
+  - Ito Calculus
+  - Itô Calculus
+  - Itô's Lemma
+  - Itô's Formula
+  - Itô Chain Rule
+  - Itô Differential Mapping
+weight: 100
+draft: false
+---
+
+# **Motivating the Itô Chain Rule: Brownian Motion and Quadratic Variation**
+
+In classical calculus, the **chain rule** tells us how to differentiate a function of another function. For example, if \( X(t) \) is a differentiable function of time, and we have another function \( f(X(t)) \), the chain rule states:
+\[
+\frac{d}{dt} f(X(t)) = f'(X(t)) \, X'(t).
+\]
+However, when dealing with **stochastic processes** such as **Brownian motion**, the behavior of these processes is fundamentally different from deterministic functions. This introduces the need for a new type of calculus—**Itô calculus**—with its own chain rule, known as **Itô’s Lemma**.
+
+---
+
+## **1. Brownian Motion and Its Unique Properties**
+
+A **Brownian motion** \( W_t \) (also called a Wiener process) is a random process that models continuous-time stochastic behavior. It has the following properties:
+
+1. **Initial Condition**: \( W_0 = 0 \).
+2. **Independent Increments**: For any \( 0 \leq s < t \), the increment \( W_t - W_s \) is independent of the past process \( \{ W_u : u \leq s \} \).
+3. **Gaussian Increments**: For any \( 0 \leq s < t \), the increment \( W_t - W_s \) is normally distributed with mean 0 and variance \( t - s \):
+   \[
+   W_t - W_s \sim N(0, t - s).
+   \]
+4. **Continuous but Non-Differentiable Paths**: Almost every sample path of \( W_t \) is continuous but nowhere differentiable. This makes it impossible to apply the usual tools of calculus.
+
+---
+
+### **1.1 Non-Trivial Quadratic Variation**
+
+One of the most critical differences between Brownian motion and deterministic functions is that **Brownian motion has non-trivial quadratic variation**. For a smooth deterministic function \( f(t) \), the squared increment \( (f(t + \Delta t) - f(t))^2 \) becomes vanishingly small as \( \Delta t \to 0 \). In contrast, for Brownian motion:
+\[
+(W_{t + \Delta t} - W_t)^2 \approx \Delta t \quad \text{(as } \Delta t \to 0).
+\]
+This means that the quadratic variation of Brownian motion over a small interval \( [t, t + \Delta t] \) is proportional to the length of the interval:
+\[
+\lim_{\Delta t \to 0} \sum_{i=1}^n (W_{t_i} - W_{t_{i-1}})^2 = T.
+\]
+Thus, **quadratic variation** for Brownian motion does not vanish as \( \Delta t \to 0 \), unlike for deterministic functions.
+
+---
+
+### **1.2 Heuristic Rule: \( dW_t^2 = dt \)**
+
+This non-trivial quadratic variation introduces a fundamental difference between Brownian motion and smooth functions. To develop intuition, we adopt the following **heuristic rule** in Itô calculus:
+\[
+dW_t^2 = dt.
+\]
+This means that the infinitesimal change in the square of the Brownian increment \( dW_t \) behaves like the deterministic time increment \( dt \). This result will play a central role in deriving the **Itô chain rule**.
+
+---
+
+### **1.3 The Need for a New Chain Rule**
+
+The standard chain rule assumes that higher-order terms (like \( (dX)^2 \)) are negligible. However, since \( dW_t^2 = dt \) is non-zero, the usual chain rule from deterministic calculus does not apply directly to functions involving stochastic processes. We need a new chain rule that accounts for the quadratic variation of Brownian motion. This leads us to the **Itô chain rule**, or **Itô’s Lemma**, which corrects the usual calculus to handle these stochastic effects.
+
+---
+
+## **2. Deriving the Single-Variable Itô Formula**
+
+We now derive the Itô chain rule for a function \( f(t, X_t) \), where \( X_t \) is a stochastic process that evolves as:
+\[
+dX_t = \mu(t, X_t) \, dt + \sigma(t, X_t) \, dW_t.
+\]
+This equation describes a process with both a deterministic drift term \( \mu(t, X_t) \) and a stochastic component \( \sigma(t, X_t) \) driven by Brownian motion \( W_t \).
+
+We want to determine the differential of \( f(t, X_t) \)—that is, how the function \( f(t, X_t) \) changes as both time and the process \( X_t \) evolve. 
+
+---
+
+### **2.1 Taylor Expansion with Quadratic Terms**
+
+We begin with the Taylor expansion of \( f(t, X_t) \) in two variables: \( t \) and \( X_t \). The first-order Taylor expansion is:
+\[
+df = \frac{\partial f}{\partial t} \, dt + \frac{\partial f}{\partial X} \, dX_t + \frac{1}{2} \frac{\partial^2 f}{\partial X^2} \, (dX_t)^2 + \text{higher-order terms}.
+\]
+In deterministic calculus, the higher-order terms (like \( (dX_t)^2 \)) are ignored. However, in stochastic calculus, we cannot ignore the term \( (dX_t)^2 \) because it contributes non-trivially to the result.
+
+---
+
+### **2.2 Computing \( (dX_t)^2 \)**
+
+From the SDE:
+\[
+dX_t = \mu(t, X_t) \, dt + \sigma(t, X_t) \, dW_t,
+\]
+we can compute the square of the differential:
+\[
+(dX_t)^2 = \left( \mu(t, X_t) \, dt + \sigma(t, X_t) \, dW_t \right)^2.
+\]
+Expanding this product:
+\[
+(dX_t)^2 = \mu^2(t, X_t) \, (dt)^2 + 2 \mu(t, X_t) \sigma(t, X_t) \, dt \, dW_t + \sigma^2(t, X_t) \, (dW_t)^2.
+\]
+Using the heuristic rules \( (dt)^2 = 0 \), \( dW_t \, dt = 0 \), and \( (dW_t)^2 = dt \), we simplify:
+\[
+(dX_t)^2 = \sigma^2(t, X_t) \, dt.
+\]
+
+---
+
+### **2.3 Substituting Back into the Taylor Expansion**
+
+Now that we know \( (dX_t)^2 = \sigma^2(t, X_t) \, dt \), we substitute this into the Taylor expansion:
+\[
+df = \frac{\partial f}{\partial t} \, dt + \frac{\partial f}{\partial X} \, dX_t + \frac{1}{2} \frac{\partial^2 f}{\partial X^2} \, \sigma^2(t, X_t) \, dt.
+\]
+Rearranging the terms, we obtain the **Itô chain rule** (Itô’s Lemma) for a single-variable function \( f(t, X_t) \):
+\[
+df = \left( \frac{\partial f}{\partial t} + \mu(t, X_t) \frac{\partial f}{\partial X} + \frac{1}{2} \sigma^2(t, X_t) \frac{\partial^2 f}{\partial X^2} \right) \, dt + \sigma(t, X_t) \frac{\partial f}{\partial X} \, dW_t.
+\]
+
+---
+
+## **3. Summary of the Single-Variable Itô Formula**
+
+The single-variable Itô formula (Itô’s Lemma) shows how to compute the differential of a function \( f(t, X_t) \) when the underlying process \( X_t \) evolves according to a stochastic differential equation. The key takeaway is that the quadratic variation of Brownian motion leads to an additional second-order term involving the second derivative of the function \( f \). This correction is essential for accurately modeling the behavior of stochastic systems.
+
+The final result is:
+\[
+df = \left( \frac{\partial f}{\partial t} + \mu(t, X_t) \frac{\partial f}{\partial X} + \frac{1}{2} \sigma^2(t, X_t) \frac{\partial^2 f}{\partial X^2} \right) \, dt + \sigma(t, X_t) \frac{\partial f}{\partial X} \, dW_t.
+\]
+
+Let’s now extend the **Itô Lemma** to the **multivariate case**, where the underlying stochastic process involves several interrelated variables and multiple sources of randomness. We’ll first derive the **multivariate Itô formula** in **summation notation** and then convert it into a more compact **matrix form**. Finally, we’ll explore the deep connection between the second-order correction terms in Itô calculus and concepts from **linear algebra** and **multivariate calculus**.
+
+---
+
+# **Multivariate Itô Lemma**
+
+## **1. Multivariate Itô Formula in Summation Notation**
+
+Suppose we have a function \( f(t, \mathbf{X}(t)) \) that depends on time \( t \) and an \( n \)-dimensional vector-valued process \( \mathbf{X}(t) = (X_1(t), \dots, X_n(t))^\top \). Each component \( X_i(t) \) evolves according to its own stochastic differential equation:
+\[
+dX_i(t) = \mu_i(t, \mathbf{X}(t)) \, dt + \sum_{j=1}^m \sigma_{ij}(t, \mathbf{X}(t)) \, dW_j(t),
+\]
+where \( W_j(t) \) are independent **Brownian motions**. Here:
+- \( \mu_i(t, \mathbf{X}(t)) \) is the **drift** of the \( i \)-th process.
+- \( \sigma_{ij}(t, \mathbf{X}(t)) \) is the **diffusion coefficient** describing how the \( j \)-th Brownian motion affects the \( i \)-th component of the process.
+
+We now seek the differential \( df(t, \mathbf{X}(t)) \) using the **Itô chain rule**.
+
+---
+
+### **1.1 Taylor Expansion in the Multivariate Setting**
+
+The multivariate Taylor expansion of \( f(t, \mathbf{X}(t)) \) gives:
+\[
+df = \frac{\partial f}{\partial t} \, dt + \sum_{i=1}^n \frac{\partial f}{\partial X_i} \, dX_i + \frac{1}{2} \sum_{i=1}^n \sum_{k=1}^n \frac{\partial^2 f}{\partial X_i \partial X_k} \, dX_i \, dX_k.
+\]
+As in the single-variable case, the second-order terms \( dX_i \, dX_k \) are non-negligible because of the quadratic variation of Brownian motion.
+
+---
+
+### **1.2 Computing the Cross-Terms \( dX_i \, dX_k \)**
+
+From the SDE \( dX_i(t) = \mu_i \, dt + \sum_{j=1}^m \sigma_{ij} \, dW_j(t) \), the cross-product of two differentials is:
+\[
+dX_i \, dX_k = \left( \mu_i \, dt + \sum_{j=1}^m \sigma_{ij} \, dW_j \right) \left( \mu_k \, dt + \sum_{l=1}^m \sigma_{kl} \, dW_l \right).
+\]
+Expanding the product and using the rules \( (dt)^2 = 0 \), \( dW_j \, dt = 0 \), and \( dW_j \, dW_l = \delta_{jl} \, dt \) (where \( \delta_{jl} \) is the Kronecker delta), we obtain:
+\[
+dX_i \, dX_k = \sum_{j=1}^m \sigma_{ij} \sigma_{kj} \, dt.
+\]
+Thus, the quadratic variation term between two processes \( X_i(t) \) and \( X_k(t) \) is given by:
+\[
+\operatorname{Cov}(dX_i, dX_k) = \sum_{j=1}^m \sigma_{ij} \sigma_{kj}.
+\]
+
+---
+
+### **1.3 The Multivariate Itô Formula in Summation Notation**
+
+Now, substituting the quadratic terms \( dX_i \, dX_k \) into the Taylor expansion, the full differential \( df(t, \mathbf{X}(t)) \) becomes:
+\[
+df = \frac{\partial f}{\partial t} \, dt + \sum_{i=1}^n \frac{\partial f}{\partial X_i} \, dX_i + \frac{1}{2} \sum_{i=1}^n \sum_{k=1}^n \frac{\partial^2 f}{\partial X_i \partial X_k} \sum_{j=1}^m \sigma_{ij} \sigma_{kj} \, dt.
+\]
+This is the **multivariate Itô formula** in summation notation.
+
+---
+
+## **2. Multivariate Itô Formula in Matrix Form**
+
+We now express the Itô formula in a more compact **matrix notation**. Let:
+- \( \mathbf{X}(t) \in \mathbb{R}^n \) be the vector of stochastic processes.
+- \( \boldsymbol{\mu}(t, \mathbf{X}(t)) \in \mathbb{R}^n \) be the drift vector.
+- \( \boldsymbol{\sigma}(t, \mathbf{X}(t)) \in \mathbb{R}^{n \times m} \) be the diffusion matrix.
+- \( \mathbf{W}(t) \in \mathbb{R}^m \) be the vector of independent Brownian motions.
+
+The SDE for \( \mathbf{X}(t) \) in matrix form is:
+\[
+d\mathbf{X}(t) = \boldsymbol{\mu}(t, \mathbf{X}(t)) \, dt + \boldsymbol{\sigma}(t, \mathbf{X}(t)) \, d\mathbf{W}(t).
+\]
+
+The **Itô differential** of a function \( f(t, \mathbf{X}(t)) \) is:
+\[
+df = \left( \frac{\partial f}{\partial t} + \nabla_\mathbf{X} f \cdot \boldsymbol{\mu} + \frac{1}{2} \operatorname{Tr} \left( \boldsymbol{\sigma}^\top \nabla^2_\mathbf{X} f \, \boldsymbol{\sigma} \right) \right) \, dt + \nabla_\mathbf{X} f \cdot \boldsymbol{\sigma} \, d\mathbf{W}(t),
+\]
+where:
+- \( \nabla_\mathbf{X} f = \left( \frac{\partial f}{\partial X_1}, \dots, \frac{\partial f}{\partial X_n} \right)^\top \) is the gradient of \( f \).
+- \( \nabla^2_\mathbf{X} f \) is the **Hessian matrix** of second-order partial derivatives of \( f \).
+- \( \operatorname{Tr}(\cdot) \) denotes the **trace** of a matrix.
+
+This matrix form is not only compact but also makes the quadratic structure of the second-order correction explicit through the product \( \boldsymbol{\sigma}^\top \nabla^2_\mathbf{X} f \, \boldsymbol{\sigma} \).
+
+---
+
+## **3. Connection to Multivariate Calculus and Linear Algebra**
+
+In deterministic multivariate calculus, the **Taylor expansion** for a function \( f(t, \mathbf{X}) \) involves both linear and quadratic terms:
+\[
+df = \frac{\partial f}{\partial t} \, dt + \nabla_\mathbf{X} f \cdot d\mathbf{X} + \frac{1}{2} \, d\mathbf{X}^\top \nabla^2_\mathbf{X} f \, d\mathbf{X}.
+\]
+The term \( d\mathbf{X}^\top \nabla^2_\mathbf{X} f \, d\mathbf{X} \) is a **quadratic form** involving the Hessian matrix. 
+
+In **Itô calculus**, the second-order correction plays a similar role, but with the important difference that the quadratic variation of Brownian motion forces us to keep the second-order terms:
+\[
+dW_t^2 = dt.
+\]
+Thus, the quadratic correction in Itô’s Lemma corresponds to a **quadratic form in linear algebra**, where the Hessian of the function interacts with the diffusion coefficients to influence the dynamics of the process.
+
+---
+
+## **Itô’s Lemma as a Linear Mapping: A Formal Perspective**
+
+In this final section, we reframe the **Itô differential transformation** as a type of **linear mapping** between function spaces, integrating (pun intended) insights from calculus, linear algebra, and stochastic processes. This perspective will help us understand how the Itô transformation operates and what spaces it maps between.
+
+---
+
+## **1. Function Spaces Involved in the Itô Transformation**
+
+To formalize the Itô transformation, we need to identify the spaces it maps between:
+
+1. **Input Space**:  
+   The function \( f(t, \mathbf{X}(t)) \) belongs to the space \( C^{1,2}([0, T] \times \mathbb{R}^n) \).  
+   - \( C^{1,2}([0, T] \times \mathbb{R}^n) \) is the space of functions that are:
+     - **Continuously differentiable** in time \( t \).
+     - **Twice continuously differentiable** in the spatial variables \( \mathbf{X}(t) \).
+
+2. **Output Space**:  
+   The output is a **stochastic differential equation (SDE)**, which involves both:
+   - **Deterministic terms** in \( L^2([0, T], dt) \), the space of square-integrable functions over time.
+   - **Stochastic terms** in \( L^2(\Omega \times [0, T], \mathcal{F}, \mathbb{P}) \), the space of square-integrable stochastic processes, where:
+     - \( \Omega \) is the sample space of possible outcomes.
+     - \( \mathcal{F} \) is a filtration representing the information up to time \( t \).
+     - \( \mathbb{P} \) is the probability measure.
+
+Thus, the Itô differential transformation maps:
+\[
+\boxed{C^{1,2}([0, T] \times \mathbb{R}^n) \quad \to \quad L^2(\Omega \times [0, T], \mathcal{F}, \mathbb{P})}.
+\]
+
+---
+
+## **2. Defining the Itô Transformation as a Linear Mapping**
+
+In deterministic calculus, the **differential mapping** acts linearly on smooth functions by mapping them to their derivatives. Similarly, the **Itô transformation** acts on smooth functions \( f(t, \mathbf{X}(t)) \), but its output is a differential equation with both deterministic and stochastic components.
+
+We can represent the Itô transformation \( \mathcal{I} \) as:
+\[
+\mathcal{I}(f)(t, \mathbf{X}(t)) = \underbrace{A(f)(t, \mathbf{X}(t)) \, dt}_{\text{Deterministic part}} + \underbrace{B(f)(t, \mathbf{X}(t)) \, d\mathbf{W}(t)}_{\text{Stochastic part}},
+\]
+where:
+1. \( A(f) \) represents the deterministic part of the differential:
+   \[
+   A(f)(t, \mathbf{X}(t)) = \frac{\partial f}{\partial t} + \nabla_\mathbf{X} f \cdot \boldsymbol{\mu}(t, \mathbf{X}(t)) + \frac{1}{2} \operatorname{Tr}\left( \boldsymbol{\sigma}^\top \nabla^2_\mathbf{X} f \, \boldsymbol{\sigma} \right).
+   \]
+
+2. \( B(f) \) represents the stochastic part:
+   \[
+   B(f)(t, \mathbf{X}(t)) = \nabla_\mathbf{X} f \cdot \boldsymbol{\sigma}(t, \mathbf{X}(t)).
+   \]
+
+The output of the Itô transformation is thus a **linear combination** of the deterministic and stochastic components.
+
+---
+
+## **3. Linearity of the Itô Transformation**
+
+The Itô transformation is **linear** with respect to the function \( f(t, \mathbf{X}(t)) \). That is, for two functions \( f \) and \( g \) in \( C^{1,2}([0, T] \times \mathbb{R}^n) \), and for constants \( a, b \in \mathbb{R} \), the Itô transformation satisfies:
+\[
+\mathcal{I}(a f + b g) = a \, \mathcal{I}(f) + b \, \mathcal{I}(g).
+\]
+This linearity holds because both the deterministic and stochastic components are linear in the partial derivatives of \( f \).
+
+---
+
+## **4. The Role of the Hessian: A Quadratic Form**
+
+The second-order correction term in Itô’s Lemma can be viewed as a **quadratic form** involving the **Hessian matrix** \( \nabla^2_\mathbf{X} f \). Specifically:
+\[
+\frac{1}{2} \operatorname{Tr}\left( \boldsymbol{\sigma}^\top \nabla^2_\mathbf{X} f \, \boldsymbol{\sigma} \right).
+\]
+This term reflects how the curvature of the function \( f \) (encoded by the Hessian matrix) interacts with the **diffusion coefficients** \( \boldsymbol{\sigma}(t, \mathbf{X}(t)) \). In linear algebra, a quadratic form is given by:
+\[
+Q(\mathbf{z}) = \mathbf{z}^\top \mathbf{A} \mathbf{z},
+\]
+where \( \mathbf{A} \) is a symmetric matrix. In the context of Itô calculus, the quadratic form captures how the random fluctuations (encoded in \( \boldsymbol{\sigma} \)) affect the second-order behavior of the function \( f \).
+
+---
+
+## **5. Summary: Itô Transformation as a Linear Mapping Between Spaces**
+
+The **Itô transformation** can be viewed as a linear mapping:
+\[
+\mathcal{I}: C^{1,2}([0, T] \times \mathbb{R}^n) \to L^2(\Omega \times [0, T], \mathcal{F}, \mathbb{P}),
+\]
+which transforms a smooth function \( f(t, \mathbf{X}(t)) \) into a stochastic differential equation involving both deterministic and stochastic components. The transformation’s linearity ensures that the Itô calculus framework behaves predictably under linear combinations of functions, much like classical calculus.
+
+However, unlike classical differential operators, the Itô transformation produces **stochastic differentials** that belong to a different type of space—one that incorporates randomness through **Brownian motion**. The quadratic form involving the Hessian matrix reflects the non-trivial second-order behavior introduced by the stochastic terms.
+
+---
+
+## **6. Conclusion**
+
+The **Itô transformation** serves as a bridge between deterministic calculus and stochastic processes. While it shares some similarities with classical differential operators, it is more accurately described as a **linear mapping** between the space of smooth functions \( C^{1,2}([0, T] \times \mathbb{R}^n) \) and the space of stochastic differentials \( L^2(\Omega \times [0, T], \mathcal{F}, \mathbb{P}) \). This transformation accounts for both linear and quadratic behavior, reflecting the dual impact of deterministic drift and stochastic noise on a system's evolution.