Research Notebook

Notes: Vazquez (2011)

October 26, 2011 by Alex

1. Introduction

In this note, I outline the main results in Scale Invariance, Bounded Rationality and Non-Equilibrium Economics (WP, 2009) by Sam Vazquez for use in a 5min presentation in Prof. Sargent‘s reading group.

This paper presents an agent based model (i.e., there are a finite number of agents) which makes 2 basic assumptions:

  1. Agents have a scale invariant utility function–i.e., consuming 1 gallon of milk is as satisfying as consuming 3.8 litres of milk.
  2. Agents are boundedly rational and have beliefs about the exchange rates between a limited number of assets.

From these 2 assumption, the author defines an ensemble of economic models using a variety of tools from physics (in particular, symmetry analysis, coarse graining and operator methods). This paper presents a new set of tools to address well known economic problems and illustrates the mathematical symmetry across the class of economic operators commonly employed by economic analysts.

2. An Analogy

To motivate this analysis for people with an economics background, I begin this note with an illustrative analogy. Physicists are used to modeling complex phenomena, e.g., think about the task of computing the pressure a gas such as oxygen exerts against the walls of a box such as a classroom. There are far too many individual oxygen atoms to count and keep track of at a micro level; however, scientists can compute the macro level properties of the gas such as the average pressure exerted on its chamber walls by subscribing to 2 main modeling tricks.

First, they look for a key symmetry of the problem. This is a bit more of an art than a science. In the thermodynamics example above, this symmetric would be the fact that a particular atom of oxygen ought to behave in the exact same way regardless of where it is in the room. For instance, there is no such thing as a “near-the-floor” or “by-the-window” oxygen atom. This symmetry puts restrictions on the functional form of the equations we can use to model the movements and interactions of oxygen atoms and reduces the number of potential state variables.

Second, they look for an appropriate reference neighborhood within which to study the movement of oxygen atoms. For example, we know that the behavior of oxygen atoms in 1 corner of the room are going to have a negligible impact on the behavior the oxygen atoms at the far corner of the room. Thus, we can study the local behavior of the atoms taking the boundaries of its neighborhood as given, and then integrate up across all neighborhoods. This procedure is called coarse graining and will affect the scope of the approximations rather than the functional form of the equations.

Now, let’s consider how to apply these principles to a financial model and follow the lead of Vazquez (2009). First, consider the problem of finding a symmetry. Vazquez chooses a scale symmetry whereby the unit of measure should not affect the utility of an agent. This assumption will put a functional form restriction on the space of viable utility functions. Next, consider the problem of coarse graining. In standard economic models, agents see and trade all assets. By analogy, this would be equivalent to directly linking all oxygen atoms in a room regardless of their distance. To break these connections and allow for coarse graining, Vazquez assumes that agents only carry around an information set containing a subset of all asset pairs which he calls a “what-by-what” matrix. Thus, these 2 key assumptions allow Vazquez to use the statistical mechanics of fields to characterize the behavior of economic agents.

3. Economic Model

In this section, I tackle the basic modeling framework. Time is discrete and moves in integer steps.

3.1 Assets

There are \bar{a} different kinds of assets labeled by a = 1, 2, \ldots, \bar{a} with \mathcal{A} denoting the set of all assets. Agents get (possibly time dependent) utility from holding different combinations of the \bar{a} assets; however, this utility generating mechanism is left unspecified.

3.2 Agents

There are a finite number of \bar{n} agents indexed by n= 1, 2, \ldots, \bar{n} with the set of all agents denoted by \mathcal{N}. Every agent has an inventory of products in the quantity x_{n,a} for asset a with X_n denoting the vector of holdings for agent n. The state space of the economy at any point in time is given by the matrix \mathbf{X} = \left\{ X_n \mid n \in \mathcal{N} \right\}. Let \mathcal{X} denote the set of all possible states with \mathcal{X} = \mathbb{R}_+^{\bar{a} \times \bar{n}}.

Let \zeta_a \in \mathbb{R}_+ denote an asset specific positive scalar constant. Then, by scale invariance I mean that the economy should be unchanged if for every agent n \in \mathcal{N}, we multiply the agent’s holdings of asset a by \zeta_a:

(1)   \begin{align*} x_{n,a} \mapsto \zeta_a \cdot x_{n,a} \end{align*}

This is tantamount to saying that, if we counted all lengths in centimeters instead of meters so that \zeta_a = 100, no actual real outcomes should be changed. This restriction will imply that all essential functions will be homogeneous of degree 1.

3.3 Information

Agents have (possibly different) beliefs about how future of the economy will play out; i.e., about how \mathcal{X} will evolve. Let \mathcal{I}_n denote the beliefs of agent n. Each agent’s information may be biased, narrow or wrong. At each point in time, agents have in their mind an exchange rate matrix \mathbf{M}_n \in \mathcal{I}_n with entries denoted by m_{n,a:b} which denotes the number of units of good a that agent n would accept in exchange for a unit of good b. Let \mathcal{A}_n denote the set of assets for which agent n has an entry in her \mathbf{M}_n matrix. Thus, \mathcal{I}_n is a \sigma-algebra over matrices.

Each agent’s \mathbf{M}_n matrix has the following 3 properties:

  1. Reciprocality: Each agent is willing to buy and sell at the same price. i.e., there is no bid ask spread.

    (2)   \begin{align*} m_{n,a:b} &= \frac{1}{m_{n,b:a}} \end{align*}

  2. Transitivity: There are no profit generating trade combinations. i.e., there are no arbitrage opportunities.

    (3)   \begin{align*} m_{n,a:b} &= m_{n,c:b} \cdot m_{n,a:c} \end{align*}

  3. Scale Symmetry: Adjusting prices by the ratio \zeta_a/\zeta_b of scalar constants used to renormalize the asset units leaves the equilibrium allocations unchanged.

    (4)   \begin{align*} m_{n,a:b} &\mapsto \left( \frac{\zeta_a}{\zeta_b} \right) \cdot m_{n,a:b} \end{align*}

The \mathbf{M}_n matrices capture the idea that each agent’s field of vision or attention is bounded.

3.4 Preferences

Given his information set \mathcal{I}_n, each agent behaves rationally in accordance with von Neumann and Morgenstern axioms of decision theory1 yielding an index of satisfaction V_n as defined below:

(5)   \begin{align*} V_n &: \mathcal{X}_n \times \mathcal{I}_n \mapsto \mathbb{R} \end{align*}

where V_n = \mathbb{E} \left[ U_n \mid \mathcal{I}_n \right] with U_n as agent n‘s utility function. V_n has the properties that for each agent n \in \mathcal{N}, the partial derivatives are \partial_a V_n > 0 and \partial_a^2 V_n < 0 for each asset a \in \mathcal{A} where \partial_a \equiv \partial / \partial x_{n,a} for brevity. Also, suppose that between period t and t+1, an agent n changes his asset holdings from X_n to X_n', then I will abbreviate the corresponding change in happiness as:

(6)   \begin{align*} \Delta V_n &= V_n \left( X_n' \right) - V_n \left( X_n \right) \end{align*}

The fact that the economy must be scale invariant implies additional restrictions on the utility function of each agent. In particular, it must be the case that the utility function can vary at most by a constant due to a change in scale. i.e., we have that the utility function must be CRRA/log-like in nature such as:

(7)   \begin{align*} U_n \left( X_n \right) &= \phi \cdot \ln \left[ \Psi_n^{\top} X_n \right] \end{align*}

where \Psi_n is a \bar{a} \times 1 vector of free parameters. For instance, consider the utility specification below which equates agent n‘s utility with the value of his asset holdings in terms of a numeraire good denoted by b=1:

(8)   \begin{align*} U_n &= \ln \left[ \sum_{a \in \mathcal{A}} \left( x_{n,a} \cdot m_{n,a:1} \right)^{\phi} \right] \end{align*}

Here, note that scaling up the numeraire on 1 of the assets will have no affect on the first order condition. However, a key assumption in this class of models is that utility is myopic and depends only on the current period’s asset holding which fits nicely with Vazquez’s log utility assumption.

3.5 Economic Operators

Economic movements are classified as operators which are mappings which change agents’ portfolio holdings. For instance, if \mathbb{H} is an arbitrary economic operator, then we have that:

(9)   \begin{align*} \mathbb{H} &: \mathcal{X} \mapsto \mathcal{X} \end{align*}

For example, think about the following examples:

  1. A consumption operator \mathbb{C} which removes asset holdings but increases utility,
  2. A production operator \mathbb{Y} which recombines asset holdings at a net surplus,
  3. A trader operator \mathbb{T} which exchanges asset holdings between agents, or…
  4. A depreciation operator \mathbb{D} which removes asset holdings but does not compensate agents with a utility boost.

Using this general framework, we can talk then about similarities and differences across economic operators. For instance, the trade operator \mathbb{T} preserves the aggregate asset proportions as it simply transfers assets between 2 agents. Thus, we can think about trade as a conservative operator. Alternatively, the consumption, production and depreciation operators are not conservative with respect to the aggregate asset proportions.

In the analysis below, I overload the \Delta terminology used to denote changes in satisfaction due to changes in holdings to be operator specific. In particular, for an arbitrary economic operator \mathbb{H} I define \Delta_{\mathbb{H}} as:

(10)   \begin{align*} \Delta_{\mathbb{H}} V_n &= \mathbb{E} \left[ U \left( \mathbb{H} X_n \right)  - U \left( X_n \right) \mid \mathcal{I}_{\alpha} \right] \end{align*}

4. Examples

In Section 2 above, I defined the basic elements of a class of economic models and then stopped just short of giving an equilibrium definition. In this section, I now look at 2 examples of equilibria from this class of models. Each of these examples will essentially represent a different take on what the equilibrium price object will look like. I do not consider any models with production, consumption or depreciation decisions.

4.1 Fixed Exchange Rate

First consider a world with a single exchange rate for each asset wich is set by fiat. In this world, we have an equilibrium definition:

Definition (Equilibrium): An equilibrium is a \bar{n} \times \bar{a} matrix of allocations \mathbf{X} as well as a \bar{a} \times \bar{a} symmetric matrix of exchange rates \mathbf{M} with a unit diagonal representing \bar{a} \cdot (\bar{a} - 1)/2 unique elements such that given the exchange rate matrix \mathbf{M}, we have that:

  1. For each agent n \in \mathcal{N}, the allocation X_n satisfies:

    (11)   \begin{align*} 0 &= \frac{\partial \left( \Delta_{\mathbb{T}} V_n \right)}{\partial \left( \Delta x_{n,a}^* \right)} \end{align*}

  2. Markets clear such that for each a \in \mathcal{A}:

    (12)   \begin{align*} 0 &= \sum_{n \in \mathcal{N}} \Delta x_{n,a} \end{align*}

In such an economy, we can characterize the equilibrium allocations as follows:

Proposition (Equilibrium w/ Single Exchange Rate): Near the equilibrium, the amount of assets a and b that agent n would trade given the agent independent exchange rate m_{a:b} is given by:

(13)   \begin{align*} \Delta x_{n,a} &\approx \kappa_{n,a:b} \cdot \left\{ \partial_a V_n - m_{a:b} \cdot \partial_b V_n \right\}^2 \\ \Delta x_{n,b} &\approx - m_{a:b} \cdot \Delta x_{n,a} \end{align*}

where \kappa_{n,a:b} is a agent dependent positive constant.

This result follows directly from a first order Taylor expansion of the utility gain to trading around the fixed point of \Delta x_{n,a} = 0:

Proof: Suppose that the trade operator \mathbb{T} changes agent n‘s asset holdings in asset a by depositing \Delta x_{n,a}. Then, an equilibrium would represent a fixed point such that the following 2 properties hold:

(14)   \begin{align*} \frac{\partial \left( \Delta_{\mathbb{T}} V_n \right)}{\partial \left( \Delta x_{n,a}^* \right)} &= \left\{ \partial_a V_n - m_{a:b} \cdot \partial_b V_n  \right\} = 0 \\ \frac{\partial^2 \left( \Delta_{\mathbb{T}} V_n \right)}{\left\{\partial \left( \Delta x_{n,a}^* \right) \right\}^2} &< 0 \end{align*}

The first order condition says that agent n no longer wants to trade and the second order condition says that we are at a local optimum. A Taylor expansion of \Delta_{\mathbb{T}} V_n around its true fixed point yields first and second order terms:

(15)   \begin{align*} \Delta_{\mathbb{T}} V_n &\approx \Delta x_{n,a} \cdot \left\{ \partial_a V_n - m_{a:b} \cdot \partial_b V_n  \right\} \Big\vert_{\Delta x_{n,a} = 0} \\ &\qquad \qquad + \frac{\left(\Delta x_{n,a}\right)^2}{2} \cdot \left\{ \partial_a^2 V_n + m_{a:b}^2 \cdot \partial_b^2 V_n - 2 \cdot m_{a:b} \cdot \partial_a \partial_b V_n \right\}\Big\vert_{\Delta x_{n,a} = 0} + \ldots \end{align*}

Taking the first order condition with respect to asset a and solving for \Delta x_{n,a} and \Delta x_{n,b} in its mirror image yields the equations above.

4.2 Barter

Next consider the case of pairwise trading between agents which I refer to as bartering. In this world, I assume that the bargaining process is exogenously specified and the agents take the split of the gains to trade as given. What’s more, because agents are perfectly myopic in their utility specifications, they have no concern for the matching process which assigns traders to new partners each period.

Definition (Equilibrium): An equilibrium is a \bar{n} \times \bar{a} matrix of allocations \mathbf{X} as well as a set of no more than \bar{n}! pairwise exchange rates m_{n:n',a:b} such that given each exchange rate, we have that:

  1. For each agent n \in \mathcal{N}, the allocation X_n satisfies:

    (16)   \begin{align*} 0 &= \frac{\partial \left( \Delta_{\hat{\mathbb{T}}} V_n \right)}{\partial \left( \Delta x_{n,a}^* \right)} \end{align*}

  2. Each pairwise market clears:

    (17)   \begin{align*} 0 &= \Delta x_{n,a} + \Delta x_{n',a} \end{align*}

In such a world, we have the following perturbation equilibrium reult:

Proposition (Equilibrium w/ Barter): Near the equilibrium, the amount of product a and b that 2 agents n and n' exchange as a result of a bargaining process \hat{\mathbb{T}} is given by:

(18)   \begin{align*} \Delta x_{n,a} &\approx = \hat{\kappa}_{n,a:b} \cdot \left\{ \partial_a \left( V_n - V_{n'} \right) \cdot \partial_b \left( V_n + V_{n'} \right) -  \partial_a \left( V_n + V_{n'} \right) \cdot \partial_b \left( V_n - V_{n'} \right) \right\} \\ \Delta x_{n,b} &\approx - \left( \frac{\partial_a \left( V_n + V_{n'} \right)}{\partial_b \left( V_n + V_{n'} \right)} \right) \cdot \Delta x_{n,a} \end{align*}

This result builds on the equilibrium characterization from above:

Proof: Since the agents are still (somewhat unrealistically) price takers in this world, we can use the equilibrium formulae from the proposition above; however, now we have a additional information which we can use to further restrict the demand functionals. In particular, we know that every pairwise trade nets out to 0:

(19)   \begin{align*} 0 &= \Delta x_{n,a} + \Delta x_{n',a} \end{align*}

As a result, we can solve for the fixed exchange rate by treating agents n and n'‘s equilibrium demand functionals as a system of 2 equations with 1 unknown:

(20)   \begin{align*} m_{n:n',a:b} &= \frac{\kappa_{n,a:b} \cdot \partial_a V_n + \kappa_{n',a:b} \cdot \partial_a V_{n'}}{\kappa_{n,a:b} \cdot \partial_b V_n + \kappa_{n',a:b} \cdot \partial_b V_{n'}} \end{align*}

Substituting this formula back into the equation for each agent’s demand function given an exogenous exchange rate yields the equilibrium result.

  1. See von Neumann and Morgenstein (1944). ↩

Filed Under: Uncategorized

Notes: Carr and Wu (2009)

October 19, 2011 by Alex

1. Introduction

In this note, I outline the main results in Variance Risk Premiums (RFS, 2009) by Peter Carr and Liuren Wu for use in a 5min presentation in Prof. Sargent‘s reading group.

Carr and Wu (2009) develops a method for quantifying the risk premium that investors demand for holding an asset that has a higher return variance. This method is based on the concept of a variance swap—a zero cost, derivative contract between 2 traders, call them Alice and Bob, in which Alice pays Bob a fixed swap rate over the life of the contract and Bob pays Alice the realized volatility of the underlying asset. In this example, Alice is buying return variance from Bob. While variance swaps are not commonly traded, their payouts can be synthesized via a linear combination of call option contracts at different strike prices. Using this basic derivative pricing framework, the Carr and Wu (2009) then use data on 35 stocks and 5 indicies to study the behavior of the variance risk premium during the period from 1996 to 2003.

I chose to review this article because this synthetic derivative methodology is an interesting tool for thinking about asset pricing inequalities. For example, this same approach should allow you to synthesize Sharpe Ratio or Entropic1 swap contracts via options prices to the study risk-return boundaries predicted by different theories.

2. Variance Swaps

I begin by defining a variance swap in more detail and showing how to replicate its state contingent menu of payouts using traded options with different strike prices. Just like a more standard interest rate swap, a variance swap is defined by 4 objects:

  1. Underlying
  2. Swap rate
  3. Maturity
  4. Notional Amount

The underlying is the asset whose variance is being tracked, the swap rate is the fixed rate which is paid by the buyer of the variance to the seller each period and the maturity is the number of periods over which this transaction occurs. For instance, suppose that Alice and Bob enter into a \$1\mathtt{M}, 31 day, variance swap contract on the S&P 500 at a rate of 15\%. Then, on day 1, Alice would pay Bob \$1,000,000 \times 0.15 = \$150,000 with certainty and Bob would pay Alice \$1,000,000 \times V_1 where V_1 represents the realized variance of the S&P 500 on the first day of the contract. So, if the the first day happened to have a high annualized variance of 20\%, then Bob would have to pay Alice \$200,000. Over the life of the contract, Bob will pay Alice a net amount X_{t \to T} given by:

(1)   \begin{align*} X_{t \to T} &= \left( V_{t \to T} - S_{t \to T} \right) \cdot L \end{align*}

where L represents the notional amount of the contract which was \$1\mathtt{M} in the example above.

Since the contract has \$0 cost at the onset, we can characterize the swap rate at the expected present value of the variance payments adjusted for risk by the prevailing stochastic discount factor:

(2)   \begin{align*} S_{t \to T} &= \mathbb{E} \left[ M_{t \to T} \cdot V_{t \to T} \right] \end{align*}

Since variance swaps are not traded in a liquid market with public prices, I now show how to synthesize this contract using a riskless bond and call options on futures contracts. Suppose that the price of a futures contract follows a geometric Brownian motion process:

(3)   \begin{align*} dF_t &= \mu_t \cdot F_t \cdot dt + \sigma_t \cdot F_t \cdot dW_t \end{align*}

Suppose that the \sigma_t process is restricted to ensure that the futures price is always positive and is predictable2 with respect to the filtration \mathcal{F}_t. Suppose that the riskless rate process is deterministic so that the futures and forward prices are identical. Thus, we have that the annualized quadratic variation on the log futures return from time t to time T is given by:

(4)   \begin{align*} V_{t \to T} &= \frac{\int_t^T \sigma_s^2 \cdot ds}{T-t} \end{align*}

Using this futures return formulation, I can characterize the swap rate using a linear combination of out of the money European options:

Proposition (Variance Swap Rate): Given no arbitrage, the time t value of the quadratic variation of an asset from time t to time T can be approximated by a portfolio of out of the money, European call option prices for all strike prices K > 0 with the same T - t maturity using the formula:

(5)   \begin{align*} \mathbb{E} \left[ M_{t \to T} \cdot V_{t \to T} \right] &\approx \frac{2}{T - t} \cdot \int_0^\infty \frac{Y_{t \to T}^{(k)}}{k^2 \cdot B_{t \to T}} \cdot dk \end{align*}

where B_{t \to T} denotes the time t price of a bond that pays out \$1 at time T and Y_{t \to T}^{(k)} denotes the time t price of an out of the money European option with strike price k and end date T where Y represents a call option if k > F and a put if k < F.

Roughly speaking, if you can buy and sell a bunch of state contingent pay outs that deliver \$1 if and only if the return on the underlying crosses some threshold, then via the de Moivre-Laplace convergence of the binomial distribution to the Gaussian under the twisted risk neutral measure, we should be able to characterize the price of the realized variance using these same assets.

Proof: By Ito’s lemma under the risk neutral measure we have that:

(6)   \begin{align*} \ln \left[ F_T \right] &= \ln \left[ F_t \right]  + \int_t^T \frac{1}{F_s} \cdot dF_s - \frac{1}{2} \cdot \int_t^T \sigma_s^2 \cdot ds \end{align*}

Thus, we can compute V_{t \to T} as:

(7)   \begin{align*} V_{t \to T} &= 2 \cdot \left( \frac{F_T}{F_t} - 1 - \ln \left[ \frac{F_T}{F_t} \right] \right) + 2 \cdot \int_t^T \left( \frac{1}{F_s} - \frac{1}{F_t} \right) \cdot dF_s \\ &= \frac{1}{T - t} \cdot \int_t^T \sigma_s^2 \cdot ds \end{align*}

If we use a Taylor expansion of \ln \left[ F_T \right] around \ln \left[ F_t \right], we get:

(8)   \begin{align*} \ln \left[ F_T \right] &= \ln \left[ F_t \right] + \left( \frac{1}{F_t} \right) \cdot \left( F_T - F_t \right) -  \frac{1}{k^2} \cdot \int_0^{F_t} \left( k - F_T  \right)^+ \cdot dk +  \frac{1}{k^2} \cdot \int_{F_t}^\infty \left( F_T - k \right)^+ \cdot dk \end{align*}

Substituting this expansion back into the formulation of the realized variance above yields:

(9)   \begin{align*} V_{t \to T} &= \frac{2}{k^2} \cdot \left\{ \int_0^{F_t} \left( k - F_T  \right)^+ \cdot dk +  \int_{F_t}^\infty \left( F_T - k \right)^+ \cdot dk \right\} + 2 \cdot \int_t^T \left( \frac{1}{F_s} - \frac{1}{F_t} \right) \cdot dF_s \\ &= \left\{ \text{Out of money options.} \right\} \cdot \left( \frac{2 \cdot dk}{k^2} \right) \\ &\qquad \qquad \qquad + \int_t^T \left\{ \text{Reciprocal futures price spread.} \right\} \cdot \left( 2 \cdot B_s \right) \end{align*}

3. Empirical Results

In the section above, I showed how to synthesize the payout to a variance swap rate using options on futures contracts. In the section below, I show how to empirically compute the swap rate and realized variance rates. First, consider the swap rate S_{t \to T}. Since options contracts on futures are generally traded at 30 day horizons with most contracts starting on the 15th of every month, there will not generally be a set 0f futures contracts with the appropriate maturity at each date. Thus, the authors interpolate the intermediary dates using the formula below:

(10)   \begin{align*} S_{t \to T} &= \frac{1}{T - t} \cdot \left( \frac{S_{t \to T_1} \cdot \left( T_1 - t \right) \cdot \left( T_2 - T \right) + S_{t \to T_2} \cdot \left( T_2 - t \right) \cdot \left( T - T_1 \right)}{T_2 - T_1} \right) \end{align*}

The realized variance is a direct application of the theoretical value listed above using futures prices:

(11)   \begin{align*} V_{t \to T} &= \frac{365}{T-t} \cdot \sum_{i=1}^{T-t} \left( \frac{F_{(t + i) \to T} - F_{(t + i - 1) \to T}}{F_{(t + i - 1) \to T}} \right)^2 \end{align*}

The figure below gives the relationship between the mean and standard deviation of the swap rate and realized variance rates of 35 stocks and 5 indicies over the period from 1996 to 2003:


Pairwise scatterplot depicting the mean and standard deviation of the realized variance v, variance swap rate s, and log variance risk premium ln(v/s) for the 40 assets presented in Tables 2 and 3 of Carr and Wu (2009).

The authors also investigate the relationship between the log variance risk premia \ln \left[ V_{t \to T,j} / S_{t \to T,j} \right] computed as the spread between the log realized variance and the log swap rate and various market factors denoted by Z_{t \to T} using the regression specification below:

(12)   \begin{align*} \ln \left[ \frac{V_{t \to T,j}}{S_{t \to T,j}} \right] &= \alpha_j + \beta_j \cdot Z_{t \to T} + \varepsilon_{t \to T,j} \end{align*}

Carr and Wu (2009) find that investors price variance risk premia as measured by Z_{t \to T} equal to the realized variance on the S&P 500; however, classic risk factors such as the excess return on the market or the Fama and French (1993) 3-factor model do not account for the excess returns on this portfolio as the variance risk premia displays both a negative market \beta as well as statistically significant negative \alpha‘s.

  1. See Backus, Chernov and Zin (2010). ↩
  2. In the stochastic calculus sense. ↩

Filed Under: Uncategorized

Notes: Hassan and Mertens (2011)

October 5, 2011 by Alex

1. Introduction

In this note, I outline the main results in Hassan and Mertens (2011)1 for use in a 5min presentation in Prof. Sargent‘s reading group.

This paper asks the question: “Suppose that you air dropped a bit of noise into equity prices, how much worse off would everyone be in the context of a macroeconomic model with production?” As it turns out, adding a bit of noise can have a first order effect on welfare as noising up equity prices causes agents to disinvest in firms in favor of riskless debt, leading to lower production and thus lower consumption as agents can only consume what has been produced. To answer this question, the authors overcome 2 main challenges. First, they define a new equilibrium concept that allows for heterogeneous beliefs. Second, they solve for the equilibrium asset prices which vary non-linearly with their information set.2

To my knowledge, this paper is the first to address the costs of noisy asset prices in a dynamic general equilibrium setting; however, other papers such as Angeletos and Pavan (2007) have looked at the social costs and benefits of correct prices. Veldkamp (2011, Ch 8) also provides a good reference for solving noisy rational expectations models.

2. Model

Consider a standard linear homogeneous production function that produces output Y_t using inputs of capital K_t and labor L_t. \eta_t denotes the total factor productivity and evolves according to a diffusion process with mean - \sigma_\eta^2/2 and variance \sigma_\eta^2:

(1)   \begin{align*} Y_t &= e^{\eta_t} \cdot F(K_t, L_t) \end{align*}

Capital evolves according to the law of motion below where \delta is the depreciation rate and I_t is the aggregate investment level:

(2)   \begin{align*} K_{t+1} &= K_t \cdot \left( 1 - \delta \right) + I_t \end{align*}

There are a continuum of households indexed on the real line by i \in [0,1]. At the beginning of every period, each household i gets a signal about tomorrow’s TFP with noise \nu_t(i) that is distributed normally with mean 0 and variance \sigma_\nu^2:

(3)   \begin{align*} s_t(i) &= \eta_{t+1} + \nu_t(i) \end{align*}

I refer to s_t(i) as the “knowledge” of agent i about tomorrow’s productivity. Given this knowledge, at time 0 each agent then has a value function V_t(i) over consumption streams \left\{ C_t(i) \right\}_{t=0}^\infty given his relative investment in stocks rather than bonds \left\{ \omega_t(i) \right\}_{t=0}^\infty:

(4)   \begin{align*} \begin{split} V_t(i) &= \max_{ \left\{ C_t(i) \right\}_{t=0}^\infty, \left\{ \omega_t(i) \right\}_{t=0}^\infty} \mathcal{E}_{i,t} \left[ \sum_{s=t}^\infty \beta^{s-t} \cdot \log C_s(i) \right] \\ &\text{s.t.} \\ W_{t+1} &= \left[ (1 - \omega_t(i)) \cdot (1 + r) + \omega_t(i) \cdot \left( 1 + \tilde{r}_{t+1} \right) \right] \\ &\qquad \times \left( W_t(i) + w_t \cdot L - C_t(i) \right) + \tau_t(i) \end{split} \end{align*}

In the optimization program above, \mathcal{E}_{i,t} denotes agent i‘s subjective views on how the future will unfold given his knowledge s_t(i), W_t(i) denotes agent i‘s financial wealth at time t, \tilde{r}_{t+1} denotes the return on stocks, \omega_t denotes the wage rate for all agents, L denotes the amount of labor supplied by each household and \tau_t(i) denotes the payments from the contingent claims trading. Equity prices are given by Q_t which is related to dividends D_t via the market return \tilde{r}_{t+1}:

(5)   \begin{align*} 1 + \tilde{r}_{t+1} &= \frac{Q_{t+1} \cdot (1 - \delta) + D_{t+1}}{Q_t} \end{align*}

Let \mathbb{E}_{i,t} be the “true” rational expectations operator which is linked to each agent’s subjective expectations via a common mean 0 shock \tilde{\varepsilon}_t which has volatility \sigma_{\varepsilon}^2:

(6)   \begin{align*} \begin{split} \mathbb{E}_{i,t} \left[ \cdot \right] &= \mathbb{E} \left[ \cdot \mid s_t(i), Q_t, K_t, B_{t-1}, \eta_t \right] \\ \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] &= \mathbb{E}_{i,t} \left[ \eta_{t+1} \right] + \tilde{\varepsilon}_t \end{split} \end{align*}

The representative firm solves the maximization program below and tries to produce as much output as possible given the production technology e^{\eta_t} \cdot F(K_t,L_t) and factor prices w_t and R_t:

(7)   \begin{align*} \Pi_t &= \max_{K_t^d,L_t^d} \left\{ e^{\eta_t} \cdot F\left( K_t^d, L_t^d \right) - w_t \cdot L_t^d - R_t \cdot K_t^d \right\} \end{align*}

All profits are dispersed immediately to shareholders. Firms take the market price of capital Q_t as given and then invest in capital accumulation according to the rule below where Q_t \cdot I_t represents the value of acquiring I_t units of capital, -I_t represents cost from selling I_t units of consumption good and - (\chi/2) \cdot (I_t^2/K_t) represents an adjustment cost:

(8)   \begin{align*} A_t &= \max_{I_t} \left\{ Q_t \cdot I_t - I_t  - \left( \frac{\chi}{2} \right) \cdot \frac{I_t^2}{K_t} \right\} \end{align*}

Thus, we can characterize the dividends paid per share as:

(9)   \begin{align*} \begin{split} D_t &= R_t + \left( \frac{\chi}{2} \right) \cdot \frac{I_{t+1}^2}{K_{t+1}^2} \\ I_t &= \left( \frac{K_t}{\chi} \right) \cdot \left( Q_t - 1 \right) \end{split} \end{align*}

With this machinery in place, I can now define an equilibrium in which agents have heterogeneous beliefs about firm productivity:

Definition (Equilibrium): Given a time path of shocks \left\{ \eta_t, \tilde{\varepsilon}_t, \nu_t(i) \right\} an equilibrium in this economy is a time path of quantities \left\{ C_t(i), B_t(i), W_t(i), \omega_t(i), \tau_t(i) \right\} \left\{ C_t, B_t, W_t, \omega_t, K_t^d, L_t^d, Y_t, K_t, I_t \right\}, signals \left\{ s_t(i) \right\} and prices \left\{ Q_t, r, R_t, w_t \right\} with the properies:

  1. \left\{ C_t(i), \omega_t(i) \right\} solve the households’ maximization problem given the vector of prices, initial wealth and the random sequences \left\{ \tilde{\varepsilon}_t, \tilde{\nu}_t(i) \right\};
  2. \left\{ K_t^d, L_t^d \right\} solve the representative firm’s maximization problem given the vector of prices;
  3. \left\{ I_t \right\} is the investment sector’s optimal policy given the vector of prices;
  4. \left\{ w_t \right\} clears the labor market, \left\{ Q_t \right\} clears the stock market and \left\{ R_t \right\} clears the market for capital services;
  5. There is a perfectly elastic supply of the consumption good and of bonds in world markets where bonds pay the interest rate r and the price of the consumption good is normalized to 1;
  6. \left\{ \tau_t(i) \right\} are defined such that all households enter each period with the same amount of wealth; and
  7. \left\{ B_t(i), C_t, B_t, W_t, \omega_t \right\} are given by the identities:

    (10)   \begin{align*} B_t(i) &= \left[ 1 - \omega_t(i) \right] \cdot \left( W_t(i) - C_t(i) \right) \\ X_t &= \int_0^1 X_t(i) \cdot di, \ \ \forall X \in \left\{ B, C, W \right\} \\ \omega_t &= \frac{Q_t \cdot K_{t+1}}{W_t - C_t} \end{align*}

This definition collapses down to the standard noisy rational expectation equilibrium in the case of \sigma_{\tilde{\varepsilon}} = 0:

Definition (Rational Expectations Equilibrium): A rational expectations equilibrium is an equilibrium as defined above in which the volatility of inference shocks \sigma_{\tilde{\varepsilon}} = 0 so that \mathcal{E}_{i,t} = \mathbb{E}_{i,t}.

However, it also has a sensible interpretation even when agent’s have a common component to their belief heterogeneity:

Definition (Near Rational Expectations Equilibrium): A near rational expectations equilibrium allows for \sigma_{\tilde{\varepsilon}} > 0. Such an equilibrium is k\% stable if the welfare gain to an individual household of obtaining rational expectations is less that k\% of consumption.

3. Financial Economy

In this section, I consider a world in which all agents hold beliefs that are slightly wrong in the same way. To formalize this idea, I define a new error term \varepsilon_t which represents the difference between the average expectation of the total factor productivity at time t in a world where \sigma_{{\varepsilon}} > 0 and the average expectation in a world where \sigma_{\tilde{\varepsilon}} = 0:

(11)   \begin{align*} \varepsilon_t &= \gamma \cdot \tilde{\varepsilon}_t \\ &= \int_{\tilde{\varepsilon}_t>0} \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] \cdot di - \int_{\tilde{\varepsilon}_t=0} \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] \cdot di \end{align*}

In particular, by expressing \gamma = \varepsilon/\tilde{\varepsilon} as a ratio we can ask: How much market forces amplify (\gamma > 1) the existing common error?

To solve for the equilibrium values in the financial economy, I start with the 2 Euler equations linking consumption in subsequent periods to the riskless rate of return and the rate of return on equities. Let \Theta_t = \left\{ K_t, B_{t-1}, \eta_t, \eta_{t+1}, \tilde{\varepsilon}_t \right\} denote the state space. Then, I can express these 2 Euler equations as:

(12)   \begin{align*} \begin{split} C_t \left( \Theta_t, \nu_t(i) \right)^{-1} &= \beta \cdot \mathcal{E}_{i,t} \left[ \left( 1 + \tilde{r}_{t+1} \right) \cdot C_{t+1} \left( \Theta_{t+1}, \nu_{t+1}(i) \right)^{-1} \right] \\ C_t \left( \Theta_t, \nu_t(i) \right)^{-1} &= \beta \cdot \mathcal{E}_{i,t} \left[ \left( 1 + r \right) \cdot C_{t+1} \left( \Theta_{t+1}, \nu_{t+1}(i) \right)^{-1} \right] \end{split} \end{align*}

Next, given these 2 Euler equations, I now face a complicated problem of trying to figure out households’ optimal allocation of consumption, stock purchases and bond purchases given that 1) households’ expectations depend non-linearly on their beliefs about the current \eta_t and 2) asset prices depend non-linearly on the average belief about future productivity. In a related paper, Mertens (2009) shows how to execute a non-linear change of variables that transforms the problem into a optimization program that resembles a more standard noisy rational expectations equilibrium. After this change of variables, I write the stock price as:

(13)   \begin{align*} \begin{split} \hat{q}_t &= \int \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] \cdot di \\ &= \int \mathbb{E}_{i,t} \left[ \eta_{t+1} \mid \hat{q}_t , s_t(i) \right] \cdot di + \tilde{\varepsilon}_t \end{split} \end{align*}

i.e., the stock price equals the market expectation of \eta_{t+1}.

I now proceed down the lines of the solution to a standard noisy rational expectations model with value \eta_{t+1} and noise term \tilde{\varepsilon}_t. I conjecture that the price \hat{q}_t is an affine function of the value and the noise term with parameters \pi_0, \pi_1 and \gamma:

(14)   \begin{align*} \hat{q}_t &= \pi_0 + \pi_1 \cdot \eta_{t+1} + \gamma \cdot \tilde{\varepsilon}_t \end{align*}

Thus, we can interpret \gamma as the loading on the noise term in this pricing rule. This is quite natural as the price \hat{q}_t is the average expected value of the total factor productivity \eta_{t+1} and \int_{\tilde{\varepsilon}_t=0} \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] \cdot di is the price in a world with no common error \sigma_{\tilde{\varepsilon}} = 0. Thus, the loading on the noise term should be \gamma, the amplification parameter defined above.

Next, using results from Mertens (2009), I write the true rational expectation of tomorrow’s total factor productivity \mathbb{E}_{i,t} \left[ \eta_{t+1} \right] as an affine function of each agent’s beliefs and the prevailing price using the coefficients A_0, A_1 and A_2 which derive from the change of variables procedure which I do not outline here:

(15)   \begin{align*} \mathbb{E}_{i,t} \left[ \eta_{t+1} \right] &= A_0 + A_1 \cdot s_t(i) + A_2 \cdot \hat{q}_t \end{align*}

By adding in the common noise term and then summing across agents i \in [0,1], I get another affine expression below which links the average expectation of tomorrow’s total factor productivity to its true value and the amplified noise term:

(16)   \begin{align*} \begin{split} \int \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] \cdot di &= \left( A_0 + A_2 \cdot \pi_0 \right) \\ &\qquad \qquad  + \left( A_1 + A_2 \cdot \pi_1 \right) \cdot \eta_{t+1} \\ &\qquad \qquad \qquad \qquad + \left( A_2 \cdot \gamma + 1 \right) \cdot \tilde{\varepsilon}_t \end{split} \end{align*}

Using this expression, I can then show that as households look more and more to the price of capital when forming prices, the amplification of common shocks to expectations via the pricing mechanism grows:

Proposition (Inference Shock Amplifier): The more weight the households place on the market price of capital when forming their expectations about \eta_{t+1}, the larger is the error in market expectations relative to \tilde{\varepsilon}_t:

(17)   \begin{align*} \gamma &= \frac{1}{1 - A_2} \end{align*}

The proof of this proposition follows directly from taking first order conditions:

Proof: Combining the equations for \hat{q}_t and \int \mathcal{E}_{i,t} \left[ \eta_{t+1} \right] \cdot di above yields a system of 3 equations and 3 unknowns:

(18)   \begin{align*} \pi_0 &= A_0 + A_2 \cdot \pi_0 \\ \pi_1 &= A_1 + A_2 \cdot \pi_1 \\ \gamma &= 1 + A_2 \cdot \gamma \end{align*}

Solving for the unknowns yields:

(19)   \begin{align*} \pi_0 &= \frac{A_0}{1-A_2} \\ \pi_1 &= \frac{A_1}{1 - A_2} \\ \gamma &= \frac{1}{1 - A_2} \end{align*}

Next, I look at how much information prices can hold in regimes that vary based on the the magnitude of the underlying common shock. For instance, consider the following thought experiment. Think about a world in which there was no common shock. If you knew nothing about asset values and then looked at prices, how precisely could you now pin down those values? To within \$5? \$1? \$10? Now consider a world in which \tilde{\varepsilon} > 0 so that agents experience a common shock to their preferences. Now, for instance, agents might revise their expectations upward, hold more stocks as a result, and then influence others to do the same for capital gains purposes as the price depends on the average value of agent’s expectations. In such a world with a positive feedback loop, how much worse are prices at conveying the true value of the asset? If in the world with no common shock, you could see a price of \$30 and know that the asset was worth something between \$29 and \$31, in a world with a common shock have the bounds now expanded to \$30 \pm \$5?

The proposition below dictates that the bounds will be increasing in the volatility of the common shock:

Proposition (Information Absorbency): The amount of information aggregated in the stock prices decreases with the volatility of the common shock \sigma_{\varepsilon}:

(20)   \begin{align*} 0 &> \frac{\partial \pi_1}{\partial \sigma_{\tilde{\varepsilon}}} \end{align*}

The proof comes via Mathematica by taking partial derivative of \pi_1 = A_1/(1 - A_2) with respect to \sigma_{\varepsilon} where \sigma_{\varepsilon} is bound up in the A_1 and A_2 terms so I omit it here.

4. Real Economy

In the previous section I showed how to derive equilibrium values for the financial economy and that the market became substantially less efficient as the volatility of the common shock increased due to the feedback loop between beliefs and asset prices. I now turn to the real economy and show that common errors in expectations distort aggregate consumption in the steady state as well. First, suppose that in equilibrium stock prices are log-normally distributed with variance \sigma^2. Then we know that the optimal consumption plan will be myopically proportional to his wealth as agent’s have log utility:

(21)   \begin{align*} C_t(i) &= \left( 1 - \beta \right) \cdot W_t(i) \end{align*}

What’s more, his optimal stock holdings will also follow the standard log utility rule:

(22)   \begin{align*} \omega_t(i) &= \frac{\mathcal{E}_{i,t} \left[ 1 + \tilde{r}_{t+1} \right] - \left( 1 + r\right)}{\sigma^2} \end{align*}

Next, I characterize the steady state levels of stock holdings as well as the marginal product of capital:

Proposition (Steady State): The equilibrium has a unique stochastic steady state if and only if \beta < (1 + r)^{-1}. At the steady state, the aggregate degree of stock holdings is:

(23)   \begin{align*} \omega_{ss} &= \sqrt{\frac{1}{\sigma^2} \cdot \left( \frac{1 - \beta}{\beta} - r \right)} \end{align*}

and the stochastic steady state capital stock is characterized by:

(24)   \begin{align*} F_K(K_{ss},L) &= \left( 1 + \delta \cdot \chi \right) \cdot \left( r + \omega_{ss} \cdot \sigma^2 + \delta \right) \end{align*}

The restriction on the time preference parameter \beta makes sure that agents do not want to perpetually accumulate capital. The steady state stock holdings links the gap between the time preference parameter and the riskless rate to agent’s depend to hold capital with a volatility of \sigma. The full proof of this result is given below:

Proof: First, I characterize the steady state marginal product of capital. To do this, I start by linking it to the dividend payout rate:

(25)   \begin{align*} D_{t+1} &= e^{\eta_{t+1}} \cdot \left( \frac{d}{dK} F(K_{t+1}, L) \right) \\ \mathbb{E}_{ss} \left[ D_{ss} \right] &= \left( \frac{d}{dK} F(K_{ss}, L) \right) \end{align*}

Next, I observe that the excess return on capital can be linked to the marginal product of capital via the relationship:

(26)   \begin{align*} r + \omega_{ss} \cdot \sigma^2 &=  - \delta + \left( \frac{1}{1 + \delta \cdot \chi} \right) \cdot \left( \frac{d}{dK} F(K_{ss},L) \right) \end{align*}

This last statement yield the desired result after rearranging. Next, I turn to the steady state asset holdings which falls out of the budget constraint. First, I rewrite the budget constraint so that it only contains terms from 2 periods:

(27)   \begin{align*} \left( 1 + r \right) \cdot B_{t-1} + &\left( Q_t \cdot \left( 1 - \delta \right) + D_t \right) \cdot K_t = Q_t \cdot K_{t+1} + B_t + C_t \\ &= Q_t \cdot \left( 1 - \delta \right) \cdot K_t + Q_t \cdot I_t + B_t + C_t \end{align*}

Next, I write the aggregate wealth, consumption and borrowing terms as functions of the steady state asset holdings:

(28)   \begin{align*} C_{ss} &= (1 - \beta) \cdot W_{ss} \\ \beta \cdot W_{ss} &= K_{ss} \cdot (1 + \delta \cdot \chi) + B_{ss} \\ B_{ss} &= \beta \cdot W_{ss} \cdot \left( 1 - \omega_{ss} \right) \end{align*}

Finally, I substitute these terms back in and rearrange to yield a result linking the steady state asset holdings to the constants:

(29)   \begin{align*} \left( 1 + \delta \cdot \chi \right) \cdot K_{ss} &= \beta \cdot W_{ss} \cdot \omega_{ss} \\ B_{ss} &= \left( \frac{1 - \omega_{ss}}{\omega_{ss}} \right) \cdot \left( 1 + \delta \cdot \chi \right) \cdot K_{ss} \end{align*}

Next, I look at the steady state capital accumulation rate of the economy conditional on the volatility of the stock market:

Proposition (Capital Accumulation): An increase in the conditional variance of stock returns decreases the level of capital stock and total output:

(30)   \begin{align*} 0 &> \frac{\partial K_{ss}}{\partial \sigma} \end{align*}

This result says that the economy will acquire less and less capital as the volatility of stock returns increases. This result stems directly from the full derivative of the steady state capital level:

Proof: First, start with the equation above characterizing the steady state stock holdings \omega_{ss} and then take the full derivative:

(31)   \begin{align*} \frac{d}{d\sigma} K_{ss} &= \frac{1 + \delta \cdot \chi}{\frac{d^2}{(dK)^2} F(K_{ss},L)} \cdot \sqrt{\frac{1 - \beta}{\beta} - r} \end{align*}

Rearranging terms and signing the RHS yields the necessary relationship.

Thus, in this section I showed that the economy is worse off when stock market volatility increases as a result of a common error in expectations.

  1. See Tarek Hassan and Thomas Mertens ↩
  2. By contrast, in the standard noisy rational expectations setup (e.g., see here) all outcomes are linearly related. ↩

Filed Under: Uncategorized

Notes: Budish (2011)

September 21, 2011 by Alex

1. Introduction

In this note, I outline the results in Budish (2011)1 for a presentation in Prof. Sargent’s reading group. Here are the slides themselves.

This paper introduces a new concept for clearing markets with indivisible goods and asset complementarities which is similar to a competitive equilibrium with equal incomes (CEEI)2. In particular, Budish (2011) extends this idea from the baseline setting with independent and perfectly divisible goods by allowing for both approximate market clearing as well as slightly unequal incomes. He then shows that the resulting equilibrium and market design concepts have the desirable qualities of being Pareto efficient, fair and strategy-proof (…given that these last 2 concepts have been appropriately adjusted).

2. Terminology and Notation

There are a set of N students denoted by \mathcal{S} = \{1,2,\ldots,N\}. There is a set of M courses, \mathcal{C} = \left\{ 1, \ldots, j, \ldots, M \right\} with q_j \in \mathbb{Z}_+ seats in each course j. A consumption bundle x_i for student i is a binary vector in \{0,1\}^M with 1 meaning that the student took the course and 0 meaning that the student didn’t. In this notation, the set of all possible courses is the powerset of \mathcal{C}: 2^{\mathcal{C}}. Each student i has von Neumann Morgenstern preferences over allocations defined as u_i(\cdot): \{0,1\}^M\mapsto \mathbb{R}_+. Let \Psi \subset 2^{\mathcal{C}} be the set of admissible schedules where the utility from an inadmissible schedule is 0. Formally, an economy is an object:

Suppose that each student gets assigned an endowment b_i \in \mathbb{R}_+ of fake currency and the market clears at prices p_j \in \mathbb{R} for each class j \in M.

3. Approximate CEEI

With this terminology in hand, I now define the equilibrium concept:

Definition (Approximate CEEI): Take an economy \mathcal{E}. For this economy, the allocation \mathbf{x}, budgets \mathbf{b} and prices \mathbf{p} constitute a (\alpha,\beta)-Approximate CEEI if the following 3 conditions hold:

(1)   \begin{align*} x_i &= \arg \max_{x \in 2^{\mathcal{C}}} \left\{ u_i(x) \mid \mathbf{p} \ x \leq b_i \right\} \quad \forall i \in \mathcal{S} \\ \alpha &\geq \left\Vert z_1(\mathbf{p}),z_2(\mathbf{p}),\ldots,z_M(\mathbf{p}) \right\Vert_2 \\ 1 + \beta &\geq \max_{i \in \mathcal{S}} b_i \geq 1 = \min_{i' \in \mathcal{S}} b_{i'} \end{align*}

where z_j(\mathbf{p}) = \sum_{i \in \mathcal{S}} x_{i \to j} - q_j.

In words, \alpha is controlling the approximation of the market clearing condition and \beta is controlling the approximation of the equal income condition. For example, say that \alpha = 10, and then there are at most 2 classes with 7 too many students: \sqrt{7^2 + 7^2} \approx 10. Alternatively, there could be 1 class with 10 students too many students. Turning to \beta, suppose that we have a \beta = 0.05, then the student with the least fake class-seat-buying currency has 1 dollar while the most wealthy student has no more that 1.05 dollars.

Next, I give the central existence result:

Proposition (Equilibrium Existence): Define the following 2 new variables:

(2)   \begin{align*} k &= \max_{i \in \mathcal{S}} \max_{x \in \Psi} \vert x \vert \\ \sigma &= \min \left\{ 2 \cdot k, M \right\} \end{align*}

First, we have that for any \beta > 0, there exists a (\sqrt{\sigma \cdot M}/2 , \beta)-Approx CEEI.

Second, it is also the case that for any \beta > 0, any budget vector \mathbf{b} that lies in the [1,1+\beta] interval and any \varepsilon > 0, there exists a (\sqrt{\sigma \cdot M}/2 , \beta)-Approx CEEI with budgets \mathbf{b}^* that satisfy the condition:

(3)   \begin{align*} \varepsilon &> \vert b_i^* - b_i \vert \quad \forall i \in \mathcal{S} \end{align*}

What is this proposition saying in words? The first part of the result says that for any arbitrarily small distortion of wealth inequalities, there exists an (\alpha,\beta)-approximate CEEI where the market clearing approximation is bounded by the amount \sqrt{\sigma \cdot M}/2. The second part of the result captures the fact that, for any budget vector \mathbf{b} in the [1,1+\beta] interval that a school administrator might choose, there is a (\alpha,\beta)-Approximate CEEI really close too it. Put differently, a school administrator can just randomly assign the initial budget vectors. Thinking ahead, this second clause indicates that the budget dispersion rather than the allocations themselves plays a key role.

Before I sketch out the proof of this result, let me first walk through an illustrative example:

Example (2 Diamonds, 2 Rocks): Suppose that there are 2 buyers, Bob and Alice, who want to split 4 items between them:

(4)   \begin{align*} \mathcal{C} &= \left\{ {\Large \text{Diamond}}, {\small \text{Diamond}}, {\Large\text{Rock}}, {\small\text{Rock}}  \right\} \end{align*}

If both Bob and Alice have an equal endowment, then there will not exist a price vector for each stone that clears the market. At each price point, both buyers would want the exact same bundle bundle making prices a useless tool for dividing up the goods. However, suppose instead that Bob has \$1 while Alice has \$(1+\varepsilon). Then, Alice would be able to buy the big diamond at a price of \$(1 + \varepsilon). Given that Alice would then have no more money, Bob could then afford to purchase both the small diamond and the big rock:

Thus, by noising up the wealth distribution a little bit, prices can become a useful tool for dividing up indivisible goods. What’s more, although Bob still envies Alice’s big diamond, his envy is limited to a single good making this division as fair as possible given the inherent indivisibility of the goods.

I now turn to sketching the existence proof out in more detail below:

Proof: I want to show that there exists a tuple of prices, budgets and allocations (\mathbf{p},\mathbf{b},\mathbf{x}) that satisfy the 3 conditions in the definition above so long as \alpha \leq \sqrt{M \cdot \sigma} / 2 and \beta > 0. If we were in the standard Econ 101 world of Mas-Collel et al. (2001) Ch 16 with Walrasian equilibria, then we could use the smoothness and convexity of the problem to guarantee existence and derive the welfare theorems. Here though, I need to overcome a critical problem: due to indivisibilities and asset complementaries, the excess demand function is discontinuous and possibly non-convex. Thus, I need to bound the size of the aggregate impact of these discontinuities, make the problem convex, and then finally apply well known fixed point theorems to show that an equilibrium must exist:

The result is in essence that, if people are given budgets with a little bit of noise from person to person, then there will exist a price vector \mathbf{p} such that no one would be strictly better off with a pairwise switch and markets almost clear. I prove a restricted version of the result showing the existence of (M \cdot \sqrt{\sigma},\beta)-Approximate CEEI. The jump from this baseline result to the tighter \sqrt{\sigma \cdot M}/2 result is purely mathematical, so for details please refer to the paper.

(Bound Discontinuities) The finiteness of the problem and the income inequality bound the aggregate size of the demand discontinuities. First, note that at worst a small change in prices can make an agent switch his entire portfolio from one bundle x_i to an entirely different bundle x_i' with \emptyset = x_i \cap x_i'. The Euclidean distance in the change in the aggregate demand due to such a shift would be \sqrt{\sigma}. To get a clearer idea of this magnitude, consider the M = 2 case below:

Next, if all agents had the same income b_i = b_{i'} \ \forall i \in \mathcal{S}, such a shift might lead to an N \cdot \sqrt{\sigma} change in the aggregate demand vector. However, if every agent has a unique budget b_i, then in general there will be at most M agents affected by a tiny price fluctuation. Why? Due to the variation in budget sets, there will always be some agent i_j who is closest to his budget constraint hyperplane in the j^{th} direction:

Thus, a little change in prices will in general only cause an M \cdot \sqrt{\sigma} shift in the aggregate demand vector.

(Convexify) Next, I want to convexify the problem. Due to the demand discontinuities and asset complementarities, it may be the case that moving prices by \varsigma might make you demand less of asset j, but moving them by 2 \cdot \varsigma might make you demand more as the good you would have liked to buy is now too expensive and you have to settle. To do this, I use a limiting result. Consider a tatonnement process f(\mathbf{p}) such that:

(5)   \begin{align*} f(\mathbf{p}) &= \mathbf{p} + \mathbf{z}(\mathbf{p}) \end{align*}

where a fixed point of f(\cdot ) would represent a set of market clearing prices. Then convexify the mapping f(\cdot) as follows:

(6)   \begin{align*} F(\mathbf{p}) &= \mathtt{co} \left\{ \mathbf{y} \mid \exists \mathbf{p}_n \to \mathbf{p} \text{ s.t. } f(\mathbf{p}_n) \to \mathbf{y} \right\} \end{align*}

Cromme and Diener (1991) show that F(\cdot) is upper hemi-continuous and convex. Thus, if I can find a fixed point for F(\cdot) I know in some sense that there is a market clearing set of prices for the true economy nearby.

(Apply Fixed Point Theorems) Finally, I can apply standard fixed point theorems to this object F(\cdot) in order to show that an equilibrium exists. In particular Cromme and Diener (1991) also show that if P \subset \mathbb{R}^n is compact and convex and f: P \mapsto P is any mapping, then we have that:

(7)   \begin{align*} \alpha &\geq \left\Vert f(\mathbf{p}) - \mathbf{p} \right\Vert \end{align*}

From intuition above, we know that \sqrt{\sigma} \cdot M > \alpha.

This error of \alpha \propto M \cdot \sqrt{\sigma} is particularly nice because it doesn’t depend on either q_j or N. Thus, even as the market gets really large, the market clearing error remains the same size.

4. Efficiency

Given that the equilibrium exists, we also want to know its relationship to the first best outcome:

Proposition (Analogue to First Welfare Theorem): Suppose \left( \mathbf{x}, \mathbf{b}, \mathbf{p} \right) is an (\alpha,\beta)-Approx CEEI of the economy \mathcal{E}. Then, the allocation \mathbf{x} is a Pareto efficient allocation in \mathcal{E}.

This result comes almost for free and follows the standard First Welfare Theorem line of argument:

Proof: Suppose that there is some feasible allocation \mathbf{x}' that Pareto improves on \mathbf{x} for an agent i. Since we know that x_i was in agent i‘s feasible utility maximizing set, then if x_i' \neq x_i we have that \mathbf{p} \ x_i' > \mathbf{p} \ x_i. However, this means that the consumption bundle \mathbf{x}'is infeasible as:

(8)   \begin{align*} \sum_{i = 1}^N \mathbf{p} \ x_i' &> \sum_{i=1}^N \mathbf{p} \ x_i \end{align*}

yielding a contradiction.

5. Fairness

In some sense, in a world with indivisible goods there will always be some unfairness. Bob or Alice can have the big diamond in the example above, but not both; thus, either Bob or Alice will envy the other’s allocation. In order to properly discuss the relative fairness of different equilibrium concepts, we need to introduce a measuring stick concept in order to compare degrees of unfairness:

Definition (Envy Bounded by a Single Good): An allocation \mathbf{x} satisfie envy bounded by a single good if, for any i,i' \in \mathcal{S}, either:

(9)   \begin{align*} u_i(x_i) &\geq u_i(x_{i'}) \end{align*}

…or, there exists some good j \in x_{i'} such that

(10)   \begin{align*} u_i(x_i) &\geq u_i(x_{i'} \setminus \{ j\}) \end{align*}

If an equilibrium satisfies the condition of envy bounded by a single good, then we could play the following game. You get to pick any 2 agents i and i' in the economy where u_i(x_{i'}) > u_i(x_i) and give them to me. Then, I will show you the good j in agent i'‘s allocation such that if you removed that good, agent i would no longer envy agent i'‘s allocation. For instance, in the example above, we could remove the big diamond from Alice’s allocation and then Bob would be perfectly happy with his little diamond and big rock allocation. Below I give the main result showing that if income inequality is small enough, then an (\alpha,\beta)-Approximate CEEI will display envy bounded by a single good:

Proposition (Bounded Envy): For any economy \mathcal{E}, if \left( \mathbf{x}, \mathbf{b}, \mathbf{p} \right) is an (\alpha, \beta)-Approx CEEI with

(11)   \begin{align*} \beta &< \frac{1}{k-1} \end{align*}

…then \mathbf{x} satisfies the condition of envy bounded by a single good.

To get an idea of where this result is coming from, return to the example with Alice and Bob above. In this example, by giving Alice a little bit more income in the beginning she was able to buy the big diamond; however, because she no longer had any money left, Bob could then buy the small diamond and the big rock. This intuition would break down if we gave Alice, say, 2 + \varepsilon times the amount of money as Bob so that she could buy both the big and small diamonds. Thus, we should suspect that envy bounded by a single good will fall out of an (\alpha,\beta)-Approximate CEEI in a world where the income inequality is not so large. The proof below makes this intuition a bit more precise:

Proof: Suppose a contradiction where agent i envies agent i'‘s allocation and this envy is not bounded by a single good. Let k' \leq k denote the number of objects envied in bundle x_{i'} by agent i. Thus, for each of these goods, we could remove just 1 of them and agent i would still prefer this bundle to his own:

(12)   \begin{align*} u_i(x_{i'} \setminus \left\{ j_1 \right\} ) &> u_i(x_i) \\ &\vdots \\ u_i(x_{i'} \setminus \left\{ j_{k'} \right\} ) &> u_i(x_i) \end{align*}

What’s more, it has to be the case that agent i cannot afford any of these bundles otherwise he would have bought them in the first place:

(13)   \begin{align*} b_{i'} &\geq \mathbf{p} \ (x_{i'} \setminus \left\{ j_1 \right\} ) > b_i \\ &\vdots \\ b_{i'} &\geq \mathbf{p} \ (x_{i'} \setminus \left\{ j_{k'} \right\} ) > b_i \end{align*}

By summing up these budget constraint inequalities we get the result that:

(14)   \begin{align*} (k'-1) \cdot b_{i'} &\geq (k'-1) \cdot (\mathbf{p} \ x_{i'}) > k' \cdot b_i \\ \frac{b_{i'}}{b_i} &\geq \frac{k'}{k'-1} \end{align*}

Since k' > k we can replace k'/(k'-1) in the above inequality with the term k/(k-1). However, we know that at most the ratio of b_{i'}/b_i can be 1+\beta so have that if (k-1)^{-1} > \beta there is a contradiction!

6. Strategy Proof-ness

Finally, I turn to strategy proof-ness. As this is a property of the actual algorithm for assigning agents to allocations and not of the stopping condition for this algorithm (i.e., the equilibrium), I first need  to outline this algorithm:

Definition (Approximate CEEI Mechanism): Define the algorithm below as the Approximate CEEI Mechanism:

  1. Each agent i reports her utility function \hat{u}_i.
  2. Check for (0,0)-Approximate CEEI’s.
  3. If non-empty:
    1. Choose random (\mathbf{x}, \mathbf{b}, \mathbf{p}).
  4. If empty:
    1. Choose target budget \mathbf{b}' uniformly from [1,1+\beta] with \beta < \min \left\{ N^{-1}, (k-1)^{-1} \right\}.
    2. Set \varepsilon \approx 0, \delta < 1 - N \cdot \beta and \alpha \leq \sqrt{\sigma \cdot M}/2.
    3. Compute set of feasible (\alpha,\beta)-Approx.\ CEEI’s.
    4. Choose random (\mathbf{x}, \mathbf{b}, \mathbf{p}) from set with minimum \alpha and \Vert \mathbf{b} - \mathbf{b}' \Vert small.

In words, this algorithm says to first solicit preferences and check for exact market clearing with prices. If no such solution exists, compute the set of (\sqrt{\sigma \cdot M}/2, \beta)-Approximate CEEI with \beta sufficiently small and choose a market clearing set of prices and allocations randomly from this set.

Next, I want to show that this algorithm is strategy proof if all of the agents behave as price takers. This seems like a reasonable assumption given that the equilibrium chooser (…think about a business school administrator) randomly selects from the set of admissible equilibria at the end of the period. One way to formalize this price taker idea would be to think about replacing each agent in the economy with a unit mass of identical agents:

Definition (Continuum Replication): The continuum replication of an economy \mathcal{E} written as:

(15)   \begin{align*} \mathcal{E}^\infty &= \left( \mathcal{S}^\infty, \mathcal{C}, \left( q_j \right)_{j=1}^M, \left( \Psi_i \right) _{i \in \mathcal{S}^\infty}, \left( u_i \right)_{i \in \mathcal{S}^\infty} \right) \end{align*}

can be constructed as by replacing each agent in the original economy with a unit mass of identical agents. \mathcal{S}^\infty = (0,N] so that agent 1 is replaced with the mass (0,1], agent 2 with the mass (1,2], agent 3 with the mass (2,3] and so on…

Then, given this replacement, we can define a strategy proof-ness concept for price taking agents:

Definition (Strategy Proof in the Large): A mechanism is strategy proof in the large if it is exactly strategy proof in the continuum replication of any finite economy.

This yields the final result:

Proposition (Strategy-Proofness): The Approximate CEEI Mechanism is SPITL.

Proof: Pick an economy \mathcal{E} and consider its continuum replication \mathcal{E}^\infty. Consider agent i \in \mathcal{S}^\infty and fix all other agent’s reports. Agent i has measure 0 so cannot affect prices. By definition of approximate-CEEI, agent i does best by truth telling given budget b_i otherwise the maximization condition would be violated.

7. Conclusion

I chose this paper to present because it has 2 interestign take aways: First, this paper outlines the actual mechanics of market clearing in a common real world setting. In standard macroeconomic and financial theory, this process is a bit of black box. While the goods usually considered in these matching models are things like class and job allocations3, elementary school allocations4 and organ donor matching5, you could think about alternative settings that might be fitting in macro-finance like buy-sell trade matching at high frequencies or even capital good allocations at lower frequencies in a market with financial constraints where it is difficult to get loans to make side payments.

Second, the paper presents a model in which in order to constrain the extent of the supply-demand mismatch, the model demands a bit (…a quantity which I define in more detail below) of inequality in order to achieve the equilibrium. Heterogeneity in wealth actually allows agents to equitably decide who gets indivisible assets using prices. Thus, wealth inequality has a new and different role in the effectively clearing markets.

  1. Website: Eric Budish at Chicago Booth. ↩
  2. See Varian (1974). ↩
  3. See Hylland and Zeckhauser (1979), Roth (1982) and Roth and Peranson (1999). ↩
  4. See Abdulkadiroglu et al. (2005), Abdulkadiroglu et al. (2005) or Pathak and Sonmez (2008). ↩
  5. See Roth et al. (2004). ↩

Filed Under: Uncategorized

Financial Econometrics Software

September 13, 2011 by Alex

1. Introduction

In this note I outline the basic facts and rules of thumb about financial econometric software that is relevant for Rob Engle‘s Fall 2011 Financial Econometrics (2) PhD course.1 First, I give a quick introduction to EViews and point to some more in depth examples. Next, I outline the benefits and limitations of using EViews and suggest that econometric derivations should be checked via numerical simulations in a more suitable programming/statistical language such as MATLAB, R or Scientific Python. My general feeling is that EViews is an excellent point of departure for doing exploratory data analysis and initial hypothesis testing; however, when doing more complicated statistical procedures a MATLAB, R or SciPy are better choices. Finally, I conclude with some helpful resources for documenting and warehousing code.

2. Using EViews

EViews is a statistical/econometric software package that is useful for getting quick summaries of financial time series using a WYSIWYG/point-and-click interface. To my mind, EViews is to MATLAB or R what a Google Doc2 is to a LaTeX file. In the same way that you should veer towards LaTeX rather than a Word Document if you are interested in writing a very technical article with lots of equations, you should head towards MATLAB or R if you want to do any serious simulations or custom econometrics; however, no one in their right mind would bother writing a quick note or to do list in TeX. If you want to do an initial exploration of the properties of a time series, EViews is a good place to start.

Below I take a quick look at the sort of procedures EViews is well equipped to tackle. For a set of more in depth examples, I found that Dick Startz and Dave Smant had good introductions and sample code to play with.

First, I load in a \mathtt{.wf1} EViews workspace file containing the adjusted closing price of BNP Paribas. After loading the data and 2 mouse clicks (selecting “show” and then “graph”), I can display the time series along with the unconditional price distribution.

Adjusted close for BNP Paribas from 2003 to 2011.

The summary statistics for this unconditional distribution are available using the “stats” menu icon on the upper right. This option gives ou the standard mean, median, min, max and standard deviation of the observations just as in MATLAB, R or STATA. It also gives you 2 higher moments, skewness and kurtosis, as well as the Jarque-Bera test for normality as the software is tailor made for financial econometrics. As a reference, this is a similar print out to the one available using the RMetrics suite in R.

Summary statistics for the daily adjusted closing price of BNP Paribas over the period from 2003 to 2011.

If you want to look at returns rather than prices, then in the command line you would enter the command below to generate (“\mathtt{genr}“) the new series:

(1)   \begin{equation*} \mathtt{genr} \ \mathtt{ret\_bnp} \ \mathtt{=} \ \mathtt{dlog} \mathtt{(} \mathtt{bnp\_adj\_close} \mathtt{)} \end{equation*}

This new series can then be inspected in the exact same way as the original price time series:

Summary statistics for the daily return series for BNP Paribas for the period from 2003 to 2011.

3. Benefits of Simulation

EViews is a nice tool for exploratory data analysis and quick intuition building experiments. However, for more serious econometric work we need to move to a more complete programming language. What’s more, because EViews includes so many out of the box statistical tests (e.g., the Jarque-Bera test for normality), it is not a great environment within which to learn econometric theory.3 At the end of the day, if you want to learn the econometrics, you need to be able to code it up.

To be more concrete, consider problem 4 in the first homework assignment on blackboard which asks you to consider the ARCH model below:

(2)   \begin{align*} r_t &= \sqrt{h_t} \cdot z_t \\ h_t &= \omega + \delta \cdot h_{t-1} + \alpha \cdot \frac{r_{t-1}^2 - h_{t-1}}{\sqrt{h_{t-1}}} \end{align*}

where h_t is the conditional volatility and r_t is a return, and then work out some properties of the estimator \mathbb{E}_t\left[ r_{t+2}^2\right].

The analytical solution to this problem can be derived by recursively substituting in the 1 step ahead formula for returns and then applying the law of iterated expectations. However, this procedure involves a fair bit of algebra4, which tends to obscure the intuition and lead to silly errors. For this sort of question, you really want to be able to check your work in a constructive way, and this is where numerical simulations come in extremely handy as they allow you to both check your work and visualize comparative statics. There is no reason why you should ever deliver a homework question of this sort with a wrong answer. Use R, MATLAB, SciPy or rocks.

As an example, take a look at my previous post in which I walk through the properties of the Hodrick (1992) standard errors for overlapping returns.

4. Document and Share Your Code!

How many additional citations and article views does John Cochrane get simply because he posts the data and code to all his papers in a easy to find location (here) on his website?

There are a variety of ways to dynamically document your code as you work. For instance see, Doxygen  for Python and Roxygen for R. Use Cell-Mode for MATLAB. If you use Emacs, take a look at Org-Mode with Org-Babel as an IDE for any language you could ever want. For EViews you really don’t have many options. All you can do is copy and paste the output into a word document.

  1. As always, the usual disclaimer applies: these are my thoughts and opinions. ↩
  2. You can actually use LaTeX equations in Google Docs now, though only in a restricted sense. Nevertheless, for short projects this seems like a nice tool for collaborating with distant coauthors as Google can take care of the version control. ↩
  3. This section has been updated in response to a comment. Upon re-reading, I was harsher phrasing than I intended. ↩
  4. Use Mathematica or SageMath for symbolic results. Don’t screw up simple algebra. You want to focus on the economics/finance, not the algebra. ↩

Filed Under: Uncategorized

« Previous Page
Next Page »

Pages

  • Publications
  • Working Papers
  • Curriculum Vitae
  • Notebook
  • Courses

Copyright © 2026 · eleven40 Pro Theme on Genesis Framework · WordPress · Log in

Loading Comments...

You must be logged in to post a comment.