probability/marginal-continuous.Rmd at master · dlsun/probability · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
# Marginal Continuous Distributions {#marginal-continuous}

## Motivating Example {-}

In Lesson \@ref(normal) on the normal distribution, we saw that there is no closed-form expression for the
antiderivative
\[ \int ce^{-z^2/2}\,dz, \]
where $c$ is a constant.
The definite integral must be computed numerically. This means that we can compute the integral to
any precision we like, but exact values are, in general, impossible.

How, then, can we be sure that the constant $c$ must be $1 / \sqrt{2\pi}$ for the total probability
to be 1? That is,
\begin{equation}
\int_{-\infty}^\infty \frac{1}{\sqrt{2\pi}} e^{-z^2/2}\,dz = 1.
(\#eq:gaussian-integral)
\end{equation}
Using numerical integration, we might be able to figure out that $c \approx .3989423$ (or as
many decimal places as desired), but how do we know it is $1 / \sqrt{2\pi}$ exactly?

It turns out that the definite integral \@ref(eq:gaussian-integral) can be evaluated exactly, but to see this,
we need to use joint distributions.

## Theory {-}

The definition for the marginal p.d.f. mirrors the definition of the marginal p.m.f. for
discrete distributions \@ref(def:marginal-discrete), except with sums replaced by
integrals and the joint p.m.f. replaced by the joint p.d.f.
```{definition marginal-continuous, name="Marginal Distribution"}
Suppose we have the joint p.d.f. $f(x, y)$ of two continuous random variables $X$ and $Y$.
The marginal p.d.f. of $X$ is the p.d.f. of $X$ alone and is obtained by _integrating_
the joint p.d.f. over all the values of $Y$.
\begin{equation}
f_X(x) = \int_{-\infty}^\infty f(x, y)\,dy
(\#eq:marginal-x-continuous)
\end{equation}
Likewise, the marginal p.d.f. of $Y$ is the p.d.f. of $Y$ alone and is obtained by _integrating_
the joint p.d.f. over all the values of $X$.
\begin{equation}
f_Y(y) = \int_{-\infty}^\infty f(x, y)\,dx.
(\#eq:marginal-y-continuous)
\end{equation}
```

If $X$ and $Y$ are independent, then their joint p.d.f. is the product of their
marginal p.d.f.s, just like it was for discrete random variables (Theorem \@ref(thm:joint-independent)):

```{theorem joint-pdf-independent, name="Joint Distribution of Independent Random Variables"}
If $X$ and $Y$ are independent, then
\begin{equation}
f(x, y) = f_X(x) \cdot f_Y(y)
\end{equation}
for all values $x$ and $y$. But only if $X$ and $Y$ are independent!
```

In the following examples, we construct the joint p.d.f. of two independent random variables $X$ and $Y$
using Theorem \@ref(thm:joint-pdf-independent). Then, we integrate the joint p.d.f. to obtain probabilities,
as usual.

```{example, name="Symmetry of Continuous Random Variables"}
The lifetimes of two lightbulbs, $X$ and $Y$, are independent $\text{Exponential}(\lambda)$ random variables.
What is the probability the second lightbulb lasts longer than the first, $P(Y > X)$?

First, we write down the joint p.d.f. using Theorem \@ref(thm:joint-pdf-independent):
\[ f(x, y) = \begin{cases} \lambda e^{-\lambda x} \cdot \lambda e^{-\lambda y} & x, y \geq 0 \\ 0 & \text{otherwise} \end{cases}. \]
Now, we just need to integrate this p.d.f. over the region where $y > x$. We sketch a bird's-eye view below.
```{r, echo=FALSE, engine='tikz', out.width='50%', fig.ext='png', fig.align='center', engine.opts = list(template = "tikz_template.tex")}
  \begin{tikzpicture}[scale=3]

\draw[->] (-.2, 0) -- (1.3, 0);
\draw[->] (0, -.2) -- (0, 1.3);
\draw (1, -.04) -- (1, .04);
\draw (0.5, -.04) -- (0.5, .04);
\draw (-.04, 0.5) -- (.04, 0.5);
\draw (-.04, 1) -- (.04, 1);

\node[anchor=west] at (1.3, 0) {$x$};
\node[anchor=south] at (0, 1.3) {$y$};

\node[anchor=east] at (0, 1) {$2$};
\node[anchor=north] at (1, -.04) {$2$};
\node[anchor=north] at (0.5, -.04) {$1$};
\node[anchor=east] at (0, 0.5) {$1$};

\node[anchor=west,blue] at (0.05, 1.1) {$y > x$};
\fill [blue,fill opacity=.3] (0, 0) -- (1.2, 1.2) -- (0, 1.2);

\draw[blue,dashed,<->] (-.2, -.2) -- (1.3, 1.3);
\node[blue,rotate=45] at (1.15, 1.05) {$x = y$};

\end{tikzpicture}
```

Now we integrate over the above region:
\begin{align*}
P(Y > X) &= \int_0^\infty \int_x^\infty \lambda^2 e^{-\lambda x - \lambda y}\,dy\,dx \\
&= \int_0^\infty \lambda e^{-\lambda x} \underbrace{\int_x^\infty \lambda e^{-\lambda y}\,dy}_{e^{-\lambda x}}\,dx \\
&= \int_0^\infty \lambda e^{-\lambda x} e^{-\lambda x}\,dx \\
&= \int_0^\infty \lambda e^{-2\lambda x}\,dx \\
&= \frac{1}{2}.
\end{align*}
(One way to see the last equality is to recognize $f(x) = 2 \lambda e^{-2\lambda x}; x > 0$ as the p.d.f. of an
$\text{Exponential}(2\lambda)$ distribution, so we multiply and divide by $2$ to make the integrand match
this p.d.f., which then must integrate to 1.)

In hindsight, the probability _has_ to be $1/2$ by symmetry. We know that either $X > Y$ or $X < Y$,
since the probability that $X = Y$ (i.e., the two lightbulbs have _exactly_ the same lifetime) is zero.
Because $X$ and $Y$ are independent with the same distribution, there is no reason for one to be preferred
over another, so $P(X > Y) = P(X < Y) = \frac{1}{2}$.
In fact, by the exact same argument, this is true for _all_ independent and identically distributed (i.i.d.)
continuous random variables with a joint p.d.f.
```

The last example answers the question posed at the beginning of this lesson.

```{example, name="The Gaussian Integral"}
The p.d.f. of a standard normal random variable $Z$ is
\[ f(z) = c e^{-z^2/2},  \]
where $c$ is a constant to make the p.d.f. integrate to 1.

Surprisingly, the easiest way to determine $c$ is to define
_two_ independent standard normal random variables $X$ and $Y$,
and use the fact that their _joint_ p.d.f. must integrate to 1.
By Theorem \@ref(thm:joint-pdf-independent), the joint p.d.f. of $X$ and $Y$ is
\[ f(x, y) = ce^{-x^2 /2} \cdot ce^{-y^2 / 2} = c^2 e^{-(x^2 + y^2) / 2}. \]
We know from Lesson \@ref(joint-continuous) that the total volume under any joint p.d.f. must be 1:
\[ 1 = \int_{-\infty}^\infty \int_{-\infty}^\infty f(x, y)\,dx\,dy = \int_{-\infty}^\infty \int_{-\infty}^\infty c^2 e^{-(x^2 + y^2) / 2}\,dx\,dy. \]
Surprisingly, this double integral can be evaluated (even though the single integral could not). To evaluate the
double integral, we convert to [polar coordinates](https://tutorial.math.lamar.edu/classes/calciii/DIPolarCoords.aspx), using the substitutions $r^2 = x^2 + y^2$ and $dx\,dy = r\,dr\,d\theta$:
\begin{align*}
c^2 \int_{-\infty}^\infty \int_{-\infty}^\infty e^{-(x^2 + y^2) / 2}\,dx\,dy &= c^2 \int_{0}^{2\pi} \int_{0}^\infty e^{-r^2 / 2} r\,dr\,d\theta \\
&= c^2 \int_0^{2\pi} \underbrace{\int_0^\infty e^{-u}\,du}_{1} \,d\theta & \text{(using $u=r^2/2$)} \\
&= c^2 (2\pi).
\end{align*}
Setting this equal to 1, we see that $c^2 = \frac{1}{2\pi}$. In other words, $c = \frac{1}{\sqrt{2\pi}}$.
```

## Essential Practice {-}

1. Let $X$ and $Y$ be continuous random variables with joint p.d.f.
\[ f(x, y) = \begin{cases}
\frac{1}{12} (1 + \frac{1}{18}(x - 2)(2y - 3)) & 0 < x < 4, 0 < y < 3 \\
0 & \text{otherwise}
\end{cases}. \]

    a. Find the marginal p.d.f.s of $X$ and $Y$. Are these named distributions? (Feel free to
    set up and the integral and use Wolfram Alpha to do the rest.)
    b. Are $X$ and $Y$ independent? (_Hint:_ According to Theorem \@ref(thm:joint-pdf-independent),
    what would their joint p.d.f. have to be if they were independent?)

2. Harry and Sally independently go to Katz's Deli for lunch on weekdays from their
respective jobs. Let $H$ be the time that Harry arrives and $S$ be the time that
Sally arrives (in minutes after 12 p.m.). $H$ and $S$
are independent $\text{Uniform}(a=0, b=90)$ random variables.
    a. What is the joint p.d.f. of $H$ and $S$? Sketch a bird's-eye view of the joint p.d.f.
    b. If they each take 20 minutes to eat their lunch after they arrive,
    what is the probability that Harry meets Sally...?

        (_Hint:_ The two are able to meet if they arrive within 20 minutes of each other.
        Sketch the region of times that correspond to Harry and Sally meeting.
        You can calculate this probability by geometry, without using integration.)
    c. What is the probability that Harry arrives after Sally arrives?