A //**linear operator**// $\hat{A}$ on a vector space is a function that maps vectors to vectors, $|\psi \rangle \rightarrow \hat{A}|\psi\rangle$ such that \[\hat{A} \left ( a |\psi \rangle + b|\phi \rangle \right ) = a\hat{A} |\psi\rangle + b\hat{A} |\phi \rangle .\] Given a linear operator $\hat{A}$ on an inner product space, we can also define an action of a linear operator on a dual vector. The dual vector $\langle \psi | \hat{A}$ is defined to be the unique dual vector such that \[\left ( \langle \psi | \hat{A}\right ) |\phi \rangle = \langle \psi | \left ( \hat{A} |\phi \rangle \right ), \] for all vectors $|\phi \rangle$. Note that, given this definition, an expression like $\langle \psi | \hat{A} | \phi \rangle$ is unambiguous. We will always get the same result regardless of whether we first apply $\hat{A}$ to $|\phi \rangle$ and then take the inner product with $|\psi \rangle$, or first apply the operator $\hat{A}$ to the dual vector $\langle \psi |$ and then apply it to $|\phi \rangle$. ===== Examples of Linear Operators ===== * The //**identity operator**// $\hat{I}$: \[\hat{I} \ket{\psi} = \ket{\psi}, \qquad \text{for all }\ket{\psi}\] * The //**position operator**// $\hat{x}$: We define its action on the position basis as \[\hat{x} \ket{x} = x\ket{x},\] where $\hat{x}$ is an operator on the left hand side and $x$ is a scalar on the right hand side. We can deduce its action on an arbitrary vector $\ket{\psi}$ by writing it in the position basis as \[\ket{\psi} = \int_{-\infty}^{+\infty} \psi(x) \ket{x}\,\D x,\] and then \[\hat{x}\ket{\psi} = \int_{-\infty}^{+\infty} x\psi(x) \ket{x}\,\D x.\] Note, when we are working exclusively in the position basis it is common to abuse notation and write \[\hat{x}\psi(x) = x\psi(x),\] which can be confusing because $\hat{x}$ acts on abstract vectors $\ket{\psi}$ rather than their components $\psi(x)$ in the position basis. However, it causes no ambiguity when only this basis is being used. * The //**derivative operator**// $\hat{\frac{\D}{\D x}}$: We again define its action in the position basis. If $\ket{\psi} = \int_{-\infty}^{+\infty} \psi(x) \ket{x}$ then \[\hat{\frac{\D}{\D x}}\ket{\psi} = \int_{-\infty}^{+\infty} \frac{\D\psi}{\D x} \ket{x}.\] Again, abusing notation, we sometimes write \[\hat{\frac{\D}{\D x}} \left ( \psi (x) \right ) = \frac{\D\psi}{\D x},\] or, even more naughtily, we might forgo the hat on the operator and write \[\frac{\D}{\D x} \left ( \psi (x) \right ) = \frac{\D\psi}{\D x}.\] * The //**linear momentum operator**// $p$: This is defined in the same way in the momentum representation as $x$ is defined in the position representation, i.e. \[\hat{p} \ket{p} = p\ket{p}.\] If write write a general vector in the momentum basis as \[\ket{\psi} = \int_{-\infty}^{+\infty} \phi(p) \ket{p}\,\D p,\] then \[\hat{p}\ket{\psi} = \int_{-\infty}^{+\infty} p\phi(p) \ket{p} \,\D p.\] Once we have understood the connection between the position and momentum bases in more detail, we will prove that \[\hat{p} = -i\hbar \hat{\frac{\D}{\D x}}.\] * The //**parity operator**// $\hat{\mathcal{P}}$: This operator is very useful for understanding the symmetries of one-dimensional quantum systems, and we will be using it in module 4. It is defined by its action on the position basis as \[\hat{\mathcal{P}}\ket{x} = \ket{-x}.\] From this, we can determine its action on the coefficients $\psi(x)$ of a vector $\ket{\psi}$ in the position basis. If $\ket{\psi} = \int_{-\infty}^{+\infty} \psi(x) \ket{x}\,\D x$ then \[\hat{\mathcal{P}}\ket{\psi} = \int_{-\infty}^{+\infty} \psi(x) \ket{-x}\,\D x.\] We can now make the substitution $x'=-x$, which gives $\D x' = -\D x$, and $x' = -\infty$ when $x' = +\infty$, $x'= +\infty$ when $x=-\infty$. This gives \begin{align*} \hat{\mathcal{P}}\ket{\psi} & = - \int_{+\infty}^{-\infty} \psi(-x') \ket{x'}\,\D x' \\ & = \int_{-\infty}^{+\infty} \psi(-x') \ket{x'}\,\D x' \\ & = \int_{-\infty}^{+\infty} \psi(-x) \ket{x}\,\D x, \end{align*} where, the minus sign is removed in the second line because we changed the order of the integration limits and we can replace $x'$ with $x$ in the third line because $x'$ is a dummy variable. The upshot is that the coefficients $\psi(x)$ get transformed to $\psi(-x)$ by the parity operator. As you may have guessed by now, we will sometimes abuse notation and write \[\hat{\mathcal{P}} \psi(x) = \psi(-x).\] * The //**Laplacian operator**// $\hat{\nabla}^{2}$: Although we have not really discussed three-dimensional systems yet, the one-dimensional position basis $\ket{x}$ is generalized to a three-dimensional position basis $\ket{\vec{r}}$ where the basis vectors are now labelled by three-dimensional position vectors $\vec{r}$. In this basis, a general vector can be decomposed as \[\ket{\psi} = \int \psi(\vec{r}) \ket{\vec{r}}\,\D V,\] where the components $\psi(\vec{r})$ are now a scalar function of the position vector and $\D V$ is the three-dimensional volume element $\D V = \D x\D y\D z$. The Laplacian operator then acts as \[\hat{\nabla}^2 \ket{\psi} = \int \left ( \frac{\partial^2 \psi}{\partial x^2} + \frac{\partial^2 \psi}{\partial y^2} + \frac{\partial^2 \psi}{\partial z^2} \right ) \ket{\vec{r}}\D V.\] * //**Outer Products**//: In Dirac notation, an outer product is an operator of the form $\ketbra{\phi}{\psi}$, which acts as \[\left ( \ketbra{\phi}{\psi}\right ) \ket{\chi} = \ket{\phi}\braket{\psi}{\chi}.\] In other words, yo\u take the inner product of $\ket{\psi}$ with whatever the input vector is and output the vector $\ket{\phi}$ scaled by this inner product. The direction of the output vector is always the same as that of phi $\ket{\phi}$, but its length is scaled by the inner product. Note that an inner product $\braket{\phi}{\psi}$ is a scalar, whereas an outer product $\ketbra{\phi}{\psi}$ is an operator. ===== The Dirac Notaty ===== We will now learn the most useful thing to know in Dirac notation. It is so important that I will call it a theorem, even though I am a physicist and I prefer to just ramble on informally most of the time. **Theorem** Let $\ket{e_1}$, $\ket{e_2}$, $\cdots$ be any orthonormal basis. Then, \[\sum_j \proj{e_j} = \hat{I},\] where $\hat{I}$ is the identity operator. Proving this theorem is an in class activity. The reason that this theorem is so useful is that we can insert the identity operator in front of any vector without changing anything, but then decomposing it in a basis as $\hat{I} = \sum_j \proj{e_j}$ allows us to determine the how things are represented in a particular basis. Conversely, if we recognize $\sum_j \proj{e_j}$ somewhere in an expression we can remove it because it is just the identity. Inserting the identity operator, moving terms around, and removing identity operators is such a useful proof technique that I have given it a name: **//The Dirac Notaty//**. It also has a song, which is sung to the tune of the Hokey Cokey (the British version of the Hokey Pokey which has an additional chorus). Here are the lyrics: