A bunch of people have been asking me about the interpretation of QM. Now, every interpretation of QM predicts (or claims to predict) the same experimental results in any experiment (or at least, any realistically feasible experiment). Otherwise they wouldn’t be rival interpretations, they would be rival theories, and we would just do an experiment to see who is right.
So before discussing what QM actually means, it’s good to get the ground rules down—the ones that all physicists agree are the right ones to use to predict the results of actual experiments.
Let’s suppose we’re doing a physics experiment, which I am going to describe in an extremely abstract way, because that’s the kind of person I am. A (somewhat idealized) way to describe a certain class of experiments is as follows: We start out by preparing the initial configuration of the apparatus to be in some particular configuration (or “state”), let’s call it A. For example, we start with a radioactive atom. We can let the experimental apparatus evolve on its own, isolated from the rest of the world, until it reaches some final configuration. Then we look inside, e.g. 10 minutes later, and check what the current state of the experiment is. Perhaps the atom has now decayed into something else. Let’s call this final configuration B.
Several aspects of this description are clearly idealizations. There are always some limitations in our ability to control and/or know the initial condition A; the system is never going to be completely isolated from the rest of the world no matter how hard I try. And in some experimental setups this may be a good thing—that is, we may want to deliberately reach in to measure and/or adjust the system, part way through its “time evolution”. (Unlike biologists, we physicists use the word evolution any time anything changes!) And, at the end of the process, we’re never going to be able to measure the final outcome with perfect precision either.
But I’m a theorist so I can ignore the messiness of real life, whenever it pleases me to do so.
Now if the laws of physics were deterministic (and if we know what they are, and we know the initial state completely precisely…) then in principle we could simply solve all of the relevant equations and find out what exactly the final state would be. So after 10 minutes, any given A will become some particular B with probability 1.
(In practice, this calculation is often impossible because of phenomena like chaos where (in some systems, not others) the final outcome depends very, very sensitively on the initial conditions. For chaotic systems, you need to know the initial conditions to exponential precision in order to predict the future. This is why we can’t predict the weather accurately for more than about a few days out, because the number of digits accuracy you’d need to measure things to is proportional to the number of days!)
But the actual laws of physics are stranger than that. They are not deterministic. I think I’ve read that some Philosophers of Science claimed that Determinism was an important assumption underlying the possibility of doing Science at all. Well, Determinism is false, and yet we scientists still have jobs. I know, 20-20 hindsight, but it was still a dumb thing to say (if anyone in fact ever said it, which I haven’t done the research to confirm…).
So let’s try again. Once again, we’ll set up our experiment in a particular initial state A. But now, there are several possible final states B, B’, B” etc. Let’s suppose we want to calculate the probability for some specific one: B. So the sane, sensible way of doing this, would be to think of all the different ways that A could evolve in time to become B. To actually do calculations, you apply the rules of probability theory:
NORMAL PROBABILITY THEORY:
- For any particular process by which A can evolve to B (a history), we survey all the events which happened in that history, and calculate the probability of each individual event (using our knowledge of the laws of physics, as worked out from experiment or theory) Then we multiply those probabilities to calculate the probability of that particular history.
. - If there is more than one distinct history going from A to B, then we add up the probabilities of each history (since each of them are separate possible ways to get B), to get the total probability of observing B.
. - At the end of the day, the probabilities for all possible final outcomes should add up to 1.
Note that, since probabilities are between 0 and 1, multiplying them makes them smaller, as befits situations where multiple unlikely things need to happen to get from A to B. On the other hand, adding them makes them bigger, as makes sense if there’s more than one way for something to happen.
This is the sort of probability theory which would make sense a priori to our rational minds. The kind from which you can prove sensible results like Bayes’ theorem. But the universe doesn’t really work that way either!
One way to think about QM is that it’s like a Behind the Looking Glass version of probability theory, where things almost work like how you expect them to, but not quite. The basic weird idea of quantum mechanics that instead of assigning each path from A to B a probability (which is a real number between 0 and 1) you assign to each path an amplitude (which is a complex number whose absolute value is less than or equal to 1).
A complex number can be thought of as just a vector lying in a two-dimensional plane. In order to specify it, you need to know how long it is (the “absolute value” of the complex number) and what direction it points in (the “phase” of the complex number). Of course, if the absolute value is zero, then the phase is meaningless, since the complex number is just 0.
In QM, the absolute value squared of an amplitude represents the probability for an event to happen. This is called the Born rule, and it is the necessary interface for getting actual predictions about the world out of the theory.
So let’s suppose you have two different possible ways to go from A to B. (A classic example is the double slit experiment, where a particle passes through a screen which has two holes in it, and then reaches one of several possible locations on the detector.)
If the two possible histories have the same phase, then they constructively interfere and the probability of B happening is more than you would expect, from adding up the probabilities of the two histories. On the other hand, if the two possible histories have opposite phases, then they destructively interfere, and the final probability is less than you would expect. In fact, if the amplitudes are equal and opposite, then the total probability of getting to B is exactly 0!
(More generally, amplitudes constructively interfere if they are at an acute angle in the complex plane, and destructively interfere if they are at an obtuse angle. For right angles, the Pythagorean Theorem + the Born Rule tells you that you get the naive expected answer from just adding up the probabilities.)
So, to summarize, instead of doing the thing that makes sense, you do this instead:
QUANTUM PROBABILITY THEORY:
- For any particular process by which A can evolve to B (a history), we survey all the events which happened in that history, and calculate the amplitude for each individual event (using our knowledge of the laws of physics, as worked out from experiment or theory.) Then we multiply those amplitudes to calculate the probability of that particular history.
. - If there is more than one history going from A to B, then we add up the amplitudes of each history (since each of them are separate possible ways to get B), to get the total probability of ending up at B.
. - The probability of observing B is given by taking the absolute value squared of the total amplitude. Unlike amplitudes, this is always a real number between 0 and 1. Also, the laws of physics are chosen so that, at the end of the day, the probabilities of all possible final outcomes still add up to 1. (This requirement is called unitarity). QM may be weird, but it’s not that weird.
(You may wish to go back and compare this, point by point, with the Not-Batshit-Crazy-Probability-Theory earlier in the post.)
So, if you have a system with N different initial states (and therefore N possible final states), you can specify the time evolution over any given time $$t$$ by writing all of the possible transition amplitudes from each possible initial state A, A’, A”… to B, B’, B”… in an N x N matrix $$U(t)$$, with complex numbers in each slot. If you know about the math of matrices, this matrix is required to be unitary: $$UU^\dagger = U^\dagger U = I$$. That’s what enforces unitarity, the rule that probabilities add to 1 no matter which state you start with.
Now, if you wanted to know which specific states are allowed, or which specific unitary matrix to use, then you need to specify a particular quantum mechanical theory, e.g. a harmonic oscillator, or Quantum Electrodynamics., or the Standard Model. QM is a framework for constructing theories, not a specific theory. Just like Newton’s Law $$F = ma$$, or the rules of classical physics, are a general framework; only experiments can tell you which particular forces actually exist in Nature.
In the next post of the series, I’ll spell out some of the implications of this framework, and then maybe I’ll be in a position to talk about interpretation.
14 Responses to Quantum Mechanics I: Interference