MIMO - an Historical Tutorial

Click here to go to our 5G page (new for October 2016)

New for September 2009! This page was contributed by Tom Hogan from LEUS Networks, on a topic he is passionate about. We're not experts on MIMO (multiple input multiple output, a "smart" wireless technology), so we're extremely grateful Tom took the time to mentor us. Many thanks! From here down is Tom's writing.

MIMO, an Historical Tutorial (Part I)

MIMO is a new term to describe the old concept of spatial multiplexing. Many young engineers today ebulliently regale us older folks with how MIMO shatters the Shannon Limit and how it will solve every problem in communications. Of these young engineers I ask one question: "When do you decide to turn MIMO off and concentrate all of your transmit power and receive resources to deliver a single stream of information instead?"
Some engineers will answer a strident "never", while other engineers will think a bit and answer: "When the channel is line of sight, MIMO does not work so it needs to be turned off". Neither answer is correct and begets a historical tutorial on spatial multiplexing.

Spatial multiplexing began in the late 1950s as a way to increase capacity of microwave telephone relay links. The very first form of spatial multiplexing involves the use of cross polarized antennas. In a free space channel, as is approximated by a line-of-sight microwave link, a transmission from a vertically polarized antenna can not be received by a horizontally polarized antenna and a transmission from a horizontally polarized antenna can not be received by a vertically polarized antenna. They are orthogonal the same way the left and right channels of sound in a stereo phonograph recording are orthogonal. If we want to double the capacity of any link we can set up a pair of transmitters, one sending vertically polarized and the other sending horizontally polarized signals over exactly the same spectrum. At the receive end, the vertically polarized antenna only receives the stream sent from the vertically polarized transmitter and the horizontally polarized antenna only receives the stream sent from the horizontally polarized transmitter. (NOTE 1)

Murphy's law tells us that in any ointment there will always be a fly. The fly in this ointment is that in order to double capacity, we not only split the information into two streams but we also doubled our transmit power. If our original transmitter was already operating at the regulatory limit, Big Brother is not going to permit us to double the transmit power even if we split the power between two polarizations or even two separate antennas. We are faced with having to send each of the two new streams at ½ the power of the original stream. To answer the question "When is this worth the bother?" we have to gauge the full benefit of all of the extra equipment if it was employed to maximum effect for the conveyance of a single stream of information. In this case we have two antennas at the transmitter and two at the receiver, and we can use the pair at the receiver in a phased array to gain 3 dB. At the transmit side we are still limited to a single antenna to stay within the ERP regulatory limit. The net effect is a 6 dB increase, 3 dB for antenna gain 3 dB for using the full transmit power in the single stream. 6 dB of course is a net increase of a factor of four.

To get a Kentucky windage estimate on any AM modulation system where the SNR (signal to noise ratio) rises and falls linearly with transmit power (FM systems do not necessarily comply with this rule), the capacity difference from sending a single stream versus sending two streams can be evaluated using Shannon's famous equation:

Capacity = Bandwidth x log2(1 + Signal/Noise)

If the SNR of the original stream is better than 8:1 or 9 dB then it is worth splitting the signal into two streams each with an SNR of 4:1 or 6 dB. If the original SNR is worse than 9 dB then it is better to concentrate all of the transmit power and receive resources into a single stream.

At this point in the tutorial, I usually have won over a few nodding heads, but there are frequently holdouts who want an example of "true" spatial multiplexing rather than polarization multiplexing.

As coincidence would have it, spatial multiplexing without polarization multiplexing was also tried as a way to increase the capacity of microwave links.

Let's take the example of a 4 GHz link with a separation of 20 miles between 200 foot towers. On top of each tower is a vertically polarized antenna. 114 ft down from the top of each tower is a second vertically polarized antenna.

MIMO - an Historical Tutorial
Figure 1

Analyzing this drawing using just the Pythagorean theorem will show that the path from the top transmit antenna (TTA) to the top receive antenna (TRA) is ¼ wavelength shorter than the path from TTA to the bottom receive antenna (BRA). Vice versa the path from the bottom transmit antenna (BTA) to the BRA is ¼ wavelength shorter than the path from BTA to the TRA . If we split the signal from each receive antennas into 2 paths, delay the signal from TRA by ¼ wavelength and sum with the signal from BRA, we null the signal from the BTA and maximize the signal from the TTA. Vice versa, if we delay the signal from BRA ¼ wavelength and sum with the signal from TRA, we null the signal from the TTA and maximize the signal from the BTA.

The situation above is the quintessential optimal MIMO alignment where the antennas are fully decorrelated and the peak of the desired signal occurs at the exact same point as the null of the undesired signal. If an LNA is used at each receive antenna to take the losses from the splitter/combiners out of the noise figure calculations the only penalty from dividing the transmit power of a single stream into two streams is the 3 dB loss in transmit power itself. Though the benefit of splitting into two streams decrease with decreasing SNR, there is no SNR value where there is an advantage to using a single stream.

Using only four two-way splitter/combiners and two ¼ wavelength pieces of coax, spatial multiplexing was achieved. The effect of this multiplexing was exactly the same as doubling the available bandwidth. At the time, it seemed like a license to print money -until the day it rained.

When adding signals, nulls and peaks behave differently. In Figure 1 (above) we could reduce the separation between the antennas at both sides of the link from 114 feet to 57 feet, lengthen the coaxial delay cables to 3/8 wavelength each, still achieve a null and only suffer a 3 dB reduction in the amplitude of the desired signal. A deep null is another story altogether. A deep null requires precise amplitude and phase matching. A trombone line in the coax delay lines was needed to calibrate the phase and variable attenuators were needed to calibrate the gain differences in the antennas. Even a 1 dB mismatch between the antennas meant you were only able to null ¾ of the power of the cross talking stream leaving you with a SINR (signal to interference noise ratio) of 6 dB. To maintain a usefully deep null required matching on the order of 0.1 dB, which proved impossible with "set and forget" hardware when rain was dripping down the radome. A pilot tone and an automated trombone line and variable attenuator were needed to track and hold the null.

At the time of this testing, the lust for capacity was driven by the need to convey video over these microwave links. FM transmission of video required at best slightly over twice the bandwidth of AM-VSB (amplitude modulation, vestigial side band) and in practice more like four times the bandwidth. If the goal was to maximize the available spectrum, part of the requirement was to transmit AM-VSB video over these links to conserve bandwidth and employ spatial multiplexing as well.

Murphy again made his existence known when a funny thing was observed even when the nulls were perfectly calibrated. Calibration was done with just the interfering transmitter active sending just a CW signal and the trombone line and attenuator were adjusted to achieve a deep null. When unique TV signals were sent on both transmitters, what you saw on each picture tube was black lines at the beginning and end of the sync pulses from the interfering video stream sweeping across the picture of the stream you wanted to maximize. The appearance of the crosstalk was similar to what you might have seen up to a few months ago on a cable system where a strong local analog TV station shared the same channel and the cable shield was imperfect. When you turned off one of the transmitters and looked at the nulled station all by itself, the signal on the oscilloscope looked like a differentiated video signal. Every fast powerful transition from light to dark and vise versa looked like a ringing sawtooth. There was no setting of the trombone line or the attenuators that would null this ringing out. After some experimentation, it was determined that the ringing was due to multipath from the ground clutter between the links. Whenever the transmitter made a step change, this resulted in a chaotic change in amplitude and phase as all of the old multipath rays died out and were replaced by rays at the new amplitude. As each new multipath ray arrived, the nulling trombone line and attenuator would need to be adjusted to a new null. It was hopelessly complex to equalize the null quickly and accurately enough to maintain the null while the signal was changing. The ideal modulation format would have been a digital one with a slow symbol rate and a guard interval at the boundary between symbols where the receiver would simply ignore what was being received until all of the multipath chaos settled out. Such a modulation scheme, namely DMT (discrete multi-tone), or more bombastically, OFDM (orthogonal frequency-division multiplexing), was decades in the future so spatial multiplexing just sat on the shelf gathering dust with the other white elephants until it was recently rediscovered.

NOTE 1: In practice, polarization multiplexing often used circular polarization in microwave links to eliminate odd order multipath. This fact is omitted from the tutorial for clarity. Polarization multiplexing is still employed today to maximize the spectrum available for "Turdbird" TV (a.k.a. Rupert Murdoch's satellite television company).

MIMO, an Historical Tutorial (Part II)

At this point in the tutorial, some of the engineers who are able to think without a computer realize that MIMO is just the summation of two vectors with the phase and amplitude optimized to null out one of the streams.

Some engineers at this point apparently uncomfortable that the emperor is sporting perhaps only a fig leaf, want to know what all this 50 year old work has to do with modern OFDM systems operating in a non line of sight channel.

As mentioned at the end of the previous part of this tutorial, the ideal modulation format for spatial multiplexing is DMT (OFDM by any other name). DMT uses a guard interval so that the receiver can ignore the transitions between symbols when multipath rays from the old symbol are being replaced with multipath rays from the new symbol. During this chaotic changeover time between symbols, the DMT receiver is simply ignoring what is happening. At the end of the guard interval, the receiver evaluates the signal using a FFT (fast Fourier transform) and converts each tone (sub-carrier) into a vector representing amplitude and phase. Each subcarrier, taken on its own, looks just like the signal from one of the receive antennas in Figure 1 of the first part of the tutorial. If the modern receiver has 2 antennas and 2 receive chains then each receive chain outputs a vector for each subcarrier. Instead of using trombone lines and attenuators as was done in prehistoric times, the modern receiver simply phase shifts and amplitude scales the vectors in DSP (digital signal processing) to achieve the desired null.

The steps we use to train a modern system for MIMO may make this clearer: training is performed on each subcarrier taken as an individual.

MIMO - an Historical Tutorial
Figure 2

Figure 2 above shows transmit antenna A and transmit antenna B as well as receive antenna 1 and receive antenna 2. Transmit antennas A and B are located on the same tower and receive antennas 1 and 2 are located on the same handheld.

Step 1. Turn on transmitter A, wait until the end of the guard interval and observe the amplitude and phase of stream A received on antennas 1 and 2.

Step 2. Choose the receive antenna where signal A is the quietest and subtract from this an attenuated and phase shifted signal from the louder antenna adjusted to cancel signal A. This subtraction will yield stream B.

Step 3. Turn off transmitter A and turn on transmitter B, wait until the end of the guard interval observe the amplitude and phase of stream B received on antennas 1 and 2.

Step 4. Choose the receive antenna where signal B is the quietest and subtract from this an attenuated and phase shifted signal from the louder antenna adjusted to cancel signal B. This subtraction will yield stream A.

Step 5. Training complete, switch on the two independent streams

From the training description, it should be apparent that as opposed to the line of sight example from pre-history, in a rich multipath environment there may be an amplitude difference between the streams as seen at the receive antennas. It should also be noted that unlike maximum ratio combining there is no telling what the phase of the vectors of the desired signal will be as this phase shift before the subtraction is dictated by the requirements of the null. Sometimes the summation will be beneficial other times it will be detrimental. As for when it is beneficial and when it is not, during any training period is it possible for the receiver to determine whether it is advantageous to use MIMO or whether it is better to use all of the transmit power and receive resources to convey a single stream. As for how often the choice swings towards MIMO, there are many papers written on this subject. The trends revealed in these papers are that when the antennas at each end of the link have greater separation, this is beneficial to MIMO. When the transmitter and the receiver are closer together, this is beneficial to MIMO. When the multipath arrives from a wide range of angles, this is beneficial to MIMO.

So far we have discussed the most common form of MIMO, where the processing to create the null is performed at the receiver. It is also possible to do exactly the same thing at the transmitter. Going back to Figure 1 in the first part of this tutorial, it is possible to put the ¼ wave delay lines and the trombone lines at the transmitter side of the link. An observer at the receive end of the link could simply radio back (by walkie-talkie) instructions to a colleague at the transmit end to adjust the trombone lines and attenuators to null the signal in such a way that stream 1 would appear on the top receive antenna (TRA) and stream 2 would appear on the bottom receive antenna (BRA). This sort of MIMO processing at the transmitter has been dubbed SDMA (spatial division multi-access) in the modern world.

Another form of MIMO processing at the transmitter that is of particular interest these days is MU-MIMO (multiple user MIMO). MU-MIMO is just MIMO where the transmit and the receive antennas are all located in different places. Referring to Figure 3 below: the transmit antennas are located on different towers and the receive antennas are each in a unique handheld. For our discussion we will refer to this technique as "a taste of the hair of the dog that bit you". An example will help understand how this technique works. Refer to Figure 3, this time transmitter antenna 1 (TA1) and transmitter antenna 2 (TA2) are on separate towers and receive antennas 1 (RA1) and receive antenna 2 (RA2) are on separate handhelds. Assume we have a high bandwidth backhaul connection to each tower and both towers know exactly what message the other is going to send. Our training steps are as follows:

Step 1. Turn on transmitter A, wait until the end of the guard interval and observe the amplitude and phase of stream A received on the antennas of both handheld 1 and 2.

Step 2. Turn off transmitter A and turn on transmitter B, wait until the end of the guard interval and observe the amplitude and phase of stream B received on the antennas of both handheld 1 and 2.

Step 3. Handheld 1 and 2 convey the amplitude and phase information they have observed from both transmitters back to transmitters A and B

Step 4. Transmitter A chooses the receiver it delivers the strongest signal to. Transmitter B chooses the receiver it delivers the strongest signal to.

Step 5. Transmitter A sends the message intended for its chosen receiver and adds to it a little of the message transmitter B is going to send to the other receive. Transmitter A adjusts the amplitude and phase of transmitter B's message to just cancel what would be received as interference on its chosen receiver from Transmitter B.

Step 6. Transmitter B sends the message intended for its chosen receiver and adds to it a little of the message transmitter A is going to send to its chosen receiver. Transmitter B adjusts the amplitude and phase of transmitter A's message to just cancel what would be received as interference on its chosen receiver from Transmitter A.

Using a training sequence like this the channel state information is conveyed to the transmitters. This information not only can be used for MIMO processing it can also be used to choose the optimal subcarriers to communicate with a particular receiver. It could even be used to decide how much information each subcarrier can convey and pack more information into some subcarriers than others.

MIMO - an Historical Tutorial
Figure 3

If we were deploying wireless networks on the dark side of the moon where nothing moves there would very little more to say about MIMO. Unfortunately here on earth, things move. The worst case is when the mobile station itself moves.
Lets consider the situation shown in the Figure 4 below:

MIMO - an Historical Tutorial
Figure 4

A car is traveling from point A towards point B at 60 miles per hour. Transmitters are located at point A and point B and have established a null over the car. Transmitter A is sending a 1000 byte packet at 8 Mbps. The packet takes 1 ms from start to finish to send. During the 1 ms packet duration the car moves about an inch. At 1 GHz, the wavelength is 1 foot and in the example above, the distance between a null and a peak in the radiation pattern is about 3 inches. The movement of an inch takes us out of the null and 1/3 of the way towards the peak. Remember that the peak is twice as loud as the original interference because we are actively transmitting a cancellation signal. A rotation of 1/3 of the way from null to peak is the point where the sum of the original interference and the cancellation signal is the same amplitude as if we had done no cancellation at all. The channel state information we learned during the training period at the beginning of the packet is hopelessly stale by the end of the packet. The rate at which the training information gets stale increases linearly with increasing carrier frequency, if we had chosen 3 GHz as the carrier frequency in this example the motion of the car would have taken us all the way from the null at the start of the packet to the peak by the end of the packet.

To mitigate this staleness problem we use a series of educated guesses, generally referred to as channel estimation. The first educated guess is that during the training period, we evaluate the Doppler shift of the incoming signals. From the Doppler shift we can get a sense of the speed and direction of the mobile station. This information can be used to predict what the channel will do next so long as there is no accelerating, decelerating or change of direction of the mobile station. Pilot subcarriers are also used for channel estimation. Pilot subcarriers convey no end data but instead convey a sequence of predetermined data known to the receiver. By observing how the pilot tones change in amplitude and phase from their expected values it is possible to calculate a correction factor between their observed amplitude and phase and their expected values. By applying this same correction factor to subcarriers in nearby spectrum, it is possible to compensate for changes in the channel as things physically move. This compensation is never exact and the approximation gets worse the farther in frequency the compensated subcarrier is from the pilot subcarrier.

Also, when the MIMO processing is done at the transmitter this channel estimation information must be transferred from the receiver to the transmitter and this transfer must be as timely as possible. Even with accurate training and the best channel estimation possible there is a performance overhead to performing all of the communication needed for this compensation and this overhead increases directly with the speed at which the channel is changing which itself increases linearly with increasing carrier frequency. Even with every optimization, there will be a carrier frequency/speed of channel change above which MIMO, particularly MIMO processed at the transmitter, where the overhead becomes excessive.

MIMO, an Historical Tutorial (Part III)

No discussion of history is complete without a respectful attempt to conceive of what will occur next.

Just as the lust for capacity was driven by the need to convey video 50 years ago, the same is true today. 4G business plans assume that customers will be able to enjoy the same YouTube experience connected to the 4G public networks as they have at home connected to their private WiFi network. To gauge the magnitude of this challenge, consider that MediaFlo , transmitting an eye popping 50,000 watts of ERP, using 5 MHz of spectrum in the vaunted 700 MHz band is only able to deliver 20 unique streams of video. Consider too that in a big city, like New York, the population density can be upwards of 40,000 people per square mile. Couple these two realities and it becomes conceivable that WiMax and LTE may swell the list of technology disappointments including G.lite, HomePNA 1.0. Broadband over Powerlines, WiMedia and Metro WiFi.

Success for 4G lies not just in optimizing technology but more importantly in understanding how the customers will use the service. In large cities, most of the customers are not in the streets and most of those who are have their visual centers otherwise engaged and will not be downloading video. The customers for video services are inside of the buildings and stationary for the most part.

Using the working intuitive understanding of MIMO developed in the first two sections of the tutorial and realizing that MIMO is simply the summation of vectors with the amplitude and phase adjusted to null out interfering streams, we can imagine the technology needed to serve this market.

Consider the classic three sector cellular reuse pattern shown in Figure 5 below. The available spectrum is broken up into three equal sized chunks: orange, blue and green. Imagine an indoor low power base station or Picocell by any other name, located at the center of one of the orange hexagons. It does not take an excess of creativity to appreciate that this Picocell can operate on the blue and green spectrum so long as it employs low enough power to not interfere.

MIMO - an Historical Tutorial

Figure 5

A much more magical solution can be imagined when considering how this Picocell can optimize the orange spectrum. So long as the Picocell shares a reasonably accurate common notion of time with the tower and has a high bandwidth backhaul so that it has a copy of the packet the tower intends to send, the Picocell can send the same packet as the tower, time aligned so that the guard intervals are reasonable well overlapped. In this way Picocells can augment the coverage of the towers without interference. To a receiver, the same packet sent from multiple transmitters simply appears as a packet sent from a single source that has strong multipath component. Picocells located anywhere within the coverage hexagon can work together to reinforce the signal from the towers. This augmentation is not MIMO per-se but rather an additional exploitation of the OFDM guard interval.

Within any building there will likely be multiple (perhaps many) Picocells. For Picocells located in the red hexagon away from the coverage fringe, the green and blue spectrum can be reused at low power. If all of the Picocells are coordinated, MU-MIMO can be used to share this spectrum without interference. MU-MIMO is more suited to indoor channels than outdoor ones because people move slower than cars Since the rate of change in channel characteristics is directly proportional to the speed of the objects moving around within the channel and also to the carrier frequency, operation around 700 MHz has clear advantages compared to operating in higher frequencies bands. In most cases, the channel state learned during the training interval will remain adequately fresh throughout the packet to obviate the need to send updates to the transmitter during the packet.

If the building containing the Picocell is located towards the coverage fringe where adjacent cell site's coverage patterns overlap, the Picocell is able to reinforce the coverage from multiple towers but not reuse the frequencies to convey independent data. Fortunately, only a small area exists where there is fringe overlap of all three chunks of spectrum.

MIMO, an Historical Tutorial (Conclusion)

Hopefully, through our understanding of the history of spatial multiplexing we can more clearly appreciate how the various flavors of MIMO can contribute to success in wireless networking. Where the first commercial deployments of MIMO in WLANs involved MIMO processed at the receiver, the variations of MIMO where the processing occurs at the transmitter show great potential to permit 4G networks to deliver a service that lives up to customers' expectations.

Should any questions arise from the reading of this article please contact me. I am always happy to discuss the history, the possibilities and the limitations of MIMO.

Author : Tom Hogan