Digital Collections

Nanometers and Gigabucks

  • Gordon E. Moore, Intel Corporation

  • 1996-Jan-26

These captions and transcript were generated by a computer and may contain errors. If there are significant errors that should be corrected, please let us know by emailing digital@sciencehistory.org.

Transcript

00:00:30 Today, I want to talk about a very important industrial technology of the last half of

00:00:55 the 20th century, the technology for making integrated circuits. These are the backbone

00:01:02 of modern electronics. They're what have brought you low-cost computers and a variety

00:01:08 of electronic controls spreading to almost all parts of industrial activity and in everything

00:01:16 around the home. This technology has been very powerful. It has allowed us to reduce

00:01:24 the cost of electronics dramatically over the last few decades. In fact, today you

00:01:30 can buy an integrated circuit containing millions of transistors for the price of a single transistor

00:01:36 when I first got in the business in the late 50s. I know of no other technology that has

00:01:41 made million-fold reductions in cost. This is one of the things that has made it so powerful.

00:01:48 In fact, to use an analogy, one I've used quite often, if the auto industry had progressed

00:01:55 at the rate of the semiconductor industry as a result of this technology over the last

00:02:01 35 years, today you could buy a car that would travel comfortably at several million miles

00:02:07 an hour, get a million miles per gallon of gasoline, and would be more economical to

00:02:14 throw away your Rolls-Royce and buy a new one and to park it downtown for the evening.

00:02:19 I thought that was pretty good until once somebody in the audience pointed out to me

00:02:22 it would only be about an inch tall and two inches long. It wouldn't be much good for

00:02:26 your morning commute. Recently, this analogy was updated by Professor Jeffrey Rayport of

00:02:33 Harvard and applied to the aircraft industry instead. By his calculations, a Boeing 767

00:02:42 would cost about $500 and would circle the earth in 20 minutes using five gallons of

00:02:50 fuel. Clearly, there's something unique about this semiconductor technology. To me, what

00:02:57 we're doing is exploiting a violation of Murphy's Law. By making things smaller, everything

00:03:05 gets better simultaneously. The circuits become higher performance because the transistors,

00:03:12 as they become smaller, will operate at higher frequencies. The power consumption drops considerably.

00:03:21 Systems become much more reliable, but most importantly, the cost of doing things electronically

00:03:27 drops dramatically. Let me show you a couple of the early devices that were really the

00:03:35 beginning of this technology. First, what I consider to be the beginning, the planar

00:03:41 transistor. This picture shows a top view of a single transistor structure that covered

00:03:50 a silicon wafer in those days about three-quarters of an inch in diameter. There were tens of

00:03:59 these identical transistor structures on the silicon wafer. Beyond this stage, they got

00:04:08 cut up into individual transistor structures, wires attached to the two aluminum electrodes

00:04:14 that you can see here, the one in the center and the circle of light-colored material with

00:04:21 the little blob on the corner of it. The third contact of the transistor being the bottom

00:04:27 of the piece of silicon where it was soldered onto the package material. I have a fond recollection

00:04:35 of this particular device. It's probably the only product I ever designed personally that

00:04:39 went into production. The peculiar dimension, 764 microns, reflects merely the fact we used

00:04:48 to work in the English system and this was 30 thousandths of an inch. This is the conversion

00:04:53 from English to metric that gives the unusual number. The next slide shows the first planar

00:05:01 integrated circuit made at Fairchild Semiconductor in 1961. I'm sorry that the picture that's

00:05:11 been preserved with such a bad example, clearly this one is quite dirty and the pads around

00:05:18 the edge to which the connections were to be made are fairly badly damaged in the process

00:05:25 of etching the round piece of silicon so it would fit in the package, the kind we used

00:05:31 at the time. These were the first two examples, the basic technology that I'm going to be

00:05:39 describing. The planar transistor gave us a structure where we could interconnect devices

00:05:47 by running metal films up over the sensitive parts of the transistors called junctions

00:05:54 and could connect one to the other. The idea of the integrated circuit was developed initially

00:06:02 I guess at Texas Instruments where Jack Kilby built an integrated circuit and then at Fairchild

00:06:08 where Bob Noyce had the availability of these planar transistors. He saw how to extend that

00:06:14 technology to make a practical integrated circuit and that has been the basic technology

00:06:19 that has been evolving ever since. In the early 1960s, I was asked to write an article for the

00:06:28 35th anniversary of Electronics Magazine in which I was supposed to predict the course of

00:06:34 semiconductor components over the next 10 years. In that, I took the existing integrated circuits,

00:06:43 there were only relatively few at that time, looked at how the complexity had been increasing

00:06:48 and thought that was a trend that was going to continue and I drew the curve that is shown in

00:06:55 the slide. This extrapolated from the most complex circuit with something like 60 transistors and

00:07:04 resistors in it to something that would be a thousand times as complex by 1975. This thousand

00:07:13 fold extrapolation corresponding to a doubling of the number of components in an integrated

00:07:19 circuit every year was what got termed Moore's Law at some later date. Moore's Law has been

00:07:26 applied to a lot of other things and I'm happy to take credit for all of them, but this was

00:07:31 really the origin of that idea. Well, the next slide shows a sprinkling of some of the more

00:07:42 complex devices that were built during that decade and as you can see, they track the line

00:07:46 really quite closely, certainly much better than I could ever have expected them to do when I

00:07:52 originally published the curve. The most complex device was within a factor of two of the thousand

00:08:01 fold increase in complexity that I had predicted. Clearly, the technology was extending quite

00:08:07 rapidly and had a long way to go. Now, it is useful to look at this and to say what changed

00:08:17 in the technology that allowed this huge increase in complexity and of course, with the increase

00:08:22 in complexity, the decrease in the cost per electronic function since we could sell an

00:08:30 integrated circuit for not much more than the price of the individual transistor. This curve

00:08:36 can be resolved into a few different contributions. As you see on the next slide, during this time

00:08:43 period, the size piece of silicon, the die size as we call it today, that is the size of the little

00:08:51 square into which we cut the round wafers, grew somewhat in area. It also grew exponentially but

00:08:59 was nowhere near the thousand fold growth that we saw for the complexity of the device. So,

00:09:07 another something else had to be happening. We had to be increasing the density of electronic

00:09:12 circuitry on the silicon wafer. That is, we had to be able to pack more transistors and resistors in

00:09:20 the same amount of real estate and indeed, that's what was happening. Again, as a result of more

00:09:27 than one factor. The next slide shows what happened just because of the process density. As

00:09:36 I mentioned earlier, this technology makes finer and finer features to get all of the benefits

00:09:42 that we've described. This shows the result of making finer features in increasing the density

00:09:50 of the components on the wafer. It goes obviously is the square of the linear dimension. Since the

00:09:57 line width and spacings weren't always the same, we've used the average rather than the individual

00:10:04 numbers on the ordinate. The abscissa showing time and again, if you neglect the first point,

00:10:12 which was the first planar transistor, it falls quite well on an exponential curve and that first

00:10:18 point can legitimately be neglected because on the first planar transistor we had a lot of other

00:10:24 problems to worry about. Working at the minimum dimension was not one of the things we considered

00:10:30 important. Now, we have two things now. We have the increasing size of the die and we have the

00:10:38 decreasing size of the features on the die. We can compare those two contributions to the total

00:10:46 of what happened during this time period. As you see on this curve where I've plotted first of all

00:10:52 each of the two contributions separately, that is that due to the increasing die size and that due

00:10:59 to the reduction in dimensions. You can take those two curves and add them or since this is logarithmic

00:11:06 logarithmic scale, you're really multiplying those two effects. We get this intermediate line,

00:11:11 the product of die size and dimensions. You see while it accounts for something like two-thirds

00:11:19 of the increase in the complexity of the circuits, there's still a significant chunk left which I've

00:11:26 ascribed here to device and circuit cleverness. That is people learn to design circuits that packed

00:11:34 more closely on the piece of silicon. It was room that was left for invention. Now, I had done this

00:11:46 in 1975 and looking forward one thing was of concern. While we could still see increasing

00:11:55 the density by making smaller dimensions and increasing the die size, the room for this

00:12:01 cleverness had really disappeared because the last point plotted for 1975 was a particular kind

00:12:09 of a device that packed transistors as closely together as they would fit geometrically on the

00:12:14 wafer. There was no more room for cleverness. So, at that time we predicted that the slope would

00:12:21 change and we would no longer be doubling every year but it's something somewhat longer than that

00:12:27 corresponding only to the other two factors that we had available. Still though, this is a very

00:12:33 rapid rate of progress in a technology and allows the complexity of the circuits to double every

00:12:41 year and a half rather than every year, slowing down the rate of progress somewhat but still

00:12:47 maintaining something that is unprecedented with any other industrial technology that I can find.

00:12:53 What's happened since? The next slide shows the die size history up until the present time for

00:13:07 both microprocessors and for the large volume dynamic random access memories that are prevalent

00:13:15 in most computing systems today. This continues on the same slope as the line from the early 60s

00:13:24 through 1975, suggesting that nothing has really changed in our ability to make larger and larger

00:13:31 silicon die economically. This trend has continued. Similarly, on the next slide we see what has

00:13:40 happened on the increase in density due to line widths. Here again, the slope has continued the

00:13:49 same as it was previously with the average line width decreasing about a factor of two every six

00:13:57 years. The same trend that we've been on since 1960. So, the increase in circuit density should

00:14:09 proceed as it did previously with only the factor for cleverness removed. The next slide shows what

00:14:19 I calculated for a 1975 update of Moore's law. Unfortunately, it's not what I published because

00:14:26 I thought things were going to carry on a little further with a contribution from cleverness. There

00:14:34 was a good technical reason why it didn't happen. The devices I was looking at turned out not to be

00:14:39 practical, having some fairly fundamental problems. But the calculation shows that the line is very

00:14:46 close to what was actually achieved over this period. Tracking with the most complex devices,

00:14:53 the DRAMs, tracking very closely the calculated curves. Now, the question looking forward is can

00:15:05 this be expected to continue? What has to be done for it to continue? And let's look at a little bit

00:15:12 of those things. First of all, for it to continue in this line, we have to obviously be able to

00:15:20 continue to make larger and larger dice and to continue to reduce our feature sizes. But we get

00:15:27 to the point where there's several other considerations that we also have to consider as we

00:15:33 look at the evolution of the technology. For example, in looking at microprocessor technology,

00:15:41 here's a list of a few things that has to be taken into account as well as just the sheer

00:15:48 complexity. Performance is one of the major drivers for pursuing new processors and performance

00:15:58 requires that several things be considered. For example, the transistors themselves have to be

00:16:05 made faster and faster, which happens fairly naturally as we go to smaller dimensions. But

00:16:10 increasingly, we have to worry about the interconnection structures, how their performance

00:16:16 affects the performance of the total circuit. And of course, how close we can get things together, the

00:16:21 density of things on the circuit itself. Reliability is extremely important. We have to be sure we

00:16:28 don't get to a region where our lines are so narrow or layers so thin that we have problems with the

00:16:36 basic integrity of the structure. And we have to worry about high volume manufacture. This has

00:16:45 become a very important industry. Products are made in millions, 10s of millions, and even

00:16:50 hundreds of millions for certain kinds of devices every year. And the economical and repeatable

00:16:57 manufacture is something that is always a major concern, and one that is increasingly a

00:17:02 consideration. Let's look at a few of these in somewhat more detail. First, let's take the

00:17:10 minimum feature size, the thing that's necessary to keep us on this curve, continuing to violate

00:17:16 Murphy's Law. This shows Intel's minimum feature size over an extended period of time, mostly on

00:17:26 processor rather than on memory devices. And it shows the roadmap that was laid out by the

00:17:34 Semiconductor Industry Association, suggesting how this is going to continue to go over the next

00:17:39 several years. As you can see, the slope we've been on tends to continue. In fact, it's

00:17:48 extrapolated at least into the early part of the next century. This corresponds to the halving of

00:17:57 the minimum dimension every six years. And since technology generations, that generally takes two

00:18:06 technology generations to make such a halving of the minimum dimension, the steps we show here as

00:18:13 points correspond to something that is about the square root of that. So every one of these steps

00:18:22 between every technology generation, say, for example, from the one shown above about 1989 to

00:18:29 the one shown above 1992, cuts the area a given function requires in half. Since if you square that

00:18:42 square root of two, you get the factor of two that corresponds to the shrinking die size. I'll show

00:18:49 that a little more clearly later on. Now, there are a lot of concerns one has to have about the

00:18:57 ability to continue this. Some technological and some that relate to the basic physics of the

00:19:04 devices. For example, do transistors continue to behave as transistors as the minimum dimensions

00:19:12 get of the order of 10s of microns. The production devices today are around 0.35 microns minimum

00:19:23 dimensions. This predicts a continuous improvement down to the range of perhaps 0.18 and maybe another

00:19:32 generation to 0.13 microns. Laboratory devices have been made that show the electrical characteristics

00:19:41 still are useful for making circuits in this dimensional range. It becomes more questionable as

00:19:50 one gets down to around 0.03 or 0.05 microns. There it appears that some of the physics may actually make

00:19:59 the devices so they cannot be produced as part of a major circuit. But that gives us a long time to worry

00:20:07 about that. And frankly, I'm more worried about the technological problem than I am about the basic

00:20:12 physical limit in this case, because making these narrow features is increasingly difficult. At one time, it was

00:20:22 generally accepted in the industry that it would probably not be possible to make these structures by optical

00:20:30 lithography, the standard production technique, below about one micron. We were using light of wavelength around a

00:20:38 half micron, and getting the high resolution required over a large field size, we thought might require using

00:20:48 dimensions twice the wavelength of light. In fact, we have been able to pass that one micron barrier without ever

00:20:58 seeing a change in the slope of the curve. This was done partially by using shorter wavelengths. We've moved to

00:21:05 ultraviolet, we'll move to increasingly hard ultraviolet as we move down the curve. But at least as big a contributor

00:21:14 has been the progress has been made in optics. The systems that do this today use the state of the art in optics.

00:21:22 These are huge lenses or very sophisticated mirror systems that allow us to make structures over very usable areas

00:21:33 where the structural features are of the order of a wavelength and in some cases by taking special precautions less than the

00:21:40 wavelength of light. We have to continue to push in that direction to stay in this curve because frankly, once we can no longer

00:21:49 use optical lithography, we're faced with some choices that are not very attractive. In fact, we have about three equally

00:21:58 unattractive choices, the use of x-rays, the use of direct electron beam writing or the writing with ion beams, all of which

00:22:07 will increase the cost dramatically and will negate much of the learning that we have accomplished over this period of time

00:22:17 using optical lithography. Staying on this minimum feature size curve is one of the real challenges for the industry. To show you

00:22:27 what some of these little devices look like, the next page shows a cross section through a production transistor today. This is a

00:22:37 transistor made with a 0.35 micron technology and is used in making Intel's most advanced Pentium and Pentium Pro

00:22:45 microprocessors. The silicon substrate is the pedestal upon which the various features are sitting. Actually, the only part of the

00:22:59 silicon wafer that seems really active in these structures is the top micron or so, the rest of it just acts as a mechanical support for

00:23:10 the structure. The two heavy pillars that you see, one on either side of a central structure, are plugs of tungsten that are used to

00:23:24 make contacts to the source and drain of this MOS transistor. The gate electrode being the little structure on top of the center

00:23:35 feature. The caps on top of the tungsten pillars are aluminum interconnection lines that run in or out of the picture in order to hook this

00:23:49 transistor to the rest of the circuit. The entire gray area surrounding the tungsten plugs and the silicon substrate is silicon oxide, which in

00:24:01 this scanning electron micrograph shows up as rather featureless. If we move to the next generation of technology or in fact, two generations

00:24:13 down to the kind of things that are just being made experimentally today, you see the transistor on this slide is showing 0.18 micron minimum

00:24:24 feature size and the related structure. This is really pushing our capability now. In fact, this is a laboratory capability and it will be several years

00:24:36 before this can be done reproducibly in production. I mentioned that more than just density was important in looking at technology appropriate for

00:24:48 processors in the future. The next slide shows, for example, how performance is affected by the transistors and by the interconnections. As you see, the

00:25:03 performance of the transistor continues to improve. That is the delay time related to the transistor gets shorter and shorter as we go to smaller and smaller

00:25:13 minimum dimensions. On the other hand, the delay time and the delay time I'm talking about here for the interconnection is that for a two millimeter line

00:25:24 connecting devices on the integrated circuit increases. Now up until the current generation, which is the 0.35 generation of technology, the delay from the

00:25:38 interconnects is still significantly less than that from the transistors. While it has to be taken into account, it's not yet the limiting factor. As we move to the next

00:25:50 generation of technology, however, at a quarter micron, the delay from the interconnection starts to dominate the delay from the transistor and looking at a little

00:25:59 further in time, far dominates it. So it's something that requires a tremendous amount of attention and continue to evolve the technology so we can take maximum

00:26:09 advantage of it. The reason the interconnect contributes this delay is that as we make smaller and smaller features to take advantage of them, we have to make narrower and

00:26:23 narrower interconnection lines. As we do that, the resistance goes up. Even if we keep the thickness of the film the same, as the line become narrower, its resistance

00:26:33 increases. And as they get closer together, the capacitance from the line to its neighbors also grows. So the RC time constant, the resistance times the

00:26:44 capacitance of these lines is what limits their performance. Obviously, there's an opportunity here to look for materials that lower the resistance or lower the

00:26:59 capacitance. It's hard to find them. Typical interconnections are made of aluminum. There are better conductors than aluminum, for example, copper and silver, but they

00:27:09 present tremendous technological problems and neither has been satisfactorily incorporated into reliable integrated circuits. Similarly, with respect to the

00:27:18 capacitance, there are materials with lower dielectric constant than the silicon dioxide, but not much lower. And they require abandoning a lot of the technology

00:27:30 that has developed here over the decades. Not an easy change to make for a small incremental improvement. There is one thing, however, we can do, though. We can work hard to keep

00:27:40 things close together. And one way to do that is to increase the number of layers of metal. So instead of running distances in one layer, we can run up into an upper story

00:27:52 along and down again. The next slide shows a modern structure. In fact, this structure is actually the one we use in our 0.35 micron technology. The silicon shows just barely at the

00:28:09 bottom of the picture. The section on the right shows four layers of conductor, labeled M1 through M4, with the intervening plugs, typically of tungsten, that connect one layer to the

00:28:25 other. This technology for the multilayer interconnection has moved very rapidly over the last several years, primarily as the result of a technology called planarization, where we're able to restore a flat surface between each series of steps. So we're always working on a surface where we don't have to be too concerned about things like depth of field or about films running up and down.

00:28:55 Some topology that might be relatively difficult. This will continue to be an approach that will help alleviate the problems of performance relating to interconnections. It will allow us to increase the density. But what it does is complicate the process considerably. Each of these layers of metal has an underlayer that makes the metal interface to the insulator.

00:29:25 work well. It has the basic aluminum layer or aluminum copper alloy, typically, and then special metals to make contact to the tungsten in one thing or another. So the number of process steps grows dramatically as we add layers. But the benefit of being able to maintain the very high density and to continue to increase it certainly justifies movement in this direction.

00:29:56 Another problem we have to contend with is one of the minimum layer thicknesses that we use in the technology. In particular, concern about the minimum insulator layer. This is also an important parameter in determining transistor performance. And it's one where we actually are approaching some sort of a physical limit.

00:30:19 I've taken this data from something published by Professor Mead at Caltech. And he shows a point that is sub-tenth micron and about 30 angstroms of being in production. I don't know where that's produced. That's certainly far beyond anything we're doing in both respects.

00:30:39 On the other hand, this trend is one that the industry has been on for some time. While it looks like the trend toward always narrower lines as the minimum feature size, it has a real physical limit that relates to quantum mechanical tunneling.

00:30:59 At 50 angstroms, at a few volts, electrons will tunnel through a material. So even a good insulator like silicon oxide will actually show up as conducting.

00:31:11 The rate of tunneling increases exponentially as the thickness decreases. So at 30 angstroms, tunneling is a very important problem and is something that has to be contended with at even very low voltages.

00:31:29 Now in fact, we've utilized this to make some special kinds of devices where we actually tunnel through the insulator in order to charge or discharge electrodes.

00:31:40 On the other hand, for conventional transistors, it's a problem and the kind of thing that requires us to take precaution. Probably the precaution will be that the layers will tend not to get any thinner than this.

00:31:54 Of course, you have to recognize that a 30 angstrom layer is only a few atomic layers thick, you know, maybe

00:32:04 five to 10 layers of silicon oxide. So we're really getting down pretty close to the atomic nature of matter. Clearly a fundamental limit there. On the other hand, we can continue to exploit the other

00:32:19 improvements on the technology, such as decreasing feature size, even though we're getting down in a range where we can no longer decrease the thickness of the insulator.

00:32:33 As we increase complexity, other things happen. There's a lot going on in a microprocessor with 10 million transistors.

00:32:43 This requires a fair amount of power, and indeed the power consumption has grown as the complexity of the devices has grown.

00:32:53 Partially just because of the increase in the number of transistors that have to be active, but more importantly, because this gets combined with an increase in frequency.

00:33:05 And the power consumption in these devices is typically that of charging capacitors and then discharging them to ground. So you get kind of a CV squared impact on the

00:33:20 total power consumption. That is, if the capacitance goes up, the total amount of stuff on the chip goes up, the power increases, and as the voltage squared, so

00:33:31 the voltage at which the circuits operate is clearly an extremely important parameter.

00:33:39 This curve shows that processors today are dissipating of the order of 20 watts, and indeed that's what Intel's leading edge

00:33:49 processors do. The extrapolation is what the Semiconductor Industry Association roadmap suggests we should be able to do looking forward.

00:34:04 Now,

00:34:05 it takes a lot of work to achieve even this, and the last point out here is nearly 60 watts, an awful lot of power to get out of a package containing one of these

00:34:19 processors, and a real problem for things like laptop computers, where batteries can't support 60 watt processors for very long.

00:34:29 In order to stay on this curve, it's necessary that we scale the voltage of operation.

00:34:36 This was almost a constant for many years.

00:34:40 Five volts was the standard operating voltage for the families of integrated circuits that were used conventionally, and from the 60s clear into the 90s,

00:34:55 five volt operation was where almost all products were designed.

00:35:01 As we got into these power problems, however, the attractiveness of scaling voltage became

00:35:08 higher and higher, and also by scaling voltage we could take advantage of the decrease in feature size,

00:35:17 since the ability to support voltages decreases as we go to smaller and smaller structures.

00:35:25 We've taken the step now from five volts to 3.3 volts, and expect to see it step down fairly rapidly to some kind of an ultimate limit.

00:35:33 While there's no fundamental limit we can point to clearly, the engineering consensus seems to be that somewhere around one volt is where this will stop being productive.

00:35:46 Indeed, getting from where we are to reliable high-performance one volt circuit requires a lot of really good engineering.

00:35:55 There's a lot still to get out of the technology, but we have to look at a lot of directions simultaneously.

00:36:02 Here I've talked about a few of the technical ones, now I want to

00:36:08 change over and talk a bit about some of the economic ones. In particular, let me talk about cost.

00:36:16 There are a variety of things that contribute to the cost of a microprocessor or other complex

00:36:22 integrated circuit. There's certainly the design cost, the product cost, and then the ongoing support cost.

00:36:28 I'm going to say a little bit about each of these. Certainly the cost to design these products

00:36:35 has not gone up as fast as the complexity, but it has gone up at a significant rate.

00:36:41 Where we've increased the complexity orders of magnitude, we've only increased the design cost

00:36:48 by a fraction of that because of the improvement in the computer age that we use.

00:36:55 On the other hand, the design of a modern microprocessor can be several hundreds of millions

00:37:00 of dollars now. If only a small quantity is made, that design cost amortized across the whole

00:37:09 production life of the device can be a significant contributor to the total cost.

00:37:14 For example, if you spent 200 million dollars to design a microprocessor and only made 200,000

00:37:21 of them over the lifetime, which is a large number for a processor dedicated to a single application

00:37:28 or a single company, the cost of the design alone adds a thousand dollars each to the

00:37:35 microprocessors that have been built. A huge increment when you look at personal computers

00:37:42 selling for less than that. Similarly, the design and engineering for the production process

00:37:53 is in the multiple hundreds of millions of dollars range this time for each of these

00:37:57 generations of technology. Not counting the cost of replicating that technology to build it in

00:38:04 volume, just the cost of the engineering necessary to bring it to fruition.

00:38:09 Product cost, we have to look at a variety of contributors. The cost of the piece of silicon

00:38:15 itself, the die, or the silicon chip, the cost of the package and packaging, the cost of testing,

00:38:22 and then these things always require some ongoing support. Let's look at a couple of these things

00:38:30 Let's look at a couple of these things in more detail.

00:38:35 This shows the relative cost for developing generations of microprocessors at Intel.

00:38:42 You see that the Pentium Pro is well over a hundred times the cost of developing the 8080

00:38:50 in the early 70s, and that the cost per generation is something like doubling. I

00:38:57 suppose here's another corollary to Moore's law that I ought to expound upon, but I don't plan to

00:39:04 at this time. This is only one of the factors of the one-time cost we have to consider, but I want

00:39:13 to spend more time talking about the cost of practicing the technology itself. That is,

00:39:19 the economics of building these batch produced devices. By batch produced, I mean

00:39:27 the case where we take a silicon wafer and we cover the entire silicon wafer with identical

00:39:32 structures, so that in the process of building one of these structures, we build tens or hundreds of

00:39:39 them, so that the cost of processing the wafer can be amortized over all the good chips we get

00:39:46 out of that wafer. Then these get cut up and packaged individually, and I want to take you

00:39:52 through a very simple model of how this works, so you get an idea of the kinds of things we have to

00:39:58 be concerned with in practicing this technology in manufacturing. On the next slide, I show the

00:40:08 simple function of how many squares can you get out of one of these circular wafers. That is,

00:40:15 depending on the size of the square, and I'm measuring the size of the square here

00:40:20 as the edge of a square die. That is, a 0.5 die is a half inch by a half inch,

00:40:29 and if you pack those on an 8-inch circular wafer, you can get 220 or whatever that point is there

00:40:37 on the wafer geometrically. It's not just a case of the ratio of the areas and squares don't pack

00:40:43 perfectly in a circle. You have to wiggle them around to see how it fits best, but this is a

00:40:49 straight geometric relationship of how many square die of a certain edge can you get on an 8-inch

00:40:55 wafer. The thing you see right away is that you get well over four times as many half-inch die

00:41:03 as you get one-inch die, which has to do with the efficiency of packing squares in a circle.

00:41:09 Clearly, if what you're interested in doing is making a cheap piece of silicon,

00:41:14 you want as small a die as possible because you want to get as many as you can,

00:41:19 but this far understates reality because, in fact, not all the dice are good. If you look

00:41:27 at the silicon wafer as containing some kind of a random distribution of defects,

00:41:33 and should one of those squares include one of those defects, it's no good. We have to reject it.

00:41:39 You can see that a bigger piece of silicon, a bigger die, has a higher probability of picking

00:41:46 up a defect for a given defect density than a small one does. So, not only do we get more of

00:41:52 them to start out with by making them small, we get a higher percentage of them that are good,

00:41:57 and I have done some calculations along that way on the next slide showing the yield,

00:42:08 that is the percentage of their good, for various die sizes, for various defect densities.

00:42:14 The defect densities here are stated in average defects per square centimeter. Excuse my mixed

00:42:20 dimensions. We use inches for die, centimeters for defects. It just happens to be the conventional

00:42:29 units that we've grown up to use at Intel, and if I translate them, I lose some of the relations

00:42:37 I'm used to being familiar with. Anyhow, this reflects a range of defects from probably

00:42:46 something better than the best production processes today to something comparable with the worst,

00:42:52 because any company that's beyond the one defect per square centimeter now probably is no longer

00:42:58 in business. As you can see, these curves diverge quite rapidly as you go to the right.

00:43:06 So, the defect density is an extremely important parameter here, as is the die size in determining

00:43:15 what we get. In fact, the next slide shows how many good die you get, where we combined the yield

00:43:24 from the previous slide with the number of possible from two slides previous, and see the total number

00:43:35 of good die one gets for various defect densities and various die sizes. Here we see that that 7

00:43:43 tenths inch die that I was describing indeed does give us about three die per wafer should we

00:43:52 have the high density, the one defect per square centimeter in our process, but we might get as

00:44:00 many as 70 or so die from the same wafer if we were operating at a tenth of a defect per square

00:44:08 centimeter. The factor of 20 difference in how many good units we'd get off a wafer, depending

00:44:14 only on the quality of our processing is measured by defects per square centimeter. So, there's a

00:44:21 tremendous premium on low defects in the processing and in squeezing to the smallest possible die.

00:44:32 Another way to look at this is cost. If I take those numbers of die off a wafer and reflect a

00:44:40 cost of a process wafer of something like $3,000, which is certainly in the ballpark, although I'm

00:44:45 sure every company in the semiconductor business would quibble with it one way or the other,

00:44:51 you see that it's very easy to get a die just from the process wafer that costs over $100.

00:45:01 Taking our 7 tenths inch die again, in fact, you have to operate a very clean process,

00:45:08 low defect density, in order to keep the die cost below $100. On the other hand, if I could get

00:45:14 the same function on a piece of silicon that was only a half inch on a side, I get down to $100

00:45:20 cost, even with the poorest manufacturing, and something in the range of $20 with the very best

00:45:26 manufacturing. So, there is tremendous leverage in that direction. And that's the reason you see the

00:45:33 kind of evolution you do in product lines. For example, on the next slide I've shown

00:45:39 photographs of three different generations of Intel's Pentium processor.

00:45:47 The original one made on 8 tenths micron minimum feature size technology.

00:45:53 The one that was a workhorse in 1994, most of 1995, the 0.6 micron technology.

00:46:01 And the current one we're shipping mostly today, the 0.35 micron technology.

00:46:09 The die size is about half the area in each of these. That is, the edge is the square root of two

00:46:16 down in each case. So, the 1995 die is only a quarter as large as the 1992 die.

00:46:25 From that alone, we get four times as many off a wafer. When you take into account the improved

00:46:31 manufacturing in the meantime, we clearly get a lot more than that. So, the cost of one of these

00:46:37 on the right is a lot less than the cost on the left. On the other hand, if you look here, you see

00:46:43 that the maximum frequency is doubled. The 66 megahertz maximum we get off the 8 tenths micron

00:46:50 technology. We now ship 133 megahertz devices. Twice the performance from that alone. The power

00:46:59 has dropped a factor of three. This is that violation of Murphy's law coming in, where we've

00:47:04 made the same device on three generations of technology and have realized the advantages of

00:47:09 going to the smaller structure on the new technology. It clearly benefits in a single generation.

00:47:18 And when looked at over several generations, the benefits are even more startling.

00:47:24 On the next slide, I've showed Intel's entire family of 32-bit microprocessors from the original

00:47:33 8386 to today's Pentium Pro. In fact, I've even shown some here that don't exist yet,

00:47:39 particularly the last generation of Pentium Pro showing on quarter micron technology,

00:47:46 since Intel has no quarter micron technology in production yet. This shows the trend though.

00:47:53 If you look at the first generation of each of the product families, you see that the area is

00:47:58 actually increasing. The Pentium Pro in the first generation is bigger than the Pentium was in its

00:48:04 first generation, even though it's on one generation later technology. Notice the size of the Pentium

00:48:14 and the Pentium Pro, for example, both on the 3 tenths micron technology. The Pentium is

00:48:21 significantly less than half the size of the Pentium Pro. So what we've done is we've taken

00:48:27 advantage of this higher density technology in two different ways here. One place, we've taken

00:48:32 advantage of it to make the same device, lower cost, higher performance. And the other way,

00:48:40 we've taken the advantage of the new technology to make a much more complex device, actually even

00:48:45 somewhat larger than the previous one, but with the expectations of shrinking it onto at least

00:48:51 two subsequent generations of technology. This is the power of this ability to continue to make

00:48:58 smaller and smaller structures, and it's why the industry is driving so hard in that direction.

00:49:06 Now to do this has become increasingly difficult, and as it's become increasingly difficult,

00:49:14 the equipment has gotten far more sophisticated and far more expensive.

00:49:20 We've evolved from a very people-intensive industry to one that is very capital intensive.

00:49:28 I can show this a bit by looking at the cost of the equipment to build 5,000 wafers a week

00:49:36 at various times, various generations of technology, where the minimum feature size

00:49:42 is shown along the curve and the wafer size is shown along the top. I chose 5,000 wafers per week

00:49:49 because that's about the size manufacturing facility that takes full use of the equipment,

00:49:55 where you can balance the output from one type of machine with the next type of machine by

00:50:01 having two of these and seven of those or whatever it takes. So equipment typically doesn't have to

00:50:06 sit idle very long. When Intel started in 1968, we not only equipped our first manufacturing

00:50:14 facility, we developed our first products and processes all for three million dollars.

00:50:19 Today that doesn't buy a single production tool. In order to build a reasonably efficient factory

00:50:29 at the three-tenths or 0.35 micron generation, you see we're well over a billion dollars for

00:50:36 the equipment alone. You add that to a building that probably costs three to five hundred million

00:50:41 dollars and you see you're in a factory that is the order of a billion and a half in order to

00:50:48 have a reasonable efficient manufacturing facility and that's continued to grow. In fact,

00:50:53 the growth to quarter micron is rather significant and before long we have to consider another

00:51:02 increment in wafer size. There's an industry trend toward 12-inch wafers instead of 8-inch wafers.

00:51:10 If you look at previous changes in wafer size, for example, the change from the one and a half

00:51:16 micron technology to the one micron technology, you see the capital cost more than doubles.

00:51:21 Similarly from the 0.8 micron to the 0.5, which goes from six to eight inch wafers, it doubles or

00:51:27 so. I guess from this I ought to speculate that a 12-inch wafer facility in the next generation of

00:51:34 technology, the equipment alone will be in the two and a half billion dollar range.

00:51:40 This is really a capital intensive industry. That's far more than a modern automobile assembly

00:51:46 factory, for example. The reason is the cost of the equipment. Well, twofold. The cost of the

00:51:52 equipment and the fact we're increasing the number of process steps in order to do all these nice

00:51:57 things like add more metal layers to maintain the density. For example, the lithography equipment,

00:52:03 the basic machine for printing patterns on the silicon wafer has escalated in cost from

00:52:09 $12,000 per machine when Intel got started to something like $10 million per machine for the

00:52:16 0.18 micron technology. That's not a firm number yet. There's some speculation it might be closer

00:52:26 to 15. The quarter micron technology, however, with about $7 million cost is something that

00:52:33 we're already spending to put the capacity in place. So from $12,000 to $7 million, a 500-fold

00:52:42 increase in the cost of a machine to process about the same number of wafers an hour. The wafers are

00:52:48 bigger. The quality of the process is a lot better, but basically you put about the same number

00:52:54 through in a given time. Well, what can a fab turn out? The world market for personal computers

00:53:05 is predicted to hit $100 million sometime later this century. It's something well over half that

00:53:13 now, maybe $60 million plus last year and growing this year. We estimated what it would cost to

00:53:19 build 100 million of the kind of personal computers that sold last year for the CPUs, the

00:53:26 memory products, the rest of the semiconductor products that go in there, and discovered it

00:53:32 would take some 40 of these wafer fab facilities of the kind we were talking about. These are wafer

00:53:38 fab facilities that we said cost about a billion and a half apiece. It requires about a $60 billion

00:53:45 capital investment in order to make just the semiconductor devices for 100 million PCs.

00:53:54 Who is going to invest this $60 billion? Well, a lot of people will. Another way of looking at it

00:54:03 though is to look at something called Rock's Law. Arthur Rock being one of Intel's directors from

00:54:08 the very beginning, he pointed out that the cost of processing a single wafer is growing exponentially

00:54:17 with time as well. But who's going to make all these investments? Well, if we add up what people

00:54:25 have spent over the last few years and what they are projected to spend for the next few, we see

00:54:31 that the industry is willing to step up to this kind of investment. This is now a worldwide industry

00:54:37 where the contributions from North America, Asia, excluding Japan, and Japan are all major contributors

00:54:49 to the capacity. In 1995, the industry invested some $35 billion in capital expansion and with

00:54:59 the announced facilities in place, it looks like we will average in the $40 to $50 billion a year

00:55:04 over the next several years, assuming there's not some major turnaround. This is a huge amount of

00:55:12 money and for a very capital-intensive business, ought to be expected to add probably $1 in revenue

00:55:21 for each incremental dollar spent on capital. That is, if we spend $40 billion on capital this year,

00:55:29 the revenue ought to increment by $40 billion, more or less, to make it a reasonable investment.

00:55:38 So we can look at what's expected to happen in the growth of the worldwide market.

00:55:42 This shows the projections from the official agency that collects this kind of data.

00:55:50 And lo and behold, they are showing increments that are at least of the order of $30 billion

00:55:55 anticipated for the market looking forward. This is a very large rate of growth for an industry of

00:56:04 this size. Last year, we were about a $150 billion industry, growing a very large amount

00:56:15 in that particular year and anticipating that this is going to continue.

00:56:21 So we have the bits and pieces in place. I think the technology says we can continue to make

00:56:26 considerable progress by working the same way we have. We have a few more turns of the screws,

00:56:32 generations of technology, before we run across any real problems. The industry responds by saying

00:56:39 we're willing to make the investment to continue to grow like that. So how concerned should we be?

00:56:45 So how concerned should we be? Well, I'm concerned more on the economic end, frankly,

00:56:52 than I am on the technical end. I think we've looked at the technical limits long enough.

00:56:58 We recognize we'll get to some eventually, but the cost of getting close continues to escalate.

00:57:06 And then I look at the requirements and the continued growth of the industry here

00:57:10 for us to make the progress we've made in the past by keeping our investment the same.

00:57:15 And on the next slide, I have plotted that along with an estimate of the

00:57:22 world's gross domestic product summed together. And just looking at semiconductors now,

00:57:28 not even the finished products that incorporate semiconductors, and extrapolating the growth rate

00:57:33 we've come to depend on, the 15% per year sort of thing, and you see that by about the year 2040,

00:57:40 there is nothing but semiconductors. We're 100% of the world's GDP. Well, clearly that's not a

00:57:47 very likely outcome. And I don't think it's very likely that the GDP will actually grow

00:57:53 significantly as a result of our approaching it. I think the solution to this problem has to be a

00:58:00 slowing in the growth of the semiconductor industry to leave room for some of the other

00:58:05 things we need like food and shelter, clothing. In fact, I don't know what a maximum percentage

00:58:14 would be, but on this basis, by something like the end of the first quarter of the next century,

00:58:23 we would be 10% of the world's GDP, which also seems high. In fact,

00:58:29 by about the end of this century, if we stay on this slope, will be 1%. And that's about as big

00:58:35 as most industries seem to get before their growth slows dramatically. The auto industry got to a few

00:58:42 percent at one time. Of course, we represent a subset of the electronics industry. I think

00:58:50 electronics is such a fundamental capability in the information age that it might grow to

00:58:56 if not a double digit percent, at least a large single digit percent. But I don't think

00:59:01 semiconductors can be more than a quarter or a third of that. So I think just the economy of

00:59:08 the world is going to put some limitations on how fast we can grow. In the meantime,

00:59:14 we can continue to enjoy the benefits of what is a phenomenal technology,

00:59:19 one unprecedented in human history. For example, in your PC, just over the last few years,

00:59:28 you can see performance advantages where it was $225 per million instructions per second in 1991.

00:59:37 It's only $11 in 95. And we can look forward to $7 per MIPS this year. Nothing else

00:59:47 gives you this kind of improvement with time. It's a technology that I think will continue to

00:59:52 be extremely important, if not forever, at least for many decades looking forward.

00:59:59 And I appreciate the opportunity to share some of my excitement about the technology with you today.

01:00:16 Thank you.

01:00:46 Thank you.