1996 IEEE NSREC IEEE Nuclear and Space Radiation Effects Conference Short Course
Radiation Effects Challenges for 21st ...
34 downloads
1196 Views
18MB Size
Report
This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form
1996 IEEE NSREC IEEE Nuclear and Space Radiation Effects Conference Short Course
Radiation Effects Challenges for 21st Century Space Systems
WI/
July 15,1996 Renaissance Esmeralda Resort Indian Wells, California Sponsored by IEEE NPSS Radiation Effects Committee Cosponsored by Defense Nuclear Agency Sandia National Laboratories Phillips Laboratory NASA Goddard Space Flight Center Jet Propulsion Laboratory
—
1996 IEEE NSREC IEEE Nuclear and Space Radiation Effects Conference
Short Course
Radiation Effects Challenges for 21st Century Space Systems
July 15, 1996 Renaissance Esmeralda Resort Indian Wells, California
Copyright @ 1996 by The Institute of Electrical and Electronics Engineers, Inc. All rights reserved. Instructors are permitted to photocopy isolated articles for noncommercial classroom use without Ifee. For all other copying, reprint, or replication permission, write to Copyrights and Permissions Department, IEEE Publishing Services, 445 Hoes Lane, Piscataway, NJ, 08855-1331.
Table of Contents
SECTION
I
FOREWORD Ronald
.............................................................................I 1-4
L. Pease
RLP Research,
Inc.
SECTIONII
SPA CECRAFTANOMALIES AND FUTURE TRENDS ............................................................. . . . . 111-52 James C. Ritter Naval SECTION
Research
Laboratory
III
TOTAL DOSE RESPONSE OF BIPOLAR MICROCIRCUITS ..................+......................................... . . . .. 1111-78 David W. Emily Naval Surface Warfhre Center, Crane Division SECTIONIV
CATASTROPHIC SINGLE-EVENT EFFECTS IN THE NATURALS PACEENVIRONMENT Kenneth F. Galloway and Gregory H. Johnson The University of Arizona SECTION
................ . . . ..IV 1-72
V
DESIGN ISSUES FOR RADIA TION TOLERANT MICROCIRCUITS FOR SPACE ....................................... . . . . . v 1-54 David R. Alexander Mission Research Corporation
1996 NSREC SHORT COURSE
SECTION1
FOREWORD
Ronald L. Pease RLP Research, Inc.
FOREWORD
This is the seventeenth year that a Short Course has been offered at the IEEE Nuclear and Space Radiation Effects Cotierence. The Short Course format provides the opportunity to cover topics in more depth than is possible with contributed papers, but is intended to be tutorial in nature. The theme for the 1996 Short Course is Radiation Effects Challenges for 21st Century Space Systems. The general approach for this year’s course, which has been taken with many of the past short courses, is to have topics on systems, ionizing radiation effects, heavy ion effects and hardening solutions. For each of these areas material has been selected that either has not been covered in the past or has not been covered in depth. The four topics relate to the overall theme in the following manner. The first topic is divided into two parts and sets the stage for the topics to follow. Background irdiormation is given on satellite anomalies to demonstrate that, to date, availability of hardened parts and conservative design practices have resulted in few system failures resulting from degradation or fictional failure of microelectronic parts. In the second part of the first topic trends for fbture satellites are discussed, showing that, unless system designers remain vigilant, a greater number of failures maybe expected in fiture systems. In the second topic, a new challenge in the area of ionization damage is presented for bipolar linear circuits, which are still widely used in space systems. In the third topic a challenge in the area of heavy ion phenomena is presented for catastrophic failure of microelectronic devices and circuits. In the final topic the challenge of designing with fewer radiation hardened components is addressed. A solution using a radiation tolerant design and layout methodology with a commercial foundry is presented. Much of the material presented this year is quite recent and covers research areas that are still very active. However, the intent has been to provide sufficient background ifiormation to make the material appeal to a wide audience. As mentioned above, the course is divided into four sections. The first section, Spacecraft Anomalies and Future Trends, discusses the various types of effects which produce anomalies in space systems, giving historical and other examples of such anomalies. A description is given for how system engineers search for the cause of the aberrant behavior of the system, and the accompanying phenomena which ofien provide valuable clues to the cause. Because the causes of many anomalies are never determined, it is difficult to accumulate comprehensive statistics on the distribution of causes of anomalies or failures in satellites. How anomalies are handled through onorbit comections or reprogramming is described, as well as how they are handled for fiture generations of the same space system. The second subject covered under this topic is trends in fiture space systems, and how those trends are tiected by radiation effects in microelectronics and photonic devices and systems. The future holds new types of systems such as cellular telephones and data transfer systems, and, equally important, greatly increasing numbers of satellites. The second section, Total Dose Response of Bipolar Microcircuits, discusses total dose effects in bipolar microcircuits, with emphasis on the recently discovered low dose rate sensitivity of many linear devices. Because past short courses have focused on CMOS microcircuits, the section begins
I-1
.
with fimdamentals of bipolar transistors operation and presents basic processing and device design details, contrasting the difference between vertical, lateral and substrate transistors. Basic total dose effects in bipolar transistors and digital and linear integrated circuits are presented as an introduction to the important low dose rate sensitivity in linear circuits. The remainder of the section is devoted to a discussion of the response of bipolar linear transistors and microcircuits as a fimction of dose rate and irradiation temperature. Recent theories to explain the dose rate mechanism are presented along with implications for hardness assurance. The third section, Catastrophic Single-Event Effects in the Natural Space Radiation Environment, covers catastrophic single particle effects in semiconductor devices and microcircuits. The section begins with an introduction to the space particle environment and the interaction of energetic particles with devices and circuits. The remainder of the section is divided according to the various catastrophic effects: burnout, gate rupture, dielectric rupture and latchup. Although many of these effects have been discussed briefly in previous short courses, none have been addressed in depth. For each of these effects the status of the experimental data is presented, followed by a discussion of modeling to understand the physical mechanisms. Based on the understanding of mechanisms, techniques for reducing susceptibility are presented. The section concludes with a summary of the important features of each effect as it relates to space applications. The fourth section, Design Issues for Radiation Tolerant Microcircuits for Space, presents design-related issues for space applications of radiation tolerant microcircuits, The section begins with a discussion of a general methodology for assessing the impact of different design approaches on the radiation hardness of integrated circuits. A typical microcircuit is partitioned into its fictional blocks, and the primary failure mechanisms associated with each fictional block are identified. Both heavy ion and total dose ionizing radiation effects are addressed. Several electrical and layout design alternatives for each fictional block are evaluated for their contributions to radiation tolerance. The general approach is illustrated for digital CMOS microcircuits, but suggestions for application to other integrated circuit types is also given. The section concludes with an application of the methodology to a radiation tolerant microcircuit. I would like to thank the five authors/presenters, Jim Ritter, Dave Emily, Ken Galloway, Greg Johnson, and Dave Alexander, for their efforts in making this short course a success. A great deal of personal time, on weekends and evenings, is required to meet the schedules and still filfill the commitments of a family and full time job. Although most of the authors have probably had second thoughts about accepting the responsibility of preparing the manuscripts and presentation material, I hope that, on reflection, they will be very satisfied with the results, as I’m sure are all of the short course attendees and readers of these notes. I would like to thank Lew Cohn of DNA for his efforts in reviewing the manuscripts and assuring that the material was cleared for public release. I would also like to thank the DNA Printing OffIce for printing the notebook. Without their efforts this Short Course Notebook would not have been possible. Ronald L. Pease Albuquerque, New Mexico
I-2
Biographies Ronald L. Pease Short Course Organizer RLP Research, Inc. Ronald L. Pease received his B. S. degree in physics from Indiana University in 1965 and pursued graduate studies in physics at the University of Washington, Seattle the following year. He joined NAD Crane (now NSWC-Crane Division) in 1966, where he performed radiation testing of missile components and headed the DNA Bipolar Program. From 1977-1979 he was with the BDM Corporation in Albuquerque, NM. In 1979 he joined Mission Research Corporation in Albuquerque where he was Manager of the Microelectronics Division. At MRC he was the principal investigator on several hardening and hardness assurance programs. In 1993 he formed his own company, RLP Research, where he is now a technical advisor and radiation effects analyst. Mr. Pease has been involved in the NSREC for many years, having served in a number of positions. He has over 40 publications and has won several outstanding and meritorious paper awards at the conference. James C. Ritter Naval Research Laboratory James C. Ritter is head of the Radiation Effects Branch of the Naval Research Laboratory where he has worked since 1962. For ten years at NRL he pursued basic research in nuclear physics. In 1971 he began peflorming and directing research in radiation effects in microelectronics. He has worked on the radiation hardening of satellite systems and has participated in revising the Joint Chiefs of Staff Guidelines for hardening satellites. He has been a Program Manager or Principal Investigator on a number of space experiments such as the Microelectronics and Photonics Test Bed and the Combined Release and Radiation Effects Satellite Microelectronics Experiment. He currently directs an extensive research program in radiation effects in semiconductor and superconductor devices and materials. Mr. Ritter was an instructor for the 1989 NSREC Short Course. David W. Emily Naval Surface Warfare Center, Crane Division David W. Emily received his B. S. degree with honors in electrical engineering from Purdue University in 1976. He has been employed by the Naval Surface Warfare Center, Crane Division since 1973. Initial assignments included reliability assessments and failure analysis of microelectronics. He has been involved in radiation effects research and testing since 1980. Responsibilities have included radiation hardness assurance evaluation of strategic missile components, development and testing of advanced technologies, and research into radiation effects on bipolar and BiCMOS processes. He currently manages the Technology Development Branch which support the research development and testing of radiation hardened microelectronics. Mr. Emily is active with NSREC and has served as official reviewer, session chairman and finance chairman. He has authored several papers in radiation effects, including the 1983 Outstanding Conference Paper.
I-3
Kenneth F. Galloway and Gregory H. Johnson The University of Arizona Kenneth F. Galloway is currently serving as a Professor and Department Head of Electrical and Computer Engineering at the University of Arizona. Prior to joining the University of Wlzona, Dr. Galloway held appointments at Indiana University(1966-’72), the Naval Weapons Support Center (1972-1974), the University of Maryland (1980-1986), and the National Bureau of Standards (19741986). He joined the University of tilzona in 1986. Dr. Galloway’s research interests include solidstate devices and semiconductor technology. He has authored or co-authored more than 100 technical publications. He was elected an IEEE Fellow in 1986 for “Contributions to the study of radiation effects in microelectronics”. He received the Medal of Honor from the University of Monpellier II. He has served in many capacities with the NSREC and RESG, including the positions of 1985 General Conference Chairman and Chairman of the RESG. Dr. Galloway received the B. A. degree from Vanderbilt University in 1962 and the Ph. D. degree from the University of South Carolina in 1966. Gregory H. Johnson received the B. S., M. S., and Ph. D. degrees in Electrical Engineering from the University of Arizona in 1988, 1990, and 1992, respectively. He held a National Research Council Post-Doctoral Fellowship at the USAF Phillips Laborato~ for two years following his graduate studies. Dr, Johnson is currently a Research Assistant Professor at the University of Arizona, where his research interests include radiation effects on microelectronics, microelectronic device physics, and long term aging effects on microelectronic device reliability. Gregory has presented several papers at previous NSRECS, including the Outstanding Cotierence Paper in 1991. David R Alexander Mission Research Corporation David R. Alexander received his B. S. in Electrical Engineering from the U. S. Air Force Academy in 1968 and his M. S. in Electrical Engineering from the University of New Mexico in 1973. From 1968 to 1973 he was an Air Force officer assigned to the Air Force Weapons Laboratory in Albuquerque, New Mexico, In 1973, he joined the BDM Corporation and was the principal investigator for several programs in radiation response modeling of microcircuits. In 1980, he became a member of the technical staff at %ndia National Laboratories. He is currently with Mission Research Corporation and has been Manager of MRC’s Microelectronics Division since 1993. At MRC, he has been responsible for applying computer-aided design and modeling practices to microcircuits. Mr. Alexander has been active in the NSREC for several years and has served in several positions. He has numerous teclin.ical publications and was a recipient of the Distinguished Poster Paper Award in 1988.
I-4
1996 NSREC SHORT COURSE
SECTIONII
SPA CECRAFTANOMALIES FUTURE TMNDS
James C. Ritter Naval Research Laboratory
AND
SPACECRAFT
ANOMALIES
AND FUTURE
TRENDS
JAMES C. RITTER NAVAL RESEARCH LABORATORY RADIATION EFFECTS BRANCH 1.0 2.0
3.0 4.0
5.0
6.0
7.0 8.0
Abstract Introduction Definition of Anomaly 2.1 Importance of Anomalies to Space Systems 2.2 2.3 Causes of Anomalies Space Environment Data Types of Anomalies 4.1 Radiation-Induced Anomalies 4.1.1 Spacecraft Charging 4.1.2 Single Event Effects 4.1.3 Total Dose 4.1.4 Displacement Damage Non-Radiation-Induced Anomalies 4.2 4.2.1 Mechanical 4.2.2 Software Learning from Anomalies Anomalies Can Lead to New Understanding 5.1 Preventing Software Anomalies 5.2 Publishing Anomaly Analyses 5.3 Future Trends 6.1 New Types of Military Space Systems Projected Increases in Numbers and Types of Commercial Spacecraft 6.2 6.3 Space Experiments to Get High Tech to Space in Record Time New Technologies and Radiation Effects Challenges 6.4 6.5 Use of COTS in Future Spacecraft Testing, Hardness Assurance and Shielding 6.6 6.7 Use of Plastic Parts New Technologies and new Effects in Future Systems 6.8 Conclusions and Summary References 1.0 ABSTRACT
This short course paper will discuss the various types of effects which produce anomalies in space systems giving historical and other examples of such anomalies and the results they have on the space systems. It will describe how system engineers search for the cause for the aberrant behavior of the system and the accompanying phenomena which often provide valuable clues to the cause (such as knowledge that a solar flare had recently occurred or that the spacecraft was in a geostationary orbit and the event happened shortly after 01996
IEEE II-1
,,
,
midnight). Because the causes of many anomalies are never determined, it is difficult to accumulate comprehensive statistics on the distribution of causes of anomalies or failures in satellites. It will describe how anomalies are handled through on-orbit corrections or reprogramming and how they are handled for future generations of the same space system. The second subject covered in this talk will be trends in future space systems and how those trends will be affected by radiation effects in microelectronic and photonic devices and systems. The future holds new types of systems such as cellular telephones and data transfer systems and, equally importantly, greatly increasing numbers of satellites. Both these trends and the introduction of new technologies will provide a variety of new radiation effects and new challenges for us and for future generations of scientists and engineers. 2.0 INTRODUCTION There have been spacecraft anomalies almost as long as there have been spacecraft. There were early examples of unexplained behavior and failure of spacecraft devices, subsystems and even of entire spacecraft dating from the late 1950’s when spacecraft were first placed into orbit. A number of the early failures occurred while we were learning to build space systems which had to do a large number of complex operations remotely. This is an expected part of the learning curve in any complicated operation. Many of these failures occurred in the start-up phase of the spacecraft following launch and injection into the proper orbit while the solar arrays and antennas were being deployed, the batteries charged, and subsystems checked out. Nevertheless even after we learned how to build space systems that worked reasonably reliably and far beyond the design lifetime, we were still plagued by a number of occurrences of what appeared to be random upsets or failures in spacecraft after they had been operating for some time. These anomalies particularly seemed to be noticed in the early 1970’s There may have been several reasons for this. By then we felt we knew how to build satellites and we expected the number of problems to decrease dramatically. At the same time, satellites were becoming much more complex which produced many more possible Microelectronic devices were also becoming much more (and more subtle) ways to fail. capable, but yet smaller, faster and lower power. This increased the vulnerability of the devices to effects produced by the natural space environment, and increased the probability of radiation-induced anomalies. 2.1 DEFINITION
OF ANOMALY
Definition: An anomaly is...an abnormality...an regular arrangement, general rule, or usual method.
irregularity...a
deviation from the
The above definition was taken from Webster’s Dictionary. In the scientific community, the above definition applies quite well; the only thing we might add is: ...an unexpected problem or failure. 2.2 IMPORTANCE
OF ANOMALIES
TO SPACE SYSTEMS
Sometimes anomalies can be just small perturbations on an otherwise perfectly operating system and can be unimportant in the larger scheme of things. For example, a single event upset (SEU) can occur in a sensor or memory which contains a lot of data. By its very nature, the error is sometimes obvious, or unimportant. A picture that contains one erroneous II-2
pixel would probably not even be detectable. One erroneous count out of 1000in data, introduces a O.10/0 error in the result and is generally negligible. An upset can also occur in a circuit which has an error detection and correction (EDAC) circuit in it. These types of anomalies cause little, if any, problem. Unfortunately, anomalies can also be of enormous importance to space systems. If an anomaly occurs and the solution isn’t found quickly, the affected device or subsystem or even the entire spacecraft can be lost. Anomalies present very tough problems to space system operators, because, by their very nature they are unexpected, and by Murphy’s Law, they always happen to the most important subsystem and at the worst possible time. The anomaly itself can cause a very serious problem, or the spacecraft operators can not understand the anomaly fully, and take an action which would ordinarily be safe, but which is not safe in the presence of the anomaly. The worst case examples of anomalies can involve turning on a thruster, or spinning up a reaction wheel, misprinting a critical antenna, producing excessive radiation damage in a solar array, or even causing an explosion. Good design practice generally excludes actions like turning on a thruster by a single anomaly. Serious actions like turning on a thruster require a series of steps like activating the thruster control circuit, applying power to the circuit, then giving the command. Such an action would require three separate anomalies and, hence, are very unlikely, but not unheard of as we shall see later. Excess radiation damage to the solar arrays, caused by a large solar flare, is, unfortunately, not as easily excluded by good design; the array is sized to accommodate the expected number of large flares and one can rarely afford to oversize the array, j ust in case a larger number of flares than expected, should occur. 2.3 CAUSES OF ANOMALIES Determining the cause of the anomaly is often not easy. There are usually a variety of clues that can be of significant help. The information needed to analyze the anomaly is listed in Table 1, along with some examples. The first thing to do is to describe just what happened and determine as closely as possible when it was first observed, noting what the spacecraft was doing at the time and what instructions it was carrying out. This often limits the type of anomaly, and the time is useful in later steps. The type of orbit and where the satellite was in the orbit when the anomaly occurred is a very valuable clue. The first example given in Table 1. under the orbit is a geostationary orbit. If the local time is between midnight and six AM, local satellite time, then there is an increased probability that the anomaly is an electrostatic discharge from surface charging, as we will see in the next section; if it occurs in the early afternoon, it is more likely to be caused by a deep dielectric discharge. Determining what the satellite’s radiation environment was at the time of, and shortly before the anomaly occurred, can provide information about the particle that produced the anomaly. If it occurred in the peak of the proton belts or in the South Atlantic Anomaly, it is very likely it was caused by a radiation belt proton. One of the most valuable clues is to look for a disturbance in the radiation environment during, or shortly before the anomaly occurred such as a large solar flare. Large solar flares can produce a number of effects in spacecraft, such as SEU’S and solar cell damage. Previous anomalies on the same satellite or simultaneous anomalies on other satellites in similar orbits are very useful in determining the cause. In the next sections we will illustrate these and other examples by a more detailed discussion.
II-3
,.
3.0 SPACE ENVIRONMENT
!
DATA
The space radiation environment around the time of an anomaly is often of a great deal of importance. It will be useful to describe briefly the space radiation environments which produce many anomalies. A detailed description of the space radiation environments would not be useful here. What is is an needed order of magnitude understanding of each of important the For most environments. _ 1 The Earth’s Radiation <s. effects, the most important environments are the Earth’s radiation belts, sometimes known as the Van Allen belts after their discoverer. The Earth’s radiation belts are shown pictorially in Figure 1 [After II-4
The figure shows the Stassinopoulosz. Earth’s proton and electron belts. One point to note is the asymmetry on the right mouse ear lobe which appears just off the coast of South America. This is known as the South Atlantic Anomaly (SAA). It is caused by the fact that the geometric center of the Earth’s magnetic field is not exactly at the center of the Earth and the field is inclined at 11 degrees to the equatorial plane. When spacecraft pass through the SAA they encounter large increases in particle fluences and this produces many anomalies. Figure 2 shows the integral fluences of protons with energy less than 30 MeV in units of protons/cm2-day as a function of satellite altitude for circular orbits of selected inclinations based on the AP-8 Minimum Mode13. Note that the belts peak at about 3300 km (1800 nmi). Most spacecraft operate either above or below these intense belts to reduce the undesired radiation unless the mission requires it. Figure 3 shows the
ELECTRON CIRCULAR
INTEGRAL
FLUSNCES
SARl?i
ORBITS
(SASED
ON AE4
AVERAGED
OF VARIOUS AND AH
OVER
INCLINATIONS
MODELS}
,@14
I
3
1-
-i
L4z!--1 “-03
F@re
I
1 z
346
3 Earth’s Electron Belts.
10
15
20
PROTON lNlE&RUmUc3
AVERAGED
EARTH ORBITS W VARIOUS INCLINATIONS (BASED ON APWIN NODEL) r, ~ L I
ALmJDEmousmDE oPsAuncALuus)
Figure 2 Earth’s Proton Belts
integral fluences of electrons with energy greater than 40 keV and also 1 MeV in units of electrons/cm2-day as a function of satellite altitude for circular orbits of selected inclination based on the AE-6 and AE-4 NASA Models4. The electron belts peak at about 5600 km (3000 nmi) without any material present. About 80 roils of aluminum will absorb one MeV electrons, so the lower curves can be used to get an indication of how the peak intensity shifts inside a spacecraft. F@we 4 shows the integrated proton fluences intercepted by a spacecraft in a Low Earth Orbit (LEO) when it crosses the SAAS. Notice how much time a LEO spacecraft spends in the SAA, nearly 30 minutes out of a 90 minute orbital period for a 1111 km orbit. The size and shape of the SAA is a strong function of orbital altitude and also of particle energy. If an anomaly occurs while a spacecraft is crossing the SAA, the anomaly was very likely caused by a proton.
II-5
We have discussed the Earth’s radiation belts now let us describe the cosmic ray and solar flare environments. Figure 5 shows the relative intensity of cosmic ray protons for solar maximum and minimum and a solar proton flare for comparison. The cosmic rays are much more energetic than solar flare protons. It is important to recognize that the solar minimum cosmic ray environment is more intense than that of the solar maximum. the cosmic rays penetrate more easily when the sun is less active. The I I I 1 I I I \!u\ Uv , J ,Dimlnm main point of this figure, however, is that if a “ ammmmmm 9@ mmlWcwu’rnmww~ solar flare occurs it dominates the cosmic ray environment by five or six orders of magnitude. Two very intense anomalously -4 Roton Fluence in the SAA [after large flares are shown in Figure 6, the very Stassinopoulos, Ref 5] energetic flare of February 26, 1956 and the very intense flare of August 4, 19727. When the term “ten percent worst case flare” is used, it means that there is only a 10 % chance of encountering a higher solar flare environment than it. Note that for the worst case environment the envelope of the 1972 flare is used above the intersection of the two curves, and the 1956 curve is used below that point. This envelope represents an extremely intense, high energy flare which has not actually been seen in nature. Figure 7 shows that even very ,.9 INTEGRAL PRow4
lW*O
‘
, ,.
i
w
t
1],,,
~hERGY SPECTRA
!,
lLW
ENERGY {MAO
-6 Solar Flare Spectra [After Adarns, Ref 71
Figure 5 Comparison of Cosmic Rays and Solar Flares [after Adarns, Ref q II-6
large amounts of shielding (up to 2 inches of aluminum) do not change the cosmic ray spectrum by very muchs. Large variations in the Earth’s radiation environment are caused by changing conditions on the sun and by the magnetic fields in the solar system which modify the access of solar and galactic particles to the near-Earth regions of space. The sun has an d over-all pattern of activity with an 11 year ml * ,,WI ml , cycle. This is shown in Figure 8 which shows lid 10-’ Id I& 10” D’ ~’ 10 I the pattern for both sunspot numbers and for LET In MeV em’ Is magnetically disturbed days (in which the geomagnetic index, Ap, is greater than 40)9. The incidence of many types of spacecraft _ 7 Cosmic My Spectra Behind anomalks such as electrostatic discharges from Various Shielding [After Adams Ref 8] surface charging follows this over-all pattern too. Figure 9 shows a more detailed look at the solar flare distribution with timel”. It is clear that solar flares are less likely to occur at solar minimum and they seem to have a higher probability of occurring on the leading and trailing edges of the solar cycle. The seasonal distribution of magnetic disturbances is shown in Figure 10’1. Note that the magnetic storms vary by about a factor of two over the year and are highest at the spring and fall equinoxes. This m-ak& upsets more probable during those seasons. ‘This same seasonal pattern is seen in spacecraft anomalies caused by surface charging.
‘t-
SunSPOts WMJ MwnctkStormDaw
,.
. Iwa
li?o
lwa
107s
mm
mm
&
-8 SunSpOtS and Magnetic Storms [After Allen Ref 9]
* 9 Solar Flare Protons for Cycles 20-22 [After Barth Ref 10]
II-7
4.0 TYPES OF ANOMALIES Anomalks can be separated into two for our purposes, broad categories non-radiationradiation-induced and induced. It seems clear that the Nuclear and Space Radiation Effects Conference attendees would be primarily interested in radiation-induced anomalies, and I will concentrate on them. Nevertheless, other anomalies will also be discussed because they are often important causes of the upset or loss of major space systems and it is necessary to have some understanding of them in order to gain some perspective. One important example of a mechanical anomaly will be discussed in some detail because it probably caused the loss of the Mars Observer spacecraft.
14% Major
[
M agnetlc
Storms:
1932-1991 11.Bn
8.1%
“-———–
I Jmwmr&r
MwJlm
JLd Au48a00sImIoac
Figure 10 Monthly Distribution of Magnetic The various types of spacecraft Storms [After allen, Ref 11] anomalies are shown in Table 2. The include anomalies radiation-induced electrostatic discharges that occur as a result of either surface or deep dielectric charge that builds up on the spacecraft. When such discharges occur, they can cause changes of state in memories, telemetry resets, breaMown paths across dielectrics, or they can short out a device. A second important type of anomaly is single event effects. SEE’s include single event upsets (SEU), single event Iatchup (SEL), single event burnout (SEB), and single event gate rupture (SEGR), among the better known effects. AU single event effects occur as the result of a single particle depositing energy along the path of a particle either directly or by means of a nuclear interaction with the material which can produce other particles. SEU often occurs in memories where it changes the state of one or more bits in the region where the particle strikes. SEUS can also give an undesired instruction to a microprocessor or even jump it to an undefined state. SELS can cause a device to burn out if the latched state draws enough current. A large pulse of current produced by a single particle can also cause a gate under bias to ruptu~ destroying the device (SEGR). Total ionizing dose can cause threshold voltage shifts or increased leakage current in CMOS devices which drives them out of specifications and causes failure. It can also cause failure in bipolar devices (operation outside of specifications) by reducing the gain or by increasing the leakage current. Displacement damage results from particles interacting directly with the atoms of the lattice and displacing them, causing charge trapping and recombination centers. This results, for example, in gain reductions in bulk devices and reductions in charge transfer efficiency in Charge Coupled Devices (CCD’S). We will discuss the four types of radiation-induced anomalies listed in Table 2 in order.
II-8
4.1 RADIATION-INDUCED
ANOMALIES
4.1.1 Spacecraft Charging Spacecraft anomalies such as upsets, switching errors and resets had been occurring regularly in some space systems in the early 1970’s. Just what was causing them was not understood. They seemed to be happening often to our major communication satellite systems which operate in a geostationary orbit. It was happening to both military systems such as the Defense SateUite Communication System (DSCS II) and the Defense Support Program (DSP) spacecraft and to commercial satellites as well such as Intelsat and Telesat. An example of the
II-9
,,
,
type of anomalies occurring in synchronous satellites is shownn in Table 3. The anomalies include a number of examples of logic or control circuit switching and other erroneous operations. It was noticed that when the locations of the anomalies were plotted as a function of when they occurred in terms of satelMe local time, the anomalies clustered into one large group (see Figure ll)n. -use a synchronous satellite has a 24-hour orbit, it circles the earth once per day and therefore stays over a fixed point on the equator. The time at that fixed point on 1 the earth is the spacecraft local time]. An v DSP LOGIC UPSEIS n i 0 OscsII RCAUPSETS unusually large fraction of the spacecraft v lMlfLSM IV o INIELSA1 Ill anomalies occurred in the midnight to six a.m. quadrant in the satellite’s local time (much larger than the expected 12). The -11 Local Tme dependence of Upset [After McPherson, Ref 13] probability that 31 out of 47 “anomalous events” would occur at random in a single quadrant was calculated as 3 chances in 109. Other work by DeForest14 indicated that synchronous satellites were probably being charged to high potentials by particles. The explanation suggested was that events occur preferentially just after midnight because the spacecraft passes through the geomagnetic tail where electrons, accelerated by geomagnetic activity, are injected and drift from midnight toward dawn. In addition, if the spacecraft is out of view of the sun, then solar photoelectrons can no longer provide a discharge mechanism. This problem was viewed as serious enough that an experimental spacecraft, was designed and launched specifically to study spacecraft charging, the Spacecraft Charging AT High Altitudes satellite, (SCATHA). One signature of dielectric discharges caused by spacecraft surface charging is the high probability of finding a large number of events occurring between midnight and dawn local time in a spacecraft having a high altitude orbit such as geosynchronous. If the discharges group in the afternoon local time there is a good probability that they result from deep dielectric discharges. These examples illustrate how environmental information can provide the necessary clues to recognize the source of the problem causing the anomaly. Figure 12 shows the insulator pulses (ESD’S) detected on the Internal Discharge Monitor instrument that were recorded onboard the Combined Release and Radiation Effects Satellite (CRRES) in a carefully
Figure 12 Orbital Distribution of CRRES Insulator Pulses [After Violet, Ref 15J 11-10
The flux of electrons with energy greater than 300 keV is also shown designed experiments. for comparison. The CRRES orbit ranges from about 200 nmi to 20,000 nmi in a geostationary transfer orbit inclined at 18 degrees to the equatorial plane. The x-axis is given in Earth radii in units of L, the Mclllwain L parameterlf, where L=l is the surface of the Earth. The geostationary orbit is at about L = 6.6. The spacecraft starts at the left side of the figure and moves down through the radiation belts until near L = 1 it passes under the belts then climbs back through them ending near its geosynchronous apogee on the right. The ESDS correlate well with the electron flux shown in the top part of the figure. The SEUS shown at the bottom nearly all occurred in the proton belts. It was a surprise that over 90 ‘A of the upsets were produced by protons. Before CRRES was launched it was predicted that most of the upsets would be produced by cosmic rays. 4.1.2 Single Event Effects Another effect which produces a large number of anomalies in space systems is Single Event Upset (SEU). The history of single event upsets is interesting. It started with a paper by Wallmark and MarcusI’, of RCA, published in 1962, in which the authors speculated that cosmic rays would be able to produce upsets in microcircuits as the microminiaturization trend continued and that cosmic ray upsets would set the ultimate limit on how small a device could be made. Their paper, however, was so far ahead of its time that it was largely overlooked. The next significant event took place 13 years later, in 1975, when Binder, Smith and Ho1man18, of the Hughes Corp., observed unexpected triggering of digital circuits in an operating sateUite and postulated that the bipolar flip-flop circuits were being upset by charge collected from the dense ionization track of an energetic heavy ion cosmic ray passing through an individual transistor. Binder, Smith and Holman calculated an upset rate which was within a factor of two of the observed results. This paper too was ahead of its time and did not awaken the community to the potential seriousness of the problem. In 1978, May and Woods19, of Intel Corp., discovered that anomalous upsets which were occurring in dynamic random access memories on the ground, were being caused by alpha particles emitted from trace amounts of thorium and uranium in the materials from which the device’s packages were made In 1978 also, Pickel and Blandford20, of the Rockwell Corp., studied anomalous upsets which were being obsemxl in NMOS dynamic RAMs used in a satellite system and concluded that these upsets were due to heavy ion cosmic rays. They also developed an approach for calculating upset rates to be expected in a cosmic ray environment. The Pickel and Blandford paper finally alerted the radiation effects community to the problem of heavy ion-induced cosmic ray upsets in microcircuits. It quickly became clear that both protons and neutrons could also induce SEU’S (Guenze#l and McNult#). Since the discovery of SEU’S, the number of Single Event Effects (SEE) has expanded greatly to Single Event Latch-up (SEL), Single Event Burnout (SEB), etc. One particular device has become famous (or infamous) for causing an unusually large number of upsets in operational spacecraft, the 93422 or 93L422. This device has been responsible for producing upsets in TDRS, LANDSAT, and DSCS and potential latchup in this device also caused the costly redesign of the Galileo spacecraft. The effect of a large solar flare such as that of October 1989 on the SEU rate is dramatic Solar flares are made up of a large number of particle types but are predominately protons. As shown in the top of Figure 13, the GOES-7 spacecraft detected x-rays emitted by the Sun during that solar flare ‘. The x-rays travel in a straight line toward the Earth. The solar flare particles are bent by the magnetic fields between the Sun and the Earth and they 11-11
,,
may or may not couple well to the- Earth, arriving somewhat later. Notice how the SEUS shown in the center of the figure follow the solar particle fluence. The fact that the SEUS track the proton fluences rather than the x-ray rates is illustrated clearly in Figure 14 for the March, 1991 solar flare event?4. Figure 15 shows the SEU rate observed on the TDRS 1 illustrating spacecraft the cosmic ray and solar flare It is important to effects2s. notice that the SEU rate peaks during solar minimum indicating that those SEU’S were produced by cosmic rays. There are also spikes on the curves cortwponding to the large solar flares of 1989 and 1991. Upsets on the UOSAT satellite observed from 1988 to 1992 in TMS 4416 64 bit NMOS dynamic RAMs are shown in Figure 1626. The upsets occur predominantly as the satellite passes through the SAA. This is typical behavior for a LEO satellite. Let us now look at single event latchup in an NEC 64 K CMOS RAM used in a geostationary and a LEO satellite. The details of the observations are given in Table 4 for the Engineering Test Satellite V launched by the NASDA into a geostationary orbit. The eight devices produced about 2.4 SEL per week and 0.76 SEU per week as shown during a period of 3.5 years. During the large solar flares of September 29, 1989 and October 19,1989 satellite 80-180
TDRS-l SEUS ●mdENERGETIC October W’
*$-7
-.
l-m z-
. Day
!6”
~~
SOLAR PARTICLSS
1989
19
20 ,
#
“
*
,C
.
.
_ 13 TDRS Memory Upsets With GoES-7 Data [After Allen, Ref 23]
Proton
SOLAR-TERRESTRIAL ENVIRONMENT March 1991
;
10”
-E
Id
/&L---—l+%————+ -
Id
.I.-.
MIMWN● LOdnna
urmw9
_ 14 CAMS-7Solar-Terrestrial Environment for March 1991 showing SEU’S [After Allen, Ref 24] 11-12
SEL and 50-70 SEU were observed respectively. The onorbit data are shown in Figure 1727. A more careful look at the data (Figure 18)28 shows that nearly 80 SEL’S were accumulated in a little over a day, an increase of over 200 in SEL rate. This same device (in a slightly different version) was also used in the ESA Earth Resources Satellite-1, a LEO satellite launched in 1991. This satellite had an on-board experiment called the Precision Rate and Range Range Experiment which only operated for five days before being destroyed by an overcurrent in the power supply shown in Table 5. Before shutdown, a number of SEU’S were observed and a processor reboot was required29 . The experimenters decided that UOSAT-Z
OHC
MEMORY
lJIWETS
: TEXAS
SEUSOBSERVEDON TDRS1
l$X16
1,987
1~88
45
: . d .
a
TMS4416
16k
. 4-I?lT
-0S
D=
-+5
-90 -100
–90
1,991
F@re 15 SEU’S Observed on TDRSS-1 Bariliot, Ref 25]
w
L . t
1$89 1,990 YEAR
o L.ngitud.
90
100
7995 Jh.mts in 1364 Dmy. Sq)f.~bae “SS - hlqy ,92
Figure 16 Dynamic RAM Upsets on UOSAT-1 [After Dyer, Ref 261
l@92
[After
the burnout was possibly caused by a proton induced They therefore Iatchup. the Engineering tested Development Model and Iatchups found that occurred in the 64 K RAM at the fluences shown. The burnout currents required in the ground tests matched the from seen those telemetry, were consistent with the total accumulated fluence and occurred in the SAA as shown in Figure 1930. SEL is going to be discussed in a great deal more detail in the later talk by Ken Galloway.
4.1.3 Total Dose The first anomaly in operation (and later, the loss) of a spacecraft due to total dose effects occurred very unexpectedly as shown in Table 6. A Bell Telephone Labs satellite called II- 13
,,
II -14
Telstar was launched into a 515 nmi by 3044 nmi orbit with 44.8 degrees of inclination on July 10, 1962. This is an orbit that passes through intense regions of the Van Allen belts. Telstar was launched just one day after a very large nuclear weapon (over 1 Megaton) was detonated at a high altitude of about 400 km. This Starfish nuclear test produced a large number of beta particles (electrons) which became trapped in the Earth’s magnetic field causing an intense, artificial radiation belt to form which lasted until the early 1970’s. This new radiation belt increased the expected total accumulated dose in the Telstar spacecraft by a factor of over 1000 This additional dose produced what was probably the first instance of the loss of a spacecraft due to total ionizing dose damage in a semiconductor device. The command decoder on Telstar I, consisting of diodes and narrow base transistors was not sensitive to displacement damage+ but was sensitive to total ionizing dose In mid-November, it became sluggish and on November 24, 1962 it failed. A series of creative steps were taken and it eventually recovered its function, but again failed, this time permanently, on February 21, 1963. Ground tests on two transistors with the same date code showed failure after 0.6 and 2.3 megarads and were consistent with the dose observed on The failure behavior Telstar. was also the same and it was concluded by Mayo et. al.31 that the failure was caused by surface damage to the Command
ON-ORBIT LATCH-UP ANDUPSETINNECM KCMOS R4MSONTHENASDAETS-VSPACECRAFT
F&n 17 On-Orbit Latchup and Upset of DRAMs [After Goka, Ref 271
ON-ORBIT LATCH-UP ANDUPSETIN NEC64K CMOS RAMSONTHENASDAETS-VSPACE~
Figure 18 On-Orbit SEUS and Latchup-in During Solar Flare [After Goka, Ref 28] II- 15
Dfi
Decoder caused by radiation enhanced from the Van Allen belts. In this case, the radiation unusual environment provided the clue to the cause of the anomaly and ground tests provided the proof or at least confirmed that a similar amount of radiation could cause a very similar effect in the suspected device. When the STARFISH burst was detonated in 1962 it destroyed a total of seven satellites within seven months, primarily due to solar cell damage. It gives pause to one contemplate how many military and
ON-ORBIT LATCH-UP IN AN NEC 64 K CMOS RAM ON THE ESA ERS-1 SPACECRAFT’
Figure 19 Orbii Positions of On-orbit Latchups in DRAMs [After Adarus, Ref 30] II -16
commercial satellites such a burst would destroy today considering the much larger number of spacecraft in orbit now and how rapidly the number is rising! 4.1.4 Displacement
Damage
Displacement damage is also sometimes the cause of spacecraft anomalies. Most displacement damage was historically associated with massive events such as large solar flares and it generally took place in analog rather than digital devices. It is often easier to determine the cause of damage in analog devices because they change in a continuous manner, rather than in one step, and when unexpected behavior starts to occur, it can often be monitored. The monitoring of a slow, but continuous change in an electronic part’s operational parameters is easier to detect than spotting a single, random event, such as a SEU. There is another reason that it was often easier to pin down in the past; it often caused problems in the spacecraft power system which is usually monitored in real time so that the anomaly was observed as it was happening. If you can see the power reducing as a solar flare occurs it is pretty easy to determine the cause. In this section we will begin by discussing some examples of classic displacement damage caused by charged particles such as protons or other heavier particles. There are many examples, but not much mystery. We will then go on to a more recent example where there aren’t many examples, but there is still a good deal of mystery. Solar cells are exceptionally vulnerable because they are on the outside of the spacecraft where they do not benefit from inherent shielding. They typically have 6-30 mil cover slips on the front and a light weight aluminum backing on the back The first example of displacement damage causing the loss of a space system from solar cell damage occurred in 1962. The Starfkh nuclear 70 weapon test, discussed r above under total dose, produced electrons and redistributed protons which caused premature degradation and failure in the solar arrays of three satellites in low-altitude orbits at the time. The three satellites, Transit 4B, Traac and Ariel became inoperable because of degradation in their solar cell power systems32. One I failed in 4 days, one in 24 and a third in 36 days. The solar cell output as a function of time in orbit is shown for Transit 4B and Traac33 in Figure 20. This
30332334536630 1s81 I
4080 196?.
co
loo1201401so lAomom’4oM ON )nMeEll
2s0300
Flgum 20 TRAAC and TRANSIT Solar Cell Outputs vs Time After Starfiih Burst [After Fwhell, Ref 33] II- 17
,.
,
rTTT7Tn II 41
I“”rlll 44
23
24
25
26
27
MARCH1%1
Figure 21 GOES-6 and GOES-7 Solar Cell Degradation [After Allen, Ref 3Sl
Produced by a Solar Flare
figure demonstrates the dramatic effects of Startlsh. With the quality of telemetry shown in Figure 20, it was pretty clear that solar panel degradation was responsible for the loss of the spacecraft, but the cause was a little more obscure. Did the trapped electrons from the weapon cause the degradation or was it caused by the protons which were redistributed by the weapon’s disturbance to the Earth’s magnetic field? One can tell whether damage is caused primarily by electrons or by protons by looking at how damage depends on the depth of shielding material. It happened that on the TELSTAR satellite (launched just after Startlsh) there were a series of solar cells with different coverslip thicknesses. By comparing the damage observed to that expected from electrons or protons of appropriate energies, the experimenters34 deduced that most of the damage was caused by protons. For comparison, we will now look at the more recent effects produced by a solar flare in the GOES-6 and GOES-7 spacecraft during the March 1991 solar flare, shown35 in Figure 21. As the Figure shows, in just nine days, the solar array lost as much power as was expected for three years of operation in a “normal” geostationary orbit environment. We now turn to some rather interesting new results of damage caused by very small fluences of protons. I have placed this under the displacement damage section because it is H- 18
caused by protons and not by Co 60 gamma rays. The problem is that this damage takes place at a very low dose or fluence level, much below that of classic displacement damage. The damage mechanism is not yet understood. The first anomaly appeared in 1993 when the Remote Command Unit (RCU) on a spacecraft failed after about three years in orbit. Fortunately there were redundant RCU’S on board. This same RCU was used on other spacecraft and after about three years in a similar orbit an RCU on the second spacecraft also failed. A year and half later, in 1995, a second RCU on the first spacecraft again failed (there are a number of RCU’S on board) but this time the failure was captured in real time telemetry. Since these RCU’S are used on a number of satellites the problem had a lot of urgency to be solved before another spacecraft with this type of RCU on board was launched. There was another, even more urgent reason for the Responsible Engineer to solve the problem quickly. When a spacecraft anomaly occurs, someone’s name and phone number is on the top of the list to be called at any time of the day or the night. When the first failure occurred, he was in California on business. When the second failure occurred, he was on a trip celebrating his anniversary with his wife in the wilderness. When the third anomaly occurred, he was golfing in Florida. Now, disturbing a business trip is annoying and ruining an anniversary trip with your wife is even worse, but being called back from a golf trip is unthinkable! Clearly something had to be done to solve this problem, and quickly!
3 r—
~~-nm
1-- --r--l
I
I I I I
.-. . . I ●
. 1700
B@kir
650
c!
o
2 -
❑
o
.G t!
0
0
●
1 – v
Vvvo Vo
v
() ~UJ~ .1
1
500
❑
o
450 100
lo
Total Dose (KRad(SI)) GAIN DEGIUDATION IN TRW 4NZ4A OP7YMSGLATORS AS A FUNCTION OF 63 MeV PROTON DAMAGE. SPBC MEANS THE CURRENT TRANSFER RATIO WAS MEASURHI WITH NO LOAD; FLIGHT MEANS m wAs MEASURED As mmm wm A LOAD (FROM REFERENCE 16).
l?igure 22 Opto-isolator Ref 36]
Gain Degradation
vs Dose [After Stapor,
II -19
The basic information for the for solution the problem was by now available. The units started to fail after a fixed time on orbit. That could be either a time or a radiation related failure. The anomalies investigation by the Tiger Team indicated that the failures were related to power switching in The the RCU’S. power switching was done by a TRW 4N24A Gtis optoisolator. This device is used to turn the unit on or off. With optical coupling, the chance of parasitic noise getting into the unit is minimized. The 4N24A devices had been tested by
the contractor and shown to work well, 4 l_ll_lHlll_f_’_7-~ l_T17’1--T” TT-rTl 11 ~ even after a dose of 1= 1 hundreds of kilorads & ; :“ v of Cobalt 60 gamma ● *O’ 800 rays. But there was 3 0 0 one disturbing bit of information; tests - 700 ❑ 0 i several conducted ❑ years earlier on a o slightly different - 600 ❑ 0 version had shown o 4N24A’s the that 1 “w v 500 141nAt3@t started failing after n o a lomspec only the equivalent of Q o 2mAspec o 0 5-10 Krads of protons. 1 1 1 1 t 1I 11 I 1 1 1 1 t 111 0 & t I 1 111 01 al A new set of 4N24’s 1 1 lo W was tested (along with TOTM DOSEOQad@O) a large variety of optoGAIN DM3RADAT30NIN TRW 4N24AOFTCHSOLA*RS “d isolators from other A FUNCITON OF 63 MeV FROTON DAMAGE IN THE MODIFIED manufacturers) at the III(3HT CIRCU~. SPECMEANSTHE CURREJ’TT’ TIUNSFER RATIO WAS MEASUREDWITH NO LOAQ FLIOHT MEANS IT WAS U.C. Davis cyclotron MEASUREDAS FLOWN W~ A LOAD (FROM REFERENCE16). using 63 MeV protons. The data is shown~ in which Figure 22. _ 23 Opt-isolator Gain Degradation Vs Dose for Higher shows how rapidly the Drive Currents [After Stapor, Ref 371 gain degrades after a 63 MeV proton dose of only a few Krads, especially in circuits which draw low drive currents. Further testing with higher drive currents (Figure 2337) or with other devices has shown methods of reducing the problem. As a result of the tesq two frees were identified: 1) modify the flight circuit to draw more drive current (20 ma, rather than 2 ma), and 2) shift to other devices e.g., the 4N49 from different manufacturers (OPTEK and Micropac).
I
This example has shown how sensitive opto-electronic devices can be to proton displacement damage compared to total ionizing dose from Cobalt 60 gamma rays. With the use of more opto-electronic devices in the future, this looks like an increasingly important problem for new space systems. Charge transfer devices - Charge Coupled Devices (CCD’S), Charge Injected Devices (CID’s), etc., have similar vulnerabilities at the same low levels and will also be a potential problem for future spacecraft. It seems probable that as we put more electro-optical devices on spacecraft, we will begin to have more of these anomalies caused by low fluences of protons or other particles. The Tiger Team believes that it has solved the problem which they believe was a radiation failure. I am not so sure, because, as shown in Table 7., I have also noted a coincidence which has occurred with 100VO frequency in this example~, namely that the Responsible Engineer has been out of town every time the anomaly has occurred! I suspect that it is correlated with time+ mther than with radiation, and that the solution is to keep him home and on the job! [In this example it has been necessary to delete a number of details (such as names) in order to protect the gu .... er, innocent] II -20
*
**
Ground tests show failures in 4N24A’s at similar doses. Emergency phone calls verify that Responsible Engineer is not at work
4.2 NON-RADIATION-INDUCED
ANOMALIES
Non-radiation-induced anomalies can result from random failures in devices, operator errors, such as sending an improper command, a malfunction in a mechanical subsystem, such as a reaction wheel, a software error through a transmitted or stored command, or from a variety of other causes. 4.2.1 Mechanical Determining the cause of an anomaly can be a very serious problem requiring a large team of experienced satellite designers and software experts. This is particularly true if a spacecraft is currently non operational and a Tiger Team is assembled to try to recover the spacecraft or if a failure has occurred and the failure is being investigated. A good example of a group assembled to determine the cause of a mission failure is illustrated in the Mars Observer Mission Failure Investigation Board39 which investigated what appears to be a failure in one of the mechanical subsystems, the propulsion subsystem. The Mars Observer program was conceived starting in 1981 and initiated as a new start in 1985. The scientific objectives of the project were to study the surface and atmosphere of Mars, its topography and magnetic field. The details of the program are shown in Table 8. The total program cost was expected to be about $415 M. It was originally planned for a shuttle launch, but after the Challenger explosion, the launch vehicle was shifted to a Titan III, and the planned launch was delayed for 26 months (the next Earth-Mars proper orientation). The Mars Observer was launched on 25 September, 1992 into a low altitude parking orbit and then it was injected into an interplanetary trajectory to begin its 11 month cruise phase (see Figure 24)40. 11-21
TABLE $415 M PROGRAM
!3
MARS OBSERVER
PROGRAM
TO STUDY SURFACE AND ATMOSPHERE
OF MAR$
LAUNCHED 25 SEPT 1992 INTO LEO THEN INJECTED INTO INTERPLANETARY TRAJECTORY TOWARD MARS AT 0021 ON 22 AUG 1993 BEGAN MARS ORBIT INSERTION UNDER CONTROL OF SOFTWARE NO DATA RECEIVED FROM SPACECRAFT
MANEUVERS
SINCE SEQUENCE
INITIATED
FAILURE REVIEW BOARD ASKED TO INVESTIGATE
Figure 24 Mars Observer Minion II -22
[after Ref 40]
All went well during the cruise phase, with only a few anomalies noted until the spacecraft was to begin the first step of its Mars orbit insertion. At 21 minutes after midnight GMT on 22 August 1993 a series of events was started to begin the Mars orbit insertion maneuvers. The system was under control of a sequence of software commands previously stored in the spacecraft computer. During all other events on the journey, a serious anomaly would activate a maneuver abort command, but at this critical time of insertion into the Mars orbit, which had to occur at a precise time, an abort could not be tolerated and hence all spacecraft abort criteria were temporarily disabled. The first step of the sequence was to pressurize the propulsion system. Because of concern that the traveling wave tubes in the RF power amplifiers might be damaged by the shock caused by the firing of the pyro valves, the RF amplifiers were turned off, as planned, just before the sequence was initiated. Unfortunately this meant that there would be no telemetry during the propellant tank pressurization sequence. No data have been received from the spacecraft since the sequence was initiated. Within three weeks of the telemetry failure of the Mars Observer, Daniel S. Goldin, NASA Administrator had requested Dr. Timothy Coffey, NRL’s Director of Research to establish a Failure Investigation Board to look into the failure. The Board consisted of eight other members and 29 technical advisom from NRL, NASA, NOAA, AFPL, the US Navy and the DMSP program ofilce. The Board was broken into six teams responsible for the spacecraft subsystems, namely: Electrical Power, Attitude and Articulation Control, Command and Data Handling, Mechanical, Telecommunications and Software. The Board presented its results after a four month investigation. The Board’s problem was to find the probable cause of the mishap when there was neither telemetry data nor any physical evidence available within 19 light minutes. The Board’s approach was to identify the possible failure modes that could have resulted in the immediate loss of telemetry. Many of these were random failures. The Board considered random failures to be highly improbable since the spacecraft had operated for nearly a year and a random failure would have had to occur during the 14 minute interval during which the telemetry was off and in which a very critical process was occurring. Accordingly, random failure modes were eliminated. Next the Board eliminated failure modes which were not compatible with the detection capabilities of the NASA Deep Space Network and the fact that they had received no further telemetry from the spacecraft. Finally the Board focussed on the commands, actions and software associated with the pressurization sequence which was This effort resulted in an evaluation of 59 to take place during the 14 minute interval. possible scenarios. The justifications for these conclusions are presented in Table 9. The first potential cause was a power supply electronics (PSE) power diode insulation failure. The PSE contains several diodes connected to the power bus. Inspection of the spare PSE box revealed several discrepancies including a misalignment of three often stud-mounted power diodes permitting the diode to come into, or very close to, direct contact with the chassis. On some diodes, a thin kapton insulator had embedded metal particles and scratches in a thermal insulator on both sides of the kapton. The shock from firing the pyro valves could have caused a breakthrough in the insulation causing a permanent short in any of these diodes which would short-circuit the power supply, destroying the spacecraft. Based on the inspectionof the sparePSE, the Board believes that this scenm”o must be retained as a possible cause of the loss of the downlink. II -23
*
**
Noted on spare PSE box. ESA and NASA initiators designed to same specifications,
but are not identical.
The second potential cause was a pressure regulator failure. If the pressure regulator failed in the open position it would cause a rapid over-pressure and a rupture of the nitrogen tetroxide tank, destroying the spacecraft. Failure of the regulator could have been caused by nitrogen tetroxide frozen in the regulator orifice, by contamination blocking the orifice, or by contamination in the seat causing leaking. Since nopost-assemblytestshadbeen cam”edout to vemfithatthesystemwascleanandthe temperatureof the regulatorwasnot known,theBoard reta”ned thisscenarioasapossiblecausefor the loss of the downlink The third potential cause was the failure of a pyro valve charge initiator. Some European Space Agency (ESA) initiator tests have resulted in initiator ejection from the valve body at speeds of 200 m& The ESA initiators are designed to the same specifications as the NASA initiators, but are not identical. The ESA initiators failure mode is from thread erosion. The Board had the acceptance test lot of NASA initiators examined and found erosion of about 50’% of their threads, though none had failed by initiator ejection. For these reasons the Board believes that pyro valvecharge initiator failure is apossiblecauseof loss of the downlink.
II -24
The cause listed as most probable was leaking of nitrogen tetroxide through the check valves which separated the nitrogen tetroxide from the rest of the pressurization side of the system. Since the valves were cold for most of the 11 month cruise, a scenario was proposed in which nitrogen tetroxide leaked through the check valves and condensed on the cold tubing upstream. The nitrogen tetroxide could then mix rapidly with monomethylhydrazine when the pressurization sequence was initiated, causing a rapid heating, rupturing of tubing, or an explosion. Tests indicated that as much as one to two grams of nitrogen tetroxide could have migrated through the valves, much more than enough to cause an explosion. Because nitrogen tetroxz”dewas shown in tests to be able to m“grate through check valves posing a n“sk of explosion, because of the correlation with the pressurization sequence, and due to the lack of other more compelling scenarios, the Board determined that the most probable cause was the unintended mixing of nitrogen tetroxide and monomethylhydrm”ne in the titanium tubing on the prasu~~”on sib of thepropufswnsystimcausedby m“gratkn through MO check vtdves during the 11 month cruisefrom Earth to Mam This example illustrates just how difficult it is to pin down completely the cause of a spacecraft anomaly, especially when telemetry or tapes of the event are not available. 4.2.2 Software The Clementine spacecraft was developed by the Naval Research Laboratory for the Ballistic Missile Defense Organization and it involved the participation of Lawrence Livermore National Laboratory, NASA, and the Jet Propulsion Laboratory as shown in Table 10. The program required only 22 months from program initiation to its launch on January 25,1994 and itdemonstrated the use of advanced technology sensors, light weight components and new computer and digital processing technologies. The program involved making a detailed map of the lunar surface then a fly-by of the near-Earth asteroid Geographos. Clementine provided the first space-based photos of the moon since the last Apollo lunar landing in 1972, taking over 1.8 million multi-spectral images in the visible, infrared, and
ON MAY 7 AN ANOMALY
OCCURRED DURING A TEST
TELEMETRY LOST FOR 20 MINUTES, WHEN RECOVERED SPINNING AT 82 rpm
II -25
CLEMENTINE
WAS
ultraviolet. The first part of the mission, the 70-day mapping of the moon, was remarkably successful as illustrated in Figure 25 which is a map of the south polar region of the moon taken through a red filter by Clementine41. Following completion of the lunar mapping phase, Clementine was supposed to leave the Earth-moon system and head for Geographos. On May 4, 1994 Clementine left lunar orbit and performed a series of maneuvers to make a gravity assisted fly-by of Geographos. New software for the Geographos portion of the mission was uploaded and tested. On May 7, during a centroiding test, the spacecraft experienced an onboard anomaly at 13:39:432 and all telemetry data was lost from the housekeeping processor for no reason. apparent For about twenty minutes the ground controllers struggled F@UW 25 CJementine Map of the South Polar Region of the Moon to recover telemetry. [After Ref 41] When the telemetry was finally recovered it was discovered that the spacecraft was in a rapidly spinning mode (at 82 RPM) and that the attitude control fuel supply was exhausted. The sequence of command events surrounding the anomaly is shown in Table 11. After six weeks of intensive investigation the cause of the anomaly was determined42 to be in the flight software. The anomaly was caused by software in the housekeeping processor. The software defect was in an error handling code that only showed up when a) an arithmetic underflow condition occurred and b) when certain lunar software routines were present. The defect caused the housekeeping processor to become corrupted and placed its instruction counter in an indeterminate location and allowed data execution to go forward from that point. This permitted indeterminate data to be placed on the housekeeping bus which under certain conditions fired the attitude control thrusters during autonomous operations. In the asteroid encounter which was being simulated at the time, the thrusters were purposely enabled to allow for autonomous operations. II -26
II -27
Before any action is taken on a spacecraft it is first simulated on a test bed which is an identical system on the ground which is supposed to be programmed with an identical set of instructions. Unfortunately, the housekeeping processor had been undergoing some unexpected resets during the mission and one reset had occurred on May 4. The spacecraft operators had been trying to discover the source of these resets. In doing so they had decided not to include two software patches in the spacecraft computer configuration. When they later loaded the centroiding program they inadvertently also left out three other patches. It was discovered later that none of these patches was present in the spacecraft computer, but they were present in the ground-based operations test bed, violating the assumption of identical In the dry run on the test bed which was carried out before software configurations. attempting the asteroid encounter simulation, the test bed (with the patches loaded) did not fire the thrusters and hence gave no warning of what was about to occur on the spacecraft. In later simulations without the Datches loaded it was seen that under certain conditions the test bed also gave the order to fire the thrusters. The Clementine example just discussed illustrates just how complex some spacecraft software anomalies can be. It only occurred under circumstances in which a latent defect in the program was permitted to take an unintended action because some other program which would protect the spacecraft was not in place. Under luckier circumstances, the program would have been ground-tested without the patches in place and would have shown that the thrusters could fire under these conditions. After investigation, the defect would have been removed or it would have then been required to check the spacecraft computer configuration for the patches before initiation of the asteroid fly-by simulation. Alternately, if the patches had been loaded in the spacecraft processor, the thrusters would not have fired and the defect in the code may never have been discovered. It is also possible that if the Program had had more time than 22 months, the program defect might have been discovered and corrected before launch. 5.0 LEARNING
FROM ANOMALIES
5.1 ANOMALIES
CAN LEAD TO NEW UNDERSTANDING
The history of the observation of anomalies in space systems began shortly after Sputnik I was launched on October 4, 1957. The first successful U.S. spacecraft was Explorer I, launched on January 31, 1958. A Geiger counter experiment put on board by J. A. Van Allen~ suddenly stopped counting. This anomalous behavior was investigated and it was found that the reason that the Geiger counter stopped counting was that it was, in fact, saturated by an extremely high count rate (much higher than had been expected) Van Allen concluded that the high count rate was caused by a very large number of charged particles trapped in the Earth’s radiation belts. Resolution of this anomaly led to the discovery of the earth’s radiation belts (now called the Van Allen belts). Anomalies which are carefully investigated can result in new discoveries such as the Earth’s radiation belts or in new radiation effects or damage mechanisms. 5.2 PREVENTING
SOFTWARE ANOMALIES
Preventing software anomalies is, almost by definition, very dif!lcult since they are unexpected. One way is to thoroughly check out the software. By repeatedly running the 11-28
program under a variety of conditions, potential defects can be discovered through ground simulation before launch. Even if a software defect escapes detection before launch, it can be found using the ground simulator which is kept in the same configuration as the space computer. If an anomaly is discovered in this way, it can be corrected before an action which makes use of the command containing the defect is taken by the spacecraft computer. The defect can even be discovered after activating the code, provided the action inadvertently taken by the spacecraft computer is not catastrophic. Software fixes can then be uploaded to prevent future recurrences. Even if a spacecraft is damaged or lost due to a software error, there is still a reason for investigating it thoroughly so that it can be corrected in other versions of the same or similar spacecraft or in the next generation of the spacecraft. 5.3 PUBLISHING
ANOMALY
ANALYSES
All spacecraft anomalies are taken very seriously because of their potential for producing a disaster. If the result of a spacecraft anomaly is not very visible however, e. g. if a device just upsets and can be recovered without a problem, the incident is likely to be examined briefly and noted in the log book (including the necessary corrective action), then forgotten. It is only when a spacecraft is lost, nearly lost, or if its status remains uncertain for an extended period of time that a Tiger Team is called in to trouble shoot the problem or a Board is convened to discover the cause. Very few anomalies or anomaly analyses will ever be published. There are several reasons for this. In most cases the cause is not found or pinned down completely. Many anomalies are unique to the particular spacecraft and reporting them through publication would not be useful to anyone other than the spacecraft operators. In addition most SPOS are not prone to publishing their errors or problems. In many ways it would be better if more anomalies were resolved and the results published. These anomalies could then be compiled and the probabilities of each type of anomaly could then be assessed for guidance to future spacecraft designers. 6.0 FUTURE TRENDS 6.1 NEW TYPES OF MILITARY
SPACE SYSTEMS
Future military space systems are facing an unusual opportunity now. With the decline of the cold war, the radiation hardness levels required for military systems are being dramatically reduced. At the same time, the Mid-east War has demonstrated the substantial advantages of space based communication, navigation and surveillance systems, and has also greatly increased performance requirements on such systems. For example, as shown in Table 12, before Desert Storm in the cold war era, there was little tactical use of space. Surveillance satellites were used primarily for intelligence purposes and so a slow revisit time of a few days or even weeks was considered quite acceptable as long as the pictures were of high quality and long term tendencies in the Soviet Union’s development of new weapons or facilities were detected in a timely manner. These satellite systems had large area coverage at low or moderate resolution and could make use of small area coverage at high resolution if a target was found to be of high interest. They also transferred their data to other satellites or to the ground at relatively low data rates. Communication among U.S. strategic assets and with the Navy’s fleet was handled primarily through the use of survivable geosynchronous communication satellite systems such as II -29
FLEETSATCOM, DSCS II and III, UHF FOLLOW-ON, and MILSTAR The Global Positioning System (GPS) was nearing completion and was operational, but its tremendous advantages for self location by ground troops was not fully realized and small, cheap, hand sets were not very available. The satellite systems built in this era were very large, heavy, capable, billion dollar systems. II -30
After Desert Storm, the military has become highly dependent on GPS for location and tracking of troops and equipment and for other uses as well. Space surveillance is now needed for tactical purposes (such as locating SCUDS), therefore high resolution coverage of large areas (such as entire battlefields) is desired and Field Commanders want continuous, rather than intermittent, coverage which in practice means very fast revisit times. In addition, we now expect to interface our satellite systems directly with our ground and sea weapons systems in real time. For example in overcoming the SCUD problem, we wanted to detect the (hidden) SCUDS or spot them on the launch pad, detect their launch, track them, determine their trajectory at a ground-based computer in the United States, and communicate with a missile launcher on the battle field, supplying proper coordinates in time to launch a Patriot missile to intercept and destroy them. The whole process could take no more than a few minutes. According to most reports that didn’t work very well, in general, but it did work at least in part, sometimes! Now it will be expected to work correctly, all the time. MILSTAR has recently demonstrated a successful crosslink between satellites. This technology will also be used more in the future. With the changes in the political and military landscape of Europe (including Bosnia), the dissolution of the Soviet Union, and the development of new and changing alliances, there will be demands for increased flexibility in future military space systems. The prospect of shrinking defense budgets assures that the solutions must be cost effective. In this environment of reduced threats and budgets, but enhanced demand for increased capabilities, higher data rates, better coverage, and more flexibility, there will be a great deal of pressure on the military to develop an entirely new generation of more capable, but cheaper, satellite systems. Furthermore, the reduced budgets and new policies on procurement of commercial parts whenever possible may require that the future systems be developed with commercial microelectronics and maybe using some, or even all, commercial spacecraft. This new generation of satellite systems will almost surely not be as large or as heavy, or as expensive as present systems, but it will have to be more capable and flexible. 6.2 PROJECTED SPACECRAFT
INCREASES
IN NUMBERS
AND
TYPES
OF COMMERCIAL
During the last several years, there has been an explosion in interest in commercial satellite systems. They have been proposed for world-wide cellular telephones, data transmission, faxing, direct broadcast television, movies and video conferencing as shown in Table 13. The projected investment in these new satellite systems is truly phenomenal, and the companies involved propose to build well more than 1000 spacecraft in the next ten years. The largest proposed system is that of Teledesic with 840 satellites plus spares for a total of nearly 1000 satellites in that one system alone. The cost of this system is projected at $9 B. Teledesic and other commercial communication satellite systems are shown in Table 14. The five systems described above in Table 14. are by no means the complete complement of proposed systems; there are at least 10-15 more that regularly are reported in newspapers. It is doubtful that all of these satellite systems will be able to obtain funding, FCC licenses, and be built. The five shown above are the larger cellular telephone satellites and are in general more likely to be developed than some others. As illustrated above, world-wide cellular telephone systems require very large numbers of spacecraft to provide global coverage and hence they require very large investments. There are certain trade-offs which are critical 11-31
,,
II -32
,
to the design of the system. The lower the orbit the smaller the ground antenna and the lower the power required for good signal strength. For hand held personal communications this is a critical need. Unfortunately, the lower the orbit, the larger the number of satellites required to obtain adequate coverage and the expense of the system goes up. To reduce the number of satellites, a designer must raise the orbit and this means larger antennas and more powerful transmitters. This type of system is more suitable for large, fixed ground stations. Of course as the orbits rise up into the intense radiation belts, the radiation-induced anomalies will increase too. There are other classes of commercial satellites including navigation, meteorological and remote imaging spacecraf~ The direct broadcast television system, which recently became operational is a satellite system designed to compete with cable TV. These commercial space systems are, in general, substantially smaller and lighter than military systems, but are remarkably capable. Not all of these systems will be economically viable and some of them will not be completed, but in the long run, competition for business will determine which ones succeed and which ones fail. This intense competition will generate strong pressure to use the latest state of the art devices in new systems to gain increased capability, but it will also require using devices which are sufficiently hard to survive in the orbits in which the spacecraft operate. Note that some of the systems shown in Table 14 operate in very severe orbits from a radiation standpoint. Any commercial space systems which develop a large number of anomalies will have serious problems if the anomalies cannot be resolved quickly and without significant impact on their users. Future commercial users will be even less understanding about anomalies, downtimes, and failures than present military and NASA users are. 6.3 SPACE EXPERIMENTS
TO GET HIGH TECH TO SPACE IN RECORD TIME
In the current design and construction of space systems, it is typical for it to require five to seven years from the time that the technology of choice is frozen until the spacecraft is actually launched. As shownain Figure 26, the devices must be chosen early in the program in order to provide time to order the devices, radiation test them, build an engineering model, build a flight qualified model, build a flight model, perform flight qualification testing, integrate the subsystem into the spacecraft, and launch it. The lead time gets even worse if there is a delay in launching the satellite. Satellite systems would benefit greatly if we could find a way to reduce the amount of time between the technology freeze date and the start of actual satellite operations. The use of smaller, less complex, less costly satellite systems will help reduce these times somewhat. There is a tremendous advantage of using the latest state of the art devices in space systems. These devices may represent breakthroughs in technology which can increase performance by several orders of magnitude, or they maybe just the latest device which has a factor of two or less gain in performance. In either case, there is a strong reason to try to get the devices or technology into space as rapidly as possible. There is only one small problem; namely, space System Project Offices (SPO’S) have been badly burned in the past by using new technologies in which the new vulnerabilities were not fully understood. This has a tendency to make them very sensitive to new technologies and to make them reluctant (with good reason) to rapidly introduce new technologies into their systems if they have not been For small changes, such as the newest generation of previously flight qualified. II -33
F~re
26 Typical Times Required to Build Space Hardware
[After Fox, Ref 44]
microelectronics, the problem is rather easily solvable. Thorough testing of the new devices at ground-based radiation facilities is generally suftlcient. The rest of the space qualification occurs as a part of qualification testing for the system. For the breakthroughs in technology, however, it is an entirely different matter; flight qualification is essential. The problem is, of course, how does a new technology get space qualified if no one is willing to fly it first? The only viable answer for the future, in my opinion, is to fly the newest technology in space experiments. In the future, space experiments will be used to demonstrate the performance gains of the new technology, and to show that they do not have any radiation effects or anomaly show stoppers. In this way we can get our latest generation of new devices and technologies into space in the most rapid time possible and increase the performance (and 11-34
reduce costs) of our next generation of space systems. In a properly designed space experiment, devices can be operating in space for the first time about one year after their initial acquisition. Before spacecraft System Program OffIces (SPOS) would be comfortable flying new technologies, the risk of using such technologies will have to be reduced. To reduce risk will require space testing of the modern generation of devices (sub-micron feature size) with concurrent more accurate ground testing and modelling programs and the development of prediction methods which more accurately predict the on-orbit performance of sub-micron devices. Better models are, of course, also needed for each significant change in technology It will also require better measurements of the space environment which occurs. simultaneously with measurement of space upset rates to reduce uncertainties. By using more accurate tests, models and environments we can reduce the over-all uncertainty associated with these new devices or technologies and therefore reduce the safety factors required for assurance of space system survivability and operability with a minimum of anomalies. If the uncertainties in predictions of radiation effects can be reduced, then spacecraft can be safely flown with much smaller safety margins. This will permit more modern devices to be flown. Carefully designed space experiments can make it possible for the next generation of space systems to have significant increases in performance, while maintaining the same reliability and operability. 6.4 NEW TECHNOLOGIES
AND RADIATION
EFFECTS CHALLENGES
When considering what the new technologies for space are going to be, it is never a good bet to bet against silicon. Silicon has run up against limit after limit and the Gurus have predicted it would lose out to a new technology only to see it overcome this new hurdle and out-perform its rival. It is a safe bet to say that for the foreseeable future silicon will be the technology of choice for the great bulk of the applications as shown in Table 15. There is a new emphasis on developing very large silicon solid state memories to replace less capable and less reliable tape units in spacecraft. These new solid state memories are like a loaf of bread made up of many slices. Each slice, however, is the latest state of the art memory unit available, currently a 16 Mbit or 64 Mbit Dynamic Random Access Memog (DRAM). There may be as many as 50-100 slices to produce multi-gigabit memory units. These large memories are extremely capable, but they are also very vulnerable to total dose, SEE, and probably to future anomalies. Large RAM’s, gate arrays, application specific integrated circuits (ASIC’S), and other logic circuits are the most used digital parts in space systems today. Gate arrays are finding a multitude of new uses in space. It appears likely that in the future we will have a very large number of new, high technology space systems. The effects of new technologies on new military and NASA space systems can already be seen. There is a shift from copper wired spacecraft with very large, heavy and expensive cable harnesses to a new generation of spacecraft which use Fiber Optic Data Busses (FODB) in place of wires as shown in Table 16. This can provide data transfer systems which are not only much lighter and less expensive, but also much higher speed (up to a few gigabits now) and more flexible. These new FODB’S will also contribute greatly to the ease of assembly of new spacecraft. The various spacecraft subsystems can be connected much more easily with fiber optic connections, rather than the more expensive and complex wiring harnesses. Furthermore, the wiring harnesses have to be soldered into place and if any II -35
connections need to be changed, it is very difllcult to make those reconnection, whereas, with a FODB system, recoupling is easy. It is expected that spacecraft will be able to be assembled in a matter of days rather than weeks, using the fiber optic technology. A typical ring data bus is i11ustrated4Sin Figure 27. The shift toward the use of fiber optic devices in space and toward faster subsystems and higher data rates has forced us to move from slower silicon devices to faster GaAs devices and very recently to even faster InP devices. Such moves can have large consequences from
II -36
the standpoint of vulnerability to SEE phenomena, total dose effects, and the occurrence of spacecraft anomalies. For example, GaAs has been shown to be somewhat less sensitive than silicon to total dose effects, but extremely sensitive to SEE. A new technique for growing GaAs at low temperatures (LT GaAs) has recently been shown to produce GaAs devices which have many orders of magnitude lower SEU cross sections~. A buffer layer is first grown at low temperatures of a few hundred degrees C, then annealed at higher temperature forming arsenic precipitates. These form charge trapping and recombination centers which do not allow any charge which may be deposited in the device to be collected thus providing a reduced SEU cross section. As illustrated in Table 17, these LT GaAs devices have the advantage of high speed performance, less sensitivity to backdating, Figure 27 TRW_HONEYWELL FODB Ring and low SEU cross sections similar to those [After Dale, Ref 45] of silicon devices. Figure 28 illustrates47 the reduction in sensitivity of LT GaAs over regular GaAs. The standard GaAs cross section is shown for a 336 stage shift register as a function of LET. When the same tests were made for an identical device with a LT GaAs buffer layer, no upsets were observed up to an LET of 90!
-d
‘!jijbk?. FODBRing
lnP has great potential for use in space. It is useful both as solar cells and as high speed devices. Its characteristics are shown in Table 18. Figure 29 showsa the radiation response of a selection of silicon, GaAs and InP solar cells. The beginning of life etllciency of both GaAs and InP are comparable and both have higher etliciency than silicon. Notice how much flatter the InP curves are than the silicon or GaAs curves. InP on silicon is not as well developed as InP on InP so its beginning of life efllciency is not as high but it is a much better material and much less expensive. InP has the potential to be a technology enabling technology because it II -37
HONE~LL G NQ 5
v ‘0-6
~
10-’
[+”’0
o
~ *= dl
“m
v
E 0 v
v
v C-12
98
MeV
FI-19 130 MoV Cl-35 209 MoV NI-58 260 M@V 1-127 320 MeV
10-8
1?
IJJ .* m
GaAs SEIUAL
m
8
9
10-5
~ 8 8 d
336 STAGE STANDA~ SHIFT REGISTERS
100
10”9
0
Serial
Mbps
m 40
m 20 LET
Data
Rata 1 60
9 80
100
(Me V*cm2/mg)
NO UPSETS OBSERVED ON DEVICES WITH LT BUFFER LAYERS UP TO AN LET OF 90
Figure 28 Reduction in Sensitivity in LTGaAs Shift Registers Compared to GaAs Devices [After Marshall, Ref 47] can be operated anywhere in the Earth’s radiation belts. This will permit satellites to be launched ‘much more-cheaply into a LEO orbit, then powered by low thrust thrusters to reach a geosynchronous orbit in weeks or even months and yet still have excellent solar power when they get there. InP is also a useful material for use in devices with its low SEU cross section, total dose insensitivity, and its very high speed performance; its only problem right now is that it is not easy to produce and package. InP is even less sensitive to total dose effects, and has been recently shown to be very insensitive to SEU49 as well. Figure 30 shows the cross section as a function of proton energy for an InP data sequence generator chip compared to unhardened GaAs. As the speed of a device increases its upset cross section also increases as shown in the II -38
figure for GaAs. The GaAs operates at 400 M bps, whereas the InP device operates at 13 G bps and yet has a SEU cross section three orders of magnitude lower.
Radiation Response of lnP/lnP lnP/Si, GaAslGe, and Si Solar Cells .
-20 $
AMO
c 6.5 USE OF COTS IN FUTURE SPACECRAFT
.{) .; w 10 c o -
In June 1994 Defense Secretary William ● InP/inP Perry issued a directive to . lnP/Si (2x4 cm2, Spire, 1995) t! military to use COTS a?s - ■ G~Ge parts in military systems (Solar Cell Radiation Handbook Addendum) c whenever possible. v Csi Regardless of the 6 (Solar Cell Radiation Handbook) , u directive, it is becoming a :.,2’ “’’;.,3’ “’1’014’ “’1 0,6’ 10’6 10’7 diftlcult problem to build spacecraft because of the 1 MeV Electron Fluence (cm-2) lack of hardened parts. .,. Military, NASA, and commercial spacecraft Figure 29 Radiation Response of InP Solar Cells Compared to builders may be forced to Silicon and GaAs Cells [After Walters, Ref 48] use COTS, even if they don’t want to, in future space systems. However, use of COTS in space systems contains significant risk. Let us examine the use of COTS parts in future space systems. ●
The advantages of using COTS devices in future space systems are shown in Table 19. The primary reason for wanting to use COTS devices is that they are state of the art devices,
II -39
whereas rad hard devices are hardened several years later than the same COTS device and hence Hughes lnP\ Proton Upset Test Results use several year old This means technology. I 1 1 1 10-9 that COTS devices are an Hughes 13 Gbps’ lnP ve. Unhardened Motorola GaAs ! order of magnitude more capable than rad hard devices. The choice is between a COTS Pentium or a rad hard 286-386 The microprocessor. choice is between a 64 M bit DRAM or a 1 M bit ,o-f4~ In addition, SR4M. 70 10 are COTS devices less substantially Proton Energy (Mev) expensive than rad hard devices. The worst case difference in price which I have encountered is for an RH 3000 which we wanted to buy for a space Figure 3(J Cross Section vs Proton Energy for InP Sequence The experiment. Generator Chip Compared to GaAs [After Marshall, Ref 49] microprocessor plus four or five glue chips would alone was $17 K and there was a lon~ and have cost nearly $100 K. The micromocessor . uncertain delivery time. We chose a less capable rad hard 3081 microprocessor, but in the 1
H -40
I
I
1
1
1
right orbit, we could have purchased a 64 bit IDT 4650 microprocessor for a few $ K, and had a more capable device. Commercial software is often much more available for COTS devices than for rad hard devices. Software development is a very costly part of building space systems today and is growing rapidly in complexity. If rad hard devices are not identical to an earlier version of a commercial device, then space system designers will have to develop much, if not all of the software for it. This is both expensive and time consuming. In addition, non-commercial software must be checked extra carefully for any errors and for its response to unexpected situations. It could become a prime candidate fo-rlater spacecraft anomalies. The U. S. rad hard electronics market was < 0.4°/0 of the commercial market and is shrinking rapidly. In 1991, there more were than 20 manufacturers of rad hard devices. Now, it is hard to count more than three or four (see Figure 31). The reasons why rad hard manufacturers are going out of business (or have gone) are illustrated in Table 20. The Cold War is over. Large and costly strategic systems such as nuclear missiles and radiation hardened satellite systems are going to be built in much fewer numbers. This leads to a perceived loss of future in the field and so it does not encourage investment in the area. Secondly, because of increases in commercial electronics orders (including those destined for commercial
CURRENT UNITED STATES RADIATION-HARDENED DEVICE SUPPLIERS ● ●
Honaywell Natkrlal
●
Loral
●
UTMC
DEVICE SUPPLIERS WHO NO LONGER SUPPORT THE RAD-HARD MARKET LSI ● TI
●
● ● ● ● ●
TRw Rockwell Im GE (RCA) sandia
● ●
Hughaa Raythaon
●
Martin-Marbtta
●
McDonnalkDougka
●
Motorola
● ●
Ford Mlcroekotronks AT&T
F@re 31 Current Radiation Hard Device Suppliers vs Suppliers in 1991
II -41
,.
,
space systems), the demand for commercial devices is beyond the current manufacturing capability. In addition, the commercial market is more profitable. When a company has unfulfilled orders for a high profit item and very few orders for a low profit item, it doesn’t take a rocket scientist to predict their course of action. Finally, equipment for each new generation of devices is becoming more expensive. A major change in device technology may cost up to $1 B or more for new equipment. Device manufacturers have to make these large investments every few years. Such an investment is not made lightly. In this environment, a large investment would certainly not be made for what could be perceived as a niche market. We have outlined the reasons for wanting to use COTS devices in future space systems, but there are many reasons why it will be difficult to use COTS devices in space. Some of the reasons are shown in Table 21. Survivability in even the natural space environment would now be ditllcult with many COTS devices, some of which are harder than 100 Krads and some of which are softer than 10 Krads. Of course, it depends on the orbit, and requirements for a system may vary from 10-20 Krads to 100 Krads or more. Many COTS devices are soft now, and advanced devices will be softer. In addition, COTS hardness varies somewhat making COTS hardness assurance an important new research area. Furthermore, while the reliability of COTS devices used in normal environments is no problem, their reliability in stressing environments (radiation, thermal, vibration, etc.) is uncertain. SPO’S will need to perform a large amount of measurements and testing to gain confidence in the space reliability of COTS devices in stressing environments. In order to be able to use COTS devices in future satellite systems, SPOS will need to buy them in large lots, possibly directly from the manufacturer, test them for radiation and other stresses, and place them in bonded storage until used (effectively space qualify the devices).
TABLE
21.
COTS
IIISAI)VANTAGES
Survivability In Many Desirable Orbits is Difficult Using Some COTS Devices, Other COTS Devices Will Survive in Most Orbits Many COTS Devices Fail Now at 10-50 Krads - Advanced Devices Will Be Softer Cots Hardness Is Not Controlfed and Varies Widely Reliability Of COTS Devices In Stressing Environments
Is Uncertain
COTS Devices Will Need To Be Tested And Placed Into Bonded Storage Cots Devices Will Have To Be Space Qualified There Is A New Generation Of COTS Devices Every 6-18 Months Safetv Factors Currentlv Used Are Too 13i@hfor Manv COTS
~evices
Another problem which will have to be faced is that COTS devices change rapidly; there is a new generation every 6-18 months, and small changes are made more often than that. This is both good and bad news. The good news is how rapidly a new device becomes available which is faster, more capable, or lower power. The bad news is that if you need a few II -42
more devices, after you have built the system, you may not be able to buy them because they are obsolete. Finally, the large safety factors often used in satellite systems are frequently too big for using COTS devices. For one space system we are working on, in the early 1980’s we somewhat arbitrarily set a specification of 100 Krads. We did this because it was not very difficult to get 100 K rad parts, and they were safe to use. Gradually we began getting more and more exceptions to the specification. We have recently reexamined the specification and now sometimes use 30 K rad parts and sometimes even less. Our on-orbit dosimeters show an expected dose of about 10 Krads in the spacecraft lifetime This safety factor has been reduced over the years and having an on-orbit dosimeter has certainly helped us to make that decision. We still buy 100 K rad parts whenever they are available. Any satellite system that plans to use COTS devices will have to depend on smaller safety factors; safety factors of ten will become increasingly diftlcult to obtain. 6.6 TESTING,
HARDNESS
ASSURANCE
AND SHIELDING
A space program which plans to use COTS devices has several alternatives for implementing radiation survivability. (It must be said at the outset that the thinking for implementing such a program has not been completed yet). First as shown in Table 22, an extensive testing program must be planned and implemented. It will be necessary to test a larger number of COTS devices to get a selection of devices which is suitable for use in a space program. It will be necessary to test devices from several manufacturers. It will also be necessary to invest in large lot buys of the devices so that if a device tests out well, or if the device manufacturer makes a change in the device there will be sutlicient devices for the space system. Poor lots will have to be used in breadboards, engineering development models, flight In many cases use of shielding will be qualified models, ground systems or discarded. required. If the system requirement is 50 Krads and only 25 K rad devices are available, then shielding is perhaps an optimal solution. Shielding is much cheaper than redesign. If only a
TABLE
EXTENSIVE
22.
USE OF COTS IN SPACE SYSTEMS
TESTING PROGRAM
REQUIRED
LOT BUYS SHIELDING ON-ORBIT UN-POWERED
SPARES
LATCHUP SCREENS EDC OR TMR MAY BE REQUIRED MUST DEVELOP NEW TECHNIQUES
FOR HARDENING
II -43
CRITICAL DEVICES
small number of devices with insutlicient hardness are to be used, then spot shielding is the preferred option. If a large number of devices with insufficient hardness are to be used in the system, then box shielding is likely to be more effective. The tendency toward building smaller, monopurpose spacecraft such as proposed by NASA may acerbate this problem. A possible solution which has not yet been proven, but which may be employed in future systems, is the use of on-orbi~ unpowered spares. To the extent that unpowered devices damage much less in a radiation environment, it may be possible to fly a small number of devices, e.g. two to four, in place of one very sensitive device. As the device damages a new, previously unpowered, device is switched in as a replacement. This would be a high risk procedure at present and the concept must be tested in a space experiment before it could be implemented in an operational space system. It may also be necessary to use a small number of devices for which none of the above approaches will work For example if the devices contemplated for the system have a measurable Single Event Latchup (SEL) cross section, most SPOS would not permit their use in their system. A device with a high SEU cross section or a very low total dose tolerance could also result in the proscription of its use. Alternatives such as Error Detection and Correction (EDC) codes or Triple Modular Redundancy (TMR) could certainly be used to help reduce the SEU problem in many cases, but at a cost of system efficiency. There would almost certainly remain a number of devices for which the above solutions would not work but yet were necessary to operate the system. In order to use these devices the SPO would have to harden them to total dose, SEU, and SEL. 6.7 USE OF PLASTIC PARTS At present many SPOS are being forced to consider the use of parts in plastic packaging because so many commercial parts come that way. There is still the feeling that these plastic parts still have lower reliability. Table 23 shows some of the considerations involved in making a decision to use plastic parts. There is currently a large disagreement among SPOS about the use of parts in plastic packages in their space systems. Some SPOS say they would redesign the whole system rather than use a plastic part. Others say that in the future they may have no choice but to use plastic parts because of the unavailability of parts in space qualified packaging. In the past parts in plastic packaging have not been as reliable as parts in ceramic or metal containers. There are some indications that this is changing, but the question is will they be reliable in stressing environments such as space with high radiation levels, a wide temperature range, high vibrational levels, etc.? This information simply is not known at present. Furthermore a new problem has recently surfaced. Devices have been shownW to exhibit different behavior in a radiation environment after burn-in than they do if they are not burned-in as shown hJ Figure 32. For devices in ceramic or metal packaging the problem is small, but for devices in plastic packaging, the problem appears critical. The flight qualified parts must be treated in exactly the same manner as the parts which will be flown. 6.8 NEW TECHNOLOGIES
AND NEW EFFECTS IN FUTURE SYSTEMS
Whenever new technologies are introduced into a new radiation environment such as space, new effects and anomalies are discovered. Furthermore, as shown in Table 24, the II -44
advances in technology in recent years has resulted in devices which have: (1) smaller feature sizes; (2) faster operation; (3) lower voltage and lower All of these power. desirable goals lead to devices which are more vulnerable to the effects of radiation. Smaller feature size means less capacitance, for example, and hence information is stored with less charge. Faster operation means that a decision that a zero or a one should be stored is made with less charge for the same current. Use of lower voltage or lower power devices means that less charge or energy is required to store
Cilii9
ICCJ-IMax vs. Dose/AnrIealfor Four Groups at High Dose Rate
Ckm
_Maxbnum
ICCHSrIK
+mE-oa 1.DOE44
1.00E48
1-* imlE-09 low-n 1.WE.11
PmRad
m Oow
(1%
180
Ambd
Figure 32 Difference in Behavior of Devices in a Radiation Environment Before and After Bum-in [After Clark, Ref 50]
II -45
,.
information. In each casq the information is stored using a smaller number of electrons. Each of these effects makes the device more vulnerable to radiation and means that small effects which were once negligible such as a cosmic ray or a proton passing through the devices are now much more likely to produce upset or damage. In the future this can only get worse. In this environment military, NASA, and commercial spacecraft builders will have to find costeffective ways of using devices which have become more vulnerable while at the same time controlling the effects of radiation-produced anomalies, upsets, and failures. If a new, serious, radiation effects problem were to come up as a result of reduced feature sizes or new technology, for example, then it is clear that we will not have the enormous resources at our disposal that DoD put into radiation hardening in the 1970’s and 1980’s.
II -46
Suppose the trends that we have been discussing come into practice in the near future. be? This question is addressed in Table 25, The device changes What will the consequences listed seem unavoidabk The technology changes and implementation choices are options for the SPO to decide, although he may ultimately have little choice about COTS or plastic packaging
use.
If the changes shown in Table 25 occur then there are going to be consequences which rmult direct4y from these changes. These projected consequence are shown in Table 26. Most of the changes will result in an increase in the number of anomalies and effects which have been hidden by the use of large safety factors will begin to show up, such as dose rate effects. One should not be unduly alarmed by the consequences listed in Table 26. Introduction of changes into new space systems is a self limiting process. If SPOS begin to get in trouble, they will become more consemative and introduce fewer changes. If they introduce a large number of these changes and still do not have insurmountable problems, they will end up with very flexible, high performance systems. The radiation effects community needs to provide them with a variety of good alternatives.
TABLE
26.
PROJECTED
PROBLEMS
FOR
FUTURE
SPACE
SYSTEMS
I
ACTION
CONSEQUENCE INCREASE
DEVICE
IN SINGLE
EVENT EFFECT$
MORE TOTAL DOSE EFFECTS
CHANGES
MORE DISPMCEMENT DEVICE AND TECHNOLOGY CHANGES
DAMAGE
I INCREASE IN ANOMALIES
I NEW
TYPES OF ANOMALIES
R4DIATION HARD DEVICE AVAHABILITY CHANGES
INCREASE IN ALL RADIATION INDUCED ANOMALIES AND FAILURES
IMPLEMENTATION
TNCREA$E IN ANOMALIES
CHOICES
I DOSE RATE EFFECTS APPEAR
7.0
CONCLUSIONS
AND SUMMARY
The first part of the paper has discussed the various effects which can produce anomalies in space systems concentrating on radiation-induced anomalies. The types of radiation-induced anomali~ discussed were ESD-surface and deep dielectric, SEE, total dose, and displacement damage effects. Other non radiation-induced anomalies were also discussed including operator error, mechanical malfunction, and software errors. The impact that anomalies can have on a space system has been illustrated with a number of examples. The II -47
causes of the anomalies are always important, but are usually very difficult to pin down. There are a number of clues to the causes such as the description and time of the anomaly, orbit, position in orbit, radiation environment and the previous anomalies observed on the spacecraft (or in similar spacecraft). Space environment data provides very valuable clues as to the causes particularly if a solar flare is present at the time or if the spacecraft is in an intense part of the radiation belts when the anomaly occurs or if the spacecraft is in a geosynchronous orbit and the anomalies occur predominately at a specific local time range. The Mars Observer and Clementine anomalies were discussed in some detail to illustrate how a Failure Review Board works and how many possible causes there can be for a complex anomaly. The second part of the paper discussed the new types of military, NASA, and other space systems which are being built or are in the planning stages. The use of new technology was described and the use of COTS and devices in plastic packages was discussed. The impact of the use of new technologies and devicm was also described.
II -48
8.0 REFERENCES
1.
Webster’s New Unive~l Unabridged Dictionary, and Schuster, New York, New York, p 75, 1983
2.
E. G. Stassinopoulos,
3.
D. M. Sawyer and J. I. Vette, “AP-8 Tnapped Proton Environment for Solar Maximum and Solar Minimum,” NSSDC/WDC-A-R&S 7(X)6, GSFC, Greenbelt, MD, (Dee 1976).
4.
M. J. Teague, K. W. Chan and J. I. Vette, “AE:6, A Model Environment of Trapped Electrons for Solar Maximum, ” NSSDC/WDC-A-R&S 76-04, GSFC, Greenbelt, MD, (May 1976) and G. W. Singley and J. I. Vette, “The AE-4 Model of the Outer Radiation Zone Electron Environment, HNSSDC 72-06, GSFC Greenbelt, MD, (August 1972). E. G. Stassinopoulos,
Second Edition,
Sirnon
NASA Goddard SFC, Private Communication,
NASA Goddard SFC, Private Communication,
1978.
1984.
J. H. Adams Jr., R. Silberberg, and C. H. Tsao, “Cosmic Ray Effects on Microelectronics, Part I: The Near Earth Particle Environment, ” NRL Nlemorandum Report 4506, (1981). 7.
J. H. Adams Jr. and A. Gelman, “ The Effects of Solar Flar~ on Single Event Upset Rates”, IEEE Tnm.s. Nut. Sci., NS-31, 1212, (1984).
8.
J. H. Adams Jr. and A. Gehnan, HThe Effects of Solar Flares on Single Event Upset Rat~”, IEEE Trans. Nut. Sci., NS-31, 1212, (1984).
9.
J. H. Allen and D. C. Wilkinson, “Solar-Terrestrial Activity Affecting Systems in Space and on Earth” PUblished in So lar-Terrestrial R-e clicticmsIV Proceedings, Vol 1, pp 75-107.
10.
J. H. Allen and D. C. Wilkinson, ‘Solar-Terrdrial Activity Affecting Systems in Space and on Earthn Pub lished in Solar-Terrestrial PredictionslV Proceed imzs, Vol 1, pp 75-107.
11.
J. H. Allen and D. C. Wilkinson, ‘Solar-Terre&rial Activity Affecting Systems in Space and on Earth” PUblishe(l in So lar-Terr@rial PreclictionsIV Proceedin~s, Vol 1, pp 75-107.
12.
D. A. hlcphe~on and D. R. Schober, “Spacecraft Charging at High Altitudes: The SCATHA Satellite Program, ” presented at the AIAA 13th Aerospace Scienc= Meeting, Pasadena, CA, January 20-22, 1975.
13.
D. A. McPhemon and D. R. Schober, “Spacecraft Charging at High Altitudes: The SCATHA Satellite Program, ” presented at the AIAA 13th Aerospace Scienc~ Meeting, Pasadena, CA, January 20-22, 1975. II -49
14.
S. E. DeForest, “Spacecraft charging at synchronous Gecm hysical Resea rch, Vol. 77, 651 (1972)
15.
M. D. Violet and A. R. Fredrickson, “Spacecraft anomali= on the CRRES Satellite Correlated With the Environment and Insulator Samples”, IEEE Tnm.s. Nut. Sci.,NS-40, Dec. 1993
16.
C. E. McIllwain, “Coordinates for Mapping the Distribution of Magnetically Tmpped Particles”J. Geophys. Res., Vol 66, p 3681, (1%1).
17.
J. T. Wallmark and S. M. Marcus, “Minimum Size and maximum Packaging Density of Nonredundant Semiconductor Devices”, Proc. IRE, ~, 286 (1962).
18.
D. Binder, E. C,. Smith, and A. B. Hohnan, “Satellite Anomalies From Galactic Cosmic Rays, ” IEEE Trans. Nut. Sci., NS-22, 2675 (1975).
19.
T. C. Nlay and M. H. Woods, “Alpha Particle Induced Soft Errors in Dynamic Memories, ” IEEE Trans. Elec. DeV. ED-26, 2 (1979).
20.
J. C. Pickel and J. T. Blandford, “Cosmic Ray Induced Errors in MOS Memory Cells, n IEEE Trans. Nut. Sci., NS-25, 1166 (1978).
21.
C. S. Guenzer, E. A. Wolicki, and R. G. Allas, “Single Event Upset of Dynamic RAMS by Neutrons and Protons, n IEEE Trans. Nut. Sci., NS26, 5048(1979)
22.
R. C. Wyatt, P. J. McNulty, P. Toumbas, P. L. Rothwell, and R. C. F&, “Soft Errors Induced by Energetic Protons, ” IEEE Trans. Nut. Sci., NS26, 4905(1979).
23.
J. H. Allen and D. C. Wilkinson, “Solar-Terrestrial Activity Affecting Systems in Space and on Earth” PUblished in So lar-Terrest rial PredictionslV Proceed inm, Vol 1, pp 75-107
24.
J. H. Allen and D. C. Wilkinson, “Solar-Terr@rial Activit y Affecting Systems in Space and on Earth” PUbli,shed in Solar-Terrestrial PreclicticmsIV Proceed inm, Vol 1, pp 75-107
25.
C. Barillot, 1995.
26.
C. Dyer, ‘In-Flight
27.
T. Goka, S. Kuboyama, Y. Shimano, and T. Kawaanishi, “The On-Orbit hle.asurements of Single Event Phenomena By ETS-V Spacecraft”, IEEE Trans. Nut. Sci., NS-38, 1693 (1991).
“In Flight Observed Anomalies,
Experiments,
II -50
” RADECS
orbits, ” Journal of
RADECS 95 Short Course,
95 Short Course,
1995.
28.
T. Goka, S. Kuboyama, Y. Shimano, and T. Kmwaanishi, “The On-Orbit Measurements of Single Event Phenomena By ETS-V Spacecmift”, IEEE Trans. Nut. Sci., NS-38, 1693 (1991).
29.
L. Adams, E. J. Daly, R. Harboe-Sorensen, R. NickSon, J. Haines, W. Schafer, M. Conrad, H. Griech, J. Merkel, T. Schwall, and R. Henneck, ‘A Verified Proton Induced btchup in Space”, IEEE Trans. Nut. Sci. NS-39, 1804 (1992).
30.
L. Adams, E. J. Daly, R. Harboe-Sorensen, R. Nickson, J. Haines, W. Schafer, M. Commd, H. Griech, J. Merkel, T. Schwall, and R. Henneck, “A Verfiled Proton Induced btchup in Space”, KEEE Trans. Nut. Sci. NS-39, 1804 (1992).
31.
J. S. Mayo, H. Mann, F. J. Witt, D. S. Peck, H. K. Gurnmel, and W. L. Brown, “The Command System Malfunction of the TeLstar Satellite, ” NASA SP-32, Vol. 2, June 1%3.
32.
E. Weuaas, “Spacecraft Charging Effects on Satellites Following the Starf~h Event, ” CSC Report H/3-J-280, February 1978.
33.
R. E. Fischell, “ANNA-lB Solar Cell Damage Experiment, ” Tram ril)t _ of jhe Photovoltaic Suecialists Co fere ce. April 10, 1963, Washington D.C. (Available from Defense Docu;enta;ion Center as PIC-SOL 209/3 July 1963).
34.
W. L. Brown, J. D. Gabbe and W. Rosenzweig, ‘Results Radiation Experiments”, NASA SP-32, Vol 2. June 1%3.
35.
J. H. Allen and D. C. Wilkinson, “Solar-Terrestrial Activity Affecting Systems in Space and on Earth” N blishecl in So lar-Terrestrial Predicticmsrv proceed inm, Vol 1, pp 75-107.
36.
W. J. Stapor, A. B. Campbell, J. Golba, and P. T. McDonald, “Spacecraft Anomalies and Testing: OptO-isolators”, to be published.
37.
W. J. Stapor, A. B. Campbell, J. Golba, and P. T. McDonald, “Spacecraft Anomalies and Testing: OptO-isolators”, to be published.
38.
W. J. Stapor has assisted in the analysis of this correlation. Communication.
39.
Report of the Marx Observer Mission Failure Investigation Board. A Report to the Administrator, National Aeronautics and Space Administration on the Investigation of the August 1993 mission failure of the Itlars Observer spacecraft. Submitted by the Mars Observer Mission Failure Board. 31 December 1993.
11-51
of the Telstar
Private
40.
Report of the Man Observer Mission Failure Investigation Board. A Report to the Administrator, National Aeronautics and Space Administration on the Investigation of the August 1993 mission failure of the Mars Observer spacecraft. Submitted by the Mars Observer Mission Failure Board. 31 December 1993.
41.
“A Clementine
42.
Clementine Report
43.
J. A. Van Allen and L. A. Frank, ‘Radiation Radial Distance of 107,400 km”, Nature, Vol J. A. Van Allen, C. E. Mcllwain, and G. H. Observations With Satellite 1958 Epsilon,” J. (1959).
44.
A. J. Fox, NRL, Private Communication.
45.
C. J. Dale, NRL Private Communication.
46.
C. J. Dale, NRL Private Communication.
47.
P. Marshall,
48.
R. Walters, NRL, Private Communication.
49.
P. hlarshall,
50.
S. D. Clark, J. P. Bings, M. C. Maher, M. K. Williams, D. R. Alexander and R. L. Pease, “Plastic Packaging and Burn-in Effects on Ionizing Dose Response in CMOS MicrocircuitsH, IEEE Trans. Nut. Sci., Vol 42, No. 6, (Dee, 1995).
Collection”,
Spacecraft
NRL/SFA
NRL/SFA
NRL/PU/1230-94-261,
Failure
Report,
June, 1994
June 24, 1994 (IX-aft Version) NRL
Around the Earth to a 183, p 430, (1959) and Ludwig, ‘Radiation Geophys. Res. Vol 64, p 271,
Inc. Private Communication.
Inc. Private Communication.
II -52
1996 NSREC SHORT COURSE
SECTION
III
TOTAL DOSE RESPONSE OF BIPOLAR MICROCIRCUITS
Naval Stiace
David W. Emily Warfh.re Center, Crane Division
TOTAL DOSE RESPONSE OF BIPOLAR MICROCIRCUITS
David W. Emily Crane Division Naval Surface Warfare Center
This work was supported in part by the Naval Surface Warfare Center, Crane Division, and the Defense Nuclear Agency
III-1
III-2
TOTAL
DOSE RESPONSE OF BIPOLAR
MICROCIRCUITS
David W. Emily Naval Surface Warfare Center, Crane Division Technology Development Branch 1.0
2,0 3.0
4.0
5.0
6.0
7.0
8.0
Introduction Bipolar Integrated Circuits in Modern Space Systems 1.1 Historical Perspective 1.2 Fundamentals of Bipolar Transistor Operation Bipolar Transistor Structures Vertical, Lateral, and Substrate Devices 3.1 Fabrication 3.2 Circuit Implementation 3.3 Transistor Total Dose Effects 4.1 Substrate, Sidewall, and Surface Inversion Gain Degradation 4.2 Integrated Circuit Response Digital 5.1 Linear 5.2 Low-Rate Effects Introduction 5.3 Low-Rate Effects Basic Mechanisms Theories 6.1 Transistor and Circuit Response 6.2 6.2.1 Bias Effects 6.2.2 Temperature Effects 6.2.3 Anneal Effects Modeling 6.3 SEM Irradiation 6.4 Hardness Assurance Test Method Development High-Temperature Irradiation 7.1 Post-Irradiation Anneal 7.2 Overtest 7.3 Future Trends and Conclusions
1.0 INTRODUCTION This session of the short course will discuss total dose effects in bipolar devices and microcircuits. Research into total dose effects in bipolar technology has long been overshadowed by CMOS research, especially since CMOS became the dominant integrated circuit (IC) technology for digital circuits during the 1980s. However, bipolar technology has continued to play an important role in mixed-signal and analog microcircuits used in current space systems. Recently, there has been a renewed interest in bipolar total dose effects due to the discovery of a
III-3
low dose rate effect in bipolar transistors and circuits. Considerable information has been presented recently on complex time-dependent effects in numerous bipolar processes. These may cause substantially lower failure levels at low dose rates with possible serious implications for space systems employing such devices. The response of bipolar microcircuits to total ionizing dose can be complex compared to CMOS. This complex response is due to several factors. Several types of bipolar transistors, each with a different total dose response, can be present in a single circuit. Additionally, different methods of transistor fabrication by manufacturers and the sensitivity of analog and mixedsignal circuit designs to small shifts in transistor parameters all contribute to complex IC responses. This discussion will cover the spectrum of bipolar total dose effects information, from basic transistor operation to the most recent microcircuit low-rate response theories, to provide a comprehensive review of total dose effects in bipolar microcircuits. The discussion will start with an overview of the important applications that bipolar integrated circuits have in modern spacecraft power, control, and signal processing systems. A historical perspective of radiation effects in bipolar devices over the years will then be presented. The fundamental operation of the bipolar transistor will be reviewed, along with different bipolar transistor structures, fabrication techniques, and typical circuit design implementations. Next, the dominant total-dose degradation mechanisms for bipolar transistors and how they affect the response of various integrated circuit types will be discussed. This will be used to introduce the recently reported excess degradation of bipolar integrated circuits when irradiated at low dose rates. Several case studies will be presented. Potential basic mechanisms responsible for the lowrate effect will be explored. Techniques used to investigate the low-rate response and results will be presented. These include temperature, bias and anneal tests, circuit modeling, and scanning electron microscope spot irradiation. Progress in the development of a hardness assurance test method for the low-rate effect will be reviewed. This short course section will conclude with a discussion of future bipolar technology trends and the potential effect on total-dose hardness. 1.1 Bipolar Integrated Circuits in Modern Space Systems CMOS has dominated as the fabrication process for digital ICS since the 1980s. For present-day digital microcircuits, bipolar is primarily limited to niche applications such as highspeed cache memories and limited mixed-technology areas such as output drivers for highWith the continual improvement in performance BiCMOS memories and microprocessors. CMOS performance, even these niche applications are rapidly converting to entirely CMOS. Bipolar remains the dominant process for linear and mixed-signal circuits, and BiCMOS is becoming an important process for high-performance analog-to-digital converters (ADCS) and The BiCMOS processes used for ADCS are considerably other mixed-signal microcircuits. different than the digital BiCMOS processes in that they are primarily bipolar or fully integrated bipolar-CMOS processes rather than CMOS with a sub-optimum bipolar device added. Bipolar is the dominant linear process due to performance advantages of higher voltage operation and current drive capability, lower noise, better linearity, and superior device matching. Figure 1 illustrates applications for bipolar microcircuits in a typical space satellite signal processing system. Bipolar microcircuits are the primary ICS used in the modern satellite power, signal processing, and control systems. III-4
The parts lists of five current production satellite systems were reviewed to identify commonly used bipolar linear ICS [1]. This review identified 162 different high-usage bipolar part types for the five satellites. Examples of the major functions that are used include operational amplifiers, analog-to-digital converters, comparators, digital-to-analog converters, analog switches, multiplexer, voltage regulators, voltage references, and pulse width modulators. Bipolar ICS account for 40?Z0to 50?Z0of the total microcircuits used in the average satellite or strategic missile system requiring radiation-hardened components. The electrical performance and parametric degradation of bipolar linear ICS are critical in many system applications. Input bias current or offset voltage is critical for an application such as a sensor buffer amplifier. Unlike digital ICS, where parametric degradation may need to be significant before having an impact on system performance, an offset voltage shift of millivolts or input bias current increase of microamps may cause serious system performance degradation. These small parametric shifts make testing linear and mixed-signal microcircuits response challenging.
ANALOG
& MIXED
Cryo Readout Multiplexer & Signal Processing
.-. r Ocal
High Precision Op-Amps
Signal
Processing
SIGNAL
APPLICATIONS Clock &
Analog-toDigital
Timing
Con;erter
GeneraIOr
Plane A may
High-Speed lnpuUOutput Interface Buffer m
Precision voltage Reference
Cfvlos Memory & Data Processor
/ J 28 Volts In –
Signal PrOcessin~COmputer Power Converter
Figure 1:
voltage Regulator
Bipolar
Microcircuit
Usagein TypicalSatelliteSystems.
1.2 Historical Perspective of Bipolar Total Dose Effects It is interesting to review the development, over the years, of information concerning total dose effects in bipolar devices and microcircuits. The evolution of bipolar technology, especially linear processes and product, has proceeded at a slow pace when compared to CMOS. To calibrate ourselves, the first commercially available monolithic bipolar amplifier to gain wide acceptance (the 709) was first introduced 31 years ago in 1965 [2]. This was followed shortly by the LM 101A amplifier in 1968[3]. This same part, with minor changes in circuit design and fabrication process, is still being designed into current and future satellites 28 years later in 1996.
III-5
As we go through this historical overview, it is also interesting to note that many of the problems then are still problems now. Research and testing performed on devices and circuits from about 30 years ago are still very relevant to the testing and research being performed in 1996. A compendium of papers primarily from the December IEEE Transactions on Nuclear Science concerning total dose effects in bipolar devices and partially used in this review is listed in Appendix A. 1967- Acceptor-Like Surface Recombination States Identified as Dominant Mechanism for NPN Transistor Gain Degradation[4] In this paper by Maier of the Naval Radiological Defense Laboratory, it was reported that, until very recently, the accumulation of positive charge has been reported as the dominant mechanism responsible for the surface effect in oxide-passivated devices. A bipolar planar npn transistor with MOS field electrodes was used to verify the existence of the positive charge effect and to introduce the existence of acceptor-like surface recombination states. The additional surface recombination states were shown to be the dominant mechanism for loss in gain. 1968- Prediction and Selection System for Radiation Effects in Planar Transistors to Identify and Eliminate “Maverick” Devices[5] In this paper by Poch and Holmes-Siedle of RCA, an irradiate-anneal test is developed to identify “maverick” transistors. During testing of many bipolar devices under Cobalt-60 gamma radiation, infrequent cases were noted where the surface damage levels exceeded those predicted. Some devices were anomalously sensitive, with damage levels falling well outside the normal statistical distribution of sensitivity. It was, thus, important to identify a method of detecting these rare, but excessively sensitive, units. These maverick devices could occur even within a production lot, all manufactured and processed in the same manner. The method developed was a 50 krad(SiOz) irradiation followed by a 250”C bake for 16 hours to return the transistor parameters to approximately pre-radiation value. The devices behaved the same on a subsequent irradiation; thus, the irradiation-anneal was a stressing method for pre-selection. 1970- Radiation-Insensitive
Silicon Oxynitride Films for Use in Silicon Devices [6]
A silicon oxynitride film deposited on the surface of a planar npn transistor and then annealed in hydrogen was shown to improve the total dose hardness by up to 30 times in this paper by Schmidt and Ashner of Bell Telephone Laboratories. 1970- Selective Irradiation to Identify Sensitive Junctions, Hardening by Elevated Temperature Irradiation and Eff:cts of Plastic Packaging on Total Dose Hardness[7] A number of topics are discussed in this paper by Bauerlein of Siemens AG Research Laboratories. Selective irradiation of npn and pnp planar bipolar transistors was performed with a narrow electron beam to show that the increase in base current occurs only if the beam is directed to the immediate vicinity of the emitter-base pn-junction. Irradiation of the remainder of the transistor surface had no effect on the current gain. In a second topic, it was reported that npn bipolar planar transistors could be total-dose-hardened if they were irradiated to a high dose at
III-6
the same time the temperature biased.
of the devices is held between 200° and 250°C while forward-
In a third topic, the effects of plastic versus hermetic metal can packaging on the total dose hardness of planar npn and pnp bipolar transistors under various bias conditions were investigated. It was reported that, in all cases, there was no additional degradation of the gain for the plastic-packaged transistors. For the npn transistors packaged in hermetic metal cans and irradiated under collector-to-base reverse voltage, it was reported that there were increased degradation in current gain and increased scatter between individual test units. For the pnp transistors packaged in metal cans and irradiated under an emitter-to-base reverse voltage, it was reported that there was a considerable increase in the degradation of current gain, 1972- Roles of Charge Accumulation Degradation[8] [9]
and Interface States in NPN Surface
The relative roles of the radiation-induced interface states and oxide charges on the surface degradation of npn bipolar planar transistors are explained in these two papers by Sivo of Boeing. Sivo reports that, in general, the new interface states are the more important factor in the surface degradation of npn transistors as long as a strong inversion of the base surface is not developed that causes a “channel” between the emitter and base contact. Charge buildup would then become the primary factor if extensive channeling did develop, which would primarily occur at high doses. It was also reported that a strong inversion of the surface by a high enough positive oxide charge would be substantially retarded by a coincident high density of interface states due to the buildup of negative charges on the acceptor-like surface states in inversion. This effect will cause an arumrent saturation in the charge accumulation with dose. The nonuniform nature of the charge buildup and the charge of the interface states can contribute significantly to low-current gain degradation at low and medium doses. At high doses, once the channeling occurs, the high-cument gain can be seriously affected as well. 1974- Use of a Scanning Electron Microscope for Screening Bipolar Surface Effects[ 10] This is the first reported use of a Scanning Electron Microscope (SEM) to irradiate pnp transistors in a paper by Lipman, Bruncke, and Crosthwait of Texas Instruments and Galloway and Pease of the Naval Ammunition Depot. Correlation of gain degradation was demonstrated between Cobalt-60 and the 20-keV electrons from the SEM.
1975- Total Dose Response of Bipolar Digital Devices [11][12][13] The first papers concerning total dose effects in digital bipolar microcircuits discuss the total dose results on high-speed emitter coupled logic (ECL) gates [11] by Daniel and Coppage of Sandia. Two papers report the effects of total dose gamma irradiation on the electrical characteristics of integrated injection logic (1%) devices and circuits [12] [13] by Pease of Naval Weapons Support Center and Raymond of Northrop Research and Technology Center.
III-7
1975- Testing and Hardening of Linear Microcircuits
14]
Ionizing radiation results are reported for a number of linear circuits, including the LM101A,LM102,LM108,LM111 , LM124, LM139, HA2520, HA2600, HA2620, and HA2700. The irradiate and anneal (IIWN) hardening technique was attempted on a number of part types and was reported to be an acceptable method for a few specific part types, while, in others, there were problems such as excessive degradation for the second irradiation. 1976- Process Investigations
of Total Dose Hard, 108 Op Amps[15]
Testing, analysis, and total dose hardening were performed on a 108 op amp in this paper by Palkuti of NRL and Sivo and Greegor of Boeing. The analysis was initiated with a total dose characterization of 108-type op amps that indicated that total dose sensitive devices could be radiation-hardened by modifying their standard processing. Extensive lot sampling over extended periods of time indicated significant lot-to-lot variation in the total dose sensitivity. The lot sampling indicated that devices from about 30% of the tested wafer lots exhibited Subsequent failure analysis by selective circuit significantly greater radiation sensitivity. irradiation with a SEM identified the critical op amp failure modes as primarily the result of super-beta-gain loss and leakage increase. A process-flow analysis with MOS capacitors on the selected baseline fabrication sequence was followed by a study of specific processing steps on the 108 hardness. Specific process steps were modified and the results presented. Using the defined process, it was possible to fabricate 108-type devices with minimal degradation at 1 Mrad(Si02). 1976- Irradiate-Anneal
Screening of Total Dose Effects[16]
The identification and elimination of maverick devices that exhibit much greater total dose sensitivity than the normal lot hardness distribution is again a topic in this paper by Stanley and Price of JPL. The effects of irradiate-anneal were measured for a number of linear bipolar devices, where normal and anomalous values obtained after the first irradiation and annealing were determined. It was found that, in almost all cases, reirradiation produced substantially greater shifts than the first irradiation. Unusual annealing behavior was also observed on some devices, such as the LM1 11. 1977- Analysis of Total Dose Response in Linear Integrated Circuits[17][18][
19]
A number of papers, through various analysis techniques, attempt to explain the response of linear integrated circuits to total dc~e irradiation. The first by Stanley and Gauthier of JPL [17] uses SEM irradiation at various levels of magnification to identi& sensitive areas and components that are responsible for the total dose degradation. Also discussed is the large variation in total dose response from device to device within the same date code. The second by Galloway and Roitman [18] of the National Bureau of Standards discusses a number of factors that must be considered in using the SEM, such as the depth-dose distribution of kilovolt electrons, dose rate uniformity, and the importance of biasing. The third paper by Johnston of Boeing [19] discusses neutron degradation in four common linear integrated circuits, but the circuit analysis discussed is very relevant to total dose degradation.
III-8
1977- Discussion of a Hard Off-the-Shelf SG1524 PWM and Total Dose Hardness Variation in LM 139s from Four Manufacturers[20] In this paper by Newell and Picciano of Ford Aerospace, the off-the-shelf total dose hardness of the SG1 524 pulse width modulator is analyzed and an attempt made to explain what is responsible for the significant total dose hardness. The wide variation in hardness of LM 139s from four different manufacturers is discussed, along with the test results and proposed explanation for the hardness differences. 1977- Evaluation of Integrated Injection Logic[21][22] Total dose results and analysis are reported on I% devices from five different manufacturers in this paper by Raymond of MRC and Pease of NWSC Crane. The use of bipolar processes for digital circuits, including memories and microprocessors, is rapidly increasing; and this paper and one by Donovan, Simons, and Burger of RTI discuss the future of LSI technologies. A subjective comparison of IS technologies in [21] rates FL slightly ahead of CMOS as the superior process in performance and hardness. Paper [22] also projects an optimistic future for 12L, with CMOS/SOS a close competitor. 1978- Hardness Assurance Considerations Structures[23]
for Total Dose Effects on Bipolar
This work assessed the theoretical understanding of long-term total dose effects in bipolar devices in support of the development of a hardness assurance technique in a paper by Hart, Smyth, van Lint, Snowden, and Leadon of MRC and IRT. The principal effort was directed at studying transistor gain degradation mechanisms by use of models relating semiconductor physical and electrical parameters to surface properties. Total dose effects on surface properties were used to identify critical physical parameters for use in hardness assurance procedures. Model implications and predictions were compared with existing data to evaluate their accuracy and usefulness as a tool. 1979-
Total Dose Homogeneity Study of the 108A Operational Amplifier[24]
Identification of maverick devices exhibiting a significantly lower total dose hardness was the subject of this paper by Johnston and Lancaster of JPL. This study investigated the homogeneity of the radiation response of components and complete 108A circuits at various levels of traceability, including diffusion lot-to-lot, wafer-to-wafer, and sub-wafer. Significant differences were found in the radiation hardness of the different diffusion lots. Even the variability of devices from a single wafer was larger than expected and was comparable to variability of devices from a single diffusion lot. Examination of the circuit data for large numbers of devices from single diffusion lots revealed that a small number of devices, approximate y 1?ZO, had a much different radiation response than the rest of the devices. In some cases, different failure mechanisms were responsible; but some devices simply had a different sensitivity to total dose. No electrical parameter could be identified that could be used to screen these devices.
III-9
,.
1981- SEM Analysis of Total Dose EffectsinAD571 Converter[25]
.
1% Analog-to-Digital
This paper by Gauthier of JPL investigated the total dose degradation mechanisms in the AD571 ADC, Sensitive areas of the circuit were identified by SEM irradiation, and then responsible components were identified. 1981- Total Dose Hardening of 108 Amplifier with Nitride Passivation[26] This paper by Condito, Lambert, and Schwartz of Precision Monolithic investigates the effect of different processing steps on the total dose hardness of the OP- 108A precision operational amplifier. Process lots with and without a silicon nitride surface passivation layer demonstrated that the nitride layer is the dominant step in assuring radiation-resistant circuits. Standard processed parts with the nitride layer demonstrated significantly lower degradation by a factor of 10 to 18 than the parts manufactured without it. 1981- Total Dose Hardness of Integrated Schottky Logic [27][28] Two papers by Johnson of Raytheon and Blice of NWSC Crane reported the total dose hardness of integrated Schottky logic transistors and circuits to be 10 Mrad(Si02). 1982- Total Dose Hardness of FL Logic Devices and SBP9989 Microprocessor[29]
[30]
Total dose hardness of six custom digital circuits designed and fabricated in an 1% process for the Global Position System is reported in this paper by Poblenz of Texas Instruments[29]. The circuits were found to be relatively hard to total doses of 1 Mrad(Si02). Similar hardness results were reported for the SBP9989 microprocessor in a paper by Woods of MIT[30]. 1983- Total Dose Effects in Recessed Oxide Digital Bipolar Microcircuits[3 1][32] Total dose failure levels of as low as 5 krad(Si02) are first reported in oxide-isolated In the first paper by Buschbom of Texas bipolar technologies in these two papers. Instruments[3 1], the failure of an advanced oxide isolated process is identified to be inversion of the base region along the oxide sidewall. In the second paper by Pease of MRC[32], low total dose failure levels of 10 to 100 krad(SiOz) are reported in oxide-isolated bipolar processes from five manufacturers. Failure modes are identified as (1) inversion of the p+ region at the bottom of the recessed oxide, causing channeling between adjacent buried layers; (2) inversion of the ptype base region along the recessed oxide sidewall, causing channeling between the emitter and collector of the npn transistor; and (3) an increase in surface recombination velocity along the sidewall due to fast surface states, causing an increase in p region sidewall currents. 1983- Comparison of Total Dose Effects in Linear ICS from Cobalt-60 and Electrons[33] Total ionizing dose response for fourteen linear IC types from eight manufacturers using Cobalt-60 and 2.2-MeV electrons sources is reported by Gauthier and Nichols of JPL. Electrons
III-10
are reported to almost always cause greater degradation Cobalt-60.
and lower device failure levels than
1984- Degradation Analysis of Lateral PNP Transistors[34] Degraded current gain due to x-ray irradiation in lateral pnp transistors is analyzed by Kato of Hitachi CentraI Research Laboratory. Base current is evaluated for two regions -- the depletion region of the emitter-base junction and the nondepleted charge-neutral region of the base surface. 1985- Total Dose Induced Hole Trapping and Interface State Generation in Bipolar Recessed Field 0xides[35] In an extension of the previous year’s reported work[32], the total-dose-induced trappedhole density, N.t, and interface state density, Nit, are investigated in bipolar recessed field oxides using test structures. New results include the effects of pn junction fringing fields on inversion voltage shift, 1987- Models for Total Dose Degradation of Linear Integrated Circuits[36] Mechanisms for total dose degradation of linear integrated circuits are discussed, including bulk effects, oxide charge buildup, and recombination at the Si-SiOz interface in this paper by Johnston and Plaag of Boeing. The dependence of damage on bias, dose, particle type, and energy is used in conjunction with two-dimensional modeling to identify the failure mechanism in a specific linear device type. The importance of surface recombination, along with the absence of bias dependence, is discussed in this paper. Substantial differences between Cobalt-60 and electrons are shown to be due to bulk darnage added to the surface darnage in the wide-base lateral and substrate pnp transistors. 1987- Total Dose Failure Levels for Circuits Using Bipolar Recessed Field Oxide Processes[37] [38] [39] The total dose failure levels for circuits fabricated in bipolar processes using recessed field oxide isolation are discussed in three papers. The papers by Schi~37] and Maurer[38] both discuss the hardness of these processes at low dose rates typical of space operation. Both papers report significantly higher failure threshold when the imadiation tests are conducted at low dose rates. This is due to the recessed field oxide failure mechanism and the annealing behavior. In the paper by Titus[39], the total dose failure threshold uniformity is mapped for several wafers. These include both standard processed wafers and ones with an enhanced field implant to improve total dose hardness. 1988- Bipolar Integrated Circuits Fabricated on Oxygen Implanted Silicon-on-Insulator Wafers[40] This paper by Platteter of NWSC Crane and Cheek of Texas Instruments describes the radiation improvements obtained by fabricating bipolar digital integrated circuits on oxygenMultiple low dose oxygen implants were used to implanted silicon-on-insulator substrates. 111-11
fabricate the wafers. Total dose hardness was reported to be significantly circuits compared to the bulk circuits.
higher for the SOI
1988- Comparison of Heavy Ion, Electrons, and Cobalt-60 Degradation Effects on an Advanced Digital Bipolar Process[41] Results are presented in this paper by Zoutendyk and Goben of JPL and Bemdt of Honeywell on measurements of the degradation effects of bromine heavy ions, electrons, and Cobalt-60 radiation on the current gain of bipolar transistors fabricated in an advanced process. Bromine was reported to produce the most degradation and Cobalt-60 the least. 1989- Total Dose Effects in Trench Isolation[42] Total dose effects in polysilicon-fflled trenches used for component isolation in an advanced commercial bipolar technology are reported on for the fust time in this paper by Enlow and Pease of MRC and Combs and Platteter of NWSC Crane. Test structures were designed and processed in an advanced trench-isolated bipolar process to study the total dose basic mechanisms. The generation and annealing behavior and effect on bias on Nil and NOt are reported. 1990- Radiation Hardening of BCDMOS Technology[43] The hardening of a power integrated circuit technology, containing bipolar, CMOS, and DMOS components, is reported in this paper by Desko, DarWish, Dolly, and Goodwin of AT&T and Titus of NWSC Crane. 1991- Low Dose Rate Effects in Advanced Bipolar Processes[44][45] The first report of increased gain degradation in bipolm devices irradiated at low dose rates is reported in this paper by Enlow and Pease of MRC, Combs of NWSC Crane, and Schrimpf and Nowlin of University of Arizona[44]. Implications of this finding on the 1019.4 test method are discussed in a paper by Fleetwood, Winokur, and Meisenheimer of Sandia[45]. In the paper by Enlow, total-dose-induced gain degradation in polysilicon and crystalline emitter The effects of bias, dose rate, and anneal temperatures are bipolar transistors is investigated. The bipolar test structures reported on are from four commercial technologies. discussed. Polysilicon emitter transistors show improved hardness over crystalline emitter transistors. Two of the three polysilicon technologies tested showed a continued decrease in gain during room temperature annealing, which may be due to post-imadiation buildup of interface traps. The gain degradation at a given total dose significantly increased as the dose rate was lowered, suggesting the existence of a dose rate phenomenon. The difference in dose for a given gain degradation at 1,1 versus 300 rad(SiOz)/s irradiations was near 50 for one of the polysilicon technologies. Both this paper and Fleetwood[45] suggest that Test Method 1019.4 hardness assurance test may not represent a worst case test for space environments.
III-12
1992- Trends in the Total Dose Response of Modem Bipolar Transistors[46] Factors that influence the total dose response of bipolar transistors, including emitter bias, transistor polarity, emitter technology, emitter geometry, base design, and irradiation dose rate, are reported in this paper by Nowlin and Schrimpf of University of Arizona, Erdow of MRC, and Combs of NSWC Crane. Advanced polysilicon and standard emitter transistors of both npn and pnp polarity processed in a trench-isolated, silicon-on-insulator technology were the subject of this study. Findings that were reported (for this technology) include: npn transistors degrade more than pnp transistors; devices with higher surface concentration degrade less; devices with smaller emitter perimeter-to-area ratios degrade less; collector bias does not affect gain degradation; reverse bias on the emitter is worst-case bias; increases in base current are larger at smaller base-emitter voltages; poly-ernitter devices are initially harder than standard emitter devices at low doses, but may become worse at large total doses; and degradation is worse at low dose rates. 1993- Charge Separation and Low Rate Effects h“Bipolar Transistors[47] [48] The role of net positive oxide-trapped charge and surface recombination velocity on excess base current in bipolar transistors is the subject of the paper by Kosier of University of Arizona[47]. Using test structures and transistors, two simple approaches for separating the effects of oxide charge and surface recombination velocity in bipolar transistors is described. Both are based on analysis of the log plot of the excess base current versus the emitter-base voltage. In the paper by Nowlin of the University of Arizona[48], the dose rate dependence of bipolar current-gain degradation is mapped over a wide range of dose rates for the fwst time. Annealing experiments following irradiation showed negligible change in base current at room temperature, but significant recovery at 100”C and above. Implications reported are that Test Method 1019.4 irradiate-anneal will not predict the worst case gain degradation. 1993- Converting a Bulk Radiation-Hardened Dielectrically Isolated Process[49]
BiCMOS Technology into a
The development and radiation test results of a dielectrically isolated radiation-hardened BiCMOS process are reported in a paper by DeLaus of Analog Devices, Emily and Mappes of NSWC Crane, and Pease of RLP Research. The process is fabricated on a bonded-wafer siliconon-insulator (S01) substrate and employs deep trenches for complete oxide isolation. Total dose hardness of the process was not affected, and the dose rate hardness exhibited substantial improvement. 1994- Low Rate Total Dose Response of Bipolar Transistors[50][5
1][52][53]
Investigations into identifying, understanding, and assessing the implications of low dose rate effects in bipolar transistors increased greatly in 1994 with three papers on this topic. A fourth paper reported on synergetic effects of radiation stress and hot carrier stress on the current In the first paper by Kosier of University of Arizona[SO], it is gain of bipolar transistors. reported that the excess base current in an irradiated bipolar transistor increases superlinearly with total dose at low total dose levels. In this regime, the excess base current depends on the charge-trapping properties of the oxide that covers the emitter-base junction. At higher total dose III-13
.
,,
-..
levels, the excess base current saturates at a value that is independent of how the charge accumulates. In the second paper by Fleetwood of Sandia[5 1], a physical model of the mechanisms responsible for enhanced low-rate gain degradation in bipolar devices is presented. The model suggests the presence of slowly transiting or metastably trapped holes in the bulk of the oxides. These act in tandem with more deeply trapped holes near the Si-Si02 interface to reduce the charge yield in the bulk of the oxide and increase the number of compensated holes in the oxide above the emitter-base junction. In the third paper by Nowlin[52], an apparent saturation of the dose rate dependence for gain degradation at dose rates below 10 rad(Si02)/s is reported. The results on elevated temperature irradiation are discussed and proposed as a possible hardness assurance test method. 1994- Low Rate Effects in Bipolar Circuits[54][55] [56][57] Four papers discuss the total dose response of bipolar linear and mixed-signal microcircuits, including total dose dose rate sensitivities. In the paper by Johnston[54], total dose degradation and dose rate effects in conventional transistors and linear integrated circuits are discussed. It was reported for some circuits and processes that the dose rate dependence may continue as low as .005 rad(Si02)/s. It was also presented that elevated temperature irradiation of a microcircuit at 60”C produced more degradation than at the same dose rate and normal temperature, but did not bound the low-rate degradation. In a paper by Beaucour[55], total dose response of LM 137 voltage regulators from four manufacturers is reported. Enhanced low dose rate degradation is reported in two of the manufacturers’ circuits. Selective SEM irradiation and mechanical probing were used to identi~ a lateral multi-collector pnp transistor as the sensitive circuit element. In a paper by McClure[56], total dose response of three bipolar linear microcircuits is reported at various dose rates, In all cases, lower dose rates produced greater degradation. In some cases, anneals following the high dose rate irradiation caused further degradation, but did not bound the low rate response. In a paper by Lee[57], the total dose response of three high-resolution BiCMOS analog-to-digital converters is reported at various dose rates. The dominant failure mode reported was due to the thick gate oxides. Rebound effects caused the response to be markedly different at high and low dose rates. 1995- Low Rate Effects in Bipolar Transistors[58] [59] [60][61] Low rate effect studies continue to be strong with three papers on the topic of low rate effects in bipolar transistors and a fourth on the effect of irradiate and anneal cycling. In the paper by Schmidt[58], the total-dose-induced gain degradation of lateral, substrate, and vertical pnp bipolar transistors is reported at various dose rates. Physical mechanisms are proposed to explain the greater degradation at low rates. In the paper by Schrimpf159], the gain degradation of a lateral pnp transistor is measured after irradiation at various elevated temperatures and several dose rates and then annealed at various temperatures. It is reported that the excess base “current for irradiation at 125°C and 167 rad(Si02)/s is nearly equal to that at 0.1 rad (Si02)/s. In the Belyakov[60] paper, a MOS test structure is used to identi~ the mechanism believed responsible for the increased low dose rate degradation in bipolar transistor gain, Shallow electron traps are reported to influence an increase in the positive charge yield. In the paper by Witczak[61], it is reported that gain degradation due to ionizing radiation in complimentary single-crystalline emitter bipolar transistors grew progressively worse after repeated cycles of
I-II-14
.
irradiation and anneal. A correlation is drawn between this degradation and mechanical stress in the oxide. 1995- Low Rate Effects in Bipolar Microcircuits[62] [63] The total dose response of five bipolar linear microcircuits from several manufacturers and irradiated at various dose rates and bias conditions is presented and analyzed in the paper by Johnston[62]. In one circuit, saturation damage is reported to be about 10 times greater at low dose rates than high dose rates, In another device, the dose rate response was reported to be very nonlinear with only minimal degradation occurring at 50 and .005 rad(Si)/s, but large degradation occurring at .002 rad(Si)/s. In other devices, the enhanced low rate degradation was reported to be very bias-dependent. In a paper by Carrii3re[63], total dose irradiation and anneal response is reported on several bipolar linear devices. Enhanced low rate degradation was reported on several devices, but post-irradiation anneal did not correlate with low rate sensitivity. 1995- Total Dose Response of BiCMOS and SiGe Processes[64][65 ][66] The total dose response of high-resolution analog-to-digital converters is reported in a paper by Lee[64]. Significant total dose hardness variation was reported with date code and dose rate. The cause of the low dose rate sensitivity was reported to be due to the CMOS circuit elements. The total dose response of a DC/DC Converter fabricated in a high-voltage bipolarCMOS process is reported in a paper by Titus[65]. Minimal degradation is reported to a total dose of 9 Mrad(Si02). In a third paper by Babcock, total dose response of high-perfommnce SiGe heterojunction bipolar transistors is reported. Minimal gain degradation was observed to 1 Mrad(Si) for transistors irradiated at 300”K and 77”K.
2.0 FUNDAMENTALS
OF BIPOLAR TRANSISTOR
OPERATION
This section will give a basic overview of bipolar transistor operation and commonly used For a more detailed description of the operation of bipolar terminology and definitions. transistors, there are many excellent sources of information, such as Physics of Semiconductor Devices by Sze. For comparison, MOS transistors are a majority carrier device with the current flowing through a channel formed between two junctions and controlled by an electric field from a gate electrode. By contrast, bipolar junction transistors are minority carrier devices with current flow through a base region between two semiconductor junctions and controlled by charge injection into the base region. Figure 2 shows the symbols and nomenclatures of the two polarities of bipolar junction transistors. In the following discussion, the more common npn transistor will be described; the results are applicable to the pnp transistor with an appropriate change of polarities.
rrI-15
,.
EMITTER
,
BASE COLLECTOR
~
-’
(a)
p-n-p
TRANSISTOR
(b)
n-p-n
TRANSISTOR
Figure2 Symbolsand Nomenclaturesof npn and pnp Transistors.
The normal operating conditions for an npn transistor are with the base emitter junction forward-biased and the collector base junction reverse-biased. Under these conditions, the transistor is said to be operating in the active region. For our transistor, this mearts that minority carrier electrons are injected into the p-type base region. The predominant current flow in the transistor is from collector to emitter and is controlled by the small base current due to the forward-biased base-emitter junction and which increases strongly as the forward voltage at the emitter junction increases, The gain (Q of a transistor is determined by the percentage of the electrons that escape recombination and transit the base region from collector to emitter divided by the base current. The base current is determined by a number of factors, but is primarily determined by the rate at which holes are lost from the base by injection across the emitter junction and the rate of hole recombination with electrons in the base, In each case, the lost holes must be resupplied through the base current 1~. For our purposes, the base current IB is divided into two components to analyze the effects of radiation -- a bulk component (_&ulk)and a surface component (I,Uti.u). For radiation effects concerns, changes in the bulk component of IB are primarily due to recombination centers in the base region, such as from neutron or electron displacement damage effects. The primary concern for total ionizing dose damage is the increase in the surface component of the base current. This will be discussed in more detail in section 4.2. B= ~/IB = ~(I~ul~+ ItiW=)
(1)
A Gummel plot is the primary means of showing the base and collector currents. In a Gummel plot, log 1, and log k are plotted versus V,,. The transistor gain ~fl,) is derived from the Gummel plots and is plotted as current gain versus V~ or L. Examples of these plots are shown in Figure 3.
III-16
1.0
IN 1 0-s
.
0.8
8
Iv 10-7
1 o~
lN 10-10
)
10-11
0.0
10-12
0.4
0.5
0.6
0.7
0.6
0.9
0.4
VEB(-v)
I
I
I
I
0.5
0.6
0.7
0.6
0.9
Vm (v)
Figure 3 (a) ExampleGummelPlots of IBand ~; (b) DerivedCurrentGainCurve.
3.0 BIPOLAR TRANSISTOR
STRUCTURES
Due to the long product lifetimes of some bipolar processes, there are currently many different bipolar processes being used to fabricate linear and mixed-signal microcircuits. The radiation response of the different processes and different transistors within each process may vary significantly. Factors that may influence the radiation response include transistor vertical geometry, layout, presence of electrical fields due to field plates and other vertical fields, fringing fields, surface doping concentration, surface oxide quality and thickness, and many other factors. This chapter will attempt to give a brief overview of the principal transistor devices currently used, fabrication differences that may influence the radiation response, and common circuit implementations. One rule that should be applied throughout this study is to always assume there is an exception to every rule. There are many variations to the devices and processes that are discussed in this chapter, so firsthand knowledge of the device of interest is important to understand the radiation response. 3.1 Vertical,
Lateral,
and Substrate
Transistors
The first major class of devices that we will discuss is the vertical, lateral, and substrate transistors used in the classical junction-isolated, medium-voltage, linear process. This process has evolved slowly since it was introduced in the late 1960s and is still commonly used to
III-17
.,
,,
,
fabricate the standard 40-volt linear product such as the LM- and OP-series of microcircuits. Figure 4, from [36], shows a cross section of the three common transistors implemented in this process.
EmitterBase Collector o
0
0
Emitter
Collector
0
0
w
p+
Base
1 p+
n-epi n+ p-substrate
p-substrate
Vertical npn
Lateral pnp
(a)
(b)
Emitter ?
Base ?
P2ss25! Substrate pnp (c)
Figure 4 Cross Sectionof TransistorStructuresUsed in MediumVoltageLinearICS
ShowingDominantCurrentFlow Paths.[36]
The vertical npn transistor shown in Figure 4(a) is typically the most common circuit element used in the medium-voltage linear circuits. The npn transistor is a high-gain vertical structure with minimal base surface area. The dominant current flow in this device is vertically, so surface oxides are expected to affect only the minor lateral component of the base current. The lateral pnp transistor shown in Figure 4(b) is a suboptimal device using the same diffusions that were optimized for the vertical npn. The lateral pnp transistor is formed with a buried layer that inhibits vertical current flow to the substrate (for this structure the substrate transistor is a parasitic transistor that reduces the gain). Typically. 95% to 97% of the current is in the lateral direction. Most lateral transistors use a collector ring or square that surrounds most of the emitter and confines the base area to the region between the two diffusions. Because the base surface in the active transistor region is predominantly low surface concentration n-epi, a field plate is commonly used to control surface effect. The field plate was first developed due to the low-quality surface oxides in the early days of IC fabrication and the tendency for ionic contamination to invert the base region, The field plate is typically an extension of the emitter metallization over the base region, but separated from it by the surface oxide. The presence or absence of a field plate and the surface layout and bias can have a strong effect on the total dose response of the lateral pnp. The substrate pnp transistor shown in Figure 4(c) is designed so that the dominant current flow occurs vertically. No buried layer is used. However, current can also flow laterally to the isolation diffusion, which is also the collector of this transistor. Depending on the base width of the vertical pnp component versus the lateral pnp component, the ratio of
III-18
vertical-to-lateral current flow can vary. Typically, the vertical component of the current flow is about 80% and the lateral about 20’-ZO.The substrate transistor also typically has a larger base surface area than the lateral transistor and so is sensitive to surface damage. Field plates are also used on some substrate devices to control surface effects and, thus, can also be a significant influence on the total dose response. A more modern bipolar process used to fabricate linear and mixed-signal ICS capable of operating at 10 volts or less and at circuit frequencies of up to 1 GHz is shown in Figure 5. This class of processes typically contains both high-performance vertical npn and pnp transistors. The Lateral cross section in Figure 5 shows devices fabricated on buried oxide SOI substrates. isolation is achieved with etched vertical trenches that have a regrown liner oxide and which are then refilled with polysilicon. Variations of this process, and especially earlier processes, may be processed on bulk wafers and employ recessed field oxide isolation. Although not common, a lateral pnp transistor may be implemented in these processes when a higher voltage device is required than the vertical npn or pnp transistor can support. The vertical npn and pnp transistors shown in Figure 5 typically have cutoff frequencies (f-r) from 2 to 10 GHz. The transistors have much shallower junctions and narrower bases than the previously discussed process due to the use of ion implantation, rather than diffusions, to form the junctions. The base is, typically, formed as a two-step process with a shallower and lighter implanted intrinsic base to form the active base region and a higher-doped extrinsic base to decrease base resistance and improve efficiency. This extrinsic base also gives a higher-doped surface concentration to reduce surface effects. The surface oxide at the critical emitter-base junction can vary greatly, depending on the process, resulting in much different radiation response. This will be discussed in more detail later.
PNP
NPN
1
I
Figure 5 DeviceCross Sectionof a Trench/SOIComplementaryBipolarProcess.[67]
111-19
,.
,
A major advance in bipolar technology has been the development of polysilicon emitters. Figure 6[44] shows the difference between a typical crystalline emitter npn transistor Fig. 6(a) and a polysilicon emitter npn transistor Fig. 6(b). The speed of the polysilicon emitter transistors, as measured by fT, is increased to 15 to 25 GHz for a 5-volt device. For 3.3-volt devices, the fTcan be increased to 30 to 50 GHz by scaling. N+ Polysilicon ilicon
Collector
N+
N+
Collector
(a)
(b)
(a) TypicalCross Sectionof a CrystallineEmitternpn Transistorand (b) TypicalCross Sectionof a PolysiliconEmitternpnTransistor.[44]
Figure 6
3.2 Bipolar Transistor Fabrication The principal fabrication factors that contribute to the total dose response of bipolar transistors are the quality and thickness of the surface oxide, especially at the emitter base junction and over the base area; the base and emitter doping concentration at the Si-Si02 interface; and, for displacement damage effects, the base width and lifetime. For the medium-voltage linear process shown in Figure 4, successive depositions (or, in some cases, implants) are performed followed by high-temperature diffusions. Successive layers of oxides that have been exposed to the different dopants are grown during these hightemperature steps and are, thus, heavily damaged. For example, the junction isolation diffusion that drives the p-isolation to the substrate is performed at 1200”C for five hours. Whether the oxide grown during this lengthy process step is removed for subsequent processing has been demonstrated to have a significant effect on total dose hardness[ 15]. Other fabrication steps that have been demonstrated to affect the total dose hardness are densification of the CVD after emitter formation, the use of a forming gas during emitter anneal, and the use of phosphorusdoped glassivation. It has been demonstrated that replacement of the surface oxide with a silicon nitride surface passivation layer can significantly improve total dose hardness [26]. There is no standard bipolar process, and variations between manufacturers may cause significantly different total dose responses for electrically similar transistors.
111-20
Crystalline emitter transistors are fabricated by direct ion implantation of the base and emitter regions. Direct implantation limits how shallow the junctions can be formed and also the base width that can be achieved. Also, metal contacts cannot be used on junctions that are less than 1 pm in depth. Typically, the surface concentrations of ion-implanted bipolar transistors are higher than diffused junctions. The oxide over the emitter-base junction region is also thinner due to the shorter process times associated with ion-implanted junctions compared to diffused junctions. One factor that varies between manufacturers is the amount of damage that is done to the surface oxide during the implant steps. In some processes, the surface oxide is removed in the emitter base region and replaced with a composite grown and deposited spacer oxide. This can create a large difference in radiation response. Polysilicon emitter transistors have several performance advantages over the crystalline Polysilicon can contact shallower emitters than with metal contacts. As emitter transistors. vertical device scaling produces shallower emitters, base current increases due to carrier Polysilicon emitters eliminate this high recombination at the metal.lsilicon interface[67]. recombination interface and also can be used as a diffusion source to form ultra-shallow emitters. The most advanced bipolar processes employ a self-aligned, double-polysilicon transistor. The double-polysilicon transistor uses a second layer of polysilicon to contact the base region. Advantages include reduced base area and, therefore, reduced collector-base capacitance. This also allows closer base-emitter contact and reduced base resistance. The radiation tolerance of polysilicon transistors has been demonstrated to be significantly better than crystalline emitter transistors[46] [50]. 3.3 Circuit Implementation Circuit design can have a significant effect on the total dose response of linear and mixedsignal circuits. Especially important is how the total dose sensitive lateral and substrate transistors are used in the circuit design. Some circuits, such as the LM1 11, LM139, and LM 124, use a substrate or lateral pnp directly in the input stage. The LM111 input stage is shown in Figure 7[36] along with the increase in input bias current. In this type of design, the input bias current k inversely proportional to the substrate transistor gain so that, as the transistor gain decreases, the input bias current increases.
111-21
~ +
Cobalt-60 2 MeV Electon
150
150
LM1l 1 Input Stage
Vm
100 A
50
Input
:
0
o 0
50
100
150
200
250
Total Dose (krad(Si)) Figure 7 LM111Input Bias Stageand InputBias Currentvs. Total Dose.[36]
Lateral and substrate pnp transistors are often used in current sources that are internal to the input or output stage and, thus, are difficult to monitor for degradation. These current sources are designed to tolerate a wide range of gain. However, even in these applications, once the gain drops below a threshold level, the circuit will fail suddenly and catastrophically. This has been reported for the LM 108 operational amplifier[36] and the LM 137 voltage regulator[55]. In the case of the LM 137, a multi-collector lateral pnp, used as a current source in the startup stage, dropped below a critical gain value of 8, causing sudden failure. In many linear circuits, careful matching of components is required for proper circuit operation. In these circuits, identical transistors are laid out symmetrically to minimize any It has been reported[ 17] that small unbalancing due to temperature and loading effects. differential changes in gain degradation for these balanced pairs can cause substantial degradation of circuit performance. Different gain degradation in identical matched devices may be due to the influence of internal bias differentials during irradiation or to nonhomogeneous transistor radiation response. It has been reported that the assumption of homogeneous wafer response has limited validity [24]. In many cases, the uniformity of the total dose response of devices from a single wafer was substantially larger than expected and no better than that of several wafers from the same diffusion lot.
4.0 TRANSISTOR
TOTAL DOSE EFFECTS
Transistor total dose response is dependent on the type of process used for fabrication. In this section, the transistor response to total dose irradiation for each of the major processes The first section will deal with surface inversion discussed previously will be presented.
III-22
mechanisms, transistors.
and the second section will deal with gain degradation
in the different types of
4.1 Substrate, Sidewall, and Surface Inversion Inversion can occur in several locations in an oxide-isolated process. Typically, it occurs where a lightly doped, p-type silicon layer is adjacent to a thick field oxide. Figure 8 [35] shows a bipolar transistor with recessed field oxide isolation and several potential inversion regions.
=zz
X= N+ BURIED LAYER
N+ BURIED LAYER
~~ P SUBSTRATE
Figure 8 Typical Cross Section of Recessed Field Oxide Bipolar Transistor with Walled Emitters.[35]
Location 1 of Figure 8 is the inversion of the p+ region at the bottom of the recessed oxide, which causes channeling between adjacent buried layers. Inversion of p-type silicon at an Si02 interface is due to positive-type charge trapped in the SiOz next to the interface that depletes the p-type silicon surface to a maximum value. This was reported in five digital bipolar processes in [35] with failure thresholds as low as 5 krad(SiOz). Location 2 of Figure 8 is the inversion of the p-type base region along the recessed oxide sidewall, causing channeling between the collector and emitter of the npn transistor. Figure 9 shows a detailed view of the inversion area. Similar positive charge trapping in the recessed field oxide, as with the substrate inversion, causes the sidewall inversion. This failure mode was reported in two digital bipolar processes in [35] and was recently reported in [75] for a BiCMOS process where inversion of the base region of a walled npn transistor occurred. The tran~istor layout was redesigned to nest the emitter so that the emitter implant falls entirely within the active area. Devices with the nested emitter were fabricated and total dose testing performed. No collector-to-emitter leakage was observed to 1 Mrad(SiOz) for the redesigned, walled, bipolar transistor.
III-23
Inversion
n+
Emitter
p-Base
n-Epi
n+DUF
)
p-Substrate
Figure 9 Inversion Mechanism in Walled EmitterTransistor.[31]
Inversion can also occur in more advanced isolation methods and has been reported in trench isolation structures [42]. Figure 10 [42], shows a deep trench that has been formed by reactive ion-etching a deep groove through the n-epi and n+ buried layers and into the p-substrate. A p+ channel stop is implanted into the bottom of the trench, and then a thin liner oxide is grown. Under ionizing radiation, positive charge is The trench is finally refilled with poly-silicon. generated and trapped in the liner oxide, which -- depending on oxide thickness, trapping characteristics, and the doping concentration of the p-substrate and p+ channel implant -- can invert the p areas and cause a channel between the two n+ buried layers. In all cases of inversion, the formation of an inversion layer is strongly bias-dependent and is aided by a positive electric field.
Surface Field Oxide
Metallization
";j;;jjjj;~j.!!!; .:.".`.'}.`.'.".f {{ 101 cm-3). Thus, in this structure, the effects of Nox and Vsutioppose rather than enhance each other; and the dependence of ~B with dose is sublinear rather than superlinear. However, the conventional diffused/impla.nted emitter of the lateral pnp shown in Figure 4b has a surface doping density comparable to the base Thus, in the conventional lateral pnp transistor, MB is region of a vertical npn transistor. expected to consist of a term, due to depletion of the emitter, that would be comparable to the MB of the npn. Since the principal component of emitter current is lateral rather than vertical, MB is expected to be much higher than in the npn transistor, resulting in much greater gain degradation. Efforts are underway to model this structure in PISCES. The dependence of/& and P on dose at high dose rates has been studied as a function of the process, device design, and test parameters in [46]. Each one will be addressed individually here, but remember that there may be complex interactions in a real device. Transistor polarity is the first factor considered. From the previous discussion, the positive oxide charge and interface states interact over the p-type base in the npn transistor to cause significant base current increase. In the pnp case, the positive charge and interface traps offset and result in less base current decrease. In general, with all other factors being equal, a pnp transistor will degrade less than an npn transistor. Figure 14 [46] shows the relative difference between similar vertical npn and pnp transistors.
III-28
1.0
-u----..
-::
-0.
-.
0.0 i
10
-..
“.
-,
L
-----u. -0.
‘.
. .
.
-u .
- v.
1
●
1 0.6
$ 0.4 0.1
02
0.0
0.01
10
Im
Total Dose (krad(Si)) Figure 14 Total Dose Response of Similar
Vertical
npn and pnp Transistors.[46]
A second factor is oxide thickness, especially over the emitter-base junction area. In general, the thicker the oxide is over the base and base-emitter junction areas, the greater the total trapped charge and the larger the increase in base current. This has been reported in [62]. A third factor is the charge-trapping characteristics of the oxide over the emitter-base junction area. It has been shown by several[ 15][51 ] that oxide damaged during the fabrication process and left over the emitter-base area can have significantly higher charge-trapping efficiency than an oxide grown after most of the processing steps. This was demonstrated in [15] when the oxide that was present during the isolation diffusion process ( 1200”C for five hours) was stripped and regrown. It has also been demonstrated in [51] screen oxides that were implanted through and characterized. A fourth factor is the surface doping concentration, especially for the base and emitter areas. The more heavily doped the base or emitter surface is, the less depletion effect that the trapped charge will have on the surface. This is shown in Figure 15(a) [46] where a highly doped p+ ring was added to the base. More improvement would be obtained by increasing the doping of the entire base surface, but this would have a negative impact on the breakdown voltage. Figure 15(b) [46] shows the improvement with a higher-doped emitter. Since these are two different emitter technologies, other factors may also be influencing the improvement in hardness, such as improved emitter efficiency.
III-29
10
1.0
0.8 1 0.6
0.6
$
9’0.4
0.4
0.1
0.1
0.2
0.2 a
0.0
,
I
r
I , ,I
10
,
8
,
0.O1
AIBflBoPoly-Emitler
0.0
100
0.01 10
100
Total Dose (krad(Si))
Total Dose (krad(Si))
Figure 15 (a)Effect of Increased Base Doping Ring on Improved Hardness (b) Comparison of Poly-Emitter with Silicon Emitter.[46]
A fifth factor Since the majority of the perimeter-to-area Figure 16 [46] shows
that influences transistor hardness is the emitter perimeter-to-area ratio. the increased base current occurs at the base-emitter perimeter, minimizing ratio will result in less base current increase and less gain degradation. the improvement from reducing the perimeter-to-area ratio.
1.0
$ ..::
0.8
e: ::....
L
o.. -0.
---
10
- n.
O.--”.q ..
-m
. ●
1 0.6
Q*
z
n=l
0.4
4 7
0.1
0.2 (
0.01
0.0 ~ 10
100
Total Dose (krad(Si)) Figure 16 The Emitter Perimeter-to-Area Effect on the Total Dose Response of an npn Transistor.[46]
111-30
Transistor geomehy can be a significant factor in transistor total dose response. Especially important is the ratio of lateral current flow to surface current flow in the base area. In general, vertical transistors will be harder than surface devices. Figure 17 shows the total dose response of a vertical, substrate, and lateral pnp. In the vertical device, almost all of the current flow is in the vertical direction; and only a few percent is at the surface. For the substrate device, we previously showed that about 80?10of the current flow was vertical and 209i0 lateral. For the lateral device, the majority of the current flow is in the lateral direction. In Figure 17, we see that the vertical pnp transistor has the least degradation; the substrate pnp transistor degradation is second; and the lateral pnp transistor has the most degradation.
1.0
‘“O~ c
~
0.5
EzEl 0.0
~
LPNP(167
rsd(SIOJ/s)
~
SPNP(167
rsd(SIOJ/s)
~
VPNP(156
rsd(SIOJ/s)
I 1“0
1
I
101
l&
Total Dose
f
0.0
1P
(krad(SiOJ)
Figure 17 Normalized Current Gain (13)vs. Total Dose for Vertical, Substrate, and Lateral pnp.
Electric field in the oxide can be a significant factor in the total dose response. Determining the electric field can be quite complex, since there are nonuniform fringing fields induced by the junctions and vertical fields formed by any metallization or polysilicon layers that may lay on top of the device or area of interest. It has been shown previously that the field plates that are on most lateral and substrate devices can substantially improve the hardness. Many early experiments with gated transistor structures showed a strong dependence on total trapped charge and applied electric field. Figure 18 [46] shows the effect of different biases on a vertical npn For this transistor and layout, reverse bias during irradiation caused larger transistor. One must carefully analyze the device being tested to make sure all “parasitic” degradation. vertical fields that may be induced by overlying conductors are accounted for, because they may dominate a fringing field. Another factor in the total dose hardness is the injection level at which the gain degradation is measured. In almost all cases, gain degradation is worst at lower injection levels.
III-3 1
This occurs because there are increased surface effects at low injection levels. This can be observed in the L degradation at low V,E values for the Gummel characteristics in Figure 19, where the effects of characterizing a device at different injection levels is shown. One last factor that is important, but will be discussed in Section 6, is the dose rate and temperature at which irradiation is performed. It will be shown later, depending on many of the factors discussed here, that there may be increased degradation at low dose rates. In summary, factors which affect bipolar transistor hardness include: ● Transistor polarity (npn or pnp) ● Oxide thickness over base-emitter region ● Oxide trap efficiency ● Vertical electric field ● Fringing electric field ● Base surface concentration ● Emitter surface concentration ● Emitter perimeter-to-area ratio ● Transistor geometry (ratio of lateral to vertical current flow) ● Injection level ● Dose rate ● Temperature
1.0 m ::
=Z
k 10
@%:; ;8 ---
: : .
0.8
0-
P/P. - VBE= 0.5V
- . ❑ . plpo -
- -
0.6
~ . -a-
:
.
0.
‘%::
VBE= -2.OV
. ..Q . .
1
AIBnEKJ - VBE= 0.5V Al&
- VBE = -2.OV
Al&O-v~~=O.OV
0.4
0.1
0.2
[
0.0 ~ 10
0-01 100
Total Dose (krad(Si)) Figure 18 The Effects of Bias During Irradiation.[46]
III-32
1.0
Q. . . . -.
L
0..
-
10
-.
0.8
. .
1 0.6 $
s .
0.4
.
0.1 .0.
0.2
~/~0- Measurement
. ❑ . ~/~0- Measurement +
AI#Bo
Bias.
0.6V
Bias=
0.7V
- Measurement
Bias = 0.6V
0.0
0.01 10
100
Total Dose (krad(Si)) Figure 19 The Effect of Characterizing a Device at Different Injection Levels.[46]
5.0 INTEGRATED
CIRCUIT RESPONSE
This section will discuss the response of bipolar microcircuits to ionizing radiation. The first section will discuss the response of digital microcircuits that are primarily fabricated in oxide-isolated processes. The second section will discuss the more complex response of linear ICS and introduce low rate effects. 5.1 Total Dose Response of Digital Microcircuits Many of the early bipolar digital processes that have been reported on have been discontinued. These include integrated injection logic (12L) and integrated Schottky logic (ISL). Other processes, such as recessed field oxide bipolar processes, are still in limited production and are used for some mixed-signal applications. The failure modes for these processes are primarily associated with inversion associated with the recessed field oxide isolation, as discussed in Section 4.1. The radiation sensitivity of the circuits is found to be very sensitive to bias conditions during irradiation. In several cases, the failure mechanism is identified as substrate inversion, which causes a parasitic leakage path to form between the input transistor and a guard ring. When the input is biased high during irradiation, parametric failure occurs in 100% of the samples by 100 krad(Si). When the inputs are grounded during irradiation and normal power bias applied, all parameters and functionality are nearly constant and within specification up to 10 Mrad(Si). Failure analysis of these devices identifies a metallization run over the field oxide between the input structure and the guard ring. This metallization run acts as a gate for the
III-33
,,
,.
parasitic structure shown in Figure 8. The substrate will not invert unless a sufficient vertical field induced by this metallization run is present. Unless a careful analysis is performed to identi~ potential leakage paths, then testing should be performed with multiple bias conditions. Another common failure mode in digital bipolar circuits is sudden failure due to inversion of walled-emitter transistors. Testing of circuits that contain these transistors shows a gradual increase in supply current and then sudden circuit failure. With this type failure, in-situ bias was less critical. Circuits require bias to fail, but no pattern sensitivity was observed for specific structures. This is most likely due to the short channel that is required to invert the sidewall and the presence of adequate fringing field in most cases. Wafer mapping of the recessed field oxide failures[39] shows uniformity across the wafer, and from wafer to wafer within the same lot, to be good. Failure distributions were found to be within two sigma from wafer to wafer, indicating that a statistical sampling program can be used to predict the radiation response of the devices from lot sampling. Failure of digital bipolar devices fabricated in soft recessed field oxide processes is ve~ similar to CMOS fabricated in similar processes. The failure modes are predominantly inversion under or along the recessed oxide isolation and appear to have the same time response as CMOS. Test method 1019.4 should provide a conservative total dose test to predict the performance of these devices in space. 5.2 Linear Integrated Circuit Total Dose Response One of the earlier papers on total dose response of bipolar linear integrated circuits was published in 1975[ 14] and concerns the characterization and screening of devices for the Mariner Jupiter/Saturn spacecraft launched in 1977. Many of the linear integrated circuits of interest then are still being evaluated for current designs. These include the LM101A, LM108, LM124, HA2600, HA2620, and HA2700. Much of the concern at that time was identifying and screening out “maverick” devices that exhibited significantly greater total dose sensitivity, even though they were from the same lot and were processed no differently. No pre-irradiation electrical parameters could be identified that could screen out these devices. Considerable effort at that time was spent on developing an Irradiate-Anneal (IRAN) test method to irradiate devices to a low-to-moderate level and then perform a high-temperature anneal to anneal the radiation damage out. It was then projected that the devices would behave the same the second time and could be used as mission parts. Over the years, as data were collected on this test method, it was found that some devices did not respond the same on reirradiation or that the initial irradiation had to be so high that the damage could not be annealed out. This test method seemed to die out in the late 1970s. There was considerable information published on maverick devices [5] [ 14][ 15] [ 16][ 17][24] until 1980, but the cause of the unusual sensitivity or an effective screen was not identified in open publications. The primary test method for bipolar linear devices is to perform a step stress irradiation and characterize all ac and dc parameters at each level. Parameters that typically degraded or failed were input bias and offset currents and output offset voltage. A considerable amount of spot irradiation was performed in the late 1970s to identify the sensitive components that were causing the normal degradation 17][25]. As expected, the lateral and substrate pnp transistors III-34
were identified as being the sensitive elements in most designs. An excellent study by Johnston published in 1979[24] evaluated breakout transistors and LM108A operational amplifiers to investigate subwafer, wafer-to-wafer, and lot-to-lot variability. Again, the lateral pnp and substrate pnp were identified as being the sensitive elements. Maverick devices were still a problem with 1% identified as having a significant and sometimes different degradation than the normal lot distribution. No electrical parameter was identified that could be used to screen these devices. Some lots showed a very tight distribution in parameter degradation, while others showed a very broad distribution. Figure 20[24] shows this type distribution for two of the parameters on three lots.
35
35
~
mLots
A&B
I
o
L
o 0
10
20
30
40
50
60
01234567891011121314
AIB (lA)
AVO~(mV)
Figure 20 Distribution of Input bias Current and Output Offset Voltage Response to Total Dose for Three Lots of LM 108A.
Significant differences in response to electron irradiation compared to Cobalt-60 irradiation were reported in [33] [36] [41]. Previously, Figure 7 showed 2-MeV electrons producing three times the change and no signs of saturation at 250 kRad(Si) in input bias current for the LM111 comparator. Cobalt-60 saturated at 100 kRad(Si). This was identified as bulk damage in the wide-base substrate and lateral pnp. However, [41] reports an enhanced electron effect for an advanced bipolar technology with a fairly narrow base. If electrons are the predominant environment of concern, then irradiation in an electron source maybe advisable. 5.3 Low-Rate
Effects Introduction
It was first reported in 1991 by Enlow[44] that an advanced bipolar technology exhibited increased degradation when irradiated at lower dose rates. These initial data reported that the effect was observed for npn transistors in two processes from one manufacturer. Figure 21 shows the data from this first paper on the low-rate effect. Since the publication of this first paper, more than fifteen papers have been published on low-rate effects in bipolar transistors and microcircuits, showing bipolar processes and circuits from multiple manufacturers exhibit the Figure 22 is a well-published summary plot of the low-rate effect by low-rate effect. Johnston[54]. Figure 22 shows data on microcircuits from two manufacturers. For one circuit (LM108), no dose rate effect is observed. For another circuit (LM 101), the low-rate effect is
III-35
.,
observed to saturate at a factor of approximately 2x at dose rates below 1 rad(Si)/s. For other circuits (LM111 and LM324), no saturation in the effect is observed down to dose rates of .01 rad(Si)/s with factors of 5x and greater in degradation compared to irradiation at 50 rad(Si)/s. The remainder of the course will be spent on the recent research performed on low-rate effects in transistors and circuits and the progress toward identifying the basic mechanisms responsible and acceptable hardness assurance test methods. ,
().1 i
,, , . lff~i /
F
t
, *.1
i-
❑
, ~.,
,
,,. . v’.,
i
,
,
.
,
,
.
. 43’
n“
.
,
,fj.,
,
Q
.
,0.3
- a+ - 0-e-
.
0
},,,,,,,
1 - 1.1 2 -1.1 1 -300 2-300
Process
Procese Process Process
I , (36
,05
Total
Dose
radls radh radls radk
I
[
t
,0.4
107
(rad(SiOZ))
Figure 21 Change in A1/f3for npn in Two Processes vs. Total Dose at Two Dose Rates.[44]
* +
● A ■
LM108(npn) LM1o I (npn) LM1ll (pnp) LM324 (pnp)
-5
+ –4
-3
-2
-1
Discrete Transistors
0.001
0.01
0.1 Dose
Rate
1
10
100
(rad(Si)/see)
Figure 22 Effect of Dose Rate on Total Dose Damage Normalized to 50 Rad(Si)/s.
III-36
6.0 BIPOLAR LOW-RATE EFFECTS Enhanced total dose darnage at low dose rates has been studied extensively the last four years in bipolar processes, transistors, and microcircuits from multiple manufacturers. Test structures, transistors, or microcircuits are defined to exhibit enhanced low dose rate degradation when a measured parameter exhibits increased degradation at lower irradiation dose rates compared to the degradation at the same total dose at higher dose rates. The low-rate effect has been found to be nonexistent to minimal in some bipolar processes and severe in others. Even for processes that are known to exhibit the low-rate effect, only certain circuits maybe affected; and only some parameters may exhibit the enhanced degradation. It has also been observed to be bias-sensitive in some circuits, exhibit lot-to-lot variations, and -- in some devices -- to be extremely nonlinear. If one statement can be made to summarize the low rate effect, it is that it appears to be consistently inconsistent. In this section of the course, we will attempt to bring some consistency to our understanding of the dose rate effect. 6.1 Basic Mechanisms The enhanced low dose rate effect is a “true” dose rate effect rather than a timedependent effect as seen in MOS oxides. Thus, a post-irradiation anneal following a high dose rate irradiation will not produce the same results as observed at the end of a low dose rate irradiation. The “true” dose rate effect has been studied using MOS capacitors fabricated using a bipolar base oxide. A complete description of the study maybe found in the literature [51]. The major results of this investigation are: (1) the enhanced low dose rate effect occurs for a near zero electric field in the oxide during irradiation; and (2) the net trapped-hole density at low rate is larger, even though large numbers of holes and electrons are trapped at high and low dose rate. In addition to the “true” dose rate effects that occur as a result of different transport, recombination, and trapping properties during irradiation, there are also time-dependent effects following irradiations, especially after high dose rate irradiations. There are two models that have been proposed to explain the “true” dose rate effects [51 ][60]. A common feature of these models is that, if the irradiations are performed at either high electric field or at elevated temperature, the enhanced low dose rate effect should be minimized. The first model [51] suggests that, “during higher-rate irradiation, the large number of defects in the bulk of the oxide evidently retard the hole transport process by several decades over its normal duration. The slowly transporting or metastably trapped holes in the bulk of the oxide act in conjunction with the building space charge due to more deeply trapped holes near the Si/ SiOz interface to reduce the charge yield in the bulk of the oxide in the high-rate case as compared to the low-rate case. This reduction is due to the decreased local potential gradient (decreased local time-dependent electric field) between the trapped holes near the silicon interface and the gate, caused by increased positive charge in the oxide bulk or near the gate interface, as shown in Figure 23(a). The slowly transporting and/or metastably trapped holes III-37
.!
,.
.
in the bulk of the oxides also provide additional electrostatic fields during highrate irradiation that causes holes to be trapped, on average, a little closer to the Si/Si02 interface than during low-rate irradiation.”
m
@
++ + + +++ + +++++++++ +-
+
Delocalized Hole / Centers\
++
+ +
+
+
Holes in Deep Traps
+ ~++
+-
+- 4
-e-h Dipoles
1 —
+ ++ b +-
+++++++
I —
(a)
(b)
High Rate
Low Rate
Figure 23 Schematic Illustration of Mechanisms Contributing to Enhanced Net Positive Charge in Bipolar Screen Oxides at Low Dose Rates. Figure (a) Refers to High-Rate Irradiation and Figure (b) to Low-Rate Exposure. Mechanisms Apply to Soft Oxides at Low ElectricFields.[51 ]
“The reduction in charge yield at high rates due to these space charge effects, coupled with the relatively slow trapped-hole neutralization rate at O V for these devices, evidently accounts for the increase in trapped-hole density at low rates. That some holes are forced by the bulk space charge to be trapped a little closer to the silicon at high rates than they otherwise would be at lower rates facilitates the formation of neutral trapped-hole/electron dipoles near the silicon. Some dipoles will function electrically as border traps, and others will be electrically indistinguishable from annealed holes. This leads to the enhanced density near the Si/SiOz interface during the high-rate trapped-electron irradiations. Finally, space-charge and electrostatic back-pressure effects only dominate device response at low fields because the applied electric field dominates local fields for higher-field exposure.” The second model proposed in [60] suggests that “electron-trapped charge is smaller for large electric fields in oxides and low dose rates. For low dose rate conditions, the total positive charge in the oxide depends of the number of holes that escape recombination during dispersion transport. It is usually assumed that a hole combines with a free electron, but the interaction of a free hole with trapped electron must be taken into account. Thus, the presence of
III-38
negative-trapped charge in oxide can dramatically decrease the hole yield because of free hole recombination. The electron traps m important in bipolar screen oxides because the following conditions are satisfied. First, the electric field in the oxide is small. Therefore, the radiation-induced electron traps are occupied. Second, the conventional screen oxide is not radiation-hard, The capturing of holes takes place in the bulk of the oxide; therefore, the electron traps are generated along the entire path of the holes to the Si/Si02 interface. Under low dose rate irradiation, the occupation of electron traps is smaller than for moderate dose rates. It causes the decrease of hole loss due to the capture of holes at occupied electron traps, which leads to the increase of positive charge buildup. It means that the positive oxide-trapped charge in screen oxides increases with the decrease of dose rate. The thermal annealing of positive oxide-trapped charge at room temperature during long-time, low dose rate irradiation cannot compensate their growth caused by the increasing of positive charge yield.” Another model has been suggested [62] that may account for the time-dependent effects following the higher dose rate irradiation, such as the continued gain degradation in lateral and substrate pnp transistors. This model proposed in [62] suggests that “the hole transport, as described by the CTRW (continuous time random walk) model, can be applied to the thick field oxides and low electric fields present over the base-emitter junctions of bipolar transistors. Very thick oxides (= 1pm) are present over the base-emitter junction of substrate and lateral pnp transistors. It is assumed that the lowest dose rate for which devices still exhibit dose rate effects is due to the time required for charge to transport to the interface, which is extended to much longer time periods under low field conditions and also increases with oxide thickness. Previous field oxide work also showed that the fraction of the charge that remains trapped in the oxide bulk region is strongly This may explain why pnp transistors in the affected by the oxide thickness. junction-isolated processes exhibit more damage than npn transistors; oxides over the emitter-base region of pnp devices are approximately three times as thick as for npn transistors. Based on npn and pnp oxide thickness of 3500 and 10000 angstroms, respectively, the npn transistors should be affected by dose rates down to 1 rad(Si)/s and the pnp transistors down to .008 rad(Si)/s. Based on limited oxide thickness results, it predicts that the relative damage in npn transistors will be approximately a factor of two or three higher when bulk trapping is taken into Still account at low dose rates, compared to trapped charge at the interface. higher relative damage is expected for the thicker oxides in the pnp transistors, but it is not possible to estimate the magnitude of increased damage from the limited data available from field oxides. One possible explanation for the dose rate dependence is radiation-induced recombination, which would reduce the fraction of holes that are trapped in the bulk of the oxide. In order to test this assumption, an experiment was performed using the LM111 comparator under mixed dose rate conditions to examine the effect of nonconstant dose rates on enhanced damage.
III-39
One set of devices was initially irradiated at a low dose rate, increasing the dose rate to a much higher level after the initial irradiation was completed. The opposite dose rate scenario was used for a second set of devices. The results are shown in Figure 24, along with earlier results from a different group of devices irradiated at high rates. From the figure, it can be seen that high- and low-rate damage effects appear to be independent. These data suggest that, at least in thick oxides, once charge is trapped within the body of the oxide, little recombination occurs from the excess electron density produced by ionization. The implication is that charge in the bulk region acts independently from the mobile charge, a rather surprising result.”
500
500
50
rad(Si)/s
400
300 m
m “i
200
200
t t 100
50 rad(Si),
0
0
lRS$ v ~ ~ , A– RW
1A+
(1)
where 1A is the anode current, IRS is the substrate current, and ZRWis the well current [12]. The substrate current level depends on the gain of the vertical BJT, arid the well current level depends on the well geometry and contact spacing.
In most cases, the gain product of the parasitic BJTs is
large enough to allow latchup. The parasitic shunting resistances act to reduce the current regeneration and help to circumvent latchup in devices with a gain product that would ordinarily result in latchup. 5.2.2.
Charge Collection When an energetic particle passes through the device structure, electron-hole pairs are gener-
ated along the track length. The manner in which this charge is coll~ted
is important in under-
standing latchup behavior. There are two basic charge collection mechanisms:
drift and diffusion.
The amount of charge collected by drift often exceeds the amount deposited within the volume of the ionization track through the equilibrium depletion region. This increased drift collection mechanism is called funneling, and the region of the track involved in drift collection is called the funnel
IV-58
B
PROMpT (Q. +
ICR
‘F)
c“’)
0.2
o
0.4
1
10
100
TIME (ns)
Figure 5.6. Illustration of funneling and difision [15].
I $ ,,, ,11
5
r ,,, ,,,
mechanisms for churge collection following an ion strike after
11
,,*, , , ,ITml
4-
g3
“
~2
.
f
/
/
/
/
, ‘,8,*I ,
,n-1-m
Bulk process He ------/
-
Epitaxial prooess
/
1-
0 ~~-i2
,.-10
10-8
f@
Time (s)
5.7. Three-dimensional substrates after [12].
Figure
modeling used to calculate the time-dependence
Iv – 59
of co!lected chacge in diodes with n-
[83-84]. Carrier densities in excess of the surrounding majority carrier levels (i.e., high injection) are necessary for the funneling mechanism.
When carrier densities drop below the level required
for funneling, any remaining charge in the depletion region is collected by drift and any charge outside the depletion region is collected by difision. There are a couple of important differences between the funneling mechanism collection and charge collection by diffusion. track onto the hit node.
for charge
Funneling directs the collected charge along the ion
Diffusion can distribute the deposited charge along many nodes.
This
phenomenon is illustrated in Figure 5.6 [84], where the charge collected by funneling and diffusion is shown versus radial distance from the strike. Also, since funneling is a drift process, charge is transported more rapidly by funneling than by diffusion. used by Dodd, et. al., to calculate the time-dependence
Three-dimensional
modeling has been
of collected charge in diodes with n-sub-
strates [85]. In this work they showed that for more heavily doped substrates (typical of submicron CMOS devices), drift collection could occur as long as several nanoseconds after the ion strike, and that drift and diffusion components were comparable in such structures.
Typical results from that
work are shown in Figure 5.7. Analytical forms of the charge collection mechanism are given in [84, 86-89], and numerical simulation studies of the charge collection mechanism are given in [9093].
Figure 5.8. A typical test-structure geometry for studing latchup ajier [12].
IV -60
5.2.3.
lliggering
SEL
Atypical test-structure geometry for studying latchup is shown in Figure 5.8 [12]. The important dimensions shown include the spacing between the anode and cathode (affects parasitic BJT gain), and the distance between the well and substrate contacts (determines shunting resistances).
values of parasitic
Single-event latchup (SEL) is triggered when one of the two parasitic BJTs
is turned on following an energetic particle strike. The p-n-p-n structure can be triggered into SEL in two ways: anode triggering (the vertical transistor is turned on first), and cathode triggering (the lateral transistor is turned on first). In a typical CMOS process, more charge is required to trigger the lateral BJT than the vertical one - making anode triggering the more sensitive SEL mechanism [94]. Anode triggering of SEL involves four different steps that will now be discussed [12]. First, the transient current generated by the energetic particle strike sets up a transient current in the wellsubstrate junction.
The current flows from the well contact to the substrate contact resulting in a
voltage drop within the well that depends on the distance from the ion strike location to the well contact (the further from the well contact - the higher the voltage).
Second, if the voltage drop is
sufficient, the vertical transistor becomes forward biased and produces a much larger current due to its gain that flows from the anode to the substrate.
Third, the voltage drop that results from the
increased current flow (second step) forward biases the lateral transistor causing it to turn on. Once the second transistor turns on, the regenerative current mechanism begins. Fourth, the regenerative current mechanism causes both parasitic transistors to enter the saturation region of operation. The structure remains latched until the power supply is interrupted. gering mechanism
Numerical simulations of the trig-
indicate that the subtleties of the currents involved more complex than that
outlined here, but the essence is the same [95-96]. 5.2.4.
Temperature Dependence of SEL SEL exhibits a very strong temperature dependence [97-98]. Figure 5.9 shows the SEL tem-
perature dependence on (a) the test structure and (b) a 64 K SRAM [98]. The threshold LET for SEL drops by a factor of two and the SEL cross-section highest temperature compared to room temperature.
(or sensitive device area) increases at the
The decrease in threshold LET is due to two
properties of the parasitic BJT structures: the = -2 mV/OC change in base-emitter voltage, VBE; and the increase in well resistance at elevated temperature.
The increase in the SEL cross-section
is due
to the decrease in threshold LET. As the threshold LET drops, more of the well area can contribute to the cross-section at a given LET. Since it is possible that some devices that don’t exhibit latchup at room temperature may exhibit latchup at elevated temperatures, the temperature dependence of SEL is important and should not be overlooked.
IV–61
,.
10-3
F
I
saturation cross-section
~:
>>
0 G ,()-6
~
10
o
20
30
O
10
5
15
20
25
LET [MeV-cm?lmg]
LET [MeV-cm?mg]
Figure 5.9. The single-event latchup temperature dependence on (a) the test structure and (b) a 64 K SRAM after [12].
5.2.5.
Holding Current and Voltage
The holding
current,
ZH, and holding voltage, VH, are generally much easier to model than the
SEL triggering mechanism.
The values for ZHand VH do not depend on how the latchup condition
is triggered, so models for ZHand VH that have been developed for electrically-induced often used by the radiation effects community.
latchup are
In general, IH and VH both increase as the spacing
between parasitic BJT is increased. In typical CMOS structures most of the current through the p-n-p-n parasitic structure flows from the anode to the substrate contact, making IH very sensitive to substrate properties.
The
holding current in epitaxial structures is much greater than that of bulk structures. This is due to the lower resistance of the bulk substrate decreases the parasitic resistance of the lateral transistor, and because the epitaxial structure lowers the transistor gain. It has been shown that the holding current is =5-10 times greater for epitaxial structures (p-substrates) increased significantly,
VH is
[99-100].
Even though ZH can be
usually the more effective parameter to monitor. This is because, in
most circuits sufficient current is available from the power supply to support latchup, even if ZHis very high. Designing internal structures with the holding voltage higher than the available power supply voltage is away to prevent SEL. If the holding voltage cannot be maintained, then the latch cannot be sustained.
Chatterjee et al., have shown that the holding voltage for epitaxial structures scales as
Lltepi, where
L is the anode-cathode spacing of an n-well epi process, and tePi is the thickness of the
epitaxial layer [101]. A larger anode-cathode
spacing and a thinner epitaxial layer will result in a
higher holding voltage. The holding voltage can also be affected by choices in the process.
IV – 62
Thin, retrograde wells
(have a much higher doping level near the bottom of the well) have a much lower sheet resistivity.
The lower sheet resistivity acts to increase the holding voltage of the structure [102-103]. Trench isolation has also been shown to be an effective means to increase the holding voltage [104- 105]. It is possible to eliminate latchup by using oxide isolation rather than junction isolation; however, technical difficulties in producing large-scale devices have limited the development of this technology. 5.3.
Reducing SEL Susceptibility To
summarize this section, possible ways of reducing SEL susceptibility will be reviewed.
The threshold LET for SEL drops and the SEL cross-section (or sensitive device area) increases at elevated temperatures compared to room temperature. So, to reduce the sensitivity to latchup, the device should not be operated at elevated temperatures. The most effective means to reduce or eliminate SEL susceptibility is to increase the holding voltage higher than the power supply voltage. This can be accomplished by careful layout, through the use of a thin epitaxial layer, or by using dielectric isolation rather than junction isolation.
IV-63
-,
.!
6.
Sumww In this Short Course, catastrophic single-event phenomena in the natural space radiation envi-
ronment have been reviewed. Single-event burnout (SEB) of power MOSFETS and power BJTs, single-event gate rupture (SEGR) of power MOSFETS, single-event dielectric failure of SRAMS and DIL4.Ms, and single-event latchup (SEL) of CMOS technologies have been discussed.
For
each of these phenomena laboratory testing procedures, physical models, and possible hardening solutions have been presented. As devices continue to scale to greater complexities, some singleevent phenomena will no longer be of concern; however, challenges for developing electronic devices immune to catastrophic single+vent effects will remain.
7.
ACKNOWIXDG~
The authors would like to thank the following individuals for technical insights, guidance, and
useful discussion of topics related to these Short Course Notes: Ron Schrimpf, Mark Allenspach, Johu Brews, Marie-Catherine Calvet, Philipp Calvel, Lew Cohn, Charles Dachs, Dave Emily, Jean Gasio~ Jakob Hohl, Allan Johnston, Roc& Koga, Ken LaBel, Isabelle Mouret, Jean-Marie Palau, Ron Pease, Dale Platteter, Frank Roubaud, Piene Taste4 Jeff Titus, Frank Wheatley, and T&l Wrobel. Over the course of several years, research on SEE in power devices at the University of Arizona has been sponsored by the Defense Nuclear Agency, NSWC-Crane, Espace, and A&ospatiale.
We greatly appreciate the encouragement
and our colleagues.
IV-64
NASA-Goddar4
Alcatel
and advice of our sponsors
8.
REFERENCES
1.
A.E. Waskiewicz, J.W. Groninger, V.H. Strahan, and D.M. Long, “Burnout of Power MOS Transistors with Heavy Ions of Califomium-252,” L%%l.3Trans. Nucl. Sci., vol. NS-33, pp. 1710-1713, 1986.
2.
J.L. Titus, G.H. Johnson, R.D. Schrimpf, and K.F. Galloway, “Single Event Burnout of Power Bipolar Junction Transistors,” IEEE Trans. Nucl. Sci., vol. NS-38, pp. 1315-1322, 1991.
3.
T.A. Fischer, “Heavy-Ion-Induced Gate-Rupture Sci., vol. NS-34, pp. 1786-1791, 1987.
4.
T.A. Fischer, “SEU-Hard Power MOSFETS,” in Government Microcircuit Applications ference Digest of Papers, pp. 131-134,1987.
5.
W.A. Kolasinski, J.B. Blake, T.K. Anthony, W.E. Price, and E.C. Smith, “Simulation of Cosmic-Ray Induced Soft Effors and Latchup in Integrated-Circuit Computer Memories,” IEEE Trans. Nucl. Sci., vol. NS-26, pp. 5087-5091, 1979.
6.
K. Soliman and D.K. Nichols, “Latchup in CMOS Devices from Heavy Ions,” IEEE Trans. Nucl. Sci, vol. NS-30, pp. 4514-4519, 1983.
7.
D.K. Nichols, W.E. Price, M.A. Shoga, J. Duffey, W.A. Kolasinski, R. Koga, “Discovery of Heavy-Ion Induced Latchup in CMOS/Epi Devices,” IEEE Trans. Nucl. Sci., vol. NS-33, pp. 1696, 1986.
8.
Y. Moreau, H. de la Rochette, G. Bruguier, F. Pelanchon, C. Sudre, R. Ecoffet, “The Latchup Risk of CMOS Technologies in Space,” IEEE Trans. Nucl. Sci., vol. NS-40, pp. 1831-1837, 1993.
9.
R. Harboe-Sorenson and e. al., “Radiation Testing of Flight Lots for MARS-94 Covering Semiconductor ~pes as 4 M-Bit DRAM, 256 K-Bit SRAM, 256 K-Bit EEPROM, and a 53C90 SCSI Controller,” Proceedings of the 1993 RADEC’S Conference, vol. pp. 490, 1993.
10.
D.K. Nichols, J.R. Coss, R.K. Watson, H. R. Schwartz, R.L. Pease, “An Observation of Proton-Induced Latchup,” ZEEE Trans. Nucl. Sci., vol. NS-39, pp. .1654- 1656, 1992.
11.
L. Adarns and e. al., “A Verified Proton Induced Latchup in Space,” L?3EETrans. Nucl. Sci., vol. NS-39, pp. 1804- 18XX, 1992.
12.
A.H. Johnston, “The Influence of VLSI Technology Evolution on Radiation-Induced in Space Systems,” IEEE Trans. Nucl. Sci., vol. NS-43, pp. 505-521, 1996.
13,
T.R. Oldham, K.W. Bennett, J. Beaucour, T. Carriere, C. Polvey, and P. Gamier, “Total Dose Failures in Advanced Electronics from Single Ions,” IEEE Trans. Nucl. Sci., vol. NS-40, pp. 1820-1830, 1993.
IV – 65
in Power MOSFETS,” IEEE Trans. Nucl.
Con-
Latchup
14.
G.M. Swift, D.J. Padgett, and A.H. Johnston, “A New Class of Single Event Hard Errors,” IEEE Trans. ~ZJCz. ~ci., vol. 41, pp. 2043-2048, 1994.
15.
J. Schwank, “Basic Mechanisms of Radiation Effects in the Natural Space Environment,” 1994 IEEE NSREC Short Course Notes, vol. pp. II-1 to 11-109, 1994.
16.
E.G. Stassinopoulos and J.P. Raymond, “The Space Radiation Environment for Electronics,” Proc. IEEE, vol. 76, pp. 1423-1442, 1988.
17.
E.G. Stassinopoulos, “Radiation Environments in Space,” in Notesfor the 1990 ZEEENuclear and Space Radiation Effects Conference Short Course, 1990.
18.
J.H. Adams, Jr., “The Natural Radiation Environment Inside a Spacecraft,” IEEE Trans. Nucl. Sci., vol. NS-29, pp. 2095-2100, 1983.
19.
J.F. Ziegler, J.P. Biersack, and U. Littmark, The Stopping and Range of Ions in Solids. Elmsford, NY Pergamon Press, Inc., 1985.
20.
H. Dussault, J.W. Howard Jr., R.C. Block, M.R. Pinto, W.J. Stapor, and A.R. Knudson, “High Energy Heavy-Ion-Induced Single Event Transients in Epitaxial Structures,” IEEE Trans. Nut. Sci., vol. NS-41, pp. 2018-2025, 1994.
21.
W.J. Stapor and P.T. McDonald, “Practical Approach to Ion Track Energy Distribution,” z@d. Phys. , vol. 64-9, pp. 4430-4434, 1988.
22.
W.J. Stapor, P.T. McDonald, A.B. Campbell, A.R. Knudson, and B.G. Glagola, “Charge Collection in Silicon for Ions of Different Energy but Same Linear Energy Transfer,” ZEEE Trans. NUCL Sci., vol. 35-6, pp. 1585-1590, 1988.
23.
R.C. Martin, N.M. Ghoniem, Y. Song, and J.S. Cable, “The Size Effect of Ion Charge Track on Single-Event Multiple-Bit Upset,” IEEE Trans. NUCLSci., vol. 34-6, pp. 1305-1309, 1987.
24.
T.L. Criswell, D.L. Oberg, J.L. Wert, P.R. Measel, and W.E. Wilson, “Measurements of SEU Thresholds and Cross-Sections at Fixed Incidence Angles,” IEEE Trans. Nucl. Sci., vol. 34-6, pp. 1316-1321, 1987.
25.
P.J. McNulty, “Predicting Single-Event Phenomena in Natural Space Environments,” in Notes for the 1990 IEEE Nuclear and Space Radiation E#ects Conference Short Course, 1990.
26.
F.W. Sexton, “Measurement of Single Event Phenomena in Devices and ICS,” in Notes for the 1992 IEEE Nuclear and Space Radiation E#ects Conference Short Course, 1992.
27.
L.W. Massengill, “SEU Modeling and Prediction Techniques,” in Notes for the 1993 IEEE Nuclear and Space Radiation E#ects Conference Short Course, 1993.
28.
E. Normand, “Single-Event Effects in Systems Using Commercial Electronics in Harsh Environments, “ in Notes for the 1994 IEEE Nuclear and Space Radiation E#ects Conference Short Course, 1994.
IV – 66
J.
29.
W.J. Stapor, “Single-Event Effects Qualification, “ in Notes for the 1995 IEEE Nuclear and Space Radiation Effects Conference Short Course, 1995.
30.
D.A. Grant and J. Gowar, Power MOSFETS: Wiley & Sons, 1989.
31.
G.H. Johnson, J.H. Hohl, R.D. Schrimpf, and K.F. Galloway, “Simulating Single-Event Burnout of N-Channel Power MOSFETS,” IEEE Trans. Electron Devices, vol. 40, pp. 1001-1008, 1993.
32.
J. Titus and C.F. Wheatley, “Experimental Studies of Single-Event Gate Rupture and Burnout in Vertical Power MOSFETS,” IEEE Trans. Nucl. Sci., vol. NS-43, pp. 533-545, 1996.
33.
T.A. Fischer, “Heavy-Ion Induced Gate-Rupture Sci., vol. NS-34, pp. 1786-179, 1987.
34.
D.K. Nichols, K.P. McCarty, J.R. Coss, A. Waskiewicz, J. Groninger, D. Oberg, J.Wert, P. Majewski, and R. Koga, “Observations of Single-Event Failure in Power MOSFETS,” 1994 IEEE Radiation E#ects Data Workshop Record, vol. pp. 41-54, 1994.
35.
C.F. Wheatley, J.L. Titus, and D.I. Burton, “Single-Event Gate Rupture in Vertical Power MOSFETS; An Original Empirical Expression,” ZEEE Trans. NM-4. Sci., vol. 41, pp. 21522159, 1994.
36.
J.L. Titus, L.S. Jarniolkowski, and C.F. Wheatley Jr., “Development of Cosmic Ray Hardened Power MOSFETS,” IEEE Trans. NUCL Sci., vol. NS-36, pp. 2375-2382, 1989.
37.
G.H. Johnson, R.D. Schrimpf, K.F. Galloway, and R. Koga, “Temperature Dependence of Single-Event Burnout in N-Channel Power MOSFETS,” IEEE Trans. NUCL Sci., vol. 39, pp. 1605-1612,
Theory and Applications.
New York: John
in Power MOSFETS,” IEEE Trans. Nut.
1992.
38.
D.L. Oberg and J.L. Wert, “First Nondestructive Measurements of Power MOSFET Single Event Burnout Cross Sections,” IEEE Trans. Nucl. Sci., vol. NS-34, pp. 1736-1741, 1987.
39.
F. Roubaud, C. Dachs, J.M. Palau, J. Gasiot, and P. Tastet, “Experimental and 2-D Study of the Single-Event Burnout in N-Channel Power MOSFETS,” ZEEE Trans. Nut. Sci., vol. NS40, pp. 1952-1958, 1993.
40.
C. Dachs, F. Roubaud, J.M. Palau, G. Bruguier, J. Gasiot, and P. Tastet, “Evidence of the Ion’s Impact Position Effect on SEB in N-Channel Power MOSFETS,” IEEE Trans. Nut. Sci., vol. NS-41, pp. 2167-2171, 1994.
41.
C. Dachs, F. Roubaud, J.M. Palau, G. Bruguier, J. Gasiot, M.C. Calvet, P. Calvel, and P. Tastet, “Evidence of the Sensitivity Inhomogeneity of Power MOSFETS’ Cells to Single Event Burnout,” RADECS-95, 1995.
42.
S. Kuboyama, S. Matsuda, T. Kanno, and T. Ishii, “Mechanism for Single-Event Power MOSFETS,” IEEE Trans. Nut. Sci., vol. NS-39, pp. 1698-1703, 1992.
IV – 67
Burnout of
43.
A.E. Waskiewicz and J.W. Groninger, “Burnout Thresholds and Cross-Sections of Power MOS Transistors with Heavy Ions;’ Rockwell International Report No. DNA-MIPR-88-507, Feb., 1990.
44.
M.N. DarWish, M.A. Shibib, M.R. Pinto, and J.L. Titus, “Single-Event Gate Rupture of Power DMOS Transistors;’ ZEDM Tech. Digest, vol. pp. 1993.
45.
P. Calvel, C. Peyrotte, A. Baiget, and E.G. Stassinopoulos, “Comparison of Experimental Measurements of Power MOSFET SEBS in Dynamic and Static Modes:’ IEEE Trans. Nucl. Sci., vol. 38, pp. 1310-1314, 1991.
46.
P. Tastet, J. Gamier, H. Constans, and A.H. Tizon, “Burnout Sensitivity of Power MOSFETS Operating in a Switching Converter,” in Proc. RADECS 93, 1993, pp. 452-457.
47.
J.H. Hohl and K.F. Galloway, “Analytical Model for Single Event Burnout MOSFETS,” IEEE Trans. NUCL Sci., vol. NS-34, pp. 1275-1280, 1987.
48.
G.H. Johnson, J.M. Palau, C. Dachs, K.F. Galloway, and R.D. Schrimpf, “A Review of the Techniques Used for Modeling Single-Event Effects in Power MOSFETS,” IEEE Trans. NUCL Sci., vol. NS-43, pp. 546-560, 1996.
49.
T.F. Wrobel, F.N. Coppage, G.L. Hash, and A.J. Smith, “Current Induced Avalanche in Epitaxial Structures;’ IEEE Trans. NUCL Sci., vol. 32-6, pp. 3991-3995, 1985.
50.
T.F. Wrobel, “On Heavy Ion Induced Hard-Errors in Dielectric Structures,” IEEE Trans. Nut. Sci., vol. NS-34, pp. 1262-1268, 1987.
51.
T.F. Wrobel and D.E. Beutler, “Solutions to Heavy Ion Induced Avalanche Burnout in Power Devices,” ZEEE Trans. Nut. Sci., vol. NS-39, pp. 1636-1641, 1992.
52.
J.H. Hohl and G.H. Johnson, “Features of the Triggering Mechanism for Single Event Burnout of Power MOSFETs~’ IEEE Trans. NUCL Sci., vol. NS-36, pp. 2260-2266, 1989.
53.
C. Dachs, “Study and Modelization of the Heavy Ion Induced Burnout in N-Channel Power MOSFETS,” PhD, University de Montpelier, 1995.
54.
Technology Modeling Associates, “MEDICI: llvo-Dimensional vol. 1 and 2, 1992.
55.
S. Kuboyarna,
Single-Event 1879,1993.
of Power
Device Simulation Prograrn~’
S. Matsuda, M. Nakajima, T. Kanno, and T. Ishii, “Numerical Analysis of Burnout of Power MOSFETS,” IEEE Trans. Nut, Sci., vol. NS-40, pp. 1872-
56.
C. Dachs, F. Roubaud, J.M. Palau, G. Brugier, J. Gasiot, P. Tastet, M.C. Calvet, P. Calvel, “Simulation Aided Hardening of N-Channel Power MOSFETS to Prevent Single Event Burnout,” IEEE Trans. Nut. Sci., vol. NS-42, pp. 1935-1939, 1995.
57.
C. Sudre, C. Dachs, H. La Rochette, F. Roubaud, G. Bruguier, F. Pelanchon, J.M. Palau, and J. Gasiot, “Application of Device Simulations to Radiation Hardening Studies,” ESSDERC95,1995. IV -68
58.
J.L. Titus, C.F. Wheatley, D.I. Burton, M. Allenspach, J. Brews, R.D. Schrimpf, K.F. Galloway, I. Mouret, and R.L. Pease, “Impact of Oxide Thickness on SEGR Failure in Vertical Power MOSFETS; Development of a Serni-Empircal Expression,” L??EE Trans. Nuc1. Sci., vol. 42, pp. 1928-1934, 1995.
59.
D.K. Nichols, J.R. Coss, and K.P. McCarty, “Single-Event Gate Rupture in Commercial Power MOSFETS,” RADECS 93 Conference Proceedings, pp. 462-467, 1993.
60,
I. Mouret, M.C. Calvet, P. Calvel, P. Tastet, M. Allenspach, K.A. LaBel, J.L. Titus, C.F. Wheatley, R.D. Schrimpf, and K.F. Galloway, “Experimental Evidence of the Temperature and Angular Dependence in SEGR,” RADECS 95 Conference Proceedings, 1995.
61.
I. Mouret, P. Calvel, M. Allenspach, J.L. Titus, C.F. Wheatley, K.A. LaBel, M.C. Calvet, R.D. Schrimpf, and K.F. Galloway, “Measurement of a Cross-Section for Single-Event Gate Rupture in Power MOSFETS,” Elect. Dev. Lett., vol. 17, pp. 163-165, 1996.
62.
J.R. Brews, “A Charge-Sheet 345-35, 1978.
63.
J.R. Brews, M. Allenspach, R.D. Schrimpf, K.F. Galloway, J.L. Titus, and C.F. Wheatley, “A Conceptual Model of Single-Event Gate Rupture in Power MOSFETS,” IEEE Trans. iWcL Sci., vol. 40, pp. 1959-1966, 1993.
64.
Silvaco International,
65.
M. Allenspach, J.R. Brews, I. Mouret, R.D. Schrimpf, and K.F. Galloway, “Evaluation of SEGR Threshold in Power MOSFETS,” IEEE Trans. iVucL Sci., vol. 41, pp. 2160-2166,1994.
66.
M. Allenspach, I. Mouret, J.L. Titus, C.F. Wheatley Jr., R.L. Pease, J.R. Brews, R.D. Schrimpf, and K.F. Galloway, “Single-Event Gate-Rupture in Power MOSFETS; Oxide Thickness Dependence and Computer Simulated Prediction of Breakdown Biases,” IEEE Trans. ZVuc.Sci., vol. NS-42, pp. 1922-1927, 1995.
67.
Silvaco International,
68.
I. Mouret, M. Allenspach, R.D. Schrimpf, J.R. Brews, K.F. Galloway, and P. Calvel, “Temperature and Angular Dependence of Substrate Response in SEGR,” IEEE Trans. NUCL Sci., vol.
41, pp. 2216-2221,
Model of the MOSFET,” Solid-State Electronics, vol. 21, pp.
“ATLAS: Device Simulation Software,” 1995.
“ATHENA:
2D Process Simulation Software,” 1995.
1994.
69.
M. Allenspach, C. Dachs, G.H. Johnson, E. Lorfevre, R.D. Schrimpf, J.M. Palau, J.R. Brews, and K.F. Galloway, “SEB and SEGR in N-Channel Power MOSFETS,” IEEE Trans. NUCL Sci., vol. 43, to be published, 1996.
70.
T.R. Oldham and J.M. McGarrity, “Ionization of Si02 by Heavy Charged Particle,” IEEE Trans. Nucl. Sci., vol. NS-28, pp. 3975, 1981.
71.
R. Koga, W.R. Crain, K.B. Crawford, D.D. Lau, S.D. Pinkerton, B.K. Yi, and R. Chitty, “On the Suitability of Non-Hardened High Density SRAMS for Space Applications,” IEEE Trans. NUCL Sci., vol. NS-38, pp. 1507, 1991.
IV – 69
!,
72.
C. Dufour, P. Gamier, T. Carriere, J. Beaucour, R. Ecoffet, and M. Labrunee, “Heavy Ion Induced Single Hard Errors on Submicronic Memories,” IEEE Trans. Nucl. Sci., vol. NS-39, pp. 1693, 1992.
73.
D.C. Shaw G.M. Swift, D.J. Padgett, A.H. Johnston, “Radiation Effects in Five Volt and Advanced Lower Voltage DRAMs,” IEEE Trans. Nuc1. Sci., vol. NS-41, pp. 2452-2458,1994.
74.
S.E. Diehl-Nagle, “A New Class of Single-Event 31, Pp. 1145, 1984.
75.
T. Hirose, “A 20-ns 4-Mb CMOS SRAM,” IEEE J. Sol. St. Circuits, vol. 25, pp. 1068, 1990.
76.
A. Tanabe and e. al., “A 30-ns 64-Mb DRAM,” IEEE J. Sol. St. Circuits, vol. 27, pp. 1525, 1992.
77.
P.K. Chatterjee and G.B. Larabee, “Gagibit-Age Microelectronics and Their Manufacture,” IEEE Trans. on Very Lurge Integration Systems, vol. 1, pp. 7, 1993.
78.
C. Hu, “Future CMOS Scaling and Reliability,” Proc. of the IEEE, vol. 81, pp. 682, 1993,
79.
R.R. Troutman, Latchup in CMOS Technology: demic Publishers, 1986.
80.
N. Khurana, “Pulsed Infrared Microscopy for Debugging Latchup on CMOS Product,” Proc. of 1984 IEEE International Reliability Symposium, pp. 122, 1984.
81.
C. Canali, F. Corsi, M. Muschitiello, and E. Zanoni, “Infrared Microscopy Study of Anomalous Latchup Characteristics Due to Current Redistribution in Different Parasitic Paths,” IEEE Trans. Elec. Dev., vol. ED-36, pp. 969, 1989.
82.
R. Fang and J. Moll, “Latchup Model for the Parasitic p-n-p-n Path in Bulk CMOS,” IEEE Trans. Elec. Dev., vol. ED-31, pp. 113, 1984.
83.
C.M. Hsieh, P.C. Murley, and R.R. O’Brien, “A Field-Funneling Effect on the Collection of Alpha-Particle-Generated Carriers in Silicon Devices:’ IEEE Electron Device Lett., vol. EDL2, pp. 103-105, 1981.
84.
F.B. McLean and T.R. Oldham, “Charge Funneling in N- and P-Type Si Substrates,” ZEEE Trans. Nucl. Sci., vol. NS-29, pp. 2018-2023, 1982.
85,
P.E. Dodd, F.W. Sexton, and P.S. Winokur, “Three-Dimensional Simulation of Charge Collection and Multiple-Bit Upset in Si Devices,” IEEE Trans. Nucl. Sci., vol. NS-41, pp. 2005, 1994.
86.
G.C. Messenger, “Collection of Charge on Junction Nodes from Ion Tracks,” ZEEE Trans. Nucl. Sci., vol. NS-29, pp. 2024-2031, 1982.
87.
C. Hu, “Alpha-Particle-Induced Field and Enhanced Collection of Carriers,” ZEEE Electron Device Lett., vol. EDL-3, pp. 31-34, 1982.
IV – 70
Soft Errors,” IEEE Trans. Nucl. Sci., vol.
The Problem and Its Cure.
Kluwer Aca-
88.
T.R. Oldham and F.B. McLean, “Charge Collection Measurements for Heavy Ions Incident on N- and P-Type Silicon,” IEEE Trans. Nucl. Sci., vol. NS-30, pp. 4493-4500, 1983.
89.
T.R. Oldham, F.B. McLean, and J.M. Hartman, “Revised Funnel Calculations for Heavy Particles with High dE/dx,” IEEE Trans. Nucl. Sci., vol. NS-33, pp. 1646-1650, 1986.
90.
C.M. Hsieh, P.C. Murley, and R.R. O’Brien, “A Field-Funneling Effect on the Collection of Alpha-Particle-Generated Carriers in Silicon Devices,” IEEE Electom Device Left., vol. EDL2, pp. 103-105, 1981.
91.
C.M, Hsieh, P.C. Murley, and R.R. O’Brien, “Dynamics of Charge Collection from AlphaParticle Tracks in Integrated Circuits,” Proc. Reliability Phys. Con., pp. 38-42, April 1981.
92.
C.M. Hsieh, P.C. Murley, and R.R. O’Brien, “Collection of Charge from Alpha-Particle Tracks in Silicon Devices,” IEEE Trans. Elect. Dev., vol. ED-30, pp. 686-693, 1983.
93.
H.L. Grubin, J.P. Kreskovsky, and B.C. Weinberg, “Numerical Studies of Charge Collection and Funneling in Silicon Devices,” IEEE Trans. Nucl. Sci., vol.NS-31, pp. 1161-1166, 1984.
94.
G. Goto, H. Takahashi, and T. Nakamura, “Modeling and Analysis of Transient Latchup in Double-Well Bulk CMOS,” IEEE Trans. Elect. Dev., vol. ED-33, pp. 1341, 1986.
95.
T. Aoki, “Dynamics of Heavy-Ion Latchup in CMOS Structures,” IEEE Trans. Elect. Dev., vol. ED-29, pp. 292, 1982.
96.
J. Rollins, “Numerical Simulation of SEU-Induced NS-33, pp. 1565, 1986.
97.
W. Kolasinski, R. Koga, E. Schnauss, and J. Duffey, “The Effect of Elevated Temperature on Latchup and Bit Errors in CMOS Devices,” IEEE Trans. Nucl. Sci., vol. NS-33, pp. 1605, 1986.
98.
A.H. Johnston, B.W. Hughlock, and R.E. Plaag, “The Effect of Temperature on Single-Particle Latchup,” IEEE Trans. Nucl. Sci., vol. NS-38, pp. 1435, 1991.
99.
D. Takacs, “Comparison of Latchup in p- and n-Well CMOS Circuits,” IEDM Technical Digest, pp. 159, 1983.
Latchup,” IEEE Trans. Nucl. Sci., vol.
100. E. Sangiorgi, “Two-Dimentional Numerical Analysis of Latchup in a VLSI CMOS Technology,” IEEE Trans. Computer-Aided Design, vol. CAD-4, pp. 561, 1985. 101. A. Chatterjee, “Direct Evidence Supporting the Premise of a Two-Dimensional Diode Model for the Parasitic Thyristor in CMOS Circuits Built on Thin Epi,” IEEE Elect. Dev. Left., vol. 9, PP. 509, 1988. 102. Y. Taur, “A Self-Aligned l-Lm-Channel CMOS Technology with Retrograde n-Well and Thin Epitaxy,” IEEE J. Solid-State Circuits, vol. 20, pp. 123, 1985.
IV–71
103. R.R. Rung, C.J. Dell’Oca, and L.G. Walker, “ARetrograde p-Well for Higher Density CMOS,” IEEE Trans. Elect. Dev., vol. 28, pp. 1115, 1981. 104. H.P. Zappe, “Advanced Techniques for CMOS Performance Enhancement and Latchup Control,” Ph. D., University of California, Berkeley, 1989. 105. P.V. Gilbert, P.E. Crabtree, and S.W. Sun, “Latch-Up Performance of a Sub-O.5 Micron InterWell Deep Trench Technology,” IEDM Technical Digest, pp.731, 1993.
IV -72
1996 NSREC SHORT COURSE
z
SECTION v
DESIGN ISSUES FOR RADIATION TOLEMNT MICROCIRCUITS FOR SPACE
David R. Alexander IWsslon Research Corporation
Design Issues for Radiation Tolerant Microcircuits
in Space
David R. Alexander and David G. Mavis Mission Research Corporation 1720 Randolph Road Albuquerque, New Mexico 87106 Charles P. Brothers and Joseph R. Chavez U.S. Air Force Phillips Laboratory Kirtland Air Force Base, New Mexico 87117
The views expressed in this article are those of the authors and do not reflect the official policy or position of the Department of Defense or the U.S. Government. This work supported in part by U.S. Air Force Phillips Laboratory Contract F29601 -89-C-0014.
v-1
-v. Design Issues for Radiation Tolerant Microcircuits Space
in
David R. Alexander and David G. Mavis Mission Research Corporation 1720 Randolph Road Albuquerque, New Mexico 87106 Charles P. Brothers and Joseph R. Chavez U.S. Air Force Phillips Laboratory Kirtland Ah- Force Base, New Mexico 87117 1.0
Introduction
2.0
Domain of Radiation Tolerant Design Practices The Impact of Radiation Effects on the Microcircuit Design Hierarchy Total Ionizing Dose Affects on Parasitic Elements Single Ion Effects (Latchup) on Parasitic Elements Radiation Response Mechanisms Affecting Transistors Radiation Response Mechanisms for Parasitic Edge Transistors Single Particle Strike Mechanisms Affecting Transistors Total Ionizing Dose Mechanisms Affecting Primitive Cells
3.0 4.0 5.0 6.0 7.0 8.0 9.0
10. 11. 12. 13. 14. 15. 16. 17.
Single Particle Effects Affecting Primitive Cells Total Ionizing Dose Mechanisms Affecting Macrocells Single Particle Effects Affecting Macro-Cells Demonstration of Radiation Tolerance Enhancement Through Design Test Structures for Supporting Radiation Tolerant Design Application of Radiation Characterization Data to Cell Design Performance Characteristics of the Design Tolerant Gate Array Summary 1.0 Introduction
The objective of this short course presentation is to provide an overview of the kinds of design (electrical and layout) issues that contribute to the radiation tolerance of commercial parts and parts fabricated in commercial foundries. The developer of electronics for space applications in the late 1990s is challenged with acquiring microelectronics which will reliably perform his mission without breaking his parts budget or derailing his schedule. He is no longer able to rely on the parts base developed and maintained by strategic weapons systems. Those systems have ended their production phase, and their replacements have generally been postponed for an indeterminate time. Furthermore, there is an ongoing emphasis within the government to reduce costs by eliminating specifications and encouraging the use of parts built with best commercial v-3
practices. The result is a greatly reduced vendor pool for radiation hardened parts and a widening gap between the performance and cost of microcircuits used in commercial electronics and those developed to fimction in a radiation environment. Consequently, the space applications developer is forced to take a much more active role in the evaluation of microcircuits and to accept a much greater risk in parts selection. One view of the parts acquisition process is depicted in the flow chart in Figure 1. It begins with a clear description of the radiation environment to be encountered and functional and performance The radiation environment varies significantly with the requirements of the microcircuits. altitude, inclination of the orbit, and solar activity. More benign environments and shorter duration missions obviously expand the candidates for the application. However, the developer is well advised to first consider the availability of radiation hardened parts even for modest radiation exposures. The savings in characterization testing, data reduction and analysis, and hardness assurance activities can far outweigh any initially perceived differences in item costs. The remaining manufacturers of QML/RHA (Qualified Manufacturers’ List/Radiation Hardness Assurance) [l] [2][3] microcircuits provide products with a legacy of built-in radiation hardness derived from a balanced technology flow which considers design, fabrication, packaging, and testing issues. The resulting components cannot be matched for robust performance in the space environment. In the event radiation hardened parts are not an available option, the applications developer has the alternatives of 1) selecting a commercial part and performing testing to determine its radiation tolerance level or 2) designing a radiation tolerant part. If a commercial part is selected, a radiation test strategy must be developed to determine its tolerance. Many commercial microcircuits have large numbers of terminals, and the selection of a test approach can be a challenging task. An approach based on considering the dominant ftilure mechanisms typical of specific parts of the commercial design can be useful in performing a test which will yield valuable insight into radiation performance of the microcircuit. There are two major risks in selecting a commercial microcircuit for use in space. First, the portions of the fabrication process that determine radiation hardness may not be tightly controlled by the vendor. Hence, the radiation tolerance of commercial parts typically have large standard deviation to mean ratios[4]. A second major risk stems from design and process changes which may be implemented at any time. Either of these may drastically change the radiation tolerance of the part. Indeed, a common industry practice is to bring out a product at a conservative feature size, and then perform a dimension shrink to improve yield and increase the number of die per wafer once a market has been established. Consequently, different revision numbers of the same part can exhibit widely varying radiation tolerance, especially with respect to single event latchup (SEL), which is very sensitive to spacing between elements on the microcircuit. The system developer can reduce these risks by acquiring all his part population from a single processing lot or wafer and by testing enough parts to obtain a valid statistical estimate of the population. Such acquisition strategies may be difficult to implement for small quantity purchases. In addition, this is often hard to accomplish because commercial tracking of parts may not reflect this wafer or even lot information.
v-4
Space Environment
& Microcircuit
No
.
t Yes +
Develop Rad Tolerant Part
I
Perform Hardness
Insert in
PEl_JkE!!4 Figure 1. Simplified Part Selection Flow Chart In the absence of a radiation hardened part and as an alternative to commercial parts, the space application developer may choose to design a radiation tolerant part. This option may be particularly attractive if a commercial function can be combined with application specific circuitry in a single ASIC (application specific integrated circuit) die. Many commercial functions are available in synthesizable VHDL or Verilog models. With appropriate design discipline in the macrocell library, a synthesized circuit can achieve radiation tolerance consistent with the requirements of many space missions. In general, the type of design discipline required trades-off packing density and pre-irradiation electrical performance to enhance radiation tolerance. 2.0 Domain of Radiation
Tolerant Design Practices
As illustrated in Figure 2, traditional radiation hardened microcircuit development has been based on contributions from rad hard processes, rad hard electrical design practices, and rad
v-5
hard layout rules. However, in this presentation, hardening approaches will be restricted to electrical design and layout practices. Only a commercial CMOS process is considered to be available for fabrication. This means that no specific hardening provisions can be relied upon for the gate oxide (GOX), the field oxide (FOX), or the transition region between gate and field oxides (edge oxides). Note however, that the industry trend is toward thinner gate oxides (e.g., 170~ for a 0.8 pm technology) as features scale into submicron dimensions. Since the oxide trapped charge is roughly proportional to tOX-2[5] in this oxide thickness range, its contribution to total ionizing dose degradation becomes less important with advancing technology. This is especially the case since tunneling mechanisms are effective in annihilating trapped charge within approximately 50 ~[6] of the Si02/Si interface. This does not necessarily pertain to interface state creation. No process-oriented provisions for single event effects hardening will be considered. This means that no high resistivity polysilicon is available for inserting high resistance elements in the cross coupling path for memory elements and latches. Also, no retrograde diffision profiles or silicon-on-insulator technology can be relied upon for latchup suppression. However, many commercial technologies are built on wafers with a lightly doped epitaxial (epi) layer on a heavily doped substrate to assist in electrically induced latchup suppression. Often, the use of epi technology is not advertised and can only be discovered by performing a destructive physical analysis on sample parts. Where epi substrates are used, they can be very beneficial in SEL suppression. Although process hardening is not a topic for this presentation, it should be pointed out that even a modest process hardening effort can produce great benefits in developing radiation tolerant parts for space. The chief benefit lies in tightening the distribution of radiation tolerance
Figure 2. Domain of radiation tolerant design practices.
V-6
This will permit less conservative design practices to be used with for the population. subsequent improvements in performance and packing density. The Defense Nuclear Agency programs[7] directed at modest improvements in hardness (Total dose = 50-200 Krad(Si), SEL LET =50- 120 MeV cm2/mg, and SEU LET = 40-65 MeV cm2/mg) in baseline commercial technologies have the potential to greatly benefit satellite applications. Radiation tolerant electrical design practices are related to the organization and interconnection of electrical elements (transistors, diodes, capacitors, and resistors) to perform the required function. Typically, the electrical design is represented by schematics or by the equivalent netlist, which is a textual description of the graphical information in the schematic. In complex microcircuits, the electrical design is depicted hierarchically. The top level describes the information processing organization within the device in terms of macrocell functional blocks and data buses. Lower levels are oriented toward primitive cell interconnections needed to form the functional blocks, and ultimately to transistor-level interconnections required to form primitive cells. Top-level netlists are typically written as behavioral blocks in a hardware description language (e.g., Verilog or VHDL). Intermediate levels may be described in register transfer language (RTL) or as pure structural language in which each macrocell is modeled by its Boolean fimction. Usually, the lowest-level netlist is written in a SPICE format in which each electrical element and node is modeled. Radiation tolerant design practices will be identified at each level of the hierarchy. They should be considered from the inception of the design process in order to be used most effectively. Radiation tolerant layout practices deal with the polygon representation of the microcircuit electrical design. The schematic/netlist description must be translated into entities which can be physically created through the photolithographic and fabrication technology to construct a microcircuit. By restricting the geometrical shapes and dimensions of the electrical elements and controlling the spacing and number of contacts between elements, substantial In general, a microcircuit designed with improvements can be made in radiation tolerance. radiation tolerant design practices will be larger and slower than the same fiction designed with no restrictions. The increased size typically translates to a lower yield and more expensive parts. Also, the complexity of functions which can be fabricated using only rad tolerant design practices is restricted by the die size and operating speed. Most likely, one would not attempt to build a state-of-the-art microprocessor or a large SRAM with rad tolerant design practices, but a moderately complex ASIC function could be appropriate.
3.0 The Impact of Radiation Effects on the Microcircuit
Design Hierarchy
To effectively employ radiation-tolerant design practices, the designer needs a clear understanding of the interactions among radiation sensitive elements at each level of the design hierarchy -depicted in Figure 3. Actually, many of the radiation effects which limit the hardness of a microcircuit are associated with parasitic elements which are typically not considered as part of the design process. These elements are an inherent part of the semiconductor technology chosen, but their influence is often not experienced until they are activated by the radiation environment. One of the major differences between radiation-hardened and commercial fabrication technologies is the attention given to controlling parasitic elements. Two parasitic elements that are particular problems for space applications are (1) field oxide (FOX) N-channel
v-7
transistors which provide leakage paths between adjacent gate oxide (GOX) N+hannel devices and between Vdd and Vss, snd (2) the four-layer (PNPN) SCR (silicon controlled rectifier) devices that consist of P-source/dr@ N-well, P-substrate, and N-source/drain. The parasitic SCR can be turned on by a heavy ion strike and produce a latchup condition that may catastrophically damage the microcircuit. These two effects often constitute the primary failure mectisms for co--ercial microcircuits.
MICROCIRCUIT FUNCTION
I
1
I
I
I
MACROCELLS
I
I
I
PRIMMVE
I CELLS
EIEHEHEIEIEI TRANSISTORS m
EEl
I
I
I
I
PARASMCS
‘-sEzClm
Figure 3. The microcircuit design hierarchy. The radiation effects associated with the GOX transistors also play a dominant role in determining radiation tolerance. Total ionizing dose effects produce both increases in the “offstate” leakage in N-channel devices and changes that tiect the switching point and “on-state” I/V (current/voltage) characteristics of both N-channel and P-channel transistors. A single particle strike can produce an ionization track which generates a current pulse that may temporarily discharge or charge the drain nodes of N-channel and P-channel transistors, respectively. Since transistors form the basic switching elements, changes in their operating characteristics are manifested as variations in performance throughout the microcircuit. The radiation-induced changes in tisistor characteristics and the activation of parasitic elements require the designer of a radiation tolerant part to modi~ the layout and electrical design of primitive logic cells such as gates (NAND, NOR, etc.), latches (D type, J-K etc.), and memory cells. For total ionizing dose, he must consider the changes in propagation delay and V-8
. ..—.
. . . .
.. . . . .++
,_
imbalance propagation from low-to-high (t@lh) and high-to-low (t@l). He may also limit the fan-in (i.e., the number of inputs to a cell), fan-out (i.e., the number of other cells connected to the output), and the maximum load attached to the output (fan-out load plus interconnect load). Primitive cell types (e.g., multiplexer) which are particularly sensitive to leakage current and/or crosstalk between logic states of adjacent devices may require significant changes in transistor geometry and cell layout for radiation effect mitigation. Similarly, single particle effects usually require changes in primitive cell design. Spacing between elements which constitute potential Iatchup paths may be changed, additional well and substrate contacts may be added, and guard bands may be used. Additional elements such as capacitors and resistors may be added to memory and latch cells to increase the charge associated with a node logic state or to filter the high frequency ionization transient. These restrictions cause the electrical circuit and layout topography of primitive cells for radiation tolerant designs to be much different than those found in commercial implementations. The importance of using radiation tolerant design principles extends to the highest level of the design hierarchy. The organizational structure of the design is affected by preference for such approaches as (1) synchronous designs over asynchronous, (2) logic cell implementations of state machines rather than ROM implementations, (3) limitation of connections to data and address buses, and (4) addition of bits to data path elements to support error detection and correction. Issues associated with post-irradiation timing control make the design of a robust clock driver and clock distribution scheme particularly important to ensure that clock skew problems (i.e., one block of the circuit receiving the clock before another block) do not occur. In general, radiation tolerant design practices for both single event effects and total ionizing dose restrict design flexibility and typically require larger die to perform the same microcircuit function. A few specific examples in the following paragraphs will help to highlight the interdependence of radiation tolerant design practices at all levels of the design hierarchy. 4.0 Total Ionizing Dose Effects on Parasitic
Elements
An illustration of the structure of the parasitic FOX transistor is shown in Figure 4. The leakage paths are formed when positive charge is trapped in the field oxide and inverts the surface of the P-type material[8]. A P-epi technology is depicted in the figure, hence the leakage path is associated with the substrate region. If a N-substrate/P-well technology were used, the leakage would be associated with the P-well. Two types of leakage paths are possible. The first is between the N-well (which is typically connected to Vdd), and an N-plus source (which is connected to Vss). The leakage path connects Vdd to Vss, producing a change in supply current with increasing radiation. Usually, the supply current increases to a maximum value as the trapped charge builds up and then decreases to a relatively constant value as interface states increase, offsetting the trapped positive charge effects. The development of the leakage path is aggravated by polysilicon interconnects overlaying the leakage path region. The pol ysilicon acts as a gate to the parasitic FOX and, if biased at Vdd, produces a field which increases hole trapping. Radiation tolerant design practices should not permit polysilicon to extend over the well-to-substrate boundary. Field oxide leakage paths can also span the N-plus source/drain regions between adjacent N-channel transistors. This will increase the Vdd to Vss leakage, and is particularly problematic v-9
if dynamic logic is used. It will bleed off charge which is used to represent a logic state between refresh cycles. Since this can result in logic errors, radiation tolerant design practices should not permit dynamic logic or logic states that do not swing all the way to the rail voltages (Vdd and Vss). Furthermore, adjacent N-plus source/drain regions should not be allowed without an intervening channel stop. A channel stop is a more heavily doped P-region implanted at the interface between the field oxide and the silicon. The increased doping makes the channel stop region more difficult to invert and effectively breaks the leakage path. Most commercial processes include some type of channel stop or field threshold adjust step to control mobile ion effects. However, the step may not be carefully monitored, and the tendency of the P-type implant (i.e., boron) to be leached out during the field oxide growth process results in considerable variability in lot-to-lot leakage characteristics in commercial microcircuits.
Vdd
Vss
Q
Q
P-epitaxial
p+
layer
‘
substrate
Figure 4. Field oxide leakage path from N-well to N+ source. The results of field oxide leakage characterization of a typical commercial, submicron technology are shown in Figure 5. Prior to irradiation the parasitic field oxide transistor required approximately 15 volts to turn on the leakage path. The turn-on characteristic is clearly degraded at low doses. An accumulation of 10 Krad(Si) produces significant leakage at 5 volts, and 30 Krad(Si) produces leakages which would render the microcircuit inoperable. Field oxide inversion is the dominant failure mechanism in many commercial microcircuits. The failure may be due to supply current greatly exceeding its specification (e.g., SRAM standby current), input and output leakage currents exceeding parametric specification (e.g., output tri-state leakage), or catastrophic fictional failure. Large increases in static supply “current with dose are indicative of field inversion. Since there is an interaction between trapped holes and interface states which causes the supply current to peak and then decrease, the supply current must be monitored at intermediate doses as well as the maximum dose required. For commercial parts, a test sequence with intermediate read points at 5, 10, 20, 40, 80, 160 Krads, etc. is convenient and samples the radiation effects frequently enough to estimate the peak Idd response. To detect logic errors in microcircuits which may contain dynamic logic, the slowest specified clock and refresh rate should be used to allow the leakage path to discharge data nodes. v-lo
.
Commercial
Submicron FOX l/V Characteristics
le4 1e-5
:88 ■
,..-
●’*
I
@ WV
1e-6
v
1e-7
10 Krad(Si)
v
1e-9 le-10
-:
le-11
-
6 Krad(Si)
#
Vv
v
v
1e-6
-m
~fl
Vv
““’
v v
r’ 8~*
● °
●
‘rad(si) W Pre-rad
●
le-12
le-13
-5
0
10
5
Field Oxide Transistor
Figure
Gate Voltage
5. Total dose effects for a typical commercial,
20
15
25
(Volts)
submicron CMOS technology.
In space radiation environments, the total ionizing dose is built up very slowly. Since there is some compensation of trapped positive charge by interface states, a low- dose rate test or a high dose rate test with room temperature anneal may by more representative of the performance to be expected in the actual space application[9]. However, caution should be exercised in interpreting the data, since the time dependent mechanisms associated with trapped charge annealing and interface state formation in field oxides with low electric fields are not well understood.
5.0 Single Ion Effects (Latchup) on Parasitic Elements The latchup path structure for a P-epi technology is shown schematically in Figure 6[1 O]. The path has been shown in terms of the traditional cross coupled transistor model of an SCR. Under normal bias conditions the P-substrate is held to the lowest potential in the circuit, and the N-well is held to the highest potential. None of the junctions associated with the PNPN structure are forward biased, and the SCR is off. However, an ionization track associated with single particle strike can produce a current transient which will inject charge into either the cathode or anode gate regions[l 1]. The voltage drop associated with this current flowing to either the well or the substrate contacts can be sufficient to forward bias a local portion of the junction. This
v-l]
results in bipolar transistor action that rapidly becomes regenerative as the parasitic SCR turns on. The latch path will conduct until the voltage across the path fidls below the holding voltage, typically Vh, + V=* Since this is a low impedance path and often occurs between adjacent Vdd and Vss contacts, the currents may be large enough to burn out metallization and catastrophically damage the microcircuit. In huge microcircuits, the resistance in the power bus Such an may be large enough to limit the cument to a level below the burnout threshold.
occurrence is called a micro-latch and is often encountered in testing commercial microcircuits. Although the micro-latch is not catastrophic in the sense that burnout occurs within a matter of minutes, its impact on reliability will depend on the cell layout snd metalhzation dimensions. As a general rule, high current density in microcircuit metdli.zation should be avoided if possible.
Vss
Vdd
Vdd N+ contact
p+
sourw
souru
Ca~hode
Agode
9
N+
Vss p+ conta~ ?
N-well )
L
P*Di P-substrate
Figure 6. Latch path for P-epi CMOS technology (Ochoa and Dressendorfer,
8 1).
Test procedures to determine the susceptibility to latchup of commercial microcircuits must be performed carefidly to ensure that they are thoroughly evaluated. Testing should be performed at the maximum temperature to be experienced in the application, since devices are more susceptible to latchup at elevated temperature. Power supply voltage should be kept at its maximum operational value. The power supply current should be monitored carefully, and the clock frequency should be kept as low as possible so that operating currents do not mask the As noted previously, different revisions of the same microcircuit may be latch currents. implemented using significantly different feature sizes, wafer material, and processing, and hence, have different latchup susceptibility. Care must be taken to ensure that the same version that has been tested is used for the flight parts. There are several design practices which can reduce susceptibility to latchup. A cross coupled transistor model of the parasitic SCR suggests that if the gain product of the parasitic NPN and PNP transistors is reduced below 1 (i.e., ~m~mP < 1) over the possible range of collector curren~ the latchup condition cannot be sustained. Also, if the anode or cathode gate junctions are shorted so that a Vh. cannot be maintained, the path will not latch. This model for latchup maybe overly simplistic for modem microcircuits with small fkature size. Research by
V-12
Sleeter and Enlow[l 1] has suggested a functional relationship between holding voltage, holding current, epitaxial layer thickness, N-plus to P-plus spacing, and doping profiles. For example, Figure 7 contains the results of an analysis for 10 pm spacing between anode and cathode in a typical CMOS process (single well without retrograde). It indicates that an epi thickness less than 3.5 pm may be necessary to maintain holding voltages above Vdd even for 3.3 volt technologies. Their model suggests the drift conduction processes play a more important role than would be suggested by the transistor-based model. However, the design practices employed to reduce susceptibility remain the same. They include (1) increasing the spacing between the Nplus and P-plus source/drain regions and the well edge, (2) adding N-plus guardbands in the Nwell and P-plus guardbands in the P-substrate to reduce the gain of the parasitic transistors and control the potential of the well and substrate in the latch patlz and (3) increasing the number of well and substrate contac~ and decreasing the distance between the contacts and the latch path (see Figure 22 in Section 13).
Latchup Holding Points for Various Epi Thicknesses 1000.0
r
r
100.0
10.0
No epi 1.0
0,1
0
1
2
3
4
Holding
voltage
5
e
7
8
(volts)
Figure 7. Radiation induced shifts if field oxide transistor
characteristics (Sleeter and Erdow, 1992) 6.0 Radiation
Response Mechanisms
Affecting
Transistors
The radiation effects mechanisms affecting the intrinsic MOS transistor include hole trapping in the gate oxide and interface state generational 3]. Positive charge trapping in the gate oxide results in threshold voltage shifts which push N-channel transistors toward depletion mode operation and P-channel devices toward enhancement mode. Interface state buildup shifts both N-channel and P-channel devices toward enhancement and decreases the slope of the
V-13
,..
.
..
.
subthreshold L/V (current/voltage) characteristic. In addition, the mobility of the devices is decreased resulting in a reduction in drain cument as a fimction of gate voltage (i.e., reduced transconductance). These effkcts are readily observable in the pre- and post-imadiation IN characteristics as shown in Figures 8 and 9 which display N-channel and P-channel data from The reader should note that the N-channel commercially processed 0.8~m technology. measurement has been taken on a re-entrant transistor (i.e., an annular gate with no edge). This permits the radiation effixts of the intrinsic gate oxide transistor to be obsenwd independently from the edge efkcts which will be discussed in Section 7.0.
Re-antrant
-1
0
N-channel
in 0.81.LCMOS
1
2
3
Technology
4
5
Gate Voltage (volts)
Figure 8. Total ionizing dose data for a re-entrant N-channel transistor from a commercial CMOS technology. Manifestations of the radiation mechanisms include: (1) increased leakage due to a decrease in the slope of the subthreshold characteristic, (2) time dependent changes in leskage
and drive current as N-channel threshold voltage initially shifts toward depletion mode under the influence of oxide trapped charge aad then toward enhancement mode as interface states buildup, (3) mismatch in N-channel snd P-channel drive current as trapped charge and interfhce states move the P-channel further into enhancement. Ultimately, the changes in the I/V characteristics of the transistors will decrease the frequency of operation aud the functionality of the microcircuit.
V-14
.-,
—..
Subthreshold Characteristic for 0.8 micron P-channel -5 1e-3
?
1e-4
r
4
-3
I
-2
0 r
-1
,
1 ~ 10-3
3
, 10+
1e-5
, 1e-5 f
~
10-6
: .
,*7
z
- 10-6
la-6
g
1
*
ii! G
1e-7
r
● la-9
le-lo
●
60t@ad(Si)
w
160 Krad(Si) Post-anneal
v
a-9
w
P
a-lo
*
!E
1
w s
10-11
j
10-12 ~ 10-13 f
-5
e-s
*
Pre-rad
e-11 10-12
v 1
,
,
1
-4
-3
-2
-1
10-13
0
1
Gate Voltage (volts)
Figure 9. Total ionizing technology.
dose data for a P-channel
transistor
from a commercial
CMOS
To observe the worst case effects of these changes on microcircuit operation, testing must consider the effects of dose, time, and bias on the post-irradiation results[l 4]. In most laboratory testing the dose rates used in accumulating total dose are usually quite high (50 to 300 rad(Si)/s for CO-60 and 104 rad(Si)/s or greater for X-ray irradiation sources). This may be conbasted to typical space irradiation rates of 1 mrad(Si)/s or less. The high dose rates tend to maximize the effects of trapped charge, especially emly in the irradiation cycle. Effects fi-om interface states, which require a longer time to build up, are most observable later in the irradiation. In order to determine the maximum effects of interfme states, a high-temperature anneal (1OO”C for 168 hours is specified in Mil Std 883 Method 1019.4 [15]) is typically performed to significantly reduce the amount of ~sitive trapped charge. The results of such an anneal for both the Nchannel and P-channel transistors can be seen in Figures 8 and 9. The post-anneal characteristics bound the changes expected from interface state effects resulting from lowdose-rate space irradiations. Biasing voltages also significantly affect the post-irradiation characteristics. A positive voltage on the gate with respect to the substrate enhances the sepmation of hole and electron pairs generated by the ionizing radiation and causes the centroid of trapped charge to be located closer to the oxide/siliccm interface where it has the greatest effect on charge carriers in the silicon. Bias voltage effects on trapped chmge and interface states are summmizd in Table 1. V-15
Table 1. Bias Condition Effects on Total Ionizing Dose Mechanisms DeviceType
Max Trapped
Min Interface
Charge
Min Trapped Charge
Max Interface
Terminal
statm
states
NMOS - Gate
Vdd
Gnd
Vdd
Gnd
NMos - Drain
Gnd
Vdd
Gnd
Gnd
NMos - source
Gnd
Vdd
Gnd
Gnd
Gnd
Gnd
Gnd
Gnd
PMOS - Gate
Vdd
Gnd
Vdd
Gnd
PMOS - Drain
Gnd
Vdd
Gnd
Vdd
PMOS- Source
Gnd
Vdd
Gnd
Vdd
PMOS - Substrate
Vdd
Vdd
Vdd
Vdd
NMOS - Substrate
As noted previously, the trend in commercitd CMOS technologies is toward thinner gate oxides for the intrinsic transistors. This translates into improved radiation tolerance since the trapped charge is proportional to the square of the reciprocal of the oxide thickness. However, interface states do not exhibit the same strong correlation and exhibit a complex dependence on post-processing conditions (temperature and atmosphere). Commercial processes may not Consequently, interface state effects show maintain tight control on such conditions. considerable variability from lot to lot. Space systems using commercial parts should design their test program to monitor microcircuit pefiormance changes due to interkce state effects[16].
7.0 Radiation Response Mechanisms
for Parasitic Edge Transistors
As illustrated in Figure 10, MOS transistors are typically designed for a self aligned process in which the polysilicon gate material is deposited over a thin oxide region. The source/drain implant is then performed and fills the region not covered by field oxide and poly. This process is very manufacturable and produces very dense circuits. Unfortunately, the material at the transition between the field oxide and the thin oxide produces a parasitic transistor that is very susceptible to total ionizing dose effects.[17] The silicon dioxide in this region (known as the bird’s beak) is under mechanical stress produced by the dynamics of the oxide growth process and the transition from thin to thick oxide. It is also subject to implant damage from exposure to the heavy sourceklmi.n implants. The transition region oxide is of variable thickness and experiences a relatively high electric field from the combination of poly gate bias and the fringing fields fi-om the source to drain bias. When this region is exposed to ionizing radiatiom
affects the W characteristic increased by approximately increasing doses, more of the leakage will quickly rise to transistor.
significant hole trapping occurs and as shown in Figure 11. At 150 Krad(Si), the edge leakage has three orders of magnitude over its pre-irradiation value. At edge parasitic will become involved in the conduction path, and the become roughly equivalent to the on-state current of the intrinsic
V-16
.,
.
-.
.
BIR=AK “N=AL”
FIELD-OXIDE REGION
REGION CHANNEL REGION POSITIVE TRAPPEDCHARGE INDUCED CURRENT LEAKAGE PATH
Figure 10. MOS transistor cross section indicating charge trapping in the bird’s beak region (McLean and Oldham, 1987).
Conventional
N-Channel
FET (#11)
1 e-3
I
1 e-4 1 e-5
z
1 e-6
1 le-14
I
--—F–~–
Prerad 150K rad Anneal
~
-2-1012345
Gate Voltage (V) Figure 11. Conventional commercial technology 2-edge N-channel transistor exhibiting edge effects induced by total ionizing dose.
V-17
Edge leakage increasing total dose, intinsic Transistor. becomes permanently
effects are typically manifested as a rapid increase in supply current with as the parasitic edge transistors shunt the source-todrain current around the They may also produce fictional failure as the composite N-channel “on”.
Parasitic edge effits
can be distinguished
from field oxide leakage and intrinsic transistor effects by testing transistors of equivalent width in two edge and multi-edge (i.e., paralleled 2-edge transistors) varisnts. Alternatively, two edge transistor characteristics can be compared with re-entrant transistors which have no edge connecting the source to drain. Commercial foundries pay little attention to the characteristics of the edge transition region since the edge parasitic is not turned on in commercial applications. Thus, edge parasitic in commercial devices are quite susceptible to radiation effects, and there is large variability in radiation tolerance since there is little control of the process parameters af%eting hardness. Therefore, edge effect mitigation for commercial processes usually requires layout modification. Three radiation tolerant layout approaches are illustrated in Figure 12. The most effkctive layout uses the re-entrant design for the N-channel transistor which totally eliminates the thin-tothick oxide edge between source and drain. Chily the N-channel requires a re-entrant design since the P-channel does not experience edge inversion. There are several disadvantages to using a re-entrant layout including: (1) increased area requirements in comparison to a 2-edge, (2) increased capacitance (gate and source/drain), (3) difficult width scaling due to comers and differences in inside and outside perimeters, (4) lack of symmetry between source/drain and drain/source characteristics (i.e., not bilateral), and (5) complication in sizing a P-channel tmmsistor with equivalent drive strength. The least intrusive layout mitigation technique is the dog bone design. It widens the poly at the step over the edge. This produces a longer parasitic channel and decreases the leakage cument by changing the effective width-to-length ratio. It also reduces some of the tinging field intensity associated with drain-to-source bias. The net effect is a reduction in leakage current but not an elimination. Other disadvantages include: (1) a decrease in the effective width of the intrinsic transistor, (2) increased gate capacitance, and (3) additional complication in layout. A third layout alternative is the use of nested transistors. It employs a mask-defied source/drain implant region nested inside the thin oxide. One end of the poly may also be nested inside the thin oxide as depicted in Figure 7. In concept, edge leakage is eliminated since only thin oxide interfaces with the source and drain, In practice, edge leakage usually occurs along the edge of the poly to the bird’s beak region and back along the opposite poly edge to connect the source/drain. This is a much longer path length than a normal 2-edge device and significantly delays the onset and magnitude of the leakage current. However, many commercial processes do not include provisions for a nested source/drain. Other disadvantages include: (1) increased capacitance, and (2) reduced circuit density. Edge leakage poses a serious limitation microcircuits, and layout mitigation techniques complexity for increased hardness.
V-18
on the radiation tolerance of commercial require significant tradeoff of area and
Thin oxide boundary _./ a. Conventional
2 edge NMOS
Thin oxide boundary~ b. Nested 2 edge NMOS
Thin oxide boundary / c. Dog bone 2 edge NMOS N+ some
I
Polysilicon gate \
Thin oxide boundary d. Re-entrant NMOS
N+ drain
J
Figure 12. MOS transistor layout alternatives.
V-19
.,
8.0 Single Particle Strike Mechanisms
Affecting
Transistors
At the transistor level, a single ion strike is experienced as a cument transient appearing at the sourcddrah node [18]. If the magnitude of the charge in the transient is sufllciently higl-q the tiormation stored as charge on the node may be lost. The amplitude and duration of the transient is determined by the ion species, its energy, and the fabrication technology of the microcircuit. Experimental and predicted waveforms of single ion strikes on silicon diodes for heli~ silico~ and iron are shown in Figure 13 [19]. A normally incident iron ion of 100 MeV has an LET (linear energy transfer) = 28 MeV-cm2/mg. Since the effective LET scales as sec9 (where 0 is the incident angle of the ion with respect to the perpendicular), the iron ion incident at 45° and 60° would have LETs =40 and 56 MeV-cm2/mg, respectively. These values cover the low end of acceptable SEU and SEL tolerance. While these waveforms represent a good starting point for simulation of SEU in a design, the waveforms in actual commercial CMOS microchcuits may be much different. The shape and amplitude of the transient will be dated by the existence and thickness of an epitaxial layer, the doping profile and depth of the well, and the lateral spacing between adjacent transistors, Consequently, experimental evah.ution of LET thresholds and cross sections is important to determine the SEU tolerance of the design. Testing should be performed on arrays of cells which are typical of the layout geometries to be used in microcircuit designs. Where the designs are based on a cell library (e.g., gate array and standard cell designs), arrays of flip flops can make an effective test structure. Ex amination of the waveforms in Figure 13 reveals the FWHM (W width at half maximum) pulse width to be greater than 100 ps for the fastest transient. Typically, the propagation delay for simple logic gates in submicron CMOS can vary from 50 to 200 ps depending on the design approach used. This means that single event transients can be propagated through combinational logic quite readily. Therefore, the effects of propagated upsets must be considered in the design of the radiation tolerant microcircuit [20]. Several approaches exist which may be considered for mitigating single ion transients for better SEU performance. They begin with selection of a technology with a structure to minimize charge collection. An epi process with the thinnest epi layer and the highest substrate doping is most desirable to reduce fi.umehng effects (i.e., early enhancement of charge collection by the acceleration of charge carriers in the cylindrical ionkmtion sheath by the distorted electrical field from the junction depletion region) [21]. The funneling effat is proportional to LETn (where q is an empirical value between 1 and 1.33) [22] [23]. For an LET of 28, the maximum enhancement due to fimneling will be a factor of 3 (i.e., enhancement= 281’33/28).
An alternate technology selection approach would be to choose an SOI technology. Access to SOI foundries is becoming more generally available to designers. In addition to commercial foundries in Europe, projects by DARPA with IBM and MIT/Lincoln Labs and by Sematech may result in commercial SOI foundries in the fhture. Caution is required in selecting a commercial U.S. SOI foundry, because the technology may not make provisions for body ties. In that case, the parasitic bipolar gain multiplication of the initial current transient may negate the benefits of the reduced collection volume [24].
V-20
●O
– fi~v FtSCE8
+ DATAI 1.0
●a@
●.1sO.*O -0.06
0.00 0
-0
100
1-
Zoo
a
8a
400
TIME (p,) 10 —P18cu
I
L o
200
100
TM!
Figure 13. Cunenttransients
I
fromion
aoo
400
600
(p@
strikes (Qudson
and Cmpbell,
l99l).
Layout modifications can also be used to mitigate single ion transients at the transistor level. Forexample, thearea of thedrain region can beminimized toreduce tocross section of vulnerable nodes. For conventional 2-edge designs, this means eliminating extended drains used as crossunders or intermediate routing layers. This is particularly important where silicided source/drain regions provide low resistivity material. For re-entrant designs, it means using the interior of the annulus as the drain because of its smaller area, and using a single contact per drain. Since a significant amount of the charge in an ionization path can be collected by diffision (particularly for high LET strikes), layout modifications to increase the distance between adjacent devices and to introduce recombination regions can be helpful. Introduction of v-2 1
,.
substrate and well contacts between devices can help prevent multiple bit upsets from a single particle strike. In general, SEU is extremely challenging for radiation tolerant designs using commercial technologies. The transient amplitude at the transistor level will be prirnarily determined by the starting material and the process used by the foundry. Most of the improvement in tolerance will have to come from design at the primitive cell, macrocell, and system level.
9. Total Ionizing Dose Mechanisms
Affecting
Primitive
Cells
The transistor-level radiation effkcts mechanisms and mitigation approaches discussed in the preceding sections are the foundation for designing primitive cells with the required radiation tolerance. Additional discipline in the design process at the primitive cell level can minimke or In general, the radiation sensitive intensi@ the manifestations of radiation degradation. parameters of interest at the primitive cell level are associated with the DC and AC performance DC parameters include supply curren~ input noise mar~ and output drive characteristics. levels as a function of Vdd, temperature, output loL@ and radiation dose. AC parameters include risetime, fdltime, and propagation delay. The degree to which these primitive cell parameters are affected depends on the circuit topography selected for the cell as well as the changes in transistor characteristics. Two extremes of cell topography are represented by the basic NOR and NAND gates as illustrated in Figure 14, The NOR gate combines a series connection of P-channel transistors and a parallel connection of N-channel transistors to perform the logic fimction. As the N-channel transistors are exposed to an ionizing radiation dose (in a time frame where oxide trapped charge effkcts dominate), their leakage current increases, and their threshold voltage moves toward depletion operation. The net effect on NOR gate pefiormance is to (1) increase the supply current in the high state, (2) decrease VWin (tiput high state threshold), (3) lower VOH (output high state voltage), (4) decrease the fall time, and (5) reduce ~~l. A the P-channels are exposed to an ionizing radiation dose, their threshold voltage moves further into enhancement and their maximum drive current decreases. The effect on NOR performance is to (1) increase the supply current switching transient, (2) decrease V1~~w (input low state threshold), (3) lower Vo~, (4) increase the rise time, and (5) increase ~~. . These effects are enhanced by the parallel connection of the N-channel transistors and series connection of P-channels. Larger NOR gate fan-ins aggravate the effects by pitting more series Pdm.nels against more parallel N-channels. This is often designate-d as a “stacking height” problem in reference to the stack of P-channels in series. In radiation tolerant designs, stacking height is usually restricted to 4 transistors as part of the design discipline. Typically, NAND primitives are somewhat less sensitive than NOW to total ionizing dose degradation. The parallel combination of the P-channels helps compensate for their reduced drive capability, and the series combination of the N-channels compensates for their increased leakage and lower threshold voltage. For this reaso~ NAND configurations are often chosen for radiation tolerant designs. However, the impact of specific Wnsistor radiation degradation mechanisms for a selected technology must be considered in selecting a topography.
v-22
NAND Topography
NOR Topography Vd In 4 d h 3 4 In 4
Vdd
1,10$% (n G % 2
Vdd
d$
Vdd
dgld+i
ln_l e
-i
—. Q
4 ln_2
=
out
Ss
=
cn G c-l x ~.
4
ln_l ~
w~w~w~
0“’
‘n+ In_4
; ;ss _._[
~
Vss
Figure
14. NOR and NAND topographies illustrating transistor stacking height.
The effects of cell topography on post-irradiation performance should also be considered in selecting worst case test conditions for primitive cell. Both the NOR and NAND should be irradiated with a high state on all inputs. However, the worst case post-irradiation test condition for leakage will be with all inputs held low for the NOR and all but one input held high for the NAND. The discussions
above have only considered
oxide trapped charge effects.
The postirradiation behavior will be quite different in the regime where interface states dominate. Both regimes (as appropriate to the mission application) should be analyzed as part of the design process and tested for worst case performance [25]. The full testing and annealing sequence specified in Mil Std 883 Method 1019.4 should be conducted to bound the oxide trapped charge and interface state effects. As a general rule, radiation tolerant designs should prefer NAND-type topographies. Even in those configurations, the stacking height should be limited to prevent significant disparities in high-to-low and low-to-high switching characteristics: Other configurations to be avoided can be deduced from a consideration of the interaction between the manifestations of radiation damage in the transistor and the function to be performed by the circuit. For example, the use of N-channel pass gates in cells such as multiplexer and flip flops should be avoided due to their tendency to develop source-to-drain leakage paths. P-channel pass gates also create problems because of their increase in on-state resistance with total dose. Careful primitive cell designs
can mitigate
many of the radiation
effects introduced
10. Single Particle Effects Affecting
at the transistor
Primitive
level.
Cells
The generation of bit errors which are detected as single event upsets occur at the primitive cell level as the circuit interacts with the current transient generated by the ion strike. Consequently, the electrical design and layout of the cell have a major impact on SEU tolerance. Similarly, the layout rules used at the cell level determine placement ‘of well and- substrate contacts
and the spacing
between
adjacent
elements
V-23
that
can participate
in a latchup
path.
Therefore, most of the design work directed at reducing the occurrence of single event effects occurs at the cell level. Commercial CMOS circuits designed for the highest possible packing density will minimize the spacing between N-channel and P-channel sources and the well edge and use infrequent well and substrate contacts. These practices do not pose a problem for commercial applications, but are highly likely to produce SEL problems. Testing at a particle accelerator is the only way to conclusively determine the SEL susceptibility. However, inspection of the die surface can provide valuable insight into the layout practices used. Also, a spreading resistance measurement can provide information on the existence and thickness of an epi layer and the depth of the well. The metallization layers must be removed to observe the relationship of the well edges to the -istor sources, but the polysilicon layer should be left in tact to facilitate identification of transistors. Commercial ftilure analysis laboratories can perform the delayering and photography for approximately $400. Spreading resistance measurements can be made for $150 per site. Typically, two sites are suilicient to determine well depth and epi thickness. For $700 (approximately the cost of one hour of accelerator time), a great deal of information can be
gathered on critical design practices, In the opinion of the authors, any radiation testing of commercial microcircuits should be preceded by die surface inspection and spreading resistance measurements. Furthermore, this Mormation should be documented in the test report so that fbture applications of the microcircuit can be verified to be the same version as previously tested. The critical design issues for improved tolerance to SEU can be identified by considering Petersen’s equation for estimating errors per bit day [26]. R= 5x10-10 abc2/QC2 where R = error rate in errors per bit day ab = area for the critical node in square microns c = collection depth in microns Q.= critical charge in picocolombs.
The area of the critical node is under the layout designer’s control and should be kept as small as possible within the constraints of the design rules for the technology, Extended drains should especially be avoided since they increase the sensitive node area. The collection depth is determined by the processing technology and is not within the control of the designer after the processing foundry has been chosen. The collection depth is essentially the epitaxial layer thickness, which can be determined from spreading resistance measurements. For 0.8 to 1.2 pm CMOS commercial technologies, a typical epi thickness is 12.5 microns. Since the error rate goes as the square of the collection depth, choosing the process technology with the thinnest epi is clearly advantageous. The designer has the most direct control of the critical charge required to upset the cell. By choosing transistor dimensions and cell topography, he can determine the value of Q,. In ASIC designs, many different types of latches can be used, and their design requires carefid attention. The latch shown in Figure 15 was selected by a commercial standard cell designer for a data register. It uses transmission gates to control the presentation of the data to the latch. In
V-24
the first section, the principal inverter is kept relatively small to permit it to track the input data quickly. The drive current of the cross coupled inverter is kept low (W/L = 0.4) so that it is easily overpowered by the input driver. While the clock is low, the first stage tracks the data. When the clock goes high, the first stage is decoupled from the input and connected to the second stage. The high clock also breaks the cross coupling between second stage inverters so that the main second stage inverter is driven by the main first stage inverter. When the clock goes low again, the cross coupling is reactivated in the second stage and the state is latched. The schematic of the latch cell contains all of the resistive elements associated with the source/drain regions and the poly used for interconnect. The SPICE model used for the SEU evaluation also contained all the parasitic capacitances associated with the silicon, poly, and metal layers. ~---..~T..__
,
6
I
II
5
1
—
r
REGISTER
d
i .—
.7
I
—
—
2
‘~’m
CELL
. ,, i w 41~u LE -1 ~1 -r “k>
I--l w% M9 64
NH05
10
R4 5*
;:
H?B PH 5
el
25
“2! Ff10
I
I
“:3kJlLYi
‘tit
R23
32
12.5
25
1
5
61
2
“2
_&“
22
301
72
.24
1-
Rt2
1413 NMR
;;5
(00
VCL,
I(1, yo!
!!10 IIM05
5
PI(D5
“
*YG 75
?2
~
18
R13
N*O5
116
190
,13
llt4
R14 1 ma
182
v
~,q
KB
64
MMU5
!6
“,>
101
u
[1 11
I
---1
I
!!,,.,..R..,.r.h
C0..0,.,4..
1??*
d
Ra”dalmh
1,,1.
—
S.E.
RCGISIER slz.
~.=u
~...!
m=.,
~
Figure
1
6
I
—
5
— 1
4
s,.!.-..
81t B6_
N.” H,.,25 ~m). The upset response for a simulated strike on node 132 is shown in the figure. Although there is sufficient charge in the f$EU transient to deeply discharge the node, significant replacement current is provided by the PMOS transistor M26. Indeed, the node begins to recover, but the state transition has already begun in the primary inverter, and the feedback drives the latch to a complete state transition. Additional simulation shows that a strike at node 18 is similarly effective in initiating an upset. Integrating the waveforms yields a critical charge of approximately 200 fC. The area of the nodes is 16.6 ~m2, and Peterson’s equation gives an
V-25
estimate of 3 .2x 10-5 errors per bit day. This would be an unacceptable error rate for many space systems. However, many latch designs used in commercial microcircuits upset at even higher rates. Although the topography of the example design is not good for an SEU tolerant latch, relatively simple modifications could be made to decrease the error rate by an order of magnitude. These include: (1) increasing the size of the restoring transistors M25 and M26, (2) rearranging the layout to reduce parasitic resistance in the restoring path, and (3) increasing the time constant in the feedback loop of the second stage.
Date/n
..,. J,
Ime run:
07/28/93
15:10
●
ECWI/LPESPICE FILE : WTRE6 [tlOS+CW Temrmml”re
!05
:
27,0
...............
Woul
400!4
mou
a
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
--------.-7...-.. ........... ..............................--------------..........................................------------------------------
I(IINI ‘“ .................
6,0
,,
40
“\ .
0
-10
.............. ..L..........* .......................................................................................+ ............................. 4 n,
2“, V[12)
● V(13)
6.s
8.,
10”s
A V(1321 Time Ezzzx%l
Figure 16. Results from SEU analysis of commercial latch. Although SPICE analyses are helpful in identifying design issues relevant to SEU tolerance, they are not definitive. In the example used here, the ionization transient has been treated as a simple photocurrent source involving only the drain. In reality, the ionization track is a complex, three-dimensional phenomenon involving dimensions similar to the layout dimensions of CMOS transistors. Consequently, the response of the transistor may be complex and involve the turn-on of “off’ devices due to lowering of the potential at the source. Furthermore, the conduction mechanisms associated with the ionization track are quite complex [28] [29]. Our analysis has used wave shapes from measurements on simple silicon diodes. However, CMOS structures have four layers of material including the sourcejdrain implant, the well, the lightly doped epi layer, and the heavily doped substrate. The ionization track can form
V-26
.,
a shunt path through the layers, transferring charge between layers at different potentials. only does this tied transient arnplities, it also significantly tiects wave shape.
Not
Testing for SEU and SEL must be performed to detenn.ine definitive cross sections and LET thresholds [30]. Where radiation tolerant designs are being implemented in commercial processes, test chips should be characterized early in the development effort. circuits representing expected sourceklmin and well spacing, well contact spacing, and substrate contact arrangement should be included for SEL evaluation. Arrays of typical latch designs and memory Comparison of empirical LET cells should be included for SEU error rate determination. threshold and cross section data with doping profiles, layout dimensions, and SPICE predictions can be used to reil.ne modeling procedures for fiture designs. In general, worst case SEL tests should be conducted at the maximum temperature and Parasitic bipolar gain increases with temperature, supply voltage expected in the application. making the device more susceptible to latchup. Maximum voltage results in wider depletion regions and more charge collection to give the worst case transient amplitude. Some SEL testing should be performed at the lowest possible clock rate. Since the supply current is proportional to clock flequency, operating circuits at high speed may produce supply currents that are much higher than micro-latch currents. Reducing the operating frequency increases the likelihood of observing small supply current increases from micro-latches. Worst case SEU testing should be conducted at elevated temperatures and the lowest voltage expected in the application. Elevated temperatures reduce the drive current in MOS transistors. Consequently, less restoring current is available to overcome the SEU transient. Since the cument of a CMOS transistor in saturation is proportional to (V~ - V~2, the restoring current at vdd = 4.5 volts will be 2/3 of the current at 5.5 volts.
There are numerous electrical and layout design techniques which can be used to improve SEL and SEU tolerance. They include: (1) the use of guard bands, (2) closely spaced well and substrate contacts, (3) selection of robust latch topographies, and (4) maximizing replacement currents while minimizing critical node area, Some of these techniques will be illustrated in the design example at the end of this article. 11.
Total Ionizing Dose Mechanisms
Affecting
Macrocells
Integrated circuits are often referred to a systems-on-a-chip; many of the design disciplines historically associated with systems must now be applied to ICS. As shown in Figure 17, microcircuits are typically organized as blocks of rnacrocells which interact with each other at well defined interfaces. Functions such as register files, arithmetic logic units, decoders, barrel shifters, mukiplexers, etc. are referred to as macrocells. They are also referred to as data path elements, megacells, and several other names. The design and layout of these macrocells and their interconnect networks can have a significant impact on their radiation tolerance. At the macrocell level most of the unique total ionizing dose issues have to do with timing. Three general mechanisms can be identified which manifest themselves as timing problems. They include: (1) non-uniform changes in rise time, fall time, and propagation delay in different cell types, (2) changes in clock drivers which reduce the maximum drive current, and (3) changes in leakage current which degrade the charge stored on a node.
V-27
,,
.
TMS320C40
CPU BLOCK DIAGRAM
ON-CHIP 20na CACHE. RAM ANO ROM
.“,%ww%%ki?
MBYWSISWDIRECT PROCESSOR TO PROCESSOR COMMUNICATION
120
Figure 17. Block diagram of Texas Instruments 320C40 digital signal processor showing major fictional blocks. As discussed previously, transistor-level degradation mechanisms can be mitigated or aggravated by different logic cell topographies (e.g., NAND vs NOR gates). Differences in the degree of degradation among different cell types can also be enhanced by different biasing conditions [31 ]. If a particular instantiation of a NOR is predominantly biased in a worst case condition (all inputs high) while a NAND is biased in the best case condition, differences in postAdditional disparities in performance can result irradiation performance will be exacerbated. from differences in stacking height, fanout, and interconnect capacitance. If the gates with asymmetric propagation delays are located in different data paths that converge on a logic cell, an undesirable “glitch” can occur at the output. It is solely the result of differences in path delays and not an error in logic. Nevertheless, the glitch can propagate and be interpreted down stream Such an error is known as a race condition (also as valid data with unknown consequences. referred to as a hazard in some systems of nomenclature). These effects may occur in either combinational or sequential systems. A race condition is possible in a sequential system any time the transition from the PRESENT_STATE to the NEXT_STATE involves two or more bits (state variables) [32]. In general, race condition problems are alleviated (not eliminated) by designing synchronous systems in which data are only considered valid at fixed intervals as defined by a clock. This has the effect of allowing the “noise” associated with glitches to be filtered out by setting the interval to be longer than the maximum accumulated propagation delay. Clearly, the clock interval must be set to accommodate worst case cell propagation delays as affected by temperature, fanin/fanout, loading, and process variation (as reflected in transistor I/v characteristics). For radiation tolerant microcircuits, the affects of total ionizing dose as affected by dose rate and irradiation bias condition must also be included. V-28
Despite its obvious benefits in mitigating race condition errors iu both combinational and sequential designs, the clock can also be a source of post-irradiation problems. These problems stem from generating the clock waveforms and distributing them throughout the die. Most complex microcircuits use a two-phase, non-overlapping clock system. A two-phase clock is used to permit combinational operations on PRESENT_STATE vectors to be completely settled prior to latching in NEXT_STATE values. The issues are illustrated by the Mealy model of a finite state machine with a two-phase clock and annotated clock wave form shown in Figure 18.
Phase’1
clock
Phasb 2 clock
Clock Phase 1 Clock Phase 2 I I
Epoch
I
I
I Phaae
1
;
T12
~
2
~T2;
I
I
‘1 time-
Phaae
I
I I
I I I I
Delay
;
I
* -‘=-wmmd Icglc 4—-——————————
-
delay
d
—————
L-
-~
P.aet
time
4
clock period
Figure 18. Two-phase clocking scheme illustrated for a Mealy machine (Mead & Conway 1980). During the Phase 1 epoch, the PRESENT_STATE vectors are latcheci into the inputs of the combinational logic block. The duration of the Phase 1 epoch must be sufficient to ensure that valid transfer has occurred. It is followed by the Phase 1 non-overlap epoch, T12, which ensures that race condition noise does not ripple through to corrupt the PRESENT_STATE vectors. The Phase 2 epoch allows the results of the combinational operations to settle at the
V-29
,.
.,
input of the NEXT_STATE latch. Its duration must be long enough to accommodate the combinational logic propagation delay and the preset data requirements of the latch. The Phase 2 non_overlap epoch, T21, is unproductive system time and is included to accommodate skew in the distribution of the clock. Skew is defined as the “variation in the effective tival time required of the clock at different clocked elements” in the microcircuit [33]. It is a function of the interconnection system and the drive strength of the clock. Commercial designers are motivated to reduce T21 to the shortest duration to improve performance. Unfortunately, total ionizing dose irradiation will affect clock skew by unsymmetrically changing the drive strength of the clock, A NOR implementation of the two-phase clock generation would produce the worst change, but it is the con.llguration found in many commercial designs. Radiation tolerant designs prefer NAND implementations for the clock electrical design. Careful attention should also be given to layout of the clock distribution network to ensure roughly equivalent interconnect distances between the clock and all clocked elements is an important consideration for radiation tolerance. While the discussion of race conditions and clock skew seem very straight forward when viewed from the perspective of a conceptual state machine and cloclq we must remember that the sequential and combinational elements are widely distributed over the surface of the microcircuit die and that communication is via data and address busses. In this context, leakage currents contribute to timing problems. Data buses are parallel interconnects connecting logic blocks and permitting data words to be transferred horn one block to the next. Typically, those blocks which can drive data onto the bus connect to its individual lines via tri-state bufkrs. When a logic block is authorized to place data on the bus, its buffkrs act as nomml gates, driving the line to either a high or low state. Otherwise, the buiTer is placed in a high impedance @i-Z) condition in which both transistors connected to the bus are turned off or disconnected from Vdd and Vss. Many logic blocks may be connected to a bus, but only one is authorized to drive it at any time. Unfortunately, after irradiation, N-channel transistors in the buiTers can develop serious leakage problems due to turn on of field oxide paths, parasitic edge transistors, or intrinsic ~istor threshold voltage shift into depletion mode. Therefore, the bufTer driving the bus must provide cument to charge the bus to the correct logic state and to overcome all the leakage currents. In the extreme case, the buffer may not be able to drive the bus to the correct logic state at all. More frequently, it takes longer to attain the required state, and the increased time may exceed the allocation of the clocking scheme. In some microcircuits the data bus is maintained in its last state by bus keeper latches until the next logic block is authorized to write new data. By their nature, bus keeper latches are small and easily overpowered by the driver buffers. For that same reaso~ they may be overcome by leakage currents. In either case, the resulting errors may be intermitten~ occurring for only some data vectors, power supply voltages, clock rates, and temperatures. Clearly commercial microcircuits, with their emphasis on packing density, high speed performance, and maximum fimctionality, are likely to be susceptible to timing errors from any of the three sources described above. Test strategies must be carefi.dly chosen to reveal such errors. Consideration must be given to irradiation bias conditions for both the I/O terminals and the internal logic. Internal logic bias should be selected for worst case conditions for those circuits susceptible to timing, leakage, or combined failure mechanisms. The irradiation test sequencing is also critically important. Some errors may only be observed under conditions of V-30
maximum hole trapping, while others occur only under rebound (interface-state-dominated) conditions. Techniques such as monitoring the supply current during irradiation may help the experimenter determine when leakage current effects are dominating. In-situ measurements of key parameters at intermediate radiation levels are essential to ensuring that subtle timing errors are not overlooked. Since in situ test resources are usually limited, analyses should be performed to select the most critical timing tests. These should be supplemented with thorough postirradiation characterization using the entire test vector suite. The designer developing radiation tolerant designs for fabrication in a commercial foundry must be especially meticulous in performing the design timing evaluation at the Robust clock designs with adequate margin to account for total dose macrocell level. degradation are especially important. Layouts should emphasize the shortest possible clock distribution nets and well matched interconnect lengths to all blocks. Clock duration and nonoverlap intervals should be selected to account for post-irradiation propagation delays. The number of connections to data buses (i.e., bus drops) should be minimized. Driver buffers and bus keepers should be sized to supply post-irradiation leakage current. Certainly, there are many more design techniques which can be employed to mitigate total ionizing dose effects at the macrocell level. These depend on the type of design architecture being implemented and the choice of macrocell building blocks. Most importantly, the designer must realize that decisions made at the highest level of the design can significantly affect postAlmost certainly, there will be some reduction in performance and irradiation performance. density required to achieve the needed radiation tolerance.
12. Single Particle Effects Affecting
Macro-Cells
The design objective at the macrocell level is to manage the effects of SEU bit errors generated at the primitive cell level. The designer must be concerned with the propagation of SEU transients through combinational logic, the capture of propagated errors in memory elements, and the effects of memory errors. The memory elements in question may be either data registers (latches) or MM (random access memory). In their discussion
of SEU effects in complex logic systems, Diehl-Nagle
et al. [34]
identified four criteria for a single event transient in combinational logic creating an error. They include: (1) the voltage transient created by the single event must be capable of propagating into
the local circuitry, (2) a critical logic pathway must exist between the struck node and a latch, (3) the transient must have sufficient amplitude and duration to write the latch input (i.e. exceed the noise margin), and (4) there must be a coincidence of the transient and the latch write enable. Figure 19 illustrates the circuit context for these criteria in tracing a node upset through the logic to a latch. Consideration of the four criteria can be helpfid in designing macrocells with improved SEU tolerance and in selecting test approaches for commercial microcircuits.
V-31
{*’D ,: ‘_ Yi5iEiElw CK
Controlinput
DQ
so -.
clocked D,atch
S1 xl YI
.8
c
Figure 19. SEU error propagation paths in a 2 bit fidl adder (Diehl-Nagle and Vinson 1984) As discussed in Section 8, the duration and amplitude of SEU transients are typically sufficient to permit their propagation through primitive cells. As feature sizes are reduced with advancing microcircuit technology, the cells become fmter, and transients are more likely to be passed as logic signals. The cell design practices for reducing the amplitude and duration of SEU transients which were discussed in Section 8 for latches are generally applicable for combinational cells. Namely, the ~istor dimensions and layout should be chosen to provide as much restoring current as possible to the struck node. Also, the capacitance of the cell inputs should be as large as is consistent with the performance requirements of the circuit. Fortunately, the very nature of a combimtional logic system prevents some transients horn propagating. If a node that is in a low state is struck such as to create a discharging transient no enor propagation will occur, because there is no state change. Similarly, an AND gate will only propagate a 1-to-O error if its other terminals are in a high state. Thus, the probability of the cell output being upset by an SEU transient on an input terminal is usually less than one. The actual probability is the product of the probabilities of physical upset for each state and the probability of occurrence of each state. The probability that an error will propagate from the struck node to a given latch is the cumulative probability from all the intervening cells. The assignment of cell upset probabilities and their use in cumulative path upset rate estimates are illustrated in Figure 20. Baze, et al. [35] have developed a formalism for estimating error propagation probabilities which has been demonstrated for static errors and is being extended to
V-32
.,
..
,,..,,
transient errors. Such a tool will be extremely helpfkl to the designer in identi~ing those data paths which limit SEU tolerance. In the interim, the designer should select the cells with the most robust SEU tolerance to drive latch inputs.
C /D
Ao-
D
p—
OUT
)-O
E
BO A
B
c
D
E
OUT
OUT-A
OUT-B
STATE PROB
0
0
0
0
0
0
0
1
0.1
0
0.1
0
1
0
1
1
1
1
1
0.4
0.4
0.4
1
0
0
0
0
0
0
0
0.1
0
0
1
1
1
0
1
0
1
0
0.4
0.4
0
umulatwe
.
IC
EOUT= POUT-AEA+ POUT-BEB = 0.8EA + Figure 20. Output error rate calculation for combinational
‘OUT-A
P~uT.~
.
1
O*SEB
network (Baze, et al 1995)
In order for a transient to be latched into a memory element, it must be able to charge (or discharge) the input capacitance to the erroneous state. If the capacitance can be made large enough to keep the node from charging (or discharging) to the noise margin during the transient, SEU tolerance can be improved. The transient can be expected to have undergone some wave shaping as part of the propagation process so that it may appear as a typical digital noise glitch at the latch input. If the input capacitance of the latch can be sized to make the RC time constant too long for an SEU-induced transient to reach the noise margin threshold, improved SEU tolerance to propagated errors can be expected. Any change in input capacitance for a cell (combinational or sequential) will slow its performance and usually increase its layout dimensions. The clocking scheme can provide some errors. As we saw in Figure 18, a system using a with effects that persist into the preset time. Any affect the latch. Transients in Phase 2 will only error that persists to the onset of preset. None of the design efforts upsets. Consequently, the designer may manifest themselves as corrupt errors may produce corrupt state processing proceeds. In processors, in the program
control
registers
benefit in discriminating against propagated two-phase clock is only susceptible to glitches transients in the Phase 1, T12, or T21 will not affect the latch if they have generated a logic
mentioned above or previously in Section 8 will eliminate must be prepared to manage the resulting errors. The errors information anywhere in the system. In state machines, the vectors which lead to inappropriate event sequencing as the errors may corrupt the data in latches or the information
(i.e., program
counter,
v-33
stack pointer,
and processor
status)
[36].
While data errors can certainly be serious, upsets in the program control registers can be particularly disastrous, since they control the sequencing of processor functions. Usually, a great deal of attention is given to upset in main memory (MM) since those upsets can affect both data and program information. Management of those errors is typically the role of the system designer. However, many processors and ASICS contain significant amounts of on-chip memory used as cache, supplements to data registers, and dual port buffers for block data transfers with off-chip circuits. The chip designer must determine his strategy for managing errors in the on-chip R4M. Since commercial designers are unconcerned about SEU-generated errors (mitigation of alpha particle effects are handled with selection of packaging material and die overcoating), engineers tasked with radiation characterization of commercial parts must ensure that the test effort addresses worst case macrocell affects, This includes considering the effects of ion selection and clocking on propagated transients. Since the duration of the SEU transient typically increases with the LET of the particle, worst case propagated SEU should be conducted with the largest LET available at the source. Also, a maximum clock frequency will produce the greatest propagated SEU vulnerability since the period of vulnerability (around preset) will be a larger percentage of the clock cycle time, The test program should be designed to detect errors occurring in program and data registers as well as on-chip memory. Designers developing radiation tolerant circuits for fabrication in commercial foundries have several design options available at the macrocell level to manage SEU errors. Synchronous designs with a two-phase clock are clearly advantageous. Particular care should be given to the design of a robust clock. An SEU transient on the clock distribution network is sure to cause malfunctions throughout the chip. Carefid engineering of the input time constants for latches can balance improved SEU tolerance with performance. For certain critical registers (e.g., program counter), a special design of a non-upsettable latch may be justified. Several circuit topographies have been recommended which can enhance SEU tolerance by (1) storing redundant bits in separate physical locations, (2) designing feedback into the cell to ensure that a node is forced to recover to its previous state following an ionization transient, and (3) using the inherent SEU immunity of certain devicektate relationships (low state data stored by an “on” N-channel transistor cannot be upset by an Nchannel strike) [37]. These designs invariably require more layout are% but they can be very valuable in protecting critical registers. Error detection and correction (EDAC) can be an extremely effkctive management approach for SEU errors. In some applications, error detection may be all that is required. It may be used to initiate reset sequences or even a cold reboot procedure. The penalty for error detection is at least an additional bit and the extra circuitry required to generate and check for parity. Full correction and detection has been estimated to impose a 17V0 to 50’% area penalty [38], Therefore, it must be used judiciously. Other similarly area intensive schemes, such as voting redundant elements, are also alternatives which can be employed for crucial functions. In summary, macrocell level design must include consideration of SEU error management. However, the selection of specific mitigation techniques must consider the LET
v-34
spectrum likely to be encountered during the mission and the trade off between the effect of errors and the performance penalties imposed by the hardened design.
13. Demonstration
of Radiation Tolerance Enhancement
Through Design
An example of the application of radiation tolerant design practices to a problem may help to illustrate many of the concepts presented in the previous sections. In a recent project, the USAF Phillips Laboratory wished to develop a design-hardened gate array to support fielding small space experiments investigating advanced signal processing and packaging techniques. The concept was to design and build fret-turn-around, low-cost experimental packages with low size, weight and power requirements. As opportunities arise, the packages would be added to launch vehicles which could accommodatethe extra payload. Since the launch vehicle and satellite may be targets of opportunity, the experimental electronics must be sufficiently robust to operate in a wide variety of orbits and to survive in minimally shielded placement in the satellite. The target radiation tolerance levels were 100 Krad(Si) total ionizing dose over all dose rates and no single event latchup. In addition, the gate array should support design of SEU tolerant cells. The functions to be implemented in the gate array were to be relatively low performance ASIC applications such as instrumentation controllers and aggravations of glue logic to replace SS1 and MSI die in multi-chip modules. Only a few die (15 to 20) of each gate array personalimtion were expected to be needed. Since there was likely to be short notice of the availability of a launch platiorm, total cycle time for an experiment was to be kept under six months. Also, the cost for the design and fabrication of a new personalization was targetted for less than $20,000. Before discussing the application of the hardened design practices to the gate array and test chip, we will first discuss the commercial foundry alternative. With the cost, production quantity and schedule constraints on the program, the MOSIS (MOS Implementation Service) clearing house developed by DARPA and managed by 1S1 (~ormation Sciences Institute) was selected as an attractive alternative for die fabrication [39], MOSIS subcontracts with several different commercial foundries to process lots on a periodic basis. As the scheduled time for the lot start arrives, MOSIS assembles microcircuit layouts submitted by their customers into a reticule and passes it to the foundry. The foundries are high volume commercial lines capable of very high performance. The Hewlett Packard 0.8 ~ CMOS foundry is particularly attractive for a design hardened gate may. A process doping profile for a section through the P-plus source/dr@ N-well, P-epi, and P-plus substrate is shown in Figure 21. The epi layer is relatively thick (= 12.5 pm), but it will still be helpful in suppressing latchup. The process uses one level of polysilicon and three levels of metal. The poly and source/drain regions are silicided with a resistivity of approximately 2 ohms per square, The gate oxide thickness is 170 ~ thickness. Design rules for the technology are available through Intemet at I?p.mosis.edu. Fabrication costs for the HP foundry are determined by the die area at the rate of approximately $600 and $650 per mmz for government and non-government organizations, respectively. The customer is guaranteed 25 die for this cost. Dedicated wafers (150 mm) can be purchased with a minimum of three wafers costing $88,700. The reticule size is 17mm x 17mm, and there is a leverage of 40 reticules per wafer. MOSIS purchases the wtiers to the foundry’s parametric specifications. The customer is responsible for ensuring that his design works within v-35
the range of the parametric limits. Layouts for the HP foundry must be submitted by the 4th Thursday of each month, and the fabrication cycle is 8 to 9 weeks. Packaging is available for a nominal fee and the cost of the packages. Table 2 gives the cost scaling for a number of different size die. The size of the Phillips Laboratory gate array (total number of primitive cells) and the number of pads (Vdd, Vss, and 1/0) are indicated. Columns show the cost for the initial 25 units and additional increments of 25 parts. Packaging costs for appropriately sized PGAs (pin grid arrays) are included in the table entries. By using design hardening practices, the cost and schedule benefits of the MOSIS/HP foundry service can be used to support space experiments. The specifics of the layout hardening approach can be seen in the primitive cell in Figure 22. It consists of two re-entrant N-channel transistors and two 2-edge P-channel transistors sized for equivalent drive strength. As discussed in Section 7, edge leakage effects are eliminated by the re-entrant design. The area Penaltv associated
with the re-entrant
transistors
.
is apparent.
p
~ti (au-3)
P
m-miusms
Figure 21. Doping profile for HP commercial CMOS process.
V-36
.
Ta e 2. Cost Scaling for Gate Array Fabrication Through MOSIS Die Dimensions
Size of Gate Army
Coststo non-Gov’t Organizations
Cost to Gov’t Organizations
inrnm
(Number of L/0)
1st25 Units
1st25 Units
(Additional 25 Units)
(Additional 25 Units)
2K
$3600
$3125
(36)
($3000)
($2605)
12K
$11,400
$9,725
(76)
($9,000)
($7645)
31K
$24,400
$20,725
(1 16)
($19,000)
($16,045)
57K
$44,850
$38,125
(156)
($35,250)
($29,805)
92K
$69,000
$58,425
(196)
($54,000)
(45,425)
2x2
4x4
6x6
8x8
10X1O
The field oxide leakage is eliminated by enclosing the N-channel thin oxide regions (source/drain) with a channel stop composed of the P-plus source/drain implant of the P-channel transistors. The P-plus sourcehkain must be used because there is no other mask definable layer for a P-type channel stop available through MOSIS. The polysilicon gate cannot be allowed to cross the P-plus, because it would create a gap in channel stop since the sourcdmin implants are self alligned to the poly. The gap would provide a possible source-to-well leakage path. Obviously, a significant area penalty has been paid for the channel stop and elimination of polysilicon interconnect to P-channel gates. However, all of the potential total-dose-induced leakage paths have been mitigated with the exception of those associated with the intrinsic transistor. The channel stop also acts as a guardband for latchup suppression. It reduces the gain of any surface lateral NPN transistor by introducing a P-plus region in the base. It also keeps the base of the NPN very close to ground potential by providing a low impedance path to Vss. The HP process is silicided so the resistivity of the polysilicon and the sourceMrain regions is approximately 2 ohms per square. The guardband has five Vss contacts located along the right edge. To be doubly sure that no SEL problems would be encountered, an N-plus guardband was placed in the N-well around the P-channel transistors. This spoils the gain of any surface lateral PNP transisistor and ties the PNP base to Vdd through a low impedance path to five contacts located along the left edge. Clearly, the double guardbands result is an additional area penalty, but the area tradeoff was considered worthwhile to minimize the prospects of latchup.
v-37
P-channels Sized forBalanced Drive Current -7
Re-entrant N-channel transistors
7
Frequent N-well & Guard Ring Contacts