Electronics Testing into the 21st Century:
Success in Test Is in Capabilities, Not Specifications

by Kirk Gray and Wayne Tustin

Development of electronics, with increasing shorter market windows, and the rapid pace of electronics invasion into almost every appliance and machine manufactured, is requiring that electronics be reliable and mature at market introduction. There may not be enough time in the market to improve a poor design. Your customers are not willing to risk it, to take another risk to purchasing another electronic product from you, and they will probable tell others about your products poor quality. Yet, today there are extremely efficient and cost effective methods to prevent field failures. These methods have been around for at least fifteen years. The evidence of the effectiveness is overwhelming, yet unpublished, because of its effectiveness. Would you publish methods that reduced your field failures by a factor of ten, letting all your competitors in on it? Probably not, and that's why the electronics testing community is still very reluctant to accept accelerated stress testing and screening as a standard approach to reliability improvement.

The methods that we are referring to are called Highly Accelerated Life Test or HALT and Highly Accelerated Stress Screening or HASS. These are methods of testing that take a fundamentally different approach. The difference is in finding the actual, not specified, operation and destruct limits and then driving those margins to the fundamental limit of the technology. The fundamental limit of the technology is the point at which the product margins cannot be extended without the use of exotic materials or methods. An example is the melting point of wire insulation or component packaging. Most electronic components are typically heated to well over 175°C for solder re-flow and can withstand temperatures much higher. Yet, most testing occurs at temperatures around operational specifications based on the end-use environment. Materials in electronics are very capable of operation in environments (typically -40 °C to +110°C) that are well beyond most electronic operation specifications (typically 0 °C to +55 °C). Those component and design weak links that limit large operational margins are the same components and design limits that will have a very significant effect on field reliability. Only by taking the products to the limits through increasing steps of stress, investigating the root cause and understanding the physics of failure, then improving that weak link, can you make a robust system in the shortest possible time. A robust system will be capable of short effective intensive, but safe, environmental screens using combined environments while being powered and monitored.

Electronics was, for many years, fragile. Glass tubes, filaments also had inherent wear-out modes that gave the electronics a limited life. Early solid state devices had mechanisms that would also cause failures in time, such as chemical contamination, metallization defects, and packaging defects that resulted in corrosion and delamination. A large percentage of these defects were accelerated by high temperature, giving rise to the successful use of "burn-in" to weed out "infant mortality".

Statistical prediction in the 1960's and '70's was accepted because designs at that time were made of mostly discrete components and statistical estimates the life of a new designs had a reasonable correlation to the actual MTBF. This was most likely due to, relative to today, a small number of devices, manufacturers, and manufacturing techniques. Today, hundreds of new electronic components are introduced to the market every week, and at the same time hundreds are being taken off the market. It is no longer possible or reasonable to even attempt statistical estimates of reliability based on a summation of components reliability, even if accurate data on current components was available. It would be virtually impossible to and useless to obtain because variation in the huge number of applications and end-use conditions.

Today's components do not have wear-out modes that are within most electronics technologically useful life. Therefore, the vast majority, if not all, of electronics failures are due to defects, either in the design or introduced in manufacturing. The most significant effects on reliability are caused by unplanned events during manufacture causing a lowering in the operating margins. It can be an EC, a change in machine operators, or a change in your vendors manufacturing capabilities that introduces a decrease in margins and a resulting increase field failures. Engineers must quit wasting time and resources trying to statistically calculate estimates of reliability. The future is unpredictable and in electronics, predictive reliability cannot be done to any accuracy that would be beneficial to a designer. Only through discovering the real capabilities and the root causes of the weak links in the design or manufacturing process, and improving them, is significant improvement in reliability realized.

The relation of field stresses and inherent field strength of a product is illustrated in Fig. 1. The variation in manufacturing the product is generally much less variable than the end use environment. Failures occur when the weakest units are subject to the highest stresses as shown in Fig. 2.

Fig. 1

 

Fig. 2

It is important to remember that when a electronic product is manufactured in volume, there is a distribution of its inherent strength around its original designed strength. The end-use environment is even more uncontrolled and has much wider distributions in most cases. No matter how you specify the end-use conditions, your customers will push those limits. By developing a robust design, the product can better survive these extremes. The difference in costs between a robust, well centered design, and one with design weaknesses is usually very small, if any. Changing the orientation of a component, location, or using a more capable component, is very cost effective in the design phase. It can even be very cost effective after the product has been in production for some time. An example would be the case of one AcceleRel Engineering client, we'll call Company A. Company A had a product with a 5% annualized field return rate. For reliability testing they had an elevated temperature (90 °F) burn-in room. The product was powered, but not monitored, for 24 hours. HALT testing found that the operating limit, also the destruct limit was as low as 15°C above the design specification (35°C). The limit was caused by one failing component (two per unit). By replacing the component with the same type component of higher current capabilities, the operating and destruct limit was move to 90°C. With the new operating margins, a short HASS process, lasting one hour for two units was developed. The product was powered and monitored while applying 10gRMS (200 - 2KHz) of random multi-axis vibration and four rapid thermal transitions of -30°C to +70°C at a rate of 60°C/ minute (measured on the product). The field return rate dropped to 0.5%. To no ones surprise, Company A has made this process company confidential, and has implemented it across most all products in production.

Another example of how easy it can be to increase an operating margin was again with client Company A. The UUT had an operating limit of 60°C, with the small +15 vdc auxiliary power supply inside a RF power supply. The limiting component was found to be a small regulating diode located next to a heat sink, but not touching it. By bending over the component to make contact with the heatsink the operation limit was raised by 30°C to 90°C. Large margin improvements can be made sometimes by just repositioning a component. This would not have been found if the operating limit had not been discovered through step stress. The product had easily passed its operation specifications when originally design.

Starting and continuing a HALT and HASS program is a major commitment of resources for any company, but the ROI in reduced warranty costs, redesign, rework, and lost sales is tremendous. Developing a robust design using HALT, even if screening cannot be implemented, is extremely valuable tool for reaching a rapid design maturity and should be a standard evaluation for all new designs. HASS is a process to precipitate and detect defects and shifts in margins that end up being field failures.

It is important to realize that even though the concept of testing to find limits, basing the testing on actual material capabilities, not specifications, and extending those capabilities to the best possible with current technology is really very simple. HALT will not find every and all defects, but it will find greater than 95% that cause field failures. Convincing designers that improving margins beyond what was originally specified is a difficult task for those who are not experienced in this new approach. Once the benefits are demonstrated, companies readily embrace the HALT and HASS processes. It is taking that first step of finding the limits and following through with changes to improve them. Designers and companies that have already discovered that failures above specifications are relevant to the field reliability, and spend the time and effort to improve the margins, will be the most successful at meeting the reliability expectations of customers for the 21st Century.

Kirk Gray has over 21 years in the electronics manufacturing industry and the last 11 years in the application of HALT and HASS processes. Mr.Gray began his career in electronics at the semiconductor, thin-film processing level and continued to proceed down the electronics assembly path to reliability testing. He discovered the speed and other benefits of HALT and HASS processes for reliability when he started a HALT and HASS team as ESS Process Engineering Manager at StorageTek in 1989. Kirk is Vice Chairman of the Denver Chapter of the IEEE Reliability Society , Chairman of the IEEE/CPMT Technical Committee 7 on Reliability, and Registration Chairman for the annual IEEE/CPMT Workshops on Accelerated Stress Testing (AST) held in the fall each year. If you would like to contact him, please send an email to gray@equipment-reliability.com

Wayne Tustin's vibration and shock teaching schedule is posted at http://www.vibrationandshock.com. Right now he is "studying up" on the vibration and shock measurement and testing issues pertaining to computer hard drives, preparing for a private course. Have you heard about intense noise developmental testing and production screening of printed wiring boards? It's an alternate method (to electrodynamic shakers and to pneumatic repetitive-shock machines) for flexing PWBs during HALT, HASS, ESS, etc. You can send him an email to tustin@equipment-reliability.com if you want to learn more. Or you can phone him at 805/564-1260. He would like to hear from you.

 
Free Newsletter
Company
First Name
Last Name
E-mail


To subscribe to ERI News - your reliability newsletter, just fill in the form above and click on
"Sign up now!". You will receive an e-mail message to confirm your subscription. Just click on the link provided on the message and you'll be all set.

Your privacy is respected and your information will not be sold or shared with others.

 

You can easily and inexpensively advertise in this space. Please visit our ads page for more information.
 


Check our Vibration and Shock Glossary. You will find important words and their definitions. This list evolved from Wayne's 50 years of work experience and it's updated frequently.

 


Are you interested in dynamics (vibration, mechanical shock, noise, acoustics, automotive buzz, squeak, rattle, etc.)? Please click here to visit vibrationandshock.com, the other ERI web site.

 


ERI is seeking expertise and a desire to teach about Life Cycle Environmental Profiling, as described near the front of MIL-STD-810F. Please contact Wayne Tustin (805/564-1260).


Send e-mail to webmaster@equipment-reliability.com with questions or comments about this web site.
Copyright © 2000-2008 Equipment Reliability Institute. All rights reserved.
ERI's Privacy Statement and Terms of Use