Breaking News
News & Analysis

Intel has late metal fix for design error

1/31/2011 11:12 PM EST
15 comments
NO RATINGS
Page 1 / 2 Next >
More Related Links
View Comments: Oldest First | Newest First | Threaded View
Page 1 / 2   >   >>
goafrit
User Rank
Manager
re: Intel has late metal fix for design error
goafrit   2/1/2011 3:57:44 AM
NO RATINGS
This is a very unfortunate event and many must have lost their jobs. It is indeed very embarrassing to say the least.

MAVERICK230
User Rank
Rookie
re: Intel has late metal fix for design error
MAVERICK230   2/1/2011 9:41:52 AM
NO RATINGS
Intel admitting that it is a "design oversight" is really unfortunate. This is the risk involved in having a basic design issue as it seeps into other blocks making its impact catastrophic. Surely many heads would have rolled down in the aftermath of this. But certainly it is " Better Late than Never".

DrizztVD
User Rank
Rookie
re: Intel has late metal fix for design error
DrizztVD   2/1/2011 11:09:45 AM
NO RATINGS
You'd have to be a very messed up manager to want to fire your engineers for something they had no control over. This is not a case of negligence, it is a case of straightforward trail-and-error learning. And it's not embarrassing either, these things happen all the time. Kudos to Intel for doing the right thing and withholding chips until they have reliable chips to sell. Lesser companies would have tried to cover it up.

design_for_life
User Rank
Rookie
re: Intel has late metal fix for design error
design_for_life   2/1/2011 11:27:20 AM
NO RATINGS
I fully agree with DrizztVD. What kind of world are we living in? Mistakes are only human and all of us make mistakes whether we admit it or not. And if a company fires its employees for making a mistake, its only creating fear and in future nobody will be eager to do new things. Its not unfortunate to have errors, its unfortunate not to admit it.

krisi
User Rank
CEO
re: Intel has late metal fix for design error
krisi   2/1/2011 3:52:42 PM
NO RATINGS
I have been designing silicon chips for longer than I wish to admit. Making metal revision to a mask set is a very standard process. Usually the errors are detected in testing in the lab. Thinking that you have a final product and later finding from a customer that it still contains a bug happens frequently too. You would expect it happens less at large company as Intel due to an army of design verifiers they have nevertheless this clearly happens from time to time (there was a big Intel recall few years back). The truth is that a sheer complexity of microprocessor or number of permutations required in testing is so large that you actually never know for sure that silicon is working all the time!...dr Kris

yalanand
User Rank
Rookie
re: Intel has late metal fix for design error
yalanand   2/1/2011 5:20:04 PM
NO RATINGS
The previous instance when Intel had this kind of bug was in 1994 (the infamous FPU bug). I guess intel has learnt lesson and didnt wanted to take chance this time around. Hence they are taking necessary steps rather than ignoring the bug.

jnissen
User Rank
Manager
re: Intel has late metal fix for design error
jnissen   2/1/2011 5:48:03 PM
NO RATINGS
Reliability issues are tough to catch unless there is significant design reviews and all. It can be easy for large teams to assume someone else has checked this or that. Can sneak up and infect the best of teams. Electromigration and/or NBTI are my best guess what they are dealing with but we may not know the details for a while. Those are tricky and many of the tool will not adequately predict the outcome.

Tom Mariner
User Rank
Rookie
re: Intel has late metal fix for design error
Tom Mariner   2/1/2011 7:47:24 PM
NO RATINGS
There seems to be a grand tradition in the chip design world of fessing up to your boo boos. Possibly because in the future when you say it is not in your section of the IC, you will be believed. Once found a problem in earlier layers of a TI DSP chip -- it seems as though noone had written software that used the entire chip at once in the three years it had been released. (If I don't give my company / customer the best the hardware will do, it leaves an opening for a competitor to them, and I don't let my customers lose!) They could have pointed the finger at me for a firmware glitch, but instead thanked me in front of my customer and put the fix into a wafer partially done to get the revised parts out in record time. Class tells -- and in both the Intel and TI cases, it tells me that if they claim it ain't the silicon, I'm looking elsewhere.

SiFarmer (Ret.)
User Rank
Rookie
re: Intel has late metal fix for design error
SiFarmer (Ret.)   2/2/2011 6:58:16 PM
NO RATINGS
Thanks Tom Mariner, "If they claim it ain't the silicon, I'm looking elsewhere". A new classic quote! I assume the writer means that if the supplier doesn't admit there's a problem with the silicon, the customer should look elsewhere for a better, more honest, chip supplier. Since it is only degradation, may not be e-migration. Anyone remember the "Fast Cadillac" reliability problem with a small percentage of Delco's first cruise control chips? Cause was a mask defect on a contact print mask.

ash9
User Rank
Rookie
re: Intel has late metal fix for design error
ash9   2/3/2011 1:09:23 AM
NO RATINGS
Both of these statements cant be true!!! Intel mentioned that after it had built over 100,000 chipsets it started to get some complaints from its customers about failures. Intel expects that over 3 years of use it would see a failure rate of approximately 5 - 15% depending on usage model. Remember this problem isn’t a functional issue but rather one of those nasty statistical issues, so by nature it should take time to show up in large numbers (at the same time there should still be some very isolated incidents of failure early on). asH

Page 1 / 2   >   >>
Top Comments of the Week
August Cartoon Caption Winner!
August Cartoon Caption Winner!
"All the King's horses and all the KIng's men gave up on Humpty, so they handed the problem off to Engineering."
5 comments
Like Us on Facebook

Datasheets.com Parts Search

185 million searchable parts
(please enter a part number or hit search to begin)
EE Times on Twitter
EE Times Twitter Feed
Radio
LATEST ARCHIVED BROADCAST
David Patterson, known for his pioneering research that led to RAID, clusters and more, is part of a team at UC Berkeley that recently made its RISC-V processor architecture an open source hardware offering. We talk with Patterson and one of his colleagues behind the effort about the opportunities they see, what new kinds of designs they hope to enable and what it means for today’s commercial processor giants such as Intel, ARM and Imagination Technologies.
Flash Poll