United Business Media EE Times


Search

HOMELATEST NEWSSEMICONDUCTORSMOST POPULARMARKET INTELLIGENCE UNITFORUMSDESIGNNEW PRODUCTSCAREERSBLOGSCONTACTEVENTSSIGN UP!RSS

 

Intel's Shen: 'microprocessor researchers at a crossroads'








EE Times


AUSTIN, Texas — The trade-off between instructions per cycle and the increasing emphasis on microprocessor clock frequency needs a thorough re-examination, said John Shen, director of Intel Corp.'s microarchitecture lab.

In a keynote speech here Tuesday (Sept. 25) at the International Conference on Computer Design (ICCD), Shen said microprocessor researchers are at a crossroads, moving toward deeper pipelines with higher frequencies, a trend which can impact instruction-level parallelism and overall processing efficiency.

Since the days of the 50-MHz Intel 486, processor performance has improved by 75 times as frequencies have multiplied by about 50 times. Improvements in process technology, supporting faster clock speeds, have accounted for 13 of that 75x improvement, and improved microarchitectures another 4x improvement. Instruction-level parallelism has shown only a 1.5x improvement over that time, as companies have emphasized deeper pipelines and higher clock frequencies.

Instruction-level parallelism requires a wider pipeline, which increases the complexity of each stage, rather than the thinner, multiple stages of a deep pipeline. "As you slice the pipeline thinner, you increase the latency and lower the instruction-level parallelism," Shen said, noting that his views were personal and did not represent Intel Corp.'s future product road map. Shen taught at Carnegie Mellon University for 18 years, heading up the microarchitecture research team there before joining Intel Labs about a year ago. Based in Santa Clara, Calif., Shen manages Intel Labs staff located in Santa Clara, Hillsboro, Ore., and in Austin, Texas.

The Pentium 4 processor has struck a good balance, he claimed, with a pipeline depth of around 25, frequencies moving past 2 GHz, and support for instruction-level parallelism as well.

"With the P4, Intel spent a lot of time incorporating instruction-level parallelism. Intel made the right decision: The higher frequency resulted in higher performance," he said.

Performance is usually defined as frequency times the instructions per cycle. "There is a point at which we don't want to pipe that deep," Shen said, adding that researchers are questioning, how deep can you go?

One answer, according to a predictive model by Intel processor architect Ed Grochowski, is that microprocessor pipelines may go to 57 stages before performance gains start to fall off. The 57-stage number — a take-off on ketchup maker Heinz' 57 Varieties — is not to be taken as a hard number. "It is 57, plus or minus 20, but the key is that we may not be able to go out past 50 pipeline stages without losing performance," Shen said.

"There are a number of approaches, and perhaps the best way of saying it is that there are tensions between each of these approaches. For example, there are tensions between hardware complexity and doing more in the compiler. But while it may take several years to develop new hardware, to recompile all of the existing software may take much longer. Between a new microarchitecture and a new compiler, there may be a kind of impedance mismatch" in terms of how long it takes to tune existing applications to a new compiler, Shen said.

Other tensions exist. Some argue that putting several processors on the same die will prove advantageous as several billion transistors become available. Others argue that a single massive core can run multiple threads, but the challenge then is to develop applications that support multi-threading.

Chip-level multiprocessing (CMP) and simultaneous multi-threading (SMT) can be combined to process multiple threads on multiple cores, but then validation time may increase. And since microprocessor teams must hit a certain level of performance within a certain time window, the time to validate the processor is growing more critical.

Also, in an era when power consumption may dictate the course that MPU design teams take, Shen said, "It is not clear whether SMT or CMP is more efficient in terms of power. In the end the solution may come from all different angles."

Many engineers are working to improve the memory pre-fetch capabilities, as well as improving the efficiency of the branch prediction engine. It may be possible to append pre-fetch threads to support what Shen called "speculative pre-computation."

"Forces are pulling us in various directions, and it is not clear what is the obvious path," Shen said.

Microprocessor designers have engaged in a decade-long discussion about the relative merits of pushing frequency or instruction-level parallelism, known as the "speed demon versus Braniac" debate.

"My point is that it is not so much one against the other. We need both instructions per cycle and frequency, and it is a real delicate balancing act. The question I am raising is, in what new and clever ways can be combine the two?"

As the debate continues, power dissipation may become an overriding concern. Shen said that as processors evolve over the rest of this decade into huge chips incorporating several billion transistors operating at frequencies of 10 to 30 GHz, power consumption will increase exponentially, putting greater emphasis on processor efficiency rather than brute frequency increases.











  Free Subscription to EE Times
First Name Last Name
Company Name Title
Email address
  Click here for your Free Subscription to EETimes Europe
 
CAREER CENTER
Ready to take that job and shove it?
SEARCH JOBS
SPONSOR

RECENT JOB POSTINGS
CAREER NEWS
Federal CTO Sees IT Leading U.S. Out Of Recession
Aneesh Chopra is looking to other CIOs to advise him on fleshing out a more detailed agenda to best serve the president's IT agenda.

For more great jobs, career related news, features and services, please visit EETimes' Career Center.



All White Papers »   

  Around Silicon Strategies

FPGA startup crunch: These articles are part of a series that examines the status of various FPGA startups in light of the economic recession. Startups Abound Logic, Achronix Semiconductor and Cswitch are all on the hot seat. More...

10 fab technologies on the hot seat: There's trouble brewing in chip-making paradise. Delivery of chips at 32-nm and beyond won't be a cool breeze. EE Times has constructed the following list of 10 fab technologies that could make or break future IC scaling. More...

6 fab technologies on the bubble: It isn't going to be a slam-dunk to deliver chips at 32-nm and beyond. See our story about 10 fab technologies on the hot seat. Then read this article: 6 technologies on the bubble. More...

Our take on Intel-River: With its acquisition of embedded software leader Wind River Systems Inc., Intel Corp. has unambiguously signaled that it is again attempting to diversify beyond X86 processors. Here's our take on the deal. More...

CEVA's reversal: When Gideon Wertheizer, CEVA's CEO, came to New York to ring the closing bell at Nasdaq to celebrate the company's 10th year anniversary, he talked about CEVA's 21.6 percent revenue growth in 2008. More...

Hot technologies to watch for in 2009: Every technologist, marketer, industry analyst and reporter on a hunt for the next big thing is bracing for the 2009 Consumer Electronics Show scheduled less than a month away. More...

Top 20 predictions for semis in 2009: To help sort out the confusion in the market, EE Times has released its own chip forecasts--and other predictions--for 2009. So, what will happen in analog, FPGAs, foundry, memory, MPUs and other sectors? More...

Silicon 60 version 8.0 The EE Times 60 Emerging Startups list, first published in April 2004, has been updated to version 8.0 to reflect the latest corporate, commercial, technology and market conditions. More...

 
Education and
Learning


Learn Now:












Home | About | Editorial Calendar | Feedback | Subscriptions | Newsletter | Media Kit | Contact | Reprints|  RSS|   Digital|  Mobile
Network Websites
International
Network Features




All materials on this site Copyright © 2009 TechInsights, a Division of United Business Media LLC All rights reserved.
Privacy Statement | Terms of Service | About