Architecture of the Pentium microprocessor

doi:10.1109/40.216745

Journal ArticleDOI

Architecture of the Pentium microprocessor

D. Alpert, +1 more

- 01 May 1993 -

IEEE Micro

- Vol. 13, Iss: 3, pp 649-659

TLDR

The techniques of pipelining, superscalar execution, and branch prediction used in the Pentium CPU, which integrates 3.1 million transistors in 0.8-mu m BiCMOS technology, are described in this article.

Abstract:

The techniques of pipelining, superscalar execution, and branch prediction used in the Pentium CPU, which integrates 3.1 million transistors in 0.8- mu m BiCMOS technology, are described. The technology improvements associated with the three most recent microprocessor generations are outlined. The Pentium's compatibility, performance, organization, and development process are also described. The compiler technology developed with the Pentium microprocessor, which includes machine-independent optimizations common to current high-performance compilers, such as inlining, unrolling, and other loop transformations, is reviewed. >

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Compiler transformations for high-performance computing

David F. Bacon, +2 more

- 01 Dec 1994 -

ACM Computing Surveys

TL;DR: This survey is a comprehensive overview of the important high-level program restructuring techniques for imperative languages, such as C and Fortran, and describes the purpose of each transformation, how to determine if it is legal, and an example of its application.

...read moreread less

Journal ArticleDOI

Larrabee: a many-core x86 architecture for visual computing

Larry D. Seiler, +13 more

TL;DR: This article consists of a collection of slides from the author's conference presentation, some of the topics discussed include: architecture convergence; Larrabee architecture; and graphics pipeline.

...read moreread less

Proceedings ArticleDOI

Optimization of instruction fetch mechanisms for high issue rates

Thomas M. Conte, +3 more

TL;DR: Results show that compiler optimization can significantly enhance performance across all schemes and the collapsing buffer supplemented by compiler techniques remains the best-performing mechanism.

...read moreread less

Proceedings ArticleDOI

Wrong-path instruction prefetching

Jim Pierce, +1 more

TL;DR: wrong-path prefetching performs better than the other prefetch algorithms studied in all of the cache configurations examined while requiring little additional hardware and is applicable to both multi-issue and long L1 miss latency machines.

...read moreread less

Proceedings ArticleDOI

Instruction fetch mechanisms for VLIW architectures with compressed encodings

Thomas M. Conte, +4 more

TL;DR: This report uses the TINKER experimental testbed to examine instruction fetch and instruction cache mechanisms for VLIWs and a new i-fetch mechanism using a silo cache is found to have the best performance.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

An analysis of MIPS and SPARC instruction set utilization on the SPEC benchmarks

Robert F. Cmelik, +3 more

TL;DR: Although MIPS and SPARC each have strengths and weaknesses in their compilers and library routines, the combined effect of compilers or library routines does not give either MIPS or SPARC a clear advantage in these areas.

...read moreread less

Book

386 Sx Microprocessor Programmer's Reference Manual

Corporate Intel Corp.

Journal ArticleDOI

The i486 CPU: executing instructions in one clock cycle

J.H. Crawford

- 01 Jan 1990 -

IEEE Micro

TL;DR: The author discusses the design goals of the i486 development program, which were to ensure binary compatibility with the 386 microprocessor and the 387 math coprocessor, increase performance by two to three times over a 386/387 processor system at the same clock rate, and extend the IBM PC standard architecture of the 386 CPU with features suitable for minicomputers.

...read moreread less

Proceedings ArticleDOI

The Intel386 CPU family-architecture and performance analysis

T. Chadha, +1 more

TL;DR: The authors present a synopsis of the various performance metrics related to the i386 CPU architecture, and shows how an average number like CPI can vary over the run time of the job.

...read moreread less

Architecture of the Pentium microprocessor

Citations

Compiler transformations for high-performance computing

Larrabee: a many-core x86 architecture for visual computing

Optimization of instruction fetch mechanisms for high issue rates

Wrong-path instruction prefetching

Instruction fetch mechanisms for VLIW architectures with compressed encodings

References

An analysis of MIPS and SPARC instruction set utilization on the SPEC benchmarks

386 Sx Microprocessor Programmer's Reference Manual

The i486 CPU: executing instructions in one clock cycle

The Intel386 CPU family-architecture and performance analysis

Related Papers (5)

Computer Architecture: A Quantitative Approach

A study of branch prediction strategies

Branch Prediction Strategies and Branch Target Buffer Design

Two-level adaptive training branch prediction

The superblock: an effective technique for VLIW and superscalar compilation