Lehrbehelf für Prozessregelung und Rechnerverbund Mikroprozessoren und Mikrocomputer, Teil 2



Download 226.48 Kb.
Page4/5
Date19.10.2016
Size226.48 Kb.
#4081
1   2   3   4   5

Digital Signal Processors (DSPs)


DSPs unterstützen high-performance, sich wiederholende, numerisch aufwendige Applikationen. DSPs eignen sich für Anwendungen mit vielen Kanälen, z.B. gleichzeitig abzuwickelnde Telefongespräche und hohen Datendurchsatzraten.
DSPs sind leistungsstarke Mikrocomputer mit leistungsfähigen Echtzeitbetriebssystemen und SW-Modulen für spezifische Applikationen wie Handys, GSM Sendestationen, PDAs, Kameras, KFZ Anwendungen, wireless switches, gateways, telephone switches, DSL Modems, etc.
Eigenschaften: optimiert für Parallel Instruktionen, komplexe arithmetische Operationen, Realtime.



Applications specific SW-Module: fast time-to-market, reduced development risks, reduced development costs.

Beispiele: TMS 320 von TI, DSP56XX von Motorola, ADSP 21xxx von Analog Devices, TriCore und Carmel von Infineon, ...


Beispiel: TMS 320C6416 (Texas Instruments)


TMS 320C6416 Highlights:

  • 400-, 500-, 600-MHz Clock Rate

  • up to Eight 32-Bit Instructions/Cycle

  • up to Twenty-Eight Operations/Cycle

  • 3200, 4000, 4800 MIPS

  • Six ALUs (32-/40-Bit), Each Supports Single 32-Bit, Dual 16-Bit, or Quad 8-Bit Arithmetic per Clock Cycle

  • Two Multipliers Support Four 16 x 16-Bit Multiplies (32-Bit Results) per Clock Cycle or Eight 8 x 8-Bit Multiplies

  • Enhanced Direct-Memory-Access (EDMA) Controller (64 Independent Channels)

  • Host-Port Interface (HPI)

  • Three Multichannel Buffered Serial Ports (McBSPs)

    • Direct Interface to T1/E1, MVIP, SCSA Framers

    • Up to 256 Channels Each

    • ST-Bus-Switching-, AC97-Compatible

    • Serial Peripheral Interface (SPI)



PROMs, PLDs, PLAs, GALs, ...


300 bis 2M Logik Gatter, die durch Programmierung verbunden werden können.

Beispiel: Grundstruktur eines PALs






ASICs (Application Specific Integrated Circuits)


ASICs= many chips on a single package.

Vorteil: kleiner

Anwendungen: Videoboards, Modems


Auswahl von Prozessoren


  • Datenbus: 4, 8, 16, 32, 64, 128, 256, ... Bit

  • Adressbus: 16, 32, 64, … Bit

  • Rechenleistung, Datendurchsatz, ...

  • CPU Leistung: MIPS, …

  • Anwendung: µC, µP, DSP, ...

  • Speicher: interner/ externer RAM, ROM, EPROM, EEPROM, Flash EPROM, HDD, FDD, CDROM, Cache, …

  • I/Os: interner/ externe serielle, parallele, I²C, CAN, ADU, DAU, Netzwerk, … Interfaces

  • I/Os: Displays, LCD Panels, Tastaturen, ...

  • I/Os: Businterfaces (AGP, PCI, ISA, ...)

  • Entwicklungstools: Compiler, Emulator, Debugger, Assembler, …

  • Verfügbare SW-Module: OS, RTOS, Industrie spezifische Applikationen, ...

  • Energieverbrauch, Größe, Stückzahl, Kosten, Verfügbarkeit, Kompatibilität, …

  • Befehlsstruktur: RISC, CISC

  • Anwendung: KFZ, Office, Haushaltsgerät, Steuerung, ...



Moderne Mikroprozessoren

INTEL Pentium ® 4



12 K

8 K

2 f= 6,12 GHz

Abbildung: Blockschaltbild Pentium 4


Abbildung: Blockschaltbild Pentium 4 mit 1,5 GHz; mit 3,06 GHz: 98 GB/s L1-L2 speed



Abbildung: Blockschaltbild Pentium 4 mit externer Beschaltung


(Pentium 4, Host Bus bis 533 MHz in 2002, 800 MHz in 2003)
Daten Durchsatz Pentium 4 – Northbridge bei 400/100 MHz Busgeschwindigkeit: 3,2 GB/s

(synchrone Datenabfrage, quad pumped data bus: 4 * 100 * 64/8 = 3,2 GB/s).


Wichtige Eigenschaften:

2002: bis 3,06 GHz.

The 3,06 GHz operating speed makes the Pentium 4 equal to the 64 bit Itanium 2.

Intel® NetBurst™ microarchitecture, 0,13-micron technology, 800/ 533/ 400 MHz system bus, Hyper-pipelined technology, Rapid execution engine, Execution trace cache, Advanced transfer cache, Advanced dynamic execution, Enhanced floating point/multimedia, Streaming SIMD extensions 2 instructions.

Cache:
512-KB L2 Advanced Transfer Cache available with speeds up to 3.06 GHz.


Integrated caches include 12-K micro-op trace cache and 8-KB L1 data cache.

HyperThreading makes each processor appear to software as two virtual processors, speeding up some applications. HyperThreading also slows down some software.
IT Week:
Intel said HyperThreading can boost performance by up to 30 percent. But tests prove the performance of some applications drops by half when HyperThreading is enabled, partly because each virtual processor has access to only half the chip's resources, such as cache memory and bandwidth-to-memory. Operating systems and applications must support HyperThreading to benefit from it. Linux and Windows XP are compatible with the feature, but Windows 95, 98 and ME are not.
According to Intel the Pentium 4 is best suited for:

Digital music, 3D gaming, digital imaging and video, DVD authoring and MPEG4 video, internet audio, streaming video, image processing, video content creation, speech, 3D, CAD, games, multimedia, and multi-tasking user environments.



For desktop PCs as well as for entry-level workstations.
Intel® NetBurst™ Microarchitecture (= Überbegriff)
Intel NetBurst microarchitecture delivers a number of innovative features including Hyper-Threading Technology, hyper-pipelined technology, 800 MHz, 533 MHz or 400 MHz system bus, Execution Trace Cache, and Rapid Execution Engine, as well as a number of enhanced features such as Advanced Transfer Cache, Advanced Dynamic Execution, enhanced floating-point and multimedia unit, and Streaming SIMD Extensions 2 (SSE2). Many of these innovations and advances were made possible with improvements in processor technology, process technology, and circuit design and could not previously be implemented in high-volume, manufacturable solutions. The features and resulting benefits of the microarchitecture are defined below.
Hyper-Threading Technology
Hyper-Threading Technology (HT Technology) is ground breaking technology from Intel that changes the landscape of processor design by going beyond GHz to improve processor performance. It allows software programs to "see" two processors and work more efficiently. This new technology enables the processor to execute two series, or threads, of instructions at the same time, thereby improving performance and system responsiveness. The Pentium 4 processor supporting Hyper-Threading Technology is specially designed to deliver immediate increases on performance and system responsiveness with existing applications in multitasking environments (i.e. where two or more things are running at the same time) and with many stand-alone applications today. Furthermore, the Pentium 4 processor supporting HT Technology provides performance headroom for the future.

Hyper-Pipelined Technology
The hyper-pipelined technology of the Intel NetBurst microarchitecture doubles the pipeline depth compared to the P6 microarchitecture used on today's Intel® Pentium® III processors. One of the key pipelines, the branch prediction / recovery pipeline, is implemented in 20 stages in the Intel NetBurst microarchitecture, compared to 10 stages in the P6 microarchitecture. This technology significantly increases the performance, frequency, and scalability of the processor.
800 MHz or 533-MHz or 400-MHz System Bus
The Pentium 4 processor's 800 MHz system bus supports Intel's highest performance desktop processor by delivering 6.4 GB of data-per-second into and out of the processor. This is accomplished through a physical signalling scheme of quad pumping the data transfers over a 200 MHz clocked system bus and a buffering scheme allowing for sustained 800 MHz data transfers. The Pentium 4 processor's 400-MHz system bus supports Intel's performance desktop processor by delivering 3.2 GB of data-per-second into and out of the processor. This is accomplished through a physical signalling scheme of quad pumping the data transfers over a 100-MHz clocked system bus and a buffering scheme allowing for sustained 400-MHz data transfers. This compares to 1.06 GB/s delivered on the Pentium III processor's 133-MHz system bus.
Level 1 Execution Trace Cache
In addition to the 8-KB data cache, the Pentium 4 processor includes an Execution Trace Cache that stores up to 12-K decoded micro-ops in the order of program execution. This increases performance by removing the decoder from the main execution loop and makes more efficient usage of the cache storage space since instructions that are branched around are not stored. The result is a means to deliver a high volume of instructions to the processor's execution units and a reduction in the overall time required to recover from branches that have been mis-predicted.
Rapid Execution Engine
Two Arithmetic Logic Units (ALUs) on the Pentium 4 processor are clocked at twice the core processor frequency. This allows basic integer instructions such as Add, Subtract, Logical AND, Logical OR, etc. to execute in one-half a clock cycle. For example, the Rapid Execution Engine on a 3.06 GHz Pentium 4 processor runs at 6.12 GHz.
512-KB Level 2 Advanced Transfer Cache
512-KB L2 Advanced Transfer Cache (ATC) is available with speeds 1.80A, 2A, 2.20, 2.26, 2.40, 2.50, 2.53, 2.60, 2.66, 2.80 and 3.06 GHz. The Level 2 ATC delivers a much higher data throughput channel between the Level 2 cache and the processor core. The Advanced Transfer Cache consists of a 256-bit (32-byte) interface that transfers data on each core clock. As a result, the Pentium 4 processor at 3.06 GHz can deliver a data transfer rate of 98 GB/s. This compares to a transfer rate of 16 GB/s on the Pentium III processor at 1 GHz. Features of the ATC include:

  • Non-Blocking, full speed, on-die level 2 cache

  • 8-way set associability

  • 256-bit data bus to the level 2 cache

  • Data clocked into and out of the cache every clock cycle


Advanced Dynamic Execution
The Advance Dynamic Execution engine is a very deep, out-of-order speculative execution engine that keeps the execution units executing instructions. The Pentium 4 processor can also view 126 instructions in flight and handle up to 48 loads and 24 stores in the pipeline. It also includes an enhanced branch prediction algorithm that has the net effect of reducing the number of branch mis-predictions by about 33% over the P6 generation processor's branch prediction capability. It does this by implementing a 4-KB branch target buffer that stores more detail on the history of past branches, as well as by implementing a more advanced branch prediction algorithm.
Enhanced Floating-Point and Multimedia Unit
The Pentium 4 processor expands the floating-point registers to a full 128-bit and adds an additional register for data movement which improves performance on both floating-point and multimedia applications.
Streaming SIMD Extensions 2 (SSE2) Instructions
With the introduction of SSE2, the Intel NetBurst microarchitecture now extends the SIMD capabilities that MMX technology and SSE technology delivered by adding 144 instructions. These instructions include 128-bit SIMD integer arithmetic and 128-bit SIMD double-precision floating-point operations. These instructions reduce the overall number of instructions required to execute a particular program task and as a result can contribute to an overall performance increase. They accelerate a broad range of applications, including video, speech, and image, photo processing, encryption, financial, engineering and scientific applications.
Data Prefetch Logic
Functionality that anticipates the data needed by an application and pre-loads it into the Advanced Transfer Cache, further increasing processor and application performance.

HyperThreading:

Der neue Pentium 4 mit 3,06 GHz bietet zwei logische CPU-Kerne.

Intel hat den Prozessor so ausgestattet, dass der eine Teil immer weiß, was der andere gerade tut und gegebenenfalls seine eigenen Aktionen auf die des anderen abstimmt.

Der HyperThreading Pentium 4 hat größere Caches (zusätzlicher Fill-In-Buffer für den Trace-Cache), sowie eine zweite Register-Alias-Table integriert.

Multithreaded - Betriebssysteme: Windows XP und Linux.



Multithreaded Applikationen: Audio-Komprimierung, Video-Encoden mit MPEG Encoder 3.5 von LSX, Cinema 4D


Processor Core Speeds Up to 3.06 GH

Maximum performance for a wide range of emerging Internet, PC and workstation applications

Hyper-Threading Technology

Improves performance and system responsiveness in today's multitasking environments by enabling the processor to execute multiple instruction threads in parallel.

New 0.13u process technology

Enables higher frequency and lower power


Intel® NetBurst™ Micro-architecture

Designed to deliver highest performance in video, graphics, multimedia and other sophisticated applications

Up to 800 MHz System Bus

High bandwidth between the processor and the rest of the system improves throughput and performance

512KB L2 Cache (for 2A GHz and Faster) or 256KB Cache (for 2 GHz and Slower)

Enhances performance by providing fast access to heavily used data and instructions



Hyper-Pipelined Technology

Extended pipeline stages increase overall throughput


Streaming SIMD Extensions 2

144 new instructions accelerate operation across a broad range of demanding applications

Rapid Execution Engine

Arithmetic Logic Units run at twice the core frequency, speeding execution in this performance critical area

128-Bit Floating Point Port


Floating Point performance boost provides enhanced 3D visualization, life-like gaming and scientific calculations

SIMD 128-bit Integer

Accelerates video, speech, encryption and imaging/photo processing

Execution Trace Cache

Greatly improves instruction cache efficiency, maximizing performance on frequently used sections of software code

Advanced Dynamic Execution

Improved branch prediction enhances performance for all 32-bit applications by optimising instruction sequences

Built-in Self Test (BIST)



Provides single stuck-at fault coverage of the microcode and large logic arrays, plus testing of the instruction cache, data cache, Translation Lookaside Buffers, and ROMs

IEEE 1149.1 Standard Test Access Port and Boundary Scan

Enables testing of the Pentium® 4 processor and system connections through a standard interface


The Pentium® 4 processor is designed to enrich users’ digital experience by providing the high performance necessary for gaming, digital music, digital imaging and video, and personal productivity.


The Pentium® III processor continues to deliver adequate performance for older applications. However, it lacks the advanced technologies to efficiently perform on today’s rich media and gaming applications.
The Pentium® II processor ushered the PC into the multimedia era with the capability to handle simple graphics, CD-audio and 2D games. The performance of this processor, however, is inadequate for today’s advanced applications


Download 226.48 Kb.

Share with your friends:
1   2   3   4   5




The database is protected by copyright ©ininet.org 2024
send message

    Main page