Intel® Intel® Software Development Products for Intel® Platforms and Technologies
image
Intel® C++ Compiler 10.1, Professional and Standard Editions for Linux*
image image
Overview

Features in Depth Print Print
Features

Compatability
New in This Release System Requirements
image image

image Overview image
Intel® C++ Compiler 10.0 for Linux*

Intel® C++ Compiler Professional Edition offers the best support for creating multi-threaded applications. Only the Professional Edition offers the breadth of advanced optimization, multi-threading, and processor support that includes automatic processor dispatch, vectorization, auto-parallelization, OpenMP*, data prefetching, and loop unrolling, along with highly optimized C++ templates for parallelism, math processing, and multimedia libraries.

The Professional Edition combines a high performance compiler, which now includes support for Debian*, Ubuntu*, and Fedora* 7, with Intel® Threading Building Blocks (Intel® TBB), Intel® Integrated Performance Primitives (Intel® IPP), and Intel® Math Kernel Library (Intel® MKL). While these libraries are available separately, the Professional Edition creates a strong foundation for building robust, high performance parallel code at significant price savings.

New – Intel® Compiler Suite Professional Edition for Linux. This suite includes all the features of the Intel C++ Compiler Professional Edition, but also includes the Intel Fortran Compiler for Linux for a more complete solution at significant price savings.

The Standard Edition compiler has the same performance and features as the Professional Edition compiler, but does not provide the multi-threaded libraries.

Cluster OpenMP* for Intel C++ Compiler for Linux is also available to provide a simple means of extending OpenMP parallelism to 64-bit Intel® architecture-based clusters.

Product Brief [PDF 656KB]

back to top

image Features image
Performance

Consider the Intel® C++ Compiler Professional Edition to maximize performance. The built-in optimization technologies and multi-threading support help create code that runs best on the latest multi-core processors.



Advanced Optimization Features
Software compiled using the Intel® C++ Compiler for Linux benefits from advanced optimization features, a few of which are explained briefly here, with links to more complete descriptions:
Multi-Threaded Application Support, including OpenMP* and auto-parallelization for simple and efficient software threading.
Auto-vectorization parallelizes code to utilize the Streaming SIMD Extensions (SSE) instruction set architectures (SSE, SSE2, SSE3, SSSE3, and SSE4) of our latest processors.
High-Performance Parallel Optimizer (HPO) restructures and optimizes loops to ensure that auto-vectorization, OpenMP, or auto-parallelization best utilizes the processor’s capabilities for cache and memory accesses, SIMD instruction sets, and for multiple cores. This revolutionary capability, new in Version 10, combines vectorization, parallelization and loop transformations into a single pass which is faster, more effective and more reliable than prior discrete phases.
Interprocedural Optimization (IPO)dramatically improves performance of small- or medium-sized functions that are used frequently, especially programs that contain calls within loops. The analysis capabilities of this optimizer can also give feedback on vulnerabilities and coding errors, such as uninitialized variables or OpenMP API issues, which cannot be detected as well by compilers which rely strictly on analysis by a compiler front-end.
Profile-guided Optimization (PGO) improves application performance by reducing instruction-cache thrashing, reorganizing code layout, shrinking code size, and reducing branch mispredictions.
Optimized Code Debugging with the Intel® Debugger improves the efficiency of the debugging process on code that has been optimized for Intel® architecture.
Eclipse* IDE Integration
Eclipse* integration, in addition to command line, is available for Linux on Intel® Itanium® processors.
The Intel® C++ Compiler for Linux ships with a copy of the powerful and popular Eclipse open-source IDE.
The compiler can be invoked from within Eclipse, as well as by using the command-line interface.
Eclipse integration augments the use of the Intel C++ Compiler for Linux with other tools you may already use, such as make, Emacs, and gdb.
back to top

image New in This Release image
The Intel C++ Compiler for Linux builds on a winning foundation. Position yourself to create next-generation software for next-generation hardware. The following features are new since Version 9 of the compiler.

What's new Benefit to you

Support for additional Linux distributions including Debian*, Ubuntu* and Fedora* 7

Broaden target market by supporting additional Linux distributions.

Improved Performance and Threading

New Parallel/Loop Optimizer (HPO)
Improved optimization in C++
Exception Handling and Class Hierarchy analysis

Better application performance for computationally intensive applications such as graphics/digital media, financial modeling, and high-performance computing for threaded and non-threaded applications. Our new High Performance Parallel Optimizer, HPO, offers an improved ability to analyze, optimize, and parallelize more loop nests.

We’ve also improved our ability to optimize in the presence of C++ exception handling, and analyzing and optimizing C++ class hierarchies.

Security Checking and Diagnostics

GNU Mudflap
Static Verifier for buffer overflow
OpenMP* API verification
Ability to create code that is less susceptible to security vulnerabilities, such as buffer overflow. The diagnostics are very helpful for novice and expert users for catching common coding errors, from unitialized variables to mismatched dummy and actual arguments to OpenMP API coding issues.
Optimization Reports More detailed optimization diagnostics for users who want to use our advanced optimizations to help the compiler do a better job of tuning their applications.  The new VTune™ Analyzer 9.0 can filter optimization reports to help guide optimization efforts.
Code generation and optimization support for future Intel processors implementing the SSE4 instructions Take advantage of Streaming SIMD Extensions 4 (SSE4) for delivering expanded capabilities, enhanced performance, and greater energy efficiency for many applications.
Options to enable more advanced optimizations for loop unrolling, streaming stores and pointer aliasing Improved application performance.
Option to select alternate algorithms for malloc Increased flexibility when allocating memory.

Support for the Latest Multi-Core Processors
The Intel C++ Compiler provides optimization support for the very latest multi-core processors, including:

Intel® Core™2 Duo processor
Intel® Core™2 Quad processor
Quad-Core Intel® Xeon® processor 5300 series
Dual-Core Intel® Xeon® processor 3000 series
Dual-Core Intel® Xeon® processor 5000 series
Dual-Core Intel® Xeon® processor 7000 series
Dual-Core Intel® Itanium 2 processor
Intel® compilers future-proof your investment with assurance of world-class support for each successive generation of processors. That's a key advantage in a world where new hardware platforms come to market with awesome speed.

Support for auto-parallelization and OpenMP enable you to create optimized, multithreaded applications that take full advantage of multi-core processing features to deliver outstanding performance.
Professional Edition Includes not only the advanced capabilities of the compiler, but also Intel® Threading Building Blocks, Intel® Integrated Performance Primitives, and Intel® Math Kernel Library with highly optimized functions for threading, math processing, and multimedia.
back to top

image Advanced Optimization Features in Depth image
This section gives detailed descriptions of the compiler’s advanced optimization features.
Multi-Threaded Application Support
OpenMP and auto-parallelization help convert serial applications into parallel applications, allowing you to take full advantage of multi-core technology like the Intel® Core™ Duo processor and Dual-Core Intel® Itanium® 2 processor, as well as symmetric multi-processing systems:
OpenMP* is the industry standard for portable multithreaded application development. It is effective at fine-grain (loop-level) and large-grain (function-level) threading.

OpenMP directives are an easy and powerful way to convert serial applications into parallel applications, enabling potentially big performance gains from parallel execution on multi-core and symmetric multiprocessor systems.
Auto Parallelization improves application performance on multiprocessor systems by means of automatic threading of loops. This option detects parallel loops capable of being executed safely in parallel and automatically generates multithreaded code.

Automatic parallelization relieves the user from having to deal with the low-level details of iteration partitioning, data sharing, thread scheduling, and synchronizations. It also provides the performance benefits available from multiprocessor systems and systems that support Hyper-Threading Technology (HT Technology).
For more information on multi-threaded application support, visit Intel's Threading Developer Center.
back to top

High Performance, Parallel Optimizer (HPO)

This revolutionary capability, new in Version 10, combines automatic vectorization, automatic parallelization and loop transformations into a single pass which is faster, more effective and more reliable than prior discrete phases.

HPO optimizes and restructures program loops to ensure that auto-parallelization, OpenMP, and auto-vectorization occur smoothly in conjunction with each other. HPO’s optimization technology utilizes a unique cost-benefit analysis to make the right optimization decisions for the given program and loop structure. It will perform many transformations such as loop unrolling, peeling, interchange, splitting, etc., as well as other optimizations to ensure the processor’s cache architecture, SIMD instruction set, and multiple cores are well utilized.

back to top

Automatic Vectorizer

Vectorization automatically parallelizes code to maximize underlying processor capabilities. This advanced optimization analyzes loops and determines when it is safe and effective to execute several iterations of the loop in parallel by utilizing MMX™, SSE, SSE2, SSE3, SSSE3, and SSE4 instructions. Figure 1 is a graphical representation of a vectorized loop that shows four iterations computed with one SSE2 operation.

Figure 3. The Vectorizer in action
Figure 1. The Vectorizer in action

Use vectorization to optimize your application code and take advantage of these new extensions when running on Intel® processors. Features include support for advanced, dynamic data alignment strategies, including loop peeling to generate aligned loads and loop unrolling to match the prefetch of a full cache line.

back to top

Interprocedural Optimization (IPO)
Interprocedural optimization (IPO) can dramatically improve application performance in programs that contain many small- or medium-sized functions that are frequently used, especially for programs that contain calls within loops. This set of techniques, which can be enabled for automatic operation in the Intel® compilers, uses multiple files or whole programs to detect and perform optimizations, rather than focusing within individual functions.

Figure 1. The interprocedural optimization process
Figure 2. The interprocedural optimization process

The IPO process, shown in Figure 2, first requires that source files are compiled with the IPO option, creating object (.o) files that contain the intermediate language (IL) used by the compiler. Upon linking, the compiler combines all of the IL information and analyzes it for optimization opportunities. Typical optimizations made as part of the IPO process include procedure inlining and re-ordering, eliminating dead (unreachable) code, and constant propagation, or the substitution of known values for constants. IPO enables more aggressive optimization than what is available at the intra-procedural level, since the added context of multiple procedures makes those more-aggressive optimizations safe.

The analysis capabilities of IPO can also give feedback on vulnerabilities and coding errors, such as uninitialized variables, which cannot be detected as well by compilers which rely strictly on analysis by a compiler front-end.
back to top

Profile-Guided Optimization (PGO)
The Profile-guided optimization (PGO) compilation process enables the Intel C++ Compiler to take better advantage of the processor microarchitecture, more effectively use instruction paging and cache memory, and make better branch predictions. It improves application performance by reorganizing code layout to reduce instruction-cache thrashing, shrinking code size, and reducing branch mispredictions.

PGO is a three-stage process, as shown in Figure 3. Those steps include 1) a compile of the application with instrumentation added, 2) a profile-generation phase, where the application is executed and monitored, and 3) a recompile where the data collected during the first run aids optimization. A description of several code size influencing profile-guided optimizations follows:

Basic block and function ordering — Place frequently-executed blocks and functions together to take advantage of instruction-cache locality.
Aid inlining decisions — Inline frequently-executed functions so the increase in code size is paid in areas of highest performance impact.
Aid vectorization decisions — Vectorize high trip count and frequently-executed loops so the increase in code size is mitigated by the increase in performance.
Figure 2. Profile-guided Optimization
Figure 3. Profile-guided optimization
back to top

Optimized Code Debugging with the Intel® Debugger
The Intel® Debugger enables optimized code debugging (i.e., debugging code that has been significantly transformed for optimal execution on a specific hardware architecture). The Intel compilers produce standards-compliant debug information for optimized code debugging that is available to all debuggers that support Intel compilers. The Intel Debugger supports multi-core architectures by enabling debugging of multithreaded applications, providing the following related capabilities:
An all-stop/all-go execution model (i.e., all threads are stopped when one is stopped, and all threads are resumed when one is resumed).
List all created threads.
Switch focus between threads.
Examine detailed thread state.
Set breakpoints (including all stop, trace, and watch variations) and display a back-trace of the stack for all threads or for a subset of threads.
The built-in GUI provides a Thread panel (on the Current Source pane) that activates when a thread is created, and that allows an operator to select thread focus and display related details.
The recently enhanced GNU Project Debugger (GDB debugger) can also be used for parallel applications. For additional information, please refer to the Intel Debugger Technical White Paper (PDF 210KB).
back to top

image Compatibility and Flexibility image
Standards Compliance and Broad Compatibility

The Intel C++ Compiler for Linux is substantially standards compliant. It also provides an upgraded Intel C++ compiler IDE integration that integrates into Eclipse 3.1 and CDT 3.0 on Linux and supports Itanium 2 processors, including Dual-Core Intel® Itanium® 2 processors.

back to top

Winning Performance across Application Domains
The Intel C++ Compiler for Linux delivers exceptional performance, usability, and business advantages to a wide variety of software markets.

Next-generation data-intensive application developers benefit Next-generation data-intensive application developers benefit from dramatic performance optimizations using the Intel compilers to decrease latency and processing times, while also allowing software architects to add additional features without unacceptable impacts to performance.
Digital home, gaming, and entertainment applications Digital home, gaming, and entertainment applications are well served by the Intel C++ Compiler, as parallel processing on multi-core platforms excels at handling downloads, security, and other tasks in the background, without impacting the user experience.
Mobilized software Mobilized software benefits tremendously from the ability of mobile multi-core platforms such as those based on the Intel® Core™ Duo processor to increase performance while also protecting battery life with low power consumption.
back to top

image System Requirements image

This section provides system requirements to develop applications for three different hardware platforms, which are described below.

Processor Terminology
Intel compilers support three platforms: general combinations of processor and operating system type. This section explains the terms that Intel uses to describe the platforms in its documentation, installation procedures, and support site.

IA-32 architecture - IA-32 (Intel Architecture, 32-bit) architecture refers to systems based on 32-bit processors supporting at least the Pentium® II instruction set, (for example, Intel® Core™ processor or Intel® Xeon® processor), or processors from other manufacturers supporting the same instruction set, running a 32-bit operating system ("Linux x86").

Intel® 64 architecture - Intel® 64 architecture (formerly Intel® EM64T) refers to systems based on IA-32 architecture-based processors which have 64-bit architectural extensions, (for example, Intel® Core™2 processor or Intel® Xeon® processor), running a 64-bit operating system ("Linux x86_64"). If the system is running a 32-bit version of the Linux operating system, then IA-32 architecture applies instead. Systems based on the AMD Athlon64* and Opteron* processors running a 64-bit operating system are also supported by Intel compilers for Intel® 64 architecture-based applications.

IA-64 architecture - Refers to systems based on the Intel® Itanium® 2 processor running a 64-bit operating system.

Native and Cross-Platform Development
The term "native" refers to building an application that will run on the same platform that it was built on, for example, building on IA-32 architecture to run on IA-32 architecture. The term "cross-platform" or "cross-compilation" refers to building an application on a platform type different from the one on which it will be run, for example, building on IA-32 architecture to run on IA-64 architecture. Not all combinations of cross-platform development are supported, and some combinations may require installation of optional tools and libraries.

The following list describes the supported combinations of compilation host (system on which you build the application) and application target (system on which the application runs).

Note: Development for a target different from the host may require optional library components to be installed from your Linux Distribution.

Note: Cluster OpenMP* for Intel® Compilers for Linux* is a separately licensed feature and has different system requirements from that of the compilers. Please refer to the product website for further details.


Requirements to develop IA-32 architecture-based applications
Component Minimum Recommended
Processor

A system based on an IA-32 architecture-based processor (minimum 450 MHz Intel Pentium® II processor or greater), Intel® 64 architecture-based processor, or a system based on an AMD* Athlon* or AMD Opteron* processor

Intel Core™2 processor
Pentium® 4 processor
RAM
512 MB
1 GB
Disk Space
100 MB of disk space, plus an additional 200 MB during installation for the download and temporary files
NA
Operating Systems

One of the following Linux distributions (this is the list of distributions tested by Intel; other distributions may or may not work and are not recommended - please contact Intel® Premier Support if you have questions):

Asianux* 3.0
Debian* 4.0
Fedora* 7
Red Hat Enterprise Linux* 3, 4, 5
SUSE LINUX Enterprise Server* 9, 10
TurboLinux* 11
Ubuntu* 7.0.4
NA
Other Software

Linux Developer tools component installed, including gcc 3.2.3, 3.3, 3.4, 4.1, or 4.11, g++ and related tools.

Linux component compat-libstdc++ providing libstdc++.so.5
NA
back to top

Requirements to develop applications for processors that support Intel® 64 architecture or for AMD* Opteron* Processors
Component Minimum Recommended
Processor
A system based on an Intel® 64 architecture-based processor or an AMD Opteron processor.

Intel® Core™2 processor
Intel® Xeon® processor

RAM
512 MB
1 GB
Disk Space
300 MB free hard disk space, plus an additional 300 MB during installation for download and temporary files.

100 MB of hard disk space for the virtual memory paging file. Be sure to use at least the minimum amount of virtual memory recommended for the installed distribution of Linux
NA
Operating Systems

One of the following Linux distributions (this is the list of distributions tested by Intel; other distributions may or may not work and are not recommended - please contact Intel® Premier Support if you have questions):

Asianux* 3.0
Debian* 4.0
Fedora* 7
Red Hat Enterprise Linux* 3, 4, 5
SGI* ProPack* 5
SUSE LINUX Enterprise Server* 9, 10
TurboLinux* 11
Ubuntu* 7.0.4
NA
Other Software

Linux Developer tools component installed, including gcc 3.2.3, 3.3, 3.4, 4.1, or 4.11, g++ and related tools.

Linux component compat-libstdc++ providing libstdc++.so.5
NA
back to top

Requirements to develop IA-64 architecture-based applications
Component Minimum Recommended
Processor
A system based on an Itanium® 2 processor
NA
RAM
512 MB
1 GB
Disk Space
150 MB of disk space, plus an additional 200 MB during installation for the download and temporary files
NA
Operating Systems

One of the following Linux distributions (this is the list of distributions tested by Intel; other distributions may or may not work and are not recommended - please contact Intel® Premier Support if you have questions):

Asianux* 3.0

Debian* 4.0
Red Hat Enterprise Linux* 3, 4, 5
SUSE LINUX Enterprise Server* 9, 10
TurboLinux* 11
NA
Other Software

Linux Developer tools component installed, including gcc 3.2.3, 3.3, 3.4, 4.1, or 4.11, g++ and related tools.

Linux component compat-libstdc++ providing libstdc++.so.5
We recommend using binutils 2.14 or later, especially if using shared libraries as there are known issues with binutils 2.11

Note on gcc Versions

The above lists of processor model names are not exhaustive - other processor models correctly supporting the same instruction set as those listed are expected to work. Please contact Intel® Premier Support if you have questions regarding a specific processor model.

Notes:
The above lists of processor model names are not exhaustive - other processor models correctly supporting the same instruction set as those listed are expected to work. Please contact Intel® Premier Support if you have questions regarding a specific processor model. Some optimization options have restrictions regarding the processor type on which the application is run. Please see the documentation of these options for more information. Compiling very large source files (several thousands of lines) using advanced optimizations such as -O3, -ipo and -openmp, may require substantially larger amounts of RAM.

Additional System Requirements for Eclipse*

Use of the Eclipse* Integrated Development Environment on Red Hat Enterprise Linux AS 2.1 has the following additional requirements:

Red Hat AS 2.1 Update 6

Mozilla* 1.4 Xft or higher or Firefox* 1.0

For users of the GTK* window system: version 2.2.1 of the GTK+ widget toolkit and associated libraries (GLib, Pango) should be installed

On Turbo Linux 10 and Red Hat 9.0 systems, the BEA* JRockit* JRE that is provided with the Intel support for Eclipse may not function properly, aborting unexpectedly. Another standard JRE can be substituted for use on these systems.

On SuSE Linux Enterprise Server 8 for IA-32 architecture systems, the Eclipse Integrated Development Environment support will not function properly if used with the GTK* window system. The Eclipse support requires a later version of GTK , version 2.2.1, than what is installed by default with this operating system. Note that upgrading the system to GTK version 2.2.1 is non-trivial.

Complete, fully functioning browser support in the Eclipse Integrated Development environment requires the installation of one of these browsers:

Mozilla 1.4 Xft or higher or Firefox 1.0

On systems where these browsers are not installed by default or available otherwise, such as on a SGI Propack 4 system, an alternate browser, e.g. Konqueror, can be used in the Eclipse Integrated Development environment. Within Eclipse, set it as the browser to be used by selecting Windows->Preferences->General->Web Browser and entering it as the external Web Browser. Note that such a browser cannot be designated as the internal Web Browser within Eclipse, and thus there will be no support available for internal web browsing with this configuration.

back to top

Intel provides both the tools and support to enhance the performance, functionality, and efficiency of software applications.
Compatible with leading Windows* and Linux* development environments, Intel® Software Development Products are the fastest and easiest way to take advantage of the latest features of Intel processors. Intel Software Development Products are designed for use in the full development cycle, and include Intel® Performance Libraries, Intel® Compilers (C++, Fortran for Windows, Linux, and Mac OS* X), Intel® VTune™ Analyzer, Intel® Threading Tools and Intel® Cluster Tools.
The Intel® Premier Support Web site provides expert technical support for all Intel software products, product updates and related downloads. For additional product information visit: www.intel.com/software/products.
Intel, the Intel logo, and VTune are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
*Other brands and names may be claimed as the property of others.
Copyright © 2007, Intel Corporation
back to top