http://www.aertia.com

Lahey/Fujitsu Fortran 95 v6.2 for Linux

by Lahey Computer Systems

LF95 for Linux includes Full Fortran 95/90/77, Automatic Parallelization, OpenMP 2.0 compatibility, New global compile-time diagnostics, File I/O speed improvements, Thread-safe BLAS and LAPACK, Improved runtime diagnostics, Winteracter Starter Kit, Thread-safe SSL2 math library, and more!

Speed. Dependability. Great Tech Support. Lahey/Fujitsu Fortran 95. The standard for Fortran programming from the leader in Fortran language systems.

We've combined our 35 years of producing award-winning Fortran language systems with Fujitsu's compiler expertise and high-performance code generator to deliver the most-productive, best-supported Fortran 95 language system for the PC.

Whether you write new Fortran programs or downsize existing applications, you need speed, dependability, and great tech support. You need Lahey/Fujitsu Fortran 95!


LF95 v6.2 Delivers!
Important features include...
  • Automatic Parallelization
  • Pentium 4 and Xeon optimizations with SSE2 instructions
  • OpenMP v2.0 Support
  • Prefetch optimizations for Pentium III and Athlon
  • Unsurpassed global compile-time and runtime diagnostics
  • Allocate arrays up to 2 GB
  • External file size 2**64 bytes
    "Our application is based on the finite-difference time-domain method. Using the auto-parallelization feature of LF95 PRO v6.2, we reduced the execution time by approximately 50 %."

    Masafumi Fujii
    Dept. of Electrical, Electronic and System Engineering
    Toyama University, Toyama, Japan
  • New Wisk, Winteracter Starter Kit
  • Thread-safe BLAS and LAPACK v3.0 (includes SSE2 version)
  • Thread-safe SSL2 math library (includes SSE2 version)
  • Automake, automatic make utility
  • ALLOCATABLE attributes on array components
  • Character variable length limit: 2,147,483,647
  • MPI compatible with MPICH source included
  • FDB debugger
  • Compatible with TotalView parallel debugger

LF95 v6.2 is available in two configurations, Express and PRO:

LF95 Express includes the powerful Lahey/Fujitsu Fortran 95 optimizing compiler, command line debugger, link-compatibility with g77 and egcs, online documentation, and free e-mail support.

LF95 PRO adds auto-parallelization, OpenMP compatibility, the Winteracter Starter Kit, WiSK, for creating Windows GUIs and displaying graphics, thread-safe BLAS and LAPACK, Polyhedron's Automake utility, the Fujitsu SSL2 math library (thread-safe for parallel applications), and free phone support.


LF95 Performance

LF95 6.2 features P4 optimizations with SSE2 instructions. We tested v6.2 optimizations on a 1.8GHz P4 with 512 MB of PC2100 RAM, running SuSE 8.1, using Polyhedron?s (www.polyhedron.com) Fortran benchmarks. Specifying the new switches --tp4, --sse2, --zfm, --o2, and -x, the 90 benchmarks ran an average of 12.4% faster and the 77 benchmarks 7.9% faster than they did when built with LF95 Linux v6.1. Try LF95 v6.2 on your code today!


LF95 Optimizations

Basic Optimization

  • Constant folding
  • Common subexpression elimination
  • Copy propagation
  • Strength Reduction
  • Algebraic simplifications
  • Dead code elimination
  • Peephole optimization
  • Loop invariant code motion
  • Transform array element to simple variable
  • Local Instruction scheduling
  • Address calculation optimization

Program Reconstruction Optimizations

  • Loop unrolling
  • Loop interchange

Procedure Optimization

  • Inlining mathematical functions
  • Stack optimization

Others

  • P4 with SSE2 instructions
  • Prefetch for Pentium III and Athlon processors
  • I486/Pentium/Pentium PRO instruction selection
  • Using fast input/output libraries


Link Fujitsu C, g77, and egcs object files
LF95 supports static linking with Fujitsu C, g77, or egcs. Combine your Fortran and C/C++ code into one executable. For the routines you don't want to develop yourself, you can also link with C/C++ routines from commercially available libraries.


Legacy Fortran Support
LF95 extends its language support in other directions adding many legacy Fortran features, including VAX structures and the various UNIX service routines. These features further facilitate your move to cost/performance efficiency on the PC platform:

  • Unlimited number of continuation lines in free or fixed source form
  • DO UNTIL statement
  • FIND statement
  • STRUCTURE and END STRUCTURE statements
  • UNION and END UNION statements
  • MAP and END MAP statements
  • RECORD statement
  • Non-standard POINTER statement
  • AUTOMATIC statement
  • STATIC statement
  • VALUE statement
  • BYTE statement
  • Hollerith constants
  • Alternative forms of binary, octal, and hexadecimal constants
  • Binary, octal, or hexadecimal constants in a DATA, declaration statement
  • Period structure component separator
  • IMPLICIT UNDEFINED statement
  • Namelist input/output on internal file
  • FORM = 'BINARY'
  • TOTALREC specifier
  • STATUS = 'SHR'
  • Gw, $, \, and R edit descriptors
  • LOC intrinsic function
  • The following service subroutines: ABORT, BEEP, BIC, BIS, CLOCK, CLOCKM, DATE, EXIT, ERRSAV, ERRSTR, ERRSET, ERRTRA, FDATE, FREE,GETARG, GETDAT, GETLOG, GETPARM, GETTIM, GMTIME, IBTOD, IDATE, IETOM, ITIME, IVALUE, LTIME, MTOIE, PERROR, PRNSET, QSORT, SETRCD, SETBIT, SIGNAL, SLEEP
  • The following service functions: ACCESS, ALARM, BIT, CHDIR, CHMOD, CTIME, DRAND, DTIME, ETIME, FGETC, FPUTC, FSEEK, FSTAT, FTELL, GETC, GETCWD, GETFD, GETPID, HOSTNM, IARGC, IERRNO, INMAX, IOINIT, IRAND, JDATE, KILL, LNBLNK, LONG, LSTAT, MALLOC, NARGS, PUTC, RAN, RAND, RENAME, RINDEX, RTC, SECOND, SECNDS, SETDAT, SETTIM, SHORT, STAT, TIME, TIMEF, UNLINK

ANSI/ISO-Compliant Fortran 95
LF95 is a complete implementation of the ANSI/ISO Fortran 95 standard. Fortran 95 offers some small but important improvements over Fortran 90, including the ability to create your own elemental procedures, default initialization for structure components, the NULL intrinsic for initializing pointers, the FORALL construct, and a standard CPU_TIME intrinsic procedure.


Automatic Parallelization
The LF95 compiler automatically parallelizes DO loops and array operations without you having to make modifications to the program. This makes it easy to migrate source programs to other platforms (as long as the program conforms with the Fortran Standard). The effect is to save elapsed execution time by using two or more CPUs simultaneously. For instance, if a DO loop can be executed in parallel by dividing it in half, then, theoretically, the execution time of this DO loop may be cut in half. In practice, improving performance requires some care and some work on the part of the programmer. During compilation, the auto-parallel function will return information regarding which processes were (and which were not) parallelized and why. While certain loops can be analyzed sufficiently to be parallelized by the compiler without input from the programmer, many loops have data dependencies that prevent automatic parallelization because of the potential for incorrect results. For that reason, LF95 PRO also includes optimization control lines (OCLs) that provide information necessary for the compiler to parallelize these otherwise unparallelizable loops. The OCLs are Fortran comments in a particular format, for example:

!OCL PARALLEL

Note that programs with OCLs are standard-conforming and can be compiled with other compilers that do not support OCLs.

Four compiler switches control automatic parallelization: --parallel, --threads, --threadstack, and --ocl. Details of automatic parallelization (loop slicing, interchange, distribution, fusion, and reduction, as well as OCL syntax and specifiers) are documented in the LF95 User's Guide.

OpenMP v2.0 Compatibility
OpenMP specifies a set of compiler directives, library routines, and environment variables for shared-memory parallelism in Fortran and C/C++ programs. LF95 Linux PRO v6.2 supports the OpenMP v2.0 specification for Fortran. Like automatic parallelization, OpenMP directives are used to parallelize a program that runs on a computer with more than one processor. With OpenMP you have more control over how code is parallelized, but also more coding to do.

The LF95 Linux PRO v6.2 CD includes the OpenMP v2.0 Fortran specification in PDF.

Winteracter Starter Kit
Use the Winteracter Starter Kit - WiSK - for creating true X/Windows programs with Fortran. WiSK is a subset of the X/Winteracter Library created by Interactive Software Services, Ltd. (X/Winteracter is available from Lahey.) X/Winteracter is a Fortran 95-callable, 32-bit, X/Windows, user-interface and graphics development kit. Derived from X/Winteracter, WiSK provides a library of subroutines for window management, input handling, dialog management, and high resolution graphics. Motif v2.2 is supported. Linux distributions such as Red Hat and SuSE are now shipping v2.2 as standard.

A new Resource Editor combines and improves the old menu and dialog editors and adds an image editor for icon, cursor,and bitmap-button creation. This substantially streamlines resource file maintenance. The included Winteracter Application Wizard provides a quick and easy method of generating a substantial starter application tailored to your needs.

A new Graphics Text(GT) group supersedes the Graphics Character Output group. The new GT interface is easier to understand and typically requires fewer subroutine calls. New features include the following: software fonts defined within the library rather than in external data files, thus eliminating the need to distribute and locate these files; all graphics text can optionally be plotted on an opaque background; underlining is available; text can be rotated and/or vertical;handling of proportional spacing has been improved, using ?native? font spacing, where available; vector-based software fonts can be drawn in a bold style; and character size is now expressed in a more meaningful manner. Numerous other enhancements: graphics can now be redirected to a dialog field instead of to a window. Windows and dialogs can now be combined. Menus can now be added to child windows in addition to the root window. Status bar help prompts can be attached to menu items. Support for progress bars is added. Routines to change the state of a field and move the cursor to a specific field are added. The mouse cursor is selectable and mouse double-clicks are reported.



BLAS and LAPACK
BLAS is a library for vector and matrix operations. The BLAS thread-safe version is based on BLAS provided on Netlib. Included in LF95 v6.2 is an optimized version for the Pentium 4 with SSE2 instructions. BLAS includes 57 functions. The total number of routines for all precision types amounts to approximately 170.

BLAS thread-safe version provides the following routines:
Level 1 BLAS : Vector operations
Level 2 BLAS : Matrix and vector operations
Level 3 BLAS : Matrix and matrix operations
Sparse-BLAS : Sparse vector operations

The thread-safe implementation of BLAS has exactly the same subroutine names and calling parameters as those of the Netlib baseline version.

Differences include:

  • the thread-safe version can be used in the environment of SMP (Symmetric Multiple Processing)
  • subroutines of the thread-safe version can be called from an OpenMP Fortran program

The purpose of using BLAS thread-safe version is to have a subroutine concurrently perform operations on different sets of data that are independent from each other, and thus reduce the time necessary to finish all the operations.


LAPACK is a library of linear algebra routines. The LAPACK thread-safe version is based on LAPACK 3.0 provided on Netlib. Included in LF95 v6.2 is an optimized version for the Pentium 4 with SSE2 instructions. LAPACK includes approximately 300 functions. The total number of routines for all precision types amounts to approximately 1100.

LAPACK provides the following routines:

  • Linear equations
  • Linear least squares problems
  • Eigenvalue problems
  • Singular value decomposition

The LAPACK thread-safe version, like the BLAS version, can be called from an OpenMP program in the environment of SMP.

Fujitsu Scientific Subroutine Library 2
The Fujitsu Scientific Subroutine Library 2 (SSL2) has been in use for years in Japan on Fujitsu mainframe and workstation hardware. Included in LF95 v6.2 is an optimized version for the Pentium 4 with SSE2 instructions. SSL2 offers over 250 optimized thread-safe routines in the following areas:

Linear Algebra
Matrix Storage Mode Conversion
Matrix Manipulation
Linear Equations and Matrix Inversion (Direct Method)
Least Squares Solution

Eigenvalues and Eigenvectors
Eigenvalues and Eigenvectors of a Real Matrix
Eigenvalues and Eigenvectors of a Complex Matrix
Eigenvalues and Eigenvectors of a Real Symmetric Matrix
Eigenvalues and Eigenvectors of a Hermitian Matrix
Eigenvalues and Eigenvectors of a Real Symmetric Band Matrix
Eigenvalues and Eigenvectors of a Real Symmetric Generalized Eigenproblem
Eigenvalues and Eigenvectors of a Real Symmetric Band Generalized Eigenproblem

Nonlinear Equations
Polynomial Equations
Transcendental Equations
Nonlinear Simultaneous Equations

Extrema
Minimization of Function with a Variable
Unconstrained Minimization of Multivariable Function
Unconstrained Minimization of Sum of Squares of Functions (Nonlinear Least Squares Solution)
Linear Programming
Nonlinear Programming (Constrained Minimization of Multivariable Function)

Interpolation and Approximation
Interpolation
Approximation
Smoothing
Series

Transforms
Discrete Real Fourier Transforms
Discrete Cosine Transforms
Discrete Sine Transforms
Discrete Complex Fourier Transforms
Laplace Transform

Numerical Differentiation and Quadrature

Differential Equations

Special Functions
Elliptic Integrals
Exponential Integral
Sine and Cosine Integrals
Fresnel Integrals
Gamma Functions
Error Functions
Bessel Functions
Normal Distribution Functions

Pseudo Random Numbers
Pseudo Random Generation
Pseudo Random Testing


LF95 System Requirements

  • glibc 2.2.4 and higher (see below for tested distributions).
  • X-Windows to view online pdf manuals.
  • 35MB of available hard disk space for LF95 Express, 70MB for LF95 PRO.
  • 32MB RAM.
  • CD-ROM Drive.
  • 80486DX or Pentium-compatible processor
  • We support LF95 v6.2 on the following distributions:
Distribution Kernel libc
Debian v3.0 2.2.20 2.2.5
Mandrake v8.2 2.4.18-6 2.2.4
Mandrake v9.0 2.4.19 2.2.5
Mandrake v9.1 2.4.21 2.3.1
Red Hat v7.2 2.4.7 2.2.4
Red Hat v7.3 2.4.18 2.2.5
Red Hat v8.0 2.4.18 2.2.93
Red Hat v9.0 2.4.20-6 2.3.2
Slackware v8.1 2.4.18 2.2.5
SuSE v7.2 2.4.4 2.2.2
SuSE v7.3 2.4.10 2.2.4
SuSE v8.0 2.4.18 2.2.5
SuSE v8.1 2.4.19 2.2.5
SuSE v8.2 2.4.20-4 2.3.2


© 2004, Aertia, S.L.