Polyhedron Benchmarks (Intel+Windows)

Win64 on Intel Processor

Absoft
17.0
Absoft(AP)
17.0
FTN95
8.05
Intel
17.0
Intel(AP)
17.0
gfortran
5.4.1 (Lahey-GNU Fortran)
NAG
6.1
PGI
16.9
AC 5.55 5.55 11.35 4.70 4.68 6.48 7.50 8.96
AERMOD 13.40 13.56 26.10 12.16 12.73 26.92 19.55 15.45
AIR 3.72 4.20 9.07 3.02 2.83 5.24 5.45 5.61
CAPACITA 22.49 21.96 45.49 18.77 18.78 20.73 23.54 19.16
CHANNEL2 86.59 68.74 204.20 90.21 55.70 91.74 119.34 90.42
DODUC 20.44 20.70 36.60 15.31 15.40 29.85 24.03 22.31
FATIGUE2 64.56 69.55 295.93 54.56 55.69 38.68 94.16 127.13
GAS_DYN2 66.69 46.05 501.52 68.26 43.33 286.31 181.57 120.41
INDUCT2 85.10 195.89 432.09 71.83 59.54 90.62 130.36 131.77
LINPK 6.14 6.46 6.70 4.98 5.24 5.79 7.42 6.91
MDBX 9.90 10.08 15.03 6.74 5.24 8.22 8.84 9.50
MP_PROP_DESIGN 125.67 34.35 622.82 64.28 13.30 445.16 185.83 97.50
NF 9.11 9.55 17.56 8.37 8.08 8.41 10.13 10.29
PROTEIN 21.81 22.78 40.69 22.12 22.36 21.20 20.24 24.22
RNFLOW 16.04 18.67 42.31 12.69 10.24 14.96 17.84 18.25
TEST_FPU2 65.61 82.17 222.38 46.13 44.34 53.31 86.45 52.02
TFFT2 53.26 53.84 69.01 49.36 49.00 49.75 53.27 49.19
Geometric Mean 24.24 23.78 60.02 20.06 16.64 29.12 31.17 28.20

 

Compiler Switches
Absoft af90 -m64 -O4 -speed_math=10 -fast_math -march=core -xINTEGER -stack:0x30000000
Absoft (autoparallel) -m64 -mavx -O5 -speed_math=10 -fast_math -march=core -xINTEGER -stack:0×30000000
FTN95 ftn95 /p6 /optimize (slink was used to increase the stack size)
Intel ifort /fast /nostandard-realloc-lhs /link /stack:64000000
Intel (autoparallel) ifort /fast /Qparallel /nostandard-realloc-lhs /link /stack:64000000
Lahey-GNU lgf -64 -ofast -unroll -wpo -stack 64000000
NAG nagfor -abi=64 -O4 -s -v -V
PGI pgf90 -V -fastsse -Munroll=n:4 -Mipa=fast,inline

 

Notes
All figures are execution times in seconds – measured on a machine with 2 x Xeon E5-2643 3.30GHz quad core processors, running at stock speed with hyperthreading disabled, 64 GBytes memory, and running 64-bit Windows 7 64-bit. Each figure is the average over at least 10 runs (many more for some). Measurement error is typically <1%. Green cells highlight figures within 10% of the fastest. Red cells indicate figures which are more than 150% of the fastest. So far as possible, we have used the compiler switches which give the best overall results. We have not attempted to tune individual benchmarks, and, in particular cases, different switch settings may give better results. Where possible, compiler switches were set to generate 64 bit executables. The settings used for the Absoft and Intel compilers enable autoparallelization. Autoparallelization settings are not used on any other compilers because we found that they produced no significant performance benefits on this benchmark set.  Thanks are due to Jos Bergervoet for permission to use his CAPACITA benchmark, to Quetzal Associates for permission to use their CHANNEL, FATIGUE, GAS_DYN, INDUCT, PROTEIN and RNFLOW benchmarks, to David Frank for his TEST_FPU benchmark, to Anthony Falzone for the use of MP_PROP_DESIGN, and to Ted Addison of McVehil-Monnett Associates for permission to use AERMOD, an air quality model used by the US Environmental Protection Agency.  All the benchmarks have been modified slightly to fit into our benchmarking harness.

 

Return to Fortran Compiler Comparisons homepage