linpack测试 错误信息 caused collective abort of all ranks
本人想在LINUX环境下测试本机的LINPACK值,但在实验过程中碰到如下问题:
环境: 1 RHEL5.2
2.MPI---> /usr/local/mpi (安装没有问题)
3.atlas库文件来自atlas3.6.0_Linux_P4SSE2.tgz解压文件, atlas库文件放在/usr/local/netlib下
(libatlas.a libcblas.a libf77blas.a liblapack.a)
4 .HPL---> /usr/local/hpl (安装没有问题,在bin/“arch“下生成HPL.dat和xhpl)
运行mpirun -np 4 xhpl 后出现的问题:
================================================================================
HPLinpack 2.0 -- High-Performance Linpack benchmark -- September 10, 2008
Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================
An explanation of the input/output parameters follows:
T/V : Wall time / encoded variant.
N : The order of the coefficient matrix A.
NB : The partitioning blocking factor.
P : The number of process rows.
Q : The number of process columns.
Time : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.
The following parameter values will be used:
N : 29 30 34 35
NB : 1 2 3 4
PMAP : Row-major process mapping
P : 2 1 4
Q : 2 4 1
PFACT : Left Crout Right
NBMIN : 2 4
NDIV : 2
RFACT : Left Crout Right
BCAST : 1ring
DEPTH : 0
SWAP : Mix (threshold = 64)
L1 : transposed form
U : transposed form
EQUIL : yes
ALIGN : 8 double precision words
--------------------------------------------------------------------------------
- The matrix A is randomly generated for each test.
- The following scaled residual check will be computed:
||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be 1.110223e-16
- Computational tests pass if scaled residuals are less than 16.0
rank 2 in job 2 localhost.localdomain_56503 caused collective abort of all ranks
exit status of rank 2: killed by signal 11
rank 0 in job 2 localhost.localdomain_56503 caused collective abort of all ranks
exit status of rank 0: killed by signal 11
请问上面两行错误是怎么回事?我现在一点眉目都没有。
我在我自己的电脑上面也是按照这样装的,但在我的电脑上面可以(32bit系统)。在服务器上面(64bit系统)就不行。我自己电脑上面就安装了所有的编程软件,服务器上面安装了所有的软件包(以后需要),难道软件环境不同造成的?
/////////////////////////////////////////////////////
Make.Linux_PII_CBLAS
/////////////////////////////////////////////////////
SHELL = /bin/sh
#
CD = cd
CP = cp
LN_S = ln -s
MKDIR = mkdir
RM = /bin/rm -f
TOUCH = touch
# - Platform identifier ------------------------------------------------
ARCH = Linux_PII_CBLAS
# - HPL Directory Structure / HPL library ------------------------------
TOPdir = /usr/local/hpl-2.0
INCdir = $(TOPdir)/include
BINdir = $(TOPdir)/bin/$(ARCH)
LIBdir = $(TOPdir)/lib/$(ARCH)
#
HPLlib = $(LIBdir)/libhpl.a
# - Message Passing library (MPI) --------------------------------------
MPdir = /usr/local/mpi
MPinc = -I$(MPdir)/include
MPlib = $(MPdir)/lib/libmpich.a
# - Linear Algebra library (BLAS or VSIPL) -----------------------------
LAdir = /usr/local/netlib
LAinc =
LAlib = $(LAdir)/libcblas.a $(LAdir)/libatlas.a
# - F77 / C interface --------------------------------------------------
F2CDEFS = -DAdd_ -DF77_INTEGER=int -DStringSunStyle
# - HPL includes / libraries / specifics -------------------------------
HPL_INCLUDES = -I$(INCdir) -I$(INCdir)/$(ARCH) $(LAinc) $(MPinc)
HPL_LIBS = $(HPLlib) $(LAlib) $(MPlib)
# - Compile time options -----------------------------------------------
HPL_OPTS = -DHPL_CALL_CBLAS
HPL_DEFS = $(F2CDEFS) $(HPL_OPTS) $(HPL_INCLUDES)
# - Compilers / linkers - Optimization flags ---------------------------
CC = /usr/bin/gcc
CCNOOPT = $(HPL_DEFS)
CCFLAGS = $(HPL_DEFS) -fomit-frame-pointer -O3 -funroll-loops
#
# On some platforms, it is necessary to use the Fortran linker to find
# the Fortran internals used in the BLAS library.
#
LINKER = /usr/local/mpi/bin/mpif77
LINKFLAGS = $(CCFLAGS)
#
ARCHIVER = ar
ARFLAGS = r
RANLIB = echo
#
# ----------------------------------------------------------------------
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
看一下这个http://www.mdbbs.org/viewthread.php?tid=13775&page=1