【问题标题】:Sporadic failure in GLXGLX 的零星故障
【发布时间】:2020-08-15 05:38:54
【问题描述】:

我正在使用 Xserver 在无头模式下运行 OpenGL,并多次调用此 api:https://github.com/RobotLocomotion/drake/blob/74292cacd1c42d6b3e682dc836254cdb834ea2e6/geometry/render/render_engine_vtk.cc#L311

偶尔但几乎总是有一个

X Error of failed request:  BadValue (integer parameter out of range for operation)
  Major opcode of failed request:  154 (GLX)
  Minor opcode of failed request:  3 (X_GLXCreateContext)
  Value in failed request:  0x0
  Serial number of failed request:  61
  Current serial number in output stream:  62

glxinfo:

glxinfo
name of display: :0
XIO:  fatal IO error 11 (Resource temporarily unavailable) on X server ":0"
      after 50 requests (50 known processed) with 0 events remaining.

/var/log/Xorg.0.log 日志的最后几行:

[ 47757.261] (EE) Backtrace:
[ 47757.261] (EE) 0: /usr/lib/xorg/Xorg (xorg_backtrace+0x4d) [0x557e48dd2acd]
[ 47757.261] (EE) 1: /usr/lib/xorg/Xorg (0x557e48c1a000+0x1bc869) [0x557e48dd6869]
[ 47757.261] (EE) 2: /lib/x86_64-linux-gnu/libpthread.so.0 (0x7f4cbddc7000+0x128a0) [0x7f4cbddd98a0]
[ 47757.261] (EE) 3: /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so (0x7f4cba768000+0x479100) [0x7f4cbabe1100] 
[ 47757.261] (EE) 
[ 47757.262] (EE) Segmentation fault at address 0x8
[ 47757.262] (EE) 
Fatal server error:
[ 47757.262] (EE) Caught signal 11 (Segmentation fault). Server aborting

机器:18.04.2-Ubuntu

NVIDIA-SMI 440.100 驱动程序版本:440.100 CUDA 版本:10.2

有人可以告诉我接下来要在这里调试什么吗?

【问题讨论】:

    标签: opengl ubuntu-18.04 nvidia xserver drake


    【解决方案1】:

    我在自己的 CI 中也看到了这一点:

    [ 18228.470] (EE) Backtrace:
    [ 18228.470] (EE) 0: /usr/lib/xorg/Xorg (xorg_backtrace+0x4d) [0x55e0ca9fcacd]
    [ 18228.470] (EE) 1: /usr/lib/xorg/Xorg (0x55e0ca844000+0x1bc869) [0x55e0caa00869]
    [ 18228.470] (EE) 2: /lib/x86_64-linux-gnu/libpthread.so.0 (0x7fce3e7d6000+0x128a0) [0x7fce3e7e88a0]
    [ 18228.470] (EE) 3: /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so (0x7fce3b177000+0x479100) [0x7fce3b5f0100]
    [ 18228.470] (EE) 
    [ 18228.470] (EE) Segmentation fault at address 0x8
    

    不同的 ASLR,但跟踪中的低位字节相同。

    我正在使用 xorg-server 2:1.19.6-1ubuntu4.4。

    [ 17925.887] (II) Module nvidia: vendor="NVIDIA Corporation"
    [ 17925.887]    compiled for 1.6.99.901, module version = 1.0.0
    [ 17925.887]    Module class: X.Org Video Driver
    [ 17925.887] (II) NVIDIA dlloader X Driver  440.100  Fri May 29 08:21:27 UTC 2020
    

    不幸的是,我还不能调试这个。

    我的即兴体验(尚未得到数据证实)是,几个月前 Ubuntu 将所有人从 nvidia 430 强制升级到 nvidia 440 时,这种情况变得更加频繁。

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 2011-09-17
      • 2015-12-11
      • 1970-01-01
      • 2020-06-10
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      相关资源
      最近更新 更多