PSOD (purple screen of death) on ESXi 6.5 or 6.7 when using NFS 4.1 with error NFS41_IOTaskDetach
search cancel

PSOD (purple screen of death) on ESXi 6.5 or 6.7 when using NFS 4.1 with error NFS41_IOTaskDetach

book

Article ID: 317565

calendar_today

Updated On:

Products

VMware vSphere ESXi

Issue/Introduction

Symptoms:
PSOD (purple screen of death) when using NFS 4.1 with below backtrace

YYYY-MM-DDTXX:XX:XX.347Z cpu1:1001390878)@BlueScreen: Failed at bora/modules/vmkernel/nfs41client/nfs41Task.c:107 -- VMK_ASSERT(!vmk_ListIsUnlinkedElement(&ioTask->fileLink))
YYYY-MM-DDTXX:XX:XX.352Z cpu1:1001390878)Code start: 0x41801e400000 VMK uptime: 0:13:44:32.694
YYYY-MM-DDTXX:XX:XX.356Z cpu1:1001390878)0x451a5411ac80:[0x41801e55a9dd]PanicvPanicInt@vmkernel#nover+0x295 stack: 0x41801e55a9dd
YYYY-MM-DDTXX:XX:XX.361Z cpu1:1001390878)0x451a5411ad30:[0x41801e55b196]Panic_vPanic@vmkernel#nover+0x23 stack: 0x4316c117bd10
YYYY-MM-DDTXX:XX:XX.367Z cpu1:1001390878)0x451a5411ad50:[0x41801e57c87c]vmk_PanicWithModuleID@vmkernel#nover+0x41 stack: 0x451a5411adb0
YYYY-MM-DDTXX:XX:XX.372Z cpu1:1001390878)0x451a5411adb0:[0x41801feb60d1]NFS41_IOTaskDetach@(nfs41client)#<None>+0x172 stack: 0x459acea34940
YYYY-MM-DDTXX:XX:XX.379Z cpu1:1001390878)0x451a5411add0:[0x41801feb649c]NFS41_TaskDone@(nfs41client)#<None>+0x181 stack: 0x4316c1169a40
YYYY-MM-DDTXX:XX:XX.384Z cpu1:1001390878)0x451a5411ae30:[0x41801feaeaa7]NFS41_SchedTaskDone@(nfs41client)#<None>+0x58 stack: 0x459acd456580
YYYY-MM-DDTXX:XX:XX.391Z cpu1:1001390878)0x451a5411ae60:[0x41801fe880b7]NFS41_NFSIODoneNFSCB@(nfs41client)#<None>+0x24 stack: 0x459ad27f4a80
YYYY-MM-DDTXX:XX:XX.397Z cpu1:1001390878)0x451a5411ae80:[0x41801fe971c5]NFS41_SessionPathDone@(nfs41client)#<None>+0x202 stack: 0x1
YYYY-MM-DDTXX:XX:XX.402Z cpu1:1001390878)0x451a5411aee0:[0x41801fde781a][email protected]#1+0x5f stack: 0x451a5411af40
YYYY-MM-DDTXX:XX:XX.408Z cpu1:1001390878)0x451a5411af00:[0x41801fde7ab6][email protected]#1+0x28f stack: 0x4305cc0cf3c0
YYYY-MM-DDTXX:XX:XX.415Z cpu1:1001390878)0x451a5411af90:[0x41801fdf96da][email protected]#1+0x33 stack: 0x451a540a3000
YYYY-MM-DDTXX:XX:XX.420Z cpu1:1001390878)0x451a5411afd0:[0x41801e87ab97]CpuSched_StartWorld@vmkernel#nover+0xf4 stack: 0x0


OR below backtrace

YYYY-MM-DDTXX:XX:XX.710Z cpu22:66856)Code start: 0x418010600000 VMK uptime: 0:01:19:06.007
YYYY-MM-DDTXX:XX:XX.710Z cpu22:66856)0x43916941bdd0:[0x418011cf133d]NFS41_TaskDone@(nfs41client)#<None>+0x89 stack: 0x430f10efce10
YYYY-MM-DDTXX:XX:XX.710Z cpu22:66856)0x43916941be50:[0x418011cec263]NFS41_SchedTaskDone@(nfs41client)#<None>+0x1b stack: 0x430f10efd500
YYYY-MM-DDTXX:XX:XX.711Z cpu22:66856)0x43916941be70:[0x418011cda194]NFS41_NFSIODoneNFSCB@(nfs41client)#<None>+0x24 stack: 0x430f10efd7c0
YYYY-MM-DDTXX:XX:XX.711Z cpu22:66856)0x43916941be90:[0x418011ce040b]NFS41_SessionPathDone@(nfs41client)#<None>+0xcb stack: 0x832eac8c347
YYYY-MM-DDTXX:XX:XX.711Z cpu22:66856)0x43916941bf20:[0x41801159252f][email protected]#1+0x5f stack: 0x4307f787b080
YYYY-MM-DDTXX:XX:XX.712Z cpu22:66856)0x43916941bf40:[0x4180115926a6][email protected]#1+0x16a stack: 0x4307f787b080
YYYY-MM-DDTXX:XX:XX.712Z cpu22:66856)0x43916941bfc0:[0x41801159c0d5][email protected]#1+0x15 stack: 0x439169427100
YYYY-MM-DDTXX:XX:XX.712Z cpu22:66856)0x43916941bfe0:[0x4180108c93c5]CpuSched_StartWorld@vmkernel#nover+0x99 stack: 0x0



there is a NFS41 datastore:
Volume Name Host(s) Share Accessible Mounted Read-Only Security isPE Hardware Acceleration
-----------------------------------------------------------------------------------------------------------
NasBackup 11.44.5.17 /NasBackup true true false AUTH_SYS false Not Supported


Cause

This is caused by a race condition in NFS41_IORequestDoneCB

Resolution


The issue is resolved in the following ESXi updates:

ESXi 6.5 Update 3
ESXi 6.7 Update 2