PSOD on ESXi 6.5 or 6.7 when using NFS 4.1 with error NFS41_IOTaskDetach
search cancel

PSOD on ESXi 6.5 or 6.7 when using NFS 4.1 with error NFS41_IOTaskDetach

book

Article ID: 317565

calendar_today

Updated On:

Products

VMware vSphere ESXi

Issue/Introduction

Symptoms:
PSOD when using NFS 4.1 with below backtrace

2017-12-07T07:35:12.347Z cpu1:1001390878)@BlueScreen: Failed at bora/modules/vmkernel/nfs41client/nfs41Task.c:107 -- VMK_ASSERT(!vmk_ListIsUnlinkedElement(&ioTask->fileLink))
2017-12-07T07:35:12.352Z cpu1:1001390878)Code start: 0x41801e400000 VMK uptime: 0:13:44:32.694
2017-12-07T07:35:12.356Z cpu1:1001390878)0x451a5411ac80:[0x41801e55a9dd]PanicvPanicInt@vmkernel#nover+0x295 stack: 0x41801e55a9dd
2017-12-07T07:35:12.361Z cpu1:1001390878)0x451a5411ad30:[0x41801e55b196]Panic_vPanic@vmkernel#nover+0x23 stack: 0x4316c117bd10
2017-12-07T07:35:12.367Z cpu1:1001390878)0x451a5411ad50:[0x41801e57c87c]vmk_PanicWithModuleID@vmkernel#nover+0x41 stack: 0x451a5411adb0
2017-12-07T07:35:12.372Z cpu1:1001390878)0x451a5411adb0:[0x41801feb60d1]NFS41_IOTaskDetach@(nfs41client)#<None>+0x172 stack: 0x459acea34940
2017-12-07T07:35:12.379Z cpu1:1001390878)0x451a5411add0:[0x41801feb649c]NFS41_TaskDone@(nfs41client)#<None>+0x181 stack: 0x4316c1169a40
2017-12-07T07:35:12.384Z cpu1:1001390878)0x451a5411ae30:[0x41801feaeaa7]NFS41_SchedTaskDone@(nfs41client)#<None>+0x58 stack: 0x459acd456580
2017-12-07T07:35:12.391Z cpu1:1001390878)0x451a5411ae60:[0x41801fe880b7]NFS41_NFSIODoneNFSCB@(nfs41client)#<None>+0x24 stack: 0x459ad27f4a80
2017-12-07T07:35:12.397Z cpu1:1001390878)0x451a5411ae80:[0x41801fe971c5]NFS41_SessionPathDone@(nfs41client)#<None>+0x202 stack: 0x1
2017-12-07T07:35:12.402Z cpu1:1001390878)0x451a5411aee0:[0x41801fde781a][email protected]#1+0x5f stack: 0x451a5411af40
2017-12-07T07:35:12.408Z cpu1:1001390878)0x451a5411af00:[0x41801fde7ab6][email protected]#1+0x28f stack: 0x4305cc0cf3c0
2017-12-07T07:35:12.415Z cpu1:1001390878)0x451a5411af90:[0x41801fdf96da][email protected]#1+0x33 stack: 0x451a540a3000
2017-12-07T07:35:12.420Z cpu1:1001390878)0x451a5411afd0:[0x41801e87ab97]CpuSched_StartWorld@vmkernel#nover+0xf4 stack: 0x0


OR below backtrace

2018-02-01T05:32:11.710Z cpu22:66856)Code start: 0x418010600000 VMK uptime: 0:01:19:06.007
2018-02-01T05:32:11.710Z cpu22:66856)0x43916941bdd0:[0x418011cf133d]NFS41_TaskDone@(nfs41client)#<None>+0x89 stack: 0x430f10efce10
2018-02-01T05:32:11.710Z cpu22:66856)0x43916941be50:[0x418011cec263]NFS41_SchedTaskDone@(nfs41client)#<None>+0x1b stack: 0x430f10efd500
2018-02-01T05:32:11.711Z cpu22:66856)0x43916941be70:[0x418011cda194]NFS41_NFSIODoneNFSCB@(nfs41client)#<None>+0x24 stack: 0x430f10efd7c0
2018-02-01T05:32:11.711Z cpu22:66856)0x43916941be90:[0x418011ce040b]NFS41_SessionPathDone@(nfs41client)#<None>+0xcb stack: 0x832eac8c347
2018-02-01T05:32:11.711Z cpu22:66856)0x43916941bf20:[0x41801159252f][email protected]#1+0x5f stack: 0x4307f787b080
2018-02-01T05:32:11.712Z cpu22:66856)0x43916941bf40:[0x4180115926a6][email protected]#1+0x16a stack: 0x4307f787b080
2018-02-01T05:32:11.712Z cpu22:66856)0x43916941bfc0:[0x41801159c0d5][email protected]#1+0x15 stack: 0x439169427100
2018-02-01T05:32:11.712Z cpu22:66856)0x43916941bfe0:[0x4180108c93c5]CpuSched_StartWorld@vmkernel#nover+0x99 stack: 0x0



there is a NFS41 datastore:
Volume Name Host(s) Share Accessible Mounted Read-Only Security isPE Hardware Acceleration
-----------------------------------------------------------------------------------------------------------
NasBackup 11.44.5.17 /NasBackup true true false AUTH_SYS false Not Supported


Cause

This is caused by a race condition in NFS41_IORequestDoneCB

Resolution


The issue is resolved in the following ESXi updates:

ESXi 6.5 Update 3
ESXi 6.7 Update 2

Both are available at
VMware Downloads.