linux/net/sunrpc/xprtrdma
Chuck Lever ec62f40d35 xprtrdma: Ensure ia->ri_id->qp is not NULL when reconnecting
Devesh Sharma <Devesh.Sharma@Emulex.Com> reports that after a
disconnect, his HCA is failing to create a fresh QP, leaving
ia_ri->ri_id->qp set to NULL. But xprtrdma still allows RPCs to
wake up and post LOCAL_INV as they exit, causing an oops.

rpcrdma_ep_connect() is allowing the wake-up by leaking the QP
creation error code (-EPERM in this case) to the RPC client's
generic layer. xprt_connect_status() does not recognize -EPERM, so
it kills pending RPC tasks immediately rather than retrying the
connect.

Re-arrange the QP creation logic so that when it fails on reconnect,
it leaves ->qp with the old QP rather than NULL.  If pending RPC
tasks wake and exit, LOCAL_INV work requests will flush rather than
oops.

On initial connect, leaving ->qp == NULL is OK, since there are no
pending RPCs that might use ->qp. But be sure not to try to destroy
a NULL QP when rpcrdma_ep_connect() is retried.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>
2014-06-04 08:56:47 -04:00
..
Makefile xprtrdma: add separate Kconfig options for NFSoRDMA client and server support 2014-03-28 18:02:12 -04:00
rpc_rdma.c xprtrdma: Simplify rpcrdma_deregister_external() synopsis 2014-06-04 08:56:40 -04:00
svc_rdma_marshal.c svcrdma: underflow issue in decode_write_list() 2013-07-15 11:46:23 -04:00
svc_rdma_recvfrom.c Fix regression in NFSRDMA server 2014-03-28 18:02:11 -04:00
svc_rdma_sendto.c svcrdma: fix offset calculation for non-page aligned sge entries 2014-03-28 18:02:13 -04:00
svc_rdma_transport.c svcrdma: fix printk when memory allocation fails 2014-03-27 16:31:56 -04:00
svc_rdma.c net: Convert uses of typedef ctl_table to struct ctl_table 2013-06-13 02:36:09 -07:00
transport.c xprtrdma: Make rpcrdma_ep_destroy() return void 2014-06-04 08:56:41 -04:00
verbs.c xprtrdma: Ensure ia->ri_id->qp is not NULL when reconnecting 2014-06-04 08:56:47 -04:00
xprt_rdma.h xprtrdma: Limit work done by completion handler 2014-06-04 08:56:45 -04:00