Personal/pragyagandhi/az auth none change #555

pragyagandhi · 2025-05-05T12:41:07Z

No description provided.

These will be updated when we create the RPC request and when we receive the RPC response bytes, respectively. The req and resp size containg the RPC header, the NFS header and optional data bytes. User can query the req and resp bytes using the two newly added APIs rpc_pdu_get_req_size() rpc_pdu_get_resp_size()

…esp_size Personal/linuxsmiths/add req resp size

rpc_find_pdu() removes the pdu from the waitpdu list, this pdu is added to rpc->pdu. If a reconnect happens during this time, we were not correctly re-queuing this pdu to rpc->outqueue. This was causing application to forever wait for the callback.

…_requeue_rpc_pdu_on_reconnect Personal/linuxsmiths/correctly requeue rpc pdu on reconnect

…s_for_catching_bug_where_libnfs_misses_a_pdu Added instrumentatio to catch missing pdu in libnfs.

These are a result of some bugs found during stress runs. 1. LIBNFS_LIST_REMOVE() was not setting q->tail properly, so it was causing some PDUs to get dropped from outqueue. Application would never hear about these PDUs and would be kept indefinitely waiting. Added rpc_remove_pdu_from_queue() which correctly updates q->tail. 2. rpc_return_to_queue() was adding pdu to the head of outqueue. If we do it when we are resetting the connection it's ok, but o/w it may cause incorrect data to be sent out as the pdu at the head may have been half written. Once such case if when we bring back a pdu from waitpdu queue to outqueue after retransmit timeout. Renamed rpc_return_to_queue() to rpc_return_to_outqueue() to be more explicit and also rpc_return_to_outqueue() adds the pdu after outqueue.head and not at the end. This ensures that all callers can safely call this to return a pdu to outqueue for retransmit, w/o worrying about the half sent pdus. Additionally added useful asserts, w/o which it's very risky to run it blind.

…_in_requeuing_pdus Few fixes for correctly updating the pdu queues.

…reconnect) Requests that timeout in outqueue have not been sent to server and hence they do not signifiy any issue with the server. Yes they may indicate issue with the server indirectly since requests ahead of them did not get responses, but any corrective action is taken on behalf of those requests which have been sent to server. Also added a new stats member num_timedout_in_outqueue to convey this to application.

…int_server_not_responding_for_requests_in_outqueue Exclude requests timing out in outqueue from causing major recovery (…

…tenance.

…oid_checks Added paranoid checks to catch irregularities in pdu state/queue main…

…t_major_timeout_less_than_timeout When retrans is not set, do not set major_timeout less than timeout.

…ore_iovec_for_write_length One iovector was accounted less in rpc_nfs3_writev_task()

Currently we go ahead and try to read the data which causes it to fail and initiates a reconnect. Caller never gets a response for such failed READ pdus.

…ure_in_zero_copy_reads Handle failed NFS response for zero copy reads

This is to avoid huge number of WRITE (and READ) PDUs to hijack a connection causing commands like ls/stat/find/chmod to appear to hang for a long time, since WRITEs can take really long to be sent out due to TCP window getting full (if server is unable to process them fast enough). POSIX has compliance requirement only for completed requests. Since we are only reordering ongoing requests (not yet completed) it should not cause any violation. Moreover I cannot think of any application which will depend on any ordering guarantees of parallel requests. Let's monitor it and if it causes any issuesm revisit on a case-by-case basic.

…io_requests_to_head_of_outqueue Add non-IO RPC PDUs to head of outqueue

… outqueue. We add one more variable tailp to rpc_queue. This tracks the last high priority pdu, and allows us to maintain high priority and low priority pdus in the single outqueue, while maintaining order between pdus of the same priority. All requests except WRITEs are treated as high priority as they don't take much space in the socket queue and can be dispatched w/o much delay. Dispatching them fast lets them get processed at the server and overall we save time and also results in better user experience. Imagine queueing READ requests behind WRITEs and READs getting delayed as WRITEs cannot be sent due to server's TCP window filling up. OTOH if we send the READs with priority they can get serviced on the server in the time when WRITEs are still waiting to be sent, thus we can overall improve the number of reqs we can server per unit time.

…_prio_pdus_to_tailp_ahead_of_low_prio_pdus Added explicit APIs for adding pdus to low priority and high priority…

After adding iov_ref I forgot to update rpc_nfs4_readv_task(). Fixing that.

…_rpc_nfs4_readv_task Add missing initialization in rpc_nfs4_readv_task()

… be set for nfs service threads

…ervice_thread_start_allows_stack_size_to_be_set Introduce nfs_mt_service_thread_start_ss() for allowing stack size to…

(cherry picked from commit b2dc9a1)

…is defined ON platforms that support atomic_int type (which is what we care about), define multithreading_enabled as atomic. This prevents a TSAN warning in nfs_mt_service_thread()/nfs_mt_service_thread_start_ss() where a thread sets multithreading_enabled while main thread waits for it to be set.

…tithreading_enabled_atomic Define multithreading_enabled as atomic_int type if HAVE_STDATOMIC_H …

Don't access rpc->outqueue outside the lock.

TSAN fix

…to_rpc_context Personal/linuxsmiths/add tid to rpc context

…du for writing. We don't need to depend on poll() timeout to notice the to-be-written pdu.

…tfd_for_notifying_pollout Personal/linuxsmiths/use eventfd for notifying pollout

…to an empty outqueue/ This takes care of properly notifying service thread even when pdus are requeued.

* Adding azauth RPC * Updated * Updated * Updated * Updated * Update nfs.x * Update libnfs-raw-nfs.h * Update libnfs-raw-nfs.c * Update nfs.c * Update libnfs-raw-nfs.h * Update libnfs-raw-nfs.c --------- Co-authored-by: pragyagandhi <pragyagandhi@microsoft.com>

* Added azauth changes * Should not push * Updated * Updated * Updated * Audit fixes * More audit changes. * Remove AZAUTH RPC from outqueue on reconnect, to avoid dup AZAUTH RPCs sent to server. * more audit changes * review changes * review * Review changes * Updated * Updated * Temp commit to fix some known issues. * better comments and logs * Added some more useful asserts and logs/comments and final audit. * Added auth_token_cb_res to have auth_data * Removed setters for auth_cb_res * review changes --------- Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

* Changed clientid to 16 bytes and auth_context * Addressed comments * Update * Update

getaddrinfo() can fail with a temp error in case of too many and too fast calls for name resolution. Retry with a wait. Also improved error logging with strerror for the case of socket open failure. Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

…nstead of looping (#27) Looping is left for paranoid build. Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

Default behavior is same as before, i.e., reconnect to the same address as was resolved the very first time we connected to the server, but user can choose the "resolve before reconnect" behaviour by calling nfs_set_resolve_on_reconnect(). With that set, everytime we have to reconnect we first resolve the server address and reconnect to that address instead of the originally resolved address. This enables support for NFS server migration where the NFS server address can change after mount. libnfs can simply reconnect to the new address and start exchanging RPCs with that. There is no need to perform mount with the new address. Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

#29) Was the PDU retransmitted? i.e., did we send it to the server more than once? User can use this info to relax some errors, f.e., a REMOVE request that fails with NFS3ERR_NOENT can be treated as success if it was a retransmited request. Note that most applications will handle an unlink() call succeeding for a non-existent file better than unlink() call failing with NOENT for a file that was actually present. Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

…ver (#30) Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

* Macros change for large RPC on libnfs * Added a comment --------- Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

…ing pollout Looks like the Windows TCP is shrinking the window (reneg'ing)

sahlberg · 2025-05-22T01:30:01Z

This conflicts with current master.
Can you resolve the conflict.

sahlberg · 2025-06-01T04:49:34Z

Also, please rebase this ontop of current master.

Nagendra Tomar and others added 30 commits August 13, 2024 04:30

Also added dispatch_usecs to get the time when the PDU was dispatched.

4ca40af

Merge pull request #2 from linuxsmiths/personal/linuxsmiths/add_req_r…

aed713d

…esp_size Personal/linuxsmiths/add req resp size

Fixed a bug during reconnect.

6499e2a

rpc_find_pdu() removes the pdu from the waitpdu list, this pdu is added to rpc->pdu. If a reconnect happens during this time, we were not correctly re-queuing this pdu to rpc->outqueue. This was causing application to forever wait for the callback.

Some more sanity changes.

bedf992

Merge pull request #3 from linuxsmiths/personal/linuxsmiths/correctly…

c66e550

…_requeue_rpc_pdu_on_reconnect Personal/linuxsmiths/correctly requeue rpc pdu on reconnect

Added instrumentatio to catch missing pdu in libnfs.

f6f33ea

Merge pull request #4 from linuxsmiths/personal/linuxsmiths/add_check…

c357165

…s_for_catching_bug_where_libnfs_misses_a_pdu Added instrumentatio to catch missing pdu in libnfs.

Merge pull request #5 from linuxsmiths/personal/linuxsmiths/bug_fixes…

1123015

…_in_requeuing_pdus Few fixes for correctly updating the pdu queues.

Merge pull request #6 from linuxsmiths/personal/linuxsmiths/do_not_pr…

adf7338

…int_server_not_responding_for_requests_in_outqueue Exclude requests timing out in outqueue from causing major recovery (…

Added paranoid checks to catch irregularities in pdu state/queue main…

908331d

…tenance.

Merge pull request #7 from linuxsmiths/personal/linuxsmiths/add_paran…

62c763b

…oid_checks Added paranoid checks to catch irregularities in pdu state/queue main…

When retrans is not set, do not set major_timeout less than timeout.

b81f1d5

Merge pull request #8 from linuxsmiths/personal/linuxsmiths/do_not_se…

9f41bb9

…t_major_timeout_less_than_timeout When retrans is not set, do not set major_timeout less than timeout.

One iovector was accounted less in rpc_nfs3_writev_task()

d1c4901

Merge pull request #9 from linuxsmiths/personal/linuxsmiths/add_one_m…

b8d32e2

…ore_iovec_for_write_length One iovector was accounted less in rpc_nfs3_writev_task()

Handle failed NFS response for zero copy reads

ab56f5c

Currently we go ahead and try to read the data which causes it to fail and initiates a reconnect. Caller never gets a response for such failed READ pdus.

Merge pull request #10 from linuxsmiths/personal/linuxsmiths/add_fail…

374760e

…ure_in_zero_copy_reads Handle failed NFS response for zero copy reads

Merge pull request #11 from linuxsmiths/personal/linuxsmiths/add_non_…

ef445ba

…io_requests_to_head_of_outqueue Add non-IO RPC PDUs to head of outqueue

Merge pull request #12 from linuxsmiths/personal/linuxsmiths/add_high…

6365fd8

…_prio_pdus_to_tailp_ahead_of_low_prio_pdus Added explicit APIs for adding pdus to low priority and high priority…

Add missing initialization in rpc_nfs4_readv_task()

89fa6fe

After adding iov_ref I forgot to update rpc_nfs4_readv_task(). Fixing that.

Merge pull request #13 from linuxsmiths/personal/linuxsmiths/fix_nfs4…

d1639de

…_rpc_nfs4_readv_task Add missing initialization in rpc_nfs4_readv_task()

Introduce nfs_mt_service_thread_start_ss() for allowing stack size to…

2c23fd1

… be set for nfs service threads

Merge pull request #14 from linuxsmiths/personal/linuxsmiths/nfs_mt_s…

aa21c12

…ervice_thread_start_allows_stack_size_to_be_set Introduce nfs_mt_service_thread_start_ss() for allowing stack size to…

Reserve additiona space to support 4096 bytes path in symlink

8ad7c2f

(cherry picked from commit b2dc9a1)

linuxsmiths and others added 23 commits October 11, 2024 17:35

Merge pull request #16 from linuxsmiths/personal/linuxsmiths/make_mul…

c3f0819

…tithreading_enabled_atomic Define multithreading_enabled as atomic_int type if HAVE_STDATOMIC_H …

TSAN fix

03d6287

Don't access rpc->outqueue outside the lock.

Merge pull request #17 from linuxsmiths/personal/linuxsmiths/tsan_fix1

8daf397

TSAN fix

Add tid to rpc_context and log it from RPC_LOG

923c751

Added accessor nfs_get_tid()

ffe6415

Merge pull request #18 from linuxsmiths/personal/linuxsmiths/add_tid_…

c4a02c5

…to_rpc_context Personal/linuxsmiths/add tid to rpc context

Use eventfd to inform service thread when rpc_queue_pdu2() queues a p…

1433deb

…du for writing. We don't need to depend on poll() timeout to notice the to-be-written pdu.

Also wakeup service thread from nfs_mt_service_thread_stop()

bdeff1d

typo

d5ac4c5

Merge pull request #19 from linuxsmiths/personal/linuxsmiths/use_even…

2775dc0

…tfd_for_notifying_pollout Personal/linuxsmiths/use eventfd for notifying pollout

Move eventfd notification code to common place whenever we add a pdu …

def08c2

…to an empty outqueue/ This takes care of properly notifying service thread even when pdus are requeued.

Az auth RPC NFSv3 (#20)

12bc690

* Adding azauth RPC * Updated * Updated * Updated * Updated * Update nfs.x * Update libnfs-raw-nfs.h * Update libnfs-raw-nfs.c * Update nfs.c * Update libnfs-raw-nfs.h * Update libnfs-raw-nfs.c --------- Co-authored-by: pragyagandhi <pragyagandhi@microsoft.com>

AzAuth changes (#23)

c9b7286

* Changed clientid to 16 bytes and auth_context * Addressed comments * Update * Update

Avoid loop in rpc_queue_length() by reading from stats.outqueue_len i…

cea11e8

…nstead of looping (#27) Looping is left for paranoid build. Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

Added write EAGAIN stats to find receive window advertised by the ser…

011cf49

…ver (#30) Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

Fixed a build warning

57daccd

Macros change for large RPC on libnfs (#21)

78cd2af

* Macros change for large RPC on libnfs * Added a comment --------- Co-authored-by: Nagendra Tomar <natomar@microsoft.com>

Disabling assert that writev() should not fail with eagain after gett…

f15c10b

…ing pollout Looks like the Windows TCP is shrinking the window (reneg'ing)

Update

d50bb6b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Personal/pragyagandhi/az auth none change #555

Personal/pragyagandhi/az auth none change #555

Uh oh!

pragyagandhi commented May 5, 2025

Uh oh!

sahlberg commented May 22, 2025

Uh oh!

sahlberg commented Jun 1, 2025

Uh oh!

Uh oh!

Personal/pragyagandhi/az auth none change #555

Are you sure you want to change the base?

Personal/pragyagandhi/az auth none change #555

Uh oh!

Conversation

pragyagandhi commented May 5, 2025

Uh oh!

sahlberg commented May 22, 2025

Uh oh!

sahlberg commented Jun 1, 2025

Uh oh!

Uh oh!