NACKs are being sent by a receiver when the sender goes idle #69

marygilliland · 2022-08-11T03:34:00Z

marygilliland
Aug 11, 2022

I have an application that sends a series of messages, goes idle for a period of time, sends another series of messages, goes idle for a period of time, and so on in that pattern. The receiving application receives all of the expected messages, but it starts to send NACKs if the idle period is more than about 30 seconds. I turned on trace and debug messages, and I can see in those messages that the receiver received all of the expected application messages, but still starts sending NACKs during the idle period. The sender seems to ignore the NACKs, possibly because it no longer has the data indicated in the NACKs; I’m not sure about that.

I’m including the log files from the sending and receiving applications to see if you can determine why the NACKs are occurring. In this test run, the sender sends 50 messages and is then idle for 60 seconds, and repeats that pattern 3 times (at the end of the third series, the application ends instead of going idle). In my real application, the number of messages in a series is variable and the period of idleness is also variable. Perhaps there is some configurable value in the API that I should be setting that I don’t know about, to avoid this behavior with NACKs during idleness.

Also, there is another behavior I have noticed that I would like to ask about. When my sender application is idle (from the application’s perspective), there are CMD(CC) messages being sent (by the thread(s) created by the NORM API), and the receiver responds with ACK(CC) messages. Even though these messages are being sent, the receiver gets “Remote Sender Inactive” events. The “Inactive” event seems to occur just before the CMD(CC) message. After the CMD(CC) message, the receiver gets a “Remote Sender Active” event. I’m just making guesses here, but it seems as if those “Inactive” events shouldn’t occur if the CMD(CC) messages are being received. But it seems like there is a timing issue. I don’t have logs showing the Inactive/Active events in relation to the CMD(CC) messages, but I could reproduce that situation if it would help you check to see if there is an issue in the API.

I appreciate any insight you can provide regarding these two scenarios.

I haven't attached files here before; hope I've done it correctly.

SenderLog.txt
ReceiverLog.txt

Answered by bebopagogo

Aug 21, 2022

I have checked in a change that I think will resolve your extraneous NACKing issue. I updated the NormSenderNode::RepairCheck() to use a new "BLIND_CHECK" check level for use on sender inactivity timeout or reactivation to inspect current repair state to determine the right NACK approach. For your case, I think that will yield the desired behavior.

BTW, the reason the sender was not responding to the NACKs was because the receiver was NACKing for a portion of the stream the sender had not yet sent.

Please let me know if this resolves that issue. The "sender inactive / idle" notification at the receiver is just a cue to your application that the remote sender isn't actively sending data. I…

View full answer

bebopagogo · 2022-08-14T01:46:47Z

bebopagogo
Aug 14, 2022
Maintainer

I was able to look at your log files posted. I think the two things you mention here may be related. By default, a receiver will time out an "inactive" sender based on measured RTT, etc. The sender when it has no data to send does reduce the rate of it NORM_CMD(CC) probe (used for measuring RTT). So, I think there could be a sort of race condition here where the receiver declares a sender "inactive" in a shorter time frame than those NORM_CMD(CC) probes are sent. It's somewhat benign (but annoying) by itself.

Another default behavior here is the receiver will drop buffering state on a sender that has gone inactive. The idea here is that in a group with multiple senders, the receiver is only allocating buffer space for senders it is actually needed for. I think what's happening possibly is when the sender become "active" again, the receiver is sending a NACK to "re-sync" to the sender's transmission in case there was an outage that had led to the sender's apparent inactivity. Do you happen to be using the NORM_SYNC_ALL or NORM_SYNC_STREAM sync policy?

I mention these as default behaviors since there may some API usage that can adjust the behavior. I will have to look into it since I can't recall off the top of my head and outline of how your sender/receiver is using the API can be helpful. I think you are using NORM_OBJECT_STREAM delivery, right?

2 replies

marygilliland Aug 16, 2022
Author

We are using NORM_OBJECT_STREAM delivery, and I have the sync policy set to NORM_SYNC_CURRENT.
I’ve attached our source code for setting up a socket to send, a socket to receive, and the event handling. We have configuration for setting some of the NORM control parameters, so I provided the values we are using in this case as comments. Let me know if this doesn’t cover all of the information that you need. I appreciate your checking into these issues. We would really like to prevent these NACKs if possible.
NORMProjectSource.txt

bebopagogo Aug 20, 2022
Maintainer

I think I see the problem with regards to the undesired NACKing. In the code called when a receiver "reactivates" a sender (NormSenderNode::Activate()) there is a call to NormSenderNode::RepairCheck() that always does a "THRU_OBJECT" repair check where it should do something more refined like it does in the NormSenderNode::OnActivityTimeout() that depends on how much state it has. BTW, since your app is not calling NormNodeFreeBuffers() (and I don't you want it to for your use case), the receiver has the state it needs to know not to NACK ... my comment above about dropping buffers on sender activity timeout is only true if your app explicitly does that. So the real issue is this overly zealous THRU_OBJECT repair check. I am going to take the block of code that is currently in the NormSenderNode::OnActivityTimeout() code and put it in its own method that will be called both from that part of the code and in the Activate() method and that should address your issue. I will post something here when I do it (will try to so today)

bebopagogo · 2022-08-21T02:56:15Z

bebopagogo
Aug 21, 2022
Maintainer

I have checked in a change that I think will resolve your extraneous NACKing issue. I updated the NormSenderNode::RepairCheck() to use a new "BLIND_CHECK" check level for use on sender inactivity timeout or reactivation to inspect current repair state to determine the right NACK approach. For your case, I think that will yield the desired behavior.

BTW, the reason the sender was not responding to the NACKs was because the receiver was NACKing for a portion of the stream the sender had not yet sent.

Please let me know if this resolves that issue. The "sender inactive / idle" notification at the receiver is just a cue to your application that the remote sender isn't actively sending data. In a multicast group with multiple senders possibly sending data, this lets the application free up memory usage from senders that aren't actively sending content (i.e., via the NormNodeFreeBuffers() call that can be made at that point)

1 reply

marygilliland Aug 24, 2022
Author

I have completed testing with the new lib update and it has resolved the extraneous NACKs. Thank you very much for this prompt update.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NACKs are being sent by a receiver when the sender goes idle #69

{{title}}

Replies: 2 comments 3 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

NACKs are being sent by a receiver when the sender goes idle #69

marygilliland Aug 11, 2022

Replies: 2 comments · 3 replies

bebopagogo Aug 14, 2022 Maintainer

marygilliland Aug 16, 2022 Author

bebopagogo Aug 20, 2022 Maintainer

bebopagogo Aug 21, 2022 Maintainer

marygilliland Aug 24, 2022 Author

marygilliland
Aug 11, 2022

Replies: 2 comments 3 replies

bebopagogo
Aug 14, 2022
Maintainer

marygilliland Aug 16, 2022
Author

bebopagogo Aug 20, 2022
Maintainer

bebopagogo
Aug 21, 2022
Maintainer

marygilliland Aug 24, 2022
Author