Issue 4 wait 5 sec #40

songmeo · 2025-02-11T17:57:38Z

closes #4

…it-5-sec

pavel-kirienko · 2025-02-12T12:20:47Z

src/handler.py

@@ -56,6 +55,11 @@ async def text_handler(update: Update, context: ContextTypes.DEFAULT_TYPE, con:
        (chat_id, user_id, text),
    )
    con.commit()
+
+
+async def generate_response(update: Update, con: psycopg2.connect):


Violation of the interface segregation principle: each component in a program needs to have access only to that data which is needs, and no more. Here, replace update with chat_id.

An architecturally cleaner solution is to split the function into several smaller ones; we can talk about that later on.

pavel-kirienko · 2025-02-12T12:51:57Z

src/main.py

+        await store_message(update, con)
+
+        if not delaying.locked():
+            _ = asyncio.create_task(delay_then_response(update, con))  # run the delay in background


Remember I mentioned that this feature will be tricky to get right?

You seem to have a dangerous race condition here that will cause some messages to be reordered and/or temporarily ignored. Consider the following sequence of events:

A message arrives. The delay-then-respond task is started.

The delay has expired. The LLM is invoked and is being handled in the background.

While the LLM call is running, another message arrives (recall that you're running the completion in another task, so the main task can still keep running). The new message is added to the database, but no new task is invoked because the semaphore (your lock) is taken.

Once the LLM call is finished, you append the response to the database.

⚠️☠️ BUG: the messsages are now reordered in the database! The LLM completion comes after the last user's message that was not used to generate the completion.

You send the response and finish the delay-then-respond task.

⚠️☠️ BUG: the latest user's message is left without a reply! It is stored in the database but no task is running to generate a response.

Speaking from experience, the likelihood of race conditions increases quickly as you add new states to the program. Here, the culprit is that you added your semaphore. Think of a solution that does not require additional states beside what you already have in the database. Any such solution will be much more robust than what you currently have.

One way to approach it is to always start a new task when a message arrives. The task will then wait, and before generating a response it will check if the last message in the chat comes from the LLM; if so, it just exits doing nothing, as it means that another task already took care of the answer. When inserting the LLM-generated response into the database, you have to check that no reordering is happening. There are several ways to do it:

Introduce a new column that specifies the ID of the message that the current message is a response to, null by default. Say, you call it response_to_id; then all messages coming from an LLM will specify the latest message from the user considered during the response generation. You will need to modify the message fetching query slightly to order by response_to_id as well.

Order by timestamp, ensuring that the LLM message has the same timestamp as the latest message considered during the response generation. This is fragile.

The simplest: just discard the LLM response if new messages appeared, and let the next task generate it.

There are other solutions as well. You might notice that what I just outlined uses the database as a sort of queue. It may be beneficial to make the queuing aspect explicit in your design, sending freshly received messages into a delayed queue specific to a given chat, that is read by a worker dedicated to that chat. The messages are then used to generate responses, and the results are pushed into the database only when the corresponding LLM completions are ready.

Perhaps it is worth visualizing this on the whiteboard.

songmeo added 8 commits January 6, 2025 10:56

merge from main

5391c92

merge from main

0af9094

WIP issue 4 async part

895b760

Merge branch 'main' of github.com:songmeo/pavelGPTbot into issue-4-wa…

cda26a8

…it-5-sec

Merge branch 'main' of github.com:songmeo/pavelGPTbot into issue-4-wa…

e762188

…it-5-sec

WIP: decouple text_handler

f0f251c

use async lock to delay while storing msg

177fbce

switch to gpt-4o, deduce delay time

50427f9

songmeo linked an issue Feb 11, 2025 that may be closed by this pull request

Implement delayed responses #4

Open

pavel-kirienko requested changes Feb 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 4 wait 5 sec #40

Issue 4 wait 5 sec #40

songmeo commented Feb 11, 2025

pavel-kirienko Feb 12, 2025

pavel-kirienko Feb 12, 2025

Issue 4 wait 5 sec #40

Are you sure you want to change the base?

Issue 4 wait 5 sec #40

Conversation

songmeo commented Feb 11, 2025

pavel-kirienko Feb 12, 2025

Choose a reason for hiding this comment

pavel-kirienko Feb 12, 2025

Choose a reason for hiding this comment