feat(cost-tracking): trace all usage keys #1021

hassiebp · 2024-12-04T10:58:24Z

update fern
add token detail extraction
update openai integration

Important

Enhances cost-tracking by introducing detailed usage and cost tracking fields, updating OpenAI integration, and improving type safety with enum usage.

Behavior:
- Introduces usage_details and cost_details fields in Observation, ObservationsView, and Usage classes for detailed usage and cost tracking.
- Deprecates old usage field in favor of usage_details and cost_details.
- Updates OpenAI integration to handle new usage and cost tracking.
Models:
- Adds OpenAiUsageSchema and UsageDetails for detailed usage tracking.
- Updates MediaContentType to use an enum for better type safety.
Misc:
- Increases comment content limit from 500 to 3000 characters in create_comment_request.py.
- Updates langfuse/client.py and langfuse/decorators/langfuse_decorator.py to support new usage tracking.
- Updates callback handlers in langfuse/callback/langchain.py and langfuse/llama_index/_event_handler.py to utilize new usage details.

^{This description was created by}^{for 54fac8e. It will automatically update as commits are pushed.}

greptile-apps

Disclaimer: Experimental PR review

PR Summary

This PR enhances cost tracking capabilities across the Langfuse Python SDK by introducing more granular usage and cost monitoring features.

Introduces new usage_details and cost_details fields in key classes while deprecating older usage field for more detailed token and cost tracking
Adds OpenAiUsageSchema for structured token usage tracking with prompt/completion token details
Increases comment content character limit from 500 to 3000 in CreateCommentRequest
Updates LangChain and LlamaIndex handlers to support granular token usage tracking from various LLM providers
Maintains backward compatibility for V2 self-hosters while introducing new cost tracking features

_{20 file(s) reviewed, 1 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2024-12-13T08:50:58Z

langfuse/api/resources/media/types/media_content_type.py

+    APPLICATION_XML = "application/xml"
+    APPLICATION_OCTET_STREAM = "application/octet-stream"
+
+    def visit(


logic: visit() method lacks exhaustive check or default case. Consider adding a default handler or raising NotImplementedError for unhandled cases to prevent silent failures if new enum values are added.

greptile-apps

Disclaimer: Experimental PR review

PR Summary

(updates since last review)

This PR continues to enhance the cost tracking implementation with additional changes focused on the OpenAI integration and usage tracking. Here's a summary of the latest changes:

Added new OpenAiUsageSchema class in /ingestion/types/open_ai_usage_schema.py with detailed token tracking fields for prompt and completion tokens
Updated OpenAI integration in openai.py to handle both old and new usage formats for backward compatibility
Enhanced error handling in OpenAI integration to include zero-value cost details for error cases
Added proper type safety with enum usage for MediaContentType instead of string literals
Improved token parsing in LangChain handler to support various LLM provider formats (Anthropic, Bedrock, Vertex AI, IBM)

_{20 file(s) reviewed, 2 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2024-12-16T16:08:28Z

langfuse/api/resources/ingestion/types/open_ai_usage_schema.py

+class OpenAiUsageSchema(pydantic_v1.BaseModel):
+    prompt_tokens: int
+    completion_tokens: int
+    total_tokens: int


logic: total_tokens should validate that it equals prompt_tokens + completion_tokens

greptile-apps · 2024-12-16T16:09:49Z

langfuse/llama_index/_event_handler.py

        self._get_generation_client(event.span_id).update(
-            usage=usage, end_time=_get_timestamp()
+            usage=usage, usage_details=usage, end_time=_get_timestamp()
        )


logic: passing same 'usage' value to both usage and usage_details fields may cause issues with backward compatibility since usage field is deprecated

hassiebp added 4 commits December 3, 2024 13:57

update fern

5dcbfbd

add token detail extraction

a26c2e1

update openai integration

04fc053

backward compat

42b7fe3

hassiebp linked an issue Dec 9, 2024 that may be closed by this pull request

bug: Missing cached token pricing column from models langfuse/langfuse#4590

Closed

hassiebp added 6 commits December 10, 2024 13:49

update fern

808d05a

add langchain support

71a2b37

add llama support

652fd2e

add decorator support

20e86f9

Merge branch 'main' into v3-cost-tracking

fba725c

remove openai parsing logic

54fac8e

hassiebp marked this pull request as ready for review December 13, 2024 08:47

greptile-apps bot reviewed Dec 13, 2024

View reviewed changes

Merge branch 'main' into v3-cost-tracking

1c9e165

greptile-apps bot reviewed Dec 16, 2024

View reviewed changes

hassiebp merged commit cf9ec3e into main Dec 16, 2024
10 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cost-tracking): trace all usage keys #1021

feat(cost-tracking): trace all usage keys #1021

hassiebp commented Dec 4, 2024 •

edited by ellipsis-dev bot

Loading

greptile-apps bot left a comment

greptile-apps bot Dec 13, 2024

greptile-apps bot left a comment

greptile-apps bot Dec 16, 2024

greptile-apps bot Dec 16, 2024

feat(cost-tracking): trace all usage keys #1021

feat(cost-tracking): trace all usage keys #1021

Conversation

hassiebp commented Dec 4, 2024 • edited by ellipsis-dev bot Loading

greptile-apps bot left a comment

Choose a reason for hiding this comment

Disclaimer: Experimental PR review

PR Summary

greptile-apps bot Dec 13, 2024

Choose a reason for hiding this comment

greptile-apps bot left a comment

Choose a reason for hiding this comment

Disclaimer: Experimental PR review

PR Summary

greptile-apps bot Dec 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 16, 2024

Choose a reason for hiding this comment

hassiebp commented Dec 4, 2024 •

edited by ellipsis-dev bot

Loading