fix(llmobs): avoid raising errors during llmobs integration span processing #10713

Yun-Kim · 2024-09-19T00:57:46Z

This PR does 2 things:

User facing changes

captures any integration-specific _llmobs_set_tags() method errors and logs the error instead of potentially crashing the user application.

Non-user facing changes

Refactors the BaseLLMIntegration class and child classes to follow a cleaner and shared llmobs_set_tags() method, which internally try/catches an abstract method _llmobs_set_tags() instead (which is implemented by each integration). We also no longer need to check integration.is_pc_sampled_llmobs(span) since we don't currently do any sampling yet and we can handle it in the llmobs_set_tags() method if needed.
tldr: _llmobs_set_tags() is now an abstract method that needs to be implemented by all LLM integrations, and its function signature now takes in the following arguments/keyword arguments (same as llmobs_set_tags()):

span: span to annotate
args: list of args passed to the traced method
kwargs: dict of keyword args passed to the traced method. If any integration requires additional data not contained by either args/kwargs (such as the model instance in Gemini or tool_input dictionary in langchain), we can pass it into the method using the kwarg dict.
response: returned response from llm provider (streamed or non-streamed)
operation: string denoting which LLM operation it is (eg. "completion", "chat", "embedding", "chain", "retrieval")

I did some refactoring to each integration to follow this new signature, which included merging logic for how we handle streamed responses, and additional required args (i.e. model instance, tool inputs).

Previously each integration did its own thing for llmobs_set_tags() with arbitrary args/kwargs, and it was difficult to maintain. Now that we have a strict function signature, future integrations should be simpler to create, and existing integrations should be easier to maintain.

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

github-actions · 2024-09-19T00:58:19Z

CODEOWNERS have been resolved as:

releasenotes/notes/fix-llmobs-integrations-safe-tagging-5e170868e5758510.yaml  @DataDog/apm-python
ddtrace/_trace/trace_handlers.py                                        @DataDog/apm-sdk-api-python
ddtrace/contrib/internal/anthropic/_streaming.py                        @DataDog/ml-observability
ddtrace/contrib/internal/anthropic/patch.py                             @DataDog/ml-observability
ddtrace/contrib/internal/google_generativeai/_utils.py                  @DataDog/ml-observability
ddtrace/contrib/internal/google_generativeai/patch.py                   @DataDog/ml-observability
ddtrace/contrib/internal/langchain/patch.py                             @DataDog/ml-observability
ddtrace/contrib/internal/openai/_endpoint_hooks.py                      @DataDog/ml-observability
ddtrace/contrib/internal/openai/utils.py                                @DataDog/ml-observability
ddtrace/llmobs/_integrations/anthropic.py                               @DataDog/ml-observability
ddtrace/llmobs/_integrations/base.py                                    @DataDog/ml-observability
ddtrace/llmobs/_integrations/bedrock.py                                 @DataDog/ml-observability
ddtrace/llmobs/_integrations/gemini.py                                  @DataDog/ml-observability
ddtrace/llmobs/_integrations/langchain.py                               @DataDog/ml-observability
ddtrace/llmobs/_integrations/openai.py                                  @DataDog/ml-observability
tests/llmobs/test_llmobs_integrations.py                                @DataDog/ml-observability

ddtrace/llmobs/_integrations/langchain.py