Instrumentation for Lanchain4j's OpenAI Chat Model by realark · Pull Request #24 · braintrustdata/braintrust-sdk-java

Andrew Kent (realark) · 2025-12-29T03:40:04Z

No description provided.

Andrew Kent (realark) · 2025-12-29T18:54:19Z

src/test/java/dev/braintrust/instrumentation/langchain/BraintrustLangchainTest.java

+    @SneakyThrows
+    void testSyncChatCompletion() {
+        // Mock the OpenAI API response
+        wireMock.stubFor(


TODO: my next quality-of-life change will be to switch out stubs for VCR-like replay (wiremock supports this apparently). Some time in the next week or two

David Elner (delner)

Some questions but nothing blocking!

David Elner (delner) · 2025-12-30T02:17:45Z

src/main/java/dev/braintrust/instrumentation/langchain/BraintrustLangchain.java

+    public record Options(String providerName) {}
+
+    @SuppressWarnings("unchecked")
+    private static <T> T getPrivateField(Object obj, String fieldName)


This question is just for my own education: it looks like we're using reflection to access the private fields in order to instrument them, correct? What are the performance/stability risks associated with reflection? Are there other practical alternatives for instrumentation?

In the Ruby world, we generally would avoid accessing private fields because of the potential for instability (e.g. someone in a patch version changes the API.)

Yes that's right, we're using reflection. There isn't much risk in this case because we'll just fail to apply instrumentation if something goes wrong

Performance is pretty good with reflection, but even if it wasn't this is only done once during client build

There isn't a viable alternative right now, but once we get into auto instrumentation for java we'll have more options

David Elner (delner) · 2025-12-30T02:19:34Z

src/main/java/dev/braintrust/instrumentation/langchain/WrappedHttpClient.java

+        Span span = startNewSpan(getSpanName(providerInfo));
+        try (Scope scope = span.makeCurrent()) {
+            tagSpan(span, request, providerInfo);
+            final long startTime = System.nanoTime();


What is nanoTime? Is it wall clock time or is it something else?

Basically wall clock time. It's an increasing nanosecond counter from an arbitrary starting point

David Elner (delner) · 2025-12-30T02:22:11Z

src/test/java/dev/braintrust/TestHarness.java

+                                        + " after %d attempts",
+                                minSpanCount, spans.size(), attempts));
+            }
+            Thread.sleep(1000);


Why do you need to wait & sleep? To read off the OTel thread? Is there a faster, more directly way to do this synchronously in the test suite?

Usually waiting isn't needed but some of the streaming tests finish their spans after this method is invoked for the first time

I feel like there should be a better way to do this, but the only gotcha is I'm using the built in otel utils to collect spans so I'm not sure what hooks I would have to insert concurrency signaling stuff

I'm making some other changes to the test harness in another branch. I'll add this to that work. At the very least I can dial down the sleep time (10ms should be plenty)

Andrew Kent (realark) added the enhancement New feature or request label Dec 29, 2025

Andrew Kent (realark) force-pushed the ark/langchain4j-instrumentation branch 2 times, most recently from 8db59a5 to 8c94b9c Compare December 29, 2025 18:03

Andrew Kent (realark) marked this pull request as ready for review December 29, 2025 18:34

langchain4j openai instrumentation

f09d73b

Andrew Kent (realark) force-pushed the ark/langchain4j-instrumentation branch from 8c94b9c to f09d73b Compare December 29, 2025 18:51

Andrew Kent (realark) commented Dec 29, 2025

View reviewed changes

Andrew Kent (realark) requested review from Matt Perpick (clutchski) and David Elner (delner) December 29, 2025 19:02

David Elner (delner) approved these changes Dec 30, 2025

View reviewed changes

Andrew Kent (realark) merged commit 1b58fef into main Dec 30, 2025
1 check passed

Andrew Kent (realark) deleted the ark/langchain4j-instrumentation branch December 30, 2025 09:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Instrumentation for Lanchain4j's OpenAI Chat Model#24

Instrumentation for Lanchain4j's OpenAI Chat Model#24
Andrew Kent (realark) merged 1 commit intomainfrom
ark/langchain4j-instrumentation

Andrew Kent (realark) commented Dec 29, 2025

Uh oh!

Andrew Kent (realark) Dec 29, 2025 •

edited

Loading

Uh oh!

David Elner (delner) left a comment

Uh oh!

David Elner (delner) Dec 30, 2025

Uh oh!

Andrew Kent (realark) Dec 30, 2025

Uh oh!

David Elner (delner) Dec 30, 2025

Uh oh!

Andrew Kent (realark) Dec 30, 2025

Uh oh!

David Elner (delner) Dec 30, 2025

Uh oh!

Andrew Kent (realark) Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Andrew Kent (realark) commented Dec 29, 2025

Uh oh!

Andrew Kent (realark) Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

David Elner (delner) left a comment

Choose a reason for hiding this comment

Uh oh!

David Elner (delner) Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Andrew Kent (realark) Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

David Elner (delner) Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Andrew Kent (realark) Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

David Elner (delner) Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Andrew Kent (realark) Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Andrew Kent (realark) Dec 29, 2025 •

edited

Loading