Run Ingest Delays in LangSmith
Resolved·Degraded performance

A spike in the number of incoming complex trace payloads between May 1 at 2350 UTC and May 2 at 0030 UTC caused some of our ingest workers to be become CPU starved temporarily which resulted in periods of high latency for some LangSmith tenants. We have adjusted our scaling parameters to reduce the duration of latency in the future and will be implementing optimizations to relieve the CPU starvation in an upcoming release.

Thu, May 2, 2024, 02:56 AM
(1 year ago)
·
Affected components
LangSmith Run Ingestion
Updates

Resolved

A spike in the number of incoming complex trace payloads between May 1 at 2350 UTC and May 2 at 0030 UTC caused some of our ingest workers to be become CPU starved temporarily which resulted in periods of high latency for some LangSmith tenants. We have adjusted our scaling parameters to reduce the duration of latency in the future and will be implementing optimizations to relieve the CPU starvation in an upcoming release.

Thu, May 2, 2024, 02:56 AM

Identified

A spike in the number of incoming complex trace payloads caused some of our ingest workers to be become CPU starved temporarily which resulted in some periods of high latency. This effect was temporary and we are adjusting our scaling parameters to prevent a recurrence.

Thu, May 2, 2024, 01:37 AM(1 hour earlier)

Investigating

We are investigating latency in run ingests with delays of more than 2 minutes for a subset of runs to appear in the LangSmith UI.

Thu, May 2, 2024, 12:03 AM(1 hour earlier)
Powered by

Looking for the EU status page? Find it here: https://eu.status.smith.langchain.com