Observability Platform Engineer

Posted · Add Comment
Career Techniques Inc
Published
May 6, 2026
Location
Dallas, TX - Hybrid - 3 days/week in-office
Category
 
Job Type

Description

You'll help design and scale observability platforms that handle telemetry from industry-leading GPU clusters and large-scale distributed systems. You'll work closely with experienced engineers to develop metrics pipelines, logging systems and tracing solutions that improve reliability and visibility across our services

Must Haves:

  • Experience with modern observability tools and frameworks, such as Prometheus, Grafana or OpenTelemetry (OTEL)
  • Exposure with cloud platforms, such as AWS, Azure, or Google Cloud
  • Familiarity with microservices architectures and containerized environments, such as Kubernetes and Docker
  • Interest in system reliability, performance engineering and platform-scale infrastructure
  • Good communication and collaboration skills

Nice to Haves

  • Exposure to enterprise observability platforms, such as Datadog or Dynatrace
  • Experience working with telemetry data (metrics, logs, traces) in large environments
  • Proficiency in scripting or programming languages (e.g. Python, Go)
  • Familiarity with Infrastructure-as-Code tools or deployment automation
  • Max. file size: 100 MB.
  • Please complete the math question to prove you are human.

Related Jobs

Atlassian Platform Specialist   Dallas, TX - In-office 5 days/week new
May 19, 2026
Security Platform Operations Engineer   Dallas, TX - Hybrid - 3 days/week in-office
May 5, 2026
Network Automation Engineer   Dallas, TX - Hybrid - 3 days/week in-office
May 5, 2026
Cloud Engineer   New York, NY - Hybrid - 4 days/week in-office
April 28, 2026
Technology Talent Acquisition Consultant   New York, NY - Hybrid (3 days/week in-office)
April 22, 2026