⚠️Solo disponible para residentes de Guadalajara, Mexico⚠️
Senior Engineer - Observability Platform
Location: Guadalajara, Remote.
Start date – Immediate
Customer Name – Jade/Indeed Inc.
Duration – 6 months with extension available.
Salary to offer: 23 per hour
Type of contract: IC
English require? Yes
Job Description
Primary Focus: monitoring and troubleshooting issues within the Data Dog environment. Integrates seamlessly with cloud environments. Experience with scaling solutions and ensuring the platform's observability.
SRE Skills:
Monitoring & Observability: Proficiency with tools like Data Dog, Prometheus, and Grafana for tracking system performance and health. Expertise in creating and managing dashboards, configuring alerts, and handling application performance monitoring (APM).
CI/CD Pipelines: Knowledge of integrating Data Dog with CI/CD pipelines
Technical Skills:
Observability and Monitoring: Hands-on experience with Data Dog's monitoring, logging
Cloud Platforms: Experience with AWS and integrating cloud services with Data Dog for a unified monitoring experience.
Container & Microservices Monitoring: Expertise in monitoring containerized environments using Kubernetes, integrated with Data Dog.
Scripting & Automation: Ability to automate monitoring tasks and configure Data Dog
via scripts using Python
Operational Skills:
Installation & Configuration: Experience installing Data Dog agents, configuring
integrations, and managing API keys or tokens for secure access to the platform.
User Management: Familiarity with managing user roles, permissions, and best
practices within Data Dog.
Migration & Modernization Skills (Nice-to-Have):
OpenTelemetry Adoption: Experience migrating teams from proprietary tracing
models (like DataDog's APM) to OpenTelemetry for distributed tracing. Ability to
make the platform capable of using OpenTelemetry and guide teams through the
transition.
API & Platform Migration: Experience working with teams to migrate and consolidate
individual keys to a service account model to ensure security and avoid disruptions
when key owners leave.