DevOps Engineer Lead
Job Title: DevOps Engineer - Lead
Job ID: 94330-1, 94329-1 & 94503-1
Only-EX-Capital one ,C2C
Client: Capital One
Location: 15075 Capital One Drive Richmond, VA 23238 (Hybrid)
Duration: 12+ Months with possible of extension
Key Skills & Tools:
Observability Tools: Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native solutions like AWS CloudWatch.
Programming Languages: Expertise in languages such as Python and Go for scripting and automation.
Infrastructure & Cloud Platforms: Experience with cloud platforms (AWS, GCP, Azure) and container orchestration systems like Kubernetes.
Infrastructure as Code (IaC): Familiarity with Terraform and Ansible for managing infrastructure and configurations.
CI/CD & Automation: Experience with CI/CD pipelines and automation tools like Jenkins.
System & Software Engineering: A strong background in both system operations and software development.
Optimize cloud agent instrumentation, with cloud certifications being a plus.
Datadog Fundamental, APM and Distributed Tracing Fundamentals & Datadog Demo Certification (Mandatory)
Strong understanding of Observability concepts (Logs, Metrics, Tracing)
Expertise in security & vulnerability management in observability
Possesses 2 years of experience in cloud-based observability solutions, specializing in monitoring, logging, and tracing across AWS, Azure, and GCP environments.
Job Description:
Design & Implement Solutions: Build and maintain comprehensive observability platforms that provide deep insights into complex systems, incorporating logs, metrics, and traces.
System Instrumentation: Instrument applications, infrastructure, and services to collect telemetry data using frameworks like OpenTelemetry.
Data Analysis & Visualization: Develop dashboards, reports, and alerts using tools like Prometheus, Grafana, and Splunk to visualize system performance and detect issues.
Collaboration: Work with development, SRE, and DevOps teams to integrate observability best practices and align monitoring with business and operational goals.
Automation: Develop scripts and use Infrastructure as Code (IaC) tools like Ansible and Terraform to automate monitoring configurations and telemetry collection.
Implement and manage full-stack observability using Datadog, ensuring seamless monitoring across infrastructure, applications, and services.
Instrument agents for on-premise, cloud, and hybrid environments to enable comprehensive monitoring.
Design and deploy key service monitoring, including dashboards, monitor creation, SLA/SLO definitions, and anomaly detection with alert notifications.
Configure and integrate Datadog with third-party services such as ServiceNow, SSO enablement, and other ITSM tools.
Recommended Jobs
Systems Engineer Principal
Public Trust: None Requisition Type: Regular Your Impact Own your opportunity to work with the largest government agency in the nation. Make an impact by advancing the Department of Defens…
Water Watcher- Lifeguard Certification Required
Welcome to Aqua-Tots! We are saving lives by teaching children all over the world how to become safe and confident swimmers for life. We are year-round and use Indoor heated 90-degree pools! If yo…
SR Java Software Developer - Space Domain
Program Overview The program is a next generation Space Domain Awareness (SDA) Indications and Warning (I&W) system. It establishes a repository of resident space objects (RSOs), finds RSOs of int…
Business Manager - Wine
**Time Type:** Full time **Remote Type:** **Job Family Group:** Sales **Job Description Summary:** The Business Manager serves as their assigned suppliers' primary POC in the market. Masters and drive…
Product Design Lead, Manager level
Product Design Lead, Manager level We’re currently seeking a Product Designer to join our Experience Design team. We’re passionate about creating memorable, meaningful product experiences tha…
Software Reverse Engineer
STR is hiring a Software Reverse Engineer who has a passion for research and analysis of vulnerabilities in cyber physical systems. Work must be performed onsite. What you’ll do: Reverse eng…
Director of Youth Ministry
Director of Youth and Children’s Ministry Reports to: Priest In Charge/Rector Directly Supervises: All volunteers in Youth Ministry Status: Part Time (10 hours/week) Compensation: $12,000…
Sales Manager, Life Insurance
Location(s) Alexandria, Louisiana **Details** _Kemper is one of the nation's leading specialized insurers. Our success is a direct reflection of the talented and diverse people who make a positive dif…
AWS FRE Warehouse Person
FRAUD ALERT: Please note that DSV will never request a chat interview or solicit funds from applicants or employees through its interviewing and hiring process. We do not require any form of payment …
Director of Development
The Director of Development supports fundraising and donor engagement efforts by managing donor and alumni data coordinating cultivation and stewardship activities and executing recognition events an…