DevOps Engineer Lead

Tek Spikes
Richmond, VA

Job Title: DevOps Engineer - Lead

Job ID: 94330-1, 94329-1 & 94503-1

Only-EX-Capital one ,C2C

Client: Capital One

Location: 15075 Capital One Drive Richmond, VA 23238 (Hybrid)

Duration: 12+ Months with possible of extension

Key Skills & Tools:

Observability Tools: Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native solutions like AWS CloudWatch.

Programming Languages: Expertise in languages such as Python and Go for scripting and automation.

Infrastructure & Cloud Platforms: Experience with cloud platforms (AWS, GCP, Azure) and container orchestration systems like Kubernetes.

Infrastructure as Code (IaC): Familiarity with Terraform and Ansible for managing infrastructure and configurations.

CI/CD & Automation: Experience with CI/CD pipelines and automation tools like Jenkins.

System & Software Engineering: A strong background in both system operations and software development.

Optimize cloud agent instrumentation, with cloud certifications being a plus.

Datadog Fundamental, APM and Distributed Tracing Fundamentals & Datadog Demo Certification (Mandatory)

Strong understanding of Observability concepts (Logs, Metrics, Tracing)

Expertise in security & vulnerability management in observability

Possesses 2 years of experience in cloud-based observability solutions, specializing in monitoring, logging, and tracing across AWS, Azure, and GCP environments.

Job Description:

Design & Implement Solutions: Build and maintain comprehensive observability platforms that provide deep insights into complex systems, incorporating logs, metrics, and traces.

System Instrumentation: Instrument applications, infrastructure, and services to collect telemetry data using frameworks like OpenTelemetry.

Data Analysis & Visualization: Develop dashboards, reports, and alerts using tools like Prometheus, Grafana, and Splunk to visualize system performance and detect issues.

Collaboration: Work with development, SRE, and DevOps teams to integrate observability best practices and align monitoring with business and operational goals.

Automation: Develop scripts and use Infrastructure as Code (IaC) tools like Ansible and Terraform to automate monitoring configurations and telemetry collection.

Implement and manage full-stack observability using Datadog, ensuring seamless monitoring across infrastructure, applications, and services.

Instrument agents for on-premise, cloud, and hybrid environments to enable comprehensive monitoring.

Design and deploy key service monitoring, including dashboards, monitor creation, SLA/SLO definitions, and anomaly detection with alert notifications.

Configure and integrate Datadog with third-party services such as ServiceNow, SSO enablement, and other ITSM tools.

Posted 2026-01-14

Recommended Jobs

System Design Engineering Integrated Project Team (IPT) Lead

BWX Technologies
Lynchburg, VA

At BWX Technologies, Inc. (NYSE: BWXT), we are People Strong, Innovation Driven. A U.S.-based company, BWXT is a Fortune 1000 and Defense News Top 100 manufacturing and engineering innovator that pro…

View Details
Posted 2025-12-16

Azure Cloud Engineer with AI

Cloud Analytics Technologies LLC
Mechanicsville, VA

Job Details Azure Cloud Engineer with AI Resources Location: Mechanicsville, VA Hybrid Role Experience providing guidance on the implementation of Al resources in Azure, and experience imple…

View Details
Posted 2025-12-12

Production Associate / Operator

SGS Consulting
Virginia

Job Responsibilities: Moderate level of responsibility, to identify defects, make repairs using tools provided, and make decisions for what can/cannot be repaired. Responsibility to minimize CO…

View Details
Posted 2025-12-11

Systems Engineer (Test Tool Development and Automation)

KBR
Springfield, VA

Title: Systems Engineer (Test Tool Development and Automation) Belong. Connect. Grow. with KBR! KBR's National Security Solutions team provides high-end engineering and advanced technology so…

View Details
Posted 2026-01-09

Term Professor Forensic Science Open-Rank

George Mason University
Manassas, VA

Term Professor Forensic Science Open-Rank Job no: 10003132 Work type: Instructional Faculty Location: Manassas, VA, On Site Required Categories: Default Department: College of Scie…

View Details
Posted 2025-12-17

Manager, Compliance Advisory | Enterprise Services (Brand Marketing, Workplace Solutions, Americans with Disabilities)

Capital One
Richmond, VA

Overview Manager, Compliance Advisory | Enterprise Services (Brand Marketing, Workplace Solutions, Americans with Disabilities) Capital One, a Fortune 500 company and one of the nation’s top 10…

View Details
Posted 2025-12-01

Call Center

HealthWorks for Northern Virginia
Leesburg, VA

HealthWorks for Northern Virginia is a non-profit Federally Qualified Health Center (FQHC) serving the medically underserved and uninsured populations of Northern Virginia. We provide quality medical,…

View Details
Posted 2025-12-29

Journeyman Software Developer

Systems Technology Forum
Dahlgren, VA

** Security Clearance Required Company Overview Systems Technology Forum LTD (STF) is an established industry partner with a passion for exceptional performance and an unwavering commitment to…

View Details
Posted 2026-01-16

Junior Exterior Estimator - Roofing and Exterior Work

Rose Roofing & Restoration
Sterling, VA

Job Title: Junior Estimator Department: Operations Reports to: Project Control Manager FLSA Status: Exempt Summary The Junior Estimator serves as the Operations Department specialist I…

View Details
Posted 2025-12-01

Manager, Counsel: Servicing and Servicing Strategy (Hybrid)

Capital One
McLean, VA

Manager, Counsel: Servicing and Servicing Strategy (Hybrid) Capital One’s growing Legal department will allow you to showcase your talents in a fast paced, fun environment. At Capital One, yo…

View Details
Posted 2025-12-16