SYSTEMS ENGINEER SR PRINCIPAL (HPC/AI System Administrator, Storage Engineer, Monitoring Expert, Solution Architect, Security/Provisioning Engineer, or Multi-discipline Expert)

GDIT
Falls Church, VA

Responsibilities for this Position

Location: Any Location / Remote
Full Part/Time: Full time
Job Req: RQ210987

Type of Requisition:
Regular

Clearance Level Must Currently Possess:
None

Clearance Level Must Be Able to Obtain:
None

Public Trust/Other Required:
None

Job Family:
IT Infrastructure and Operations

Job Qualifications:

Skills:
Complex Systems, High Performance Computing (HPC), Operations Management, System Performance, Systems Management
Certifications:
None
Experience:
10 + years of related experience
US Citizenship Required:
Yes

Job Description:

SYSTEMS ENGINEER SR PRINCIPAL (HPC/AI System Administrator, Storage Engineer, Monitoring Expert, Solution Architect, Security/Provisioning Engineer, or Multi-discipline Expert)

Advance how our customers operate while you advance your career. Join GDIT as a Systems Engineer Sr Principal for High Performance Computing (HPC) and build an impactful career in enterprise IT, collaborating with people who are driven and resourceful like you.

MEANINGFUL WORK AND PERSONAL IMPACT
As a Systems Engineer Sr Principal, the work you'll do at GDIT will be impactful to the mission of National Oceanagraphic and Atmospheric Administration (NOAA) National Weather Service (NWS). You will play a crucial role in supporting the full lifecycle sustainment and operational availability of leading edge High Performance Computing (HPC) clusters that are the key elements of the Weather & Climate Operational Supercomputing System (WCOSS) used 24/7 by the National Centers for Environmental Prediction (NCEP) Central Operations (NCO).
Lead/Manage/Support the day-day operations, sustainment, HPC services delivery, and incremental enhancements of two, geographically separated HPC clusters that are GDIT contractor owned and contractor operated (COCO) and used exclusively for WCOSS. This position will be essential in maintaining complex HPC service availability and delivery for intricate customer workload processing and output specifically aligned to forecasting and predictions from the Global Forecast System (GFS) and supporting models.
Collaborate with the GDIT WCOSS team as a senior-level HPC functional expert addressing intricate and multifaceted HPC challenges by providing innovative ideas, solutions, and resolution for customer requests, issues, and improvement efficiencies on a continuous basis.
Drive and prioritize resource utilization towards continuously improving customer satisfaction with GDIT's HPC service delivery and exceeding the contract service level metrics of uptime, availability, performance, stability, and on-time product delivery.
Utilize past experience, team collaboration, system management and troubleshooting applications, and ingenuity to support customer operations while working on systems that range in capacity from 1000-3000+ nodes and 100's of PB storage per system.

WHAT YOU'LL NEED TO SUCCEED
Bring your technology expertise and drive for innovation to GDIT. The Systems Engineer Sr Principal must have:
Education: Bachelor of Arts/Bachelor of Science
Experience: 10+ years of related experience
Technical skills: Highly proficient with Linux (RockyOS, SLES, etc), scripting in Python, Perl, or Bash, networking concepts and technology such as Ethernet, InfiniBand and Slingshot, TCP/IP networking, basic routing, and network services, programming in Python, C/C++, or Fortran, administrating PBSpro, SLURM or other batch systems in an HPC cluster, and system performance monitoring and tuning in an HPC cluster environment (e.g., Opensearch, Grafana, Prometheus)
Security clearance level: must complete a satisfactory background investigation
US citizenship required
Role requirements: Expected to perform as individual SME contributor, functional lead, or project/task leader responsible for workproduct delivery. Extensive experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers; coordinating with vendors to resolve hardware and software problems. Minimal travel required for onsite work, team collaboration, training, and customer interaction.

GDIT IS YOUR PLACE
At GDIT, the mission is our purpose, and our people are at the center of everything we do.
Growth: AI-powered career tool that identifies career steps and learning opportunities
Support: An internal mobility team focused on helping you achieve your career goals
Rewards: Comprehensive benefits and wellness packages, 401K with company match, and competitive pay and paid time off
Flexibility: Full-flex work week to own your priorities at work and at home as part of an onsite and distributed remote team with as part of an onsite and distributed remote team with opportunities for in-person collaboration.
Community: Award-winning culture of innovation and a military-friendly workplace

OWN YOUR OPPORTUNITY
Explore an enterprise IT career at GDIT and you'll find endless opportunities to grow alongside colleagues who share your desire to drive operations forward.

The likely salary range for this position is $140,250 - $189,750. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.

Scheduled Weekly Hours:
40

Travel Required:
Less than 10%

Telecommuting Options:
Remote

Work Location:
Any Location / Remote

Additional Work Locations:

Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year. Paid leave and paid holidays are prorated based on the employee's date of hire. The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.

We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.

Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.

Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans



PI280358799




SYSTEMS ENGINEER SR PRINCIPAL (HPC/AI System Administrator, Storage Engineer, Monitoring Expert, Solution Architect, Security/Provisioning Engineer, or Multi-discipline Expert)


Advance how our customers operate while you advance your career. Join GDIT as a Systems Engineer Sr Principal for High Performance Computing (HPC) and build an impactful career in enterprise IT, collaborating with people who are driven and resourceful like you.


MEANINGFUL WORK AND PERSONAL IMPACT
As a Systems Engineer Sr Principal, the work you'll do at GDIT will be impactful to the mission of National Oceanagraphic and Atmospheric Administration (NOAA) National Weather Service (NWS). You will play a crucial role in supporting the full lifecycle sustainment and operational availability of leading edge High Performance Computing (HPC) clusters that are the key elements of the Weather & Climate Operational Supercomputing System (WCOSS) used 24/7 by the National Centers for Environmental Prediction (NCEP) Central Operations (NCO).
Lead/Manage/Support the day-day operations, sustainment, HPC services delivery, and incremental enhancements of two, geographically separated HPC clusters that are GDIT contractor owned and contractor operated (COCO) and used exclusively for WCOSS. This position will be essential in maintaining complex HPC service availability and delivery for intricate customer workload processing and output specifically aligned to forecasting and predictions from the Global Forecast System (GFS) and supporting models.
Collaborate with the GDIT WCOSS team as a senior-level HPC functional expert addressing intricate and multifaceted HPC challenges by providing innovative ideas, solutions, and resolution for customer requests, issues, and improvement efficiencies on a continuous basis.
Drive and prioritize resource utilization towards continuously improving customer satisfaction with GDIT's HPC service delivery and exceeding the contract service level metrics of uptime, availability, performance, stability, and on-time product delivery.
Utilize past experience, team collaboration, system management and troubleshooting applications, and ingenuity to support customer operations while working on systems that range in capacity from 1000-3000+ nodes and 100's of PB storage per system.


WHAT YOU'LL NEED TO SUCCEED
Bring your technology expertise and drive for innovation to GDIT. The Systems Engineer Sr Principal must have:
Education: Bachelor of Arts/Bachelor of Science
Experience: 10+ years of related experience
Technical skills: Highly proficient with Linux (RockyOS, SLES, etc), scripting in Python, Perl, or Bash, networking concepts and technology such as Ethernet, InfiniBand and Slingshot, TCP/IP networking, basic routing, and network services, programming in Python, C/C++, or Fortran, administrating PBSpro, SLURM or other batch systems in an HPC cluster, and system performance monitoring and tuning in an HPC cluster environment (e.g., Opensearch, Grafana, Prometheus)
Security clearance level: must complete a satisfactory background investigation
US citizenship required
Role requirements: Expected to perform as individual SME contributor, functional lead, or project/task leader responsible for workproduct delivery. Extensive experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers; coordinating with vendors to resolve hardware and software problems. Minimal travel required for onsite work, team collaboration, training, and customer interaction.


GDIT IS YOUR PLACE
At GDIT, the mission is our purpose, and our people are at the center of everything we do.
Growth: AI-powered career tool that identifies career steps and learning opportunities
Support: An internal mobility team focused on helping you achieve your career goals
Rewards: Comprehensive benefits and wellness packages, 401K with company match, and competitive pay and paid time off
Flexibility: Full-flex work week to own your priorities at work and at home as part of an onsite and distributed remote team with as part of an onsite and distributed remote team with opportunities for in-person collaboration.
Community: Award-winning culture of innovation and a military-friendly workplace


OWN YOUR OPPORTUNITY
Explore an enterprise IT career at GDIT and you'll find endless opportunities to grow alongside colleagues who share your desire to drive operations forward.


The likely salary range for this position is $140,250 - $189,750. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.



Scheduled Weekly Hours:
40



Travel Required:
Less than 10%



Telecommuting Options:
Remote



Work Location:
Any Location / Remote



Additional Work Locations:



Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year. Paid leave and paid holidays are prorated based on the employee's date of hire. The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.


We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.


Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.


Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans







PI280358799

Posted 2025-12-05

Recommended Jobs

Emergency Director - Fredericksburg, VA

Virginia Veterinary Centers - Fredericksburg
Fredericksburg, VA

Emergency Director Opportunity at Virginia Veterinary Centers – Fredericksburg! Virginia Veterinary Centers, a leading emergency and critical care hospital in historic Fredericksburg, Virg…

View Details
Posted 2025-12-06

Manual Test Engineer with SECS/GEM/EDA experience mandatory (Semiconductor)

Vtekis Consulting Llp
Stafford, VA

Company Description We provide Recruitment and Staffing services to many industries and domain through our innovative and customized solutions and passionate commitment to research. Ability to u…

View Details
Posted 2025-11-25

Project Administrator for Commercial Construction

Commonwealth Blinds & Shades
Mechanicsville, VA

Commonwealth Blinds & Shades is a subcontractor to the commercial construction industry specializing in the sales and installation of commercial window treatments. We are dedicated to providing high-…

View Details
Posted 2025-11-10

Bartender/server - matchbox

Thompson Hospitality Corporation
Mc Lean, VA

Overview: Join the dynamic team at  Matchbox as a  Bartender and showcase your creativity and skill in crafting exceptional cocktails. As a Bartender, you will contribute to the high-energy atmospher…

View Details
Posted 2025-11-07

HR Business Partner

Netrix Global
Virginia

About the Opportunity Are you looking for an opportunity to join a global HR team focused on the employee experience and building a strong culture? Do you enjoy collaborating with managers on ta…

View Details
Posted 2026-01-01

Registered Nurse (RN)�

Matrix Providers
Chesapeake, VA

Matrix Providers is hiring a qualified Registered Nurse (RN) to join our team of talented professionals who provide healthcare services to our veterans and their families at TRICARE Prime Clinic Ch…

View Details
Posted 2025-09-27

Head of Customer Success, US (Florida)

Booksy
Hopewell, VA

Your Purpose: We’re looking for an experienced and data-driven Head of Customer Success (US) to lead our regional CS function, with a strong focus on driving efficiency and excellence in B2B onboard…

View Details
Posted 2026-01-10

Data Engineer Senior

GDIT
Falls Church, VA

Responsibilities for this Position Location: Any Location / Remote Full Part/Time: Full time Job Req: RQ210531 Type of Requisition: Regular Clearance Level Must Currently Possess:…

View Details
Posted 2026-01-08

Assistant Director Research and Shop Safety

George Mason University
Manassas, VA

Department: Risk, Safety, and Resilience Classification: Administrative Faculty Job Category: Administrative or Professional Faculty Job Type: Full-Time Work Schedule: Full-time (1.0…

View Details
Posted 2025-12-15