Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 1400 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
The role
As a Senior Technical Program Manager - Data Center Operations, you will be responsible for driving operational performance across data center IT and infrastructure teams through data-driven insights, structured tracking, and cross-team coordination.This role is centered on operational efficiency, performance measurement, and continuous improvement. You will define how success is measured, ensure transparency across operations, and help teams optimize execution at scale.
Travel: This position may require on-site presence at data centers.
Your responsibilities will include:
- Define, implement, and continuously improve KPIs, metrics, and dashboards for data center operations
- Establish tracking frameworks to monitor performance, incidents, and operational efficiency across teams
- Lead operational reviews (weekly/monthly) to identify bottlenecks, inefficiencies, and improvement opportunities
- Ensure visibility and accountability through structured reporting and data-driven insights
- Standardize processes across sites, improving operational consistency, scalability, and efficiency
- Collaborate with stakeholders to align operational performance with business goals and customer SLAs
We expect you to have:
- 5+ years of experience in technical project/program management, preferably in data center, cloud or infrastructure environments
- Experience with project management methodologies and tools
- Strong experience in building metrics, dashboards, and reporting systems for operations
- Strong technical skills (SQL and/or programming languages such as Python or Go)
- Strong analytical skills with a good foundation in math and statistics
- Proven ability to work across multiple teams and drive alignment and execution without direct authority
- Excellent communication and stakeholder management skills
- A structured, proactive, and results-driven mindset
It will be an added bonus if you have:
- Familiarity with ITIL / ITSM processes
- Experience working with GPU clusters, HPC, or cloud infrastructure
- Understanding of data center network traffic patterns (east-west and north-south)
Key Employee Benefits in the US:
- Health Insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
- 401(k) Plan: Up to 4% company match with immediate vesting.
- Parental Leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
- Remote Work Reimbursement: Up to $85/month for mobile and internet.
- Disability & Life Insurance: Company-paid short-term, long-term, and life insurance coverage.
Compensation
We offer competitive salaries between 110k - 204k plus quarterly bonuses and equity based on your experience.
Join Nebius Today!
What we offer
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!