Website remotenow my smart prosnetwfh
job description
CoreWeave is the AI Hyperscaler™, a trailblazing cloud platform recognized as one of the TIME100 most influential companies of 2024. We provide the essential infrastructure that powers the world’s leading AI labs and enterprises, delivering high-performance, resilient solutions for accelerated computing. As we continue to expand our global footprint across the US and Europe, we are looking for resilient, adaptable engineers who are eager to solve complex problems and define their careers by building the future of intelligence-driven innovation.
About the role:
-
Design and implement solutions to large-scale server observability to continually improve the stability of CoreWeave’s global hardware fleet.
-
Adapt, extend, and implement open-source solutions to augment the depth and breadth of our visibility into our operating environment.
-
Generate and maintain custom reports, alarms, and visualizations to help teams understand and respond to our growth and changes.
-
Create test plans, deployment automation, dashboards, alerts, and insights into our fleet operations, as well as participate in the Fleet Engineering Developers’ on-call rotation.
-
Grow, change, invest in your teammates, be invested in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.
Requirements, Qualifications and Skills
-
You have 2 or more years experience in a software or infrastructure engineering industry.
-
You have experience in the domains of automation and orchestration workflows and are knowledgeable about server hardware, components, and related technologies and strategies for the management of physical infrastructure at scale.
-
You have experience implementing metrics collection and alerting on standard platforms.
-
You believe in the value of automation and will champion practices that drive reliability and prioritize the CoreWeave customer experience.
-
Applicants must have work authorization that does not require sponsorship from the company now or in the future.
Compensation & Benefits
-
Base pay: $160,000-$185,000.
-
Medical, dental, and vision insurance – 100% paid for by CoreWeave.
-
401(k) with a generous employer match.
-
Flexible PTO and hybrid/remote work options.
About the Role
As a member of the Fleet Monitoring & Analysis Team, you will be instrumental in developing the “zero-touch” management engine that governs our massive hardware fleet. This role focuses on elevating observability and data-driven insights, ensuring our global infrastructure remains stable as we scale. You will work in a hybrid environment that values diversified experiences and provides comprehensive benefits—including family-forming support and mental wellness programs—while contributing to a culture focused on innovative disruption and technical excellence.
To apply for Company Website tgmarinejobs.com.