 |
Staff Engineer, Data Federation and Online Archive San Francisco - San Francisco California
Company: MongoDB Location: San Francisco, California
Posted On: 05/03/2025
MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure. Atlas allows customers to build and run applications anywhere-on premises, or across cloud providers. With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it's no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications.About our ProductsThe Atlas Data Federation service creates virtual databases that enables customers to query, transform, and analyze data across multiple sources (MongoDB clusters and cloud object storage) through a unified MongoDB query interface, without moving or copying the underlying data. We handle hundreds of millions of queries per month, processing exabytes of customer data.The Atlas Online Archive service provides low-cost, tiered storage for querying infrequently-accessed, read-only data. By optimizing the storage layout for customer data during ingestion and rebalancing it as needed, Online Archive delivers great performance and scalability at reasonable cost for Atlas customers. We manage petabytes of customer data and have a steep growth trajectory.About the RoleAs a Staff Engineer on the Atlas Data Federation and Archiving team, you'll lead initiatives to drive technical excellence across our federated query system and data archival systems. You'll balance hands-on development with technical leadership, focusing on operational excellence, system reliability, and mentoring. This role offers the opportunity to solve complex distributed systems challenges at scale while improving the reliability and performance of business-critical systems. Your work will directly impact thousands of MongoDB Atlas customers who depend on our data federation and archiving capabilities for their critical business operations.This role can be based out of our Austin, New York City, San Francisco, or Seattle office or remotely in the United States.Key ResponsibilitiesSystem Reliability & Performance - Lead initiatives to improve system observability, stability, and resource management
- Design and implement advanced autoscaling solutions for query execution and data processing
- Reduce incident rates through holistic improvements to system resilience
- Identify opportunities to improve operating costs in storage and query systemsTechnical Leadership
- Guide architectural decisions and conduct design reviews across two engineering teams
- Mentor senior engineers in distributed systems design and operational excellence
- Collaborate with Product Management on technical roadmap development
- Drive cross-team technical initiatives and standards
- Participate in on-call rotation and provide senior oversight for incident response and postmortem retrospectivesSystems Development
- Design and implement improvements to our distributed query execution engine
- Optimize data archival pipelines for increased throughput, durability and reliability
- Design and implement solutions for single-tenant isolation requirementsCandidate Profile
|
 |