Benture logo
 ←  next job →

Software Engineer, Infrastructure at Mercor

posted 4 days ago
mercor.com Full Time San Francisco, CA 210-405k 106 views

Software Engineer, Infrastructure

Salary: $210,000 - $405,000 per year, plus equity
Location: San Francisco, CA or Seattle, WA
Position: Full-time

About the Infrastructure Team

We're seeking Software Engineers to join OpenAI’s Infrastructure team, supporting multiple high-impact groups. You can contribute to areas like Core Distributed Systems, Reliability Engineering, Observability, Developer Productivity, or Cloud Infrastructure, based on your expertise and interests.

About the Role

In this role, you'll collaborate closely with teams building critical infrastructure designed for scalability, reliability, and performance. You'll shape technical strategies, support advanced research, and develop products that bring OpenAI technologies—such as ChatGPT and the OpenAI API—to millions of global users.

Team Focus Areas

  • Distributed Systems: Build highly scalable, available, and reliable distributed systems essential for OpenAI’s technology stack.

  • Systems Engineering: Address core infrastructure challenges and optimize performance and scalability.

  • Reliability Engineering: Develop fault-tolerant systems and manage incident response and resilience initiatives.

  • Observability: Create tools for monitoring metrics, logs, and tracing to ensure system visibility and reliability.

  • Developer Productivity: Enhance tools, workflows, and environments that improve engineer efficiency and software quality.

  • Cloud Infrastructure: Manage and evolve cloud-based compute, networking, and storage infrastructure supporting all services and workloads.

Responsibilities

  • Design, implement, and manage infrastructure systems to ensure reliability and performance.

  • Collaborate with cross-functional teams to understand and fulfill infrastructure needs.

  • Enhance developer experience through improved tooling, automation, and workflows.

  • Participate in incident response, conduct postmortems, and implement best practices for reliability and scalability.

Ideal Candidate

  • Strong experience in software engineering, proficient in languages like Python, Go, C++, or Rust.

  • Proven experience building, operating, or scaling distributed systems or developer infrastructure.

  • Proficient with Linux environments and tools such as Kubernetes, Terraform, CI/CD pipelines, and observability stacks.

  • Skilled at debugging complex systems, with meticulous attention to detail.

  • Excellent communication skills and ability to collaborate effectively with cross-functional teams.

Qualifications

  • 4+ years of relevant industry experience, including 2+ years in a leadership or tech lead role managing complex, large-scale projects.

  • Passionate about scalable distributed systems, emphasizing reliability, security, and continuous improvement.

  • Exceptional communication skills, able to build consensus with diverse stakeholders.

About OpenAI

OpenAI is committed to developing and deploying safe, powerful AI technologies for the benefit of all humanity. We prioritize safety, inclusivity, and diverse perspectives. As an equal-opportunity employer, we welcome qualified applicants regardless of protected characteristics.

We adhere strictly to applicable fair hiring practices, including consideration of applicants with criminal histories.

For details, refer to OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.

Applicants requiring accommodations can request assistance via the provided link.

Join us in shaping AI’s future responsibly and inclusively.

Go back

Related Jobs