Site Reliability Engineer - Platform Infrastructure
Bluesky Social
Software Engineering, Other Engineering
Remote
Posted on Mar 27, 2025
<div><div> <div> <span>Bluesky's mission is to transition the social web from platforms to protocols. We're building a federated social network where users have more power. Our team has decades of combined experience building distributed applications. </span><span>As a site reliability engineer, you'll operate and shape infrastructure that powers core Bluesky services, and will be a critical part of transforming social media.</span> </div> <div><br></div> <div><span>Working on Bluesky’s cloud and bare-metal systems, you’ll be charting the path for long-term availability and resilience of systems running on dense, latest-generation servers running in our own racks performing millions of queries per second—all as we rapidly scale from tens to hundreds of millions of users.</span></div> <div><br></div> <div><strong>Responsibilities:</strong></div> <ul> <li><span>Plan the growth of Bluesky infrastructure with an eye towards availability and data durability.</span></li> <li><span>Drive on practices that improve operations, especially surrounding observability, documentation, and automation.</span></li> <li><span>Join the infra operational team, participating in our existing frontline on-call rotation.</span></li> <li><span>Work closely with engineers who develop our core software systems.</span></li> </ul> <div><br></div> <div><strong>You might be a good fit if you:</strong></div> <ul> <li><span>Have worked with systems running on self-hosted bare metal.</span></li> <li><span>Have familiarity with the upper limits of operating systems, networks, and storage.</span></li> <li><span>Have familiarity with internet-scale networking protocols like BGP and DNS-level load balancing.</span></li> <li><span>Have significant experience managing database systems (e.g. Postgres, SQLite, Cassandra, ScyllaDB).</span></li> <li><span>Have experience with various tools and platforms such as Ansible, Docker, Kubernetes, MaaS, and CI/CD solutions.</span></li> <li><span>Have software engineering experience, particularly with Go.</span></li> <li><span>Like working on small, fast-moving teams.</span></li> <li><span>Have read the AT Protocol docs and want to contribute.</span></li> </ul> <div><br></div> <div><strong>We're especially excited if you have:</strong></div> <ul><li><span>Previous experience working on infrastructure for social apps at large scale.</span></li></ul> <div><br></div> <div><span>We're a fully remote team, though a significant overlap of working hours with US/Pacific is required. For full-time roles, we offer health, dental, and vision insurance.</span></div> <div><br></div> <div><strong>To learn more about us, check out:</strong></div> <ul> <li><a href="https://bsky.social/about" target="_blank">bsky.social</a></li> <li><a href="https://atproto.com/" target="_blank">atproto.com</a></li> </ul> <div><span>Please attach your resume and links to your GitHub, GitLab, or a portfolio of past work within the same attachment.</span></div> <div><br></div> <div><em>*The application also includes the following questions:* </em></div> <ol> <li><em>Please write a cover letter explaining why you would like to work here at Bluesky.</em></li> <li><em>If you have a Bluesky account, please share your handle here.</em></li> </ol> <div><br></div> </div></div>
Bluesky Social is an equal opportunity employer.