Software Engineer, Traffic and Observability
- New York, NY
The mission of the Traffic and Observability team is to build highly scalable and reliable application-level communication and monitoring infrastructure for thousands of geographically distributed microservices. The software and services provided by Traffic and Observability Engineering team are the bedrock of Squares reliability.
Develop, operate and maintain a sidecar-based Service Mesh (Envoy) and associated RPC libraries (gRPC) in Java, Go, Ruby.
Develop, operate and maintain highly reliable, mission-critical software-based L4 and L7 edge proxies.
Provide seamless service discovery for every application at Square. (Zookeeper et al)
Implement intelligent rate-limiting / circuit-breaking for preventing cascading failures.
Contribute to our distributed tracing (DataDog), exception tracking (Sentry), infrastructure monitoring (SignalFx), and alerting (PagerDuty) integrations.
Manage our on-prem Elasticsearch-based logging cluster
Participate in a team on-call rotation.
We believe that a well designed production environment can be simple and beautiful. We are looking for experienced engineers with a passion to build highly reliable infrastructure.
- 3+ years of software development experience in Java, Go or similar.
Experience building, operating and maintaining low-latency, highly reliable services.
*Open to team members in our New York or San Francisco offices.