Infrastructure Engineer

Remote
Full Time
Engineering
Mid Level

The Role

We are looking for a jack-of-all-trades Infrastructure Engineer responsible for keeping our production systems running and evolving. This role spans the full breadth of infrastructure - networking, storage, web serving, and cloud - primarily GCP, but also AWS, Azure, Cloudflare, and a range of VPS providers. You'll have direct ownership over the reliability and operability of systems that the rest of the company depends on every day.

This position is best suited for experienced engineers / SRE / DevOps looking to solve a wide array of technical challenges as part of fast and nimble team.

What You’ll Do

  • Keep infrastructure operational and up-to-date - own the health of production systems across a heterogeneous mix of cloud providers and bare-metal, including patching, upgrades, and routine maintenance.
  • Turn requirements into working infrastructure quickly - take technical requirements from engineering and product teams and ship working infrastructure without a lot of back-and-forth or hand-holding.
  • Proactively fix problems before they become incidents - through monitoring, regular audits, and a general habit of leaving things better than you found them.
  • Own incident response - triage and mitigate infrastructure incidents, communicate clearly to the rest of the company, and follow through with root cause analysis and lasting fixes.
  • Manage access to infrastructure and services - set up and maintain IAM roles, policies, and permissions across cloud providers and internal tooling so teams can work without becoming a security liability.
  • Automate repetitive operational work - use Terraform, Ansible, and scripting to eliminate manual toil; if you're doing something by hand for the second time, it should be automated.
  • Operate and improve the monitoring stack - maintain and refine our alerting and observability infrastructure so we have useful signal, not just noise.

What We’re Looking For

Must-have experience:

  • Significant production experience operating large-scale Internet services.
  • Strong Linux sysadmin fundamentals, including debugging at the OS level.
  • Solid working knowledge of Kubernetes in production, including deploying, debugging, and maintaining clusters.
  • Practical infrastructure automation experience with Terraform and Ansible in real production environments.
  • Hands-on experience with at least one major cloud provider at production scale; GCP experience is a strong plus.

Nice to have:

  • Familiarity with web application architectures (databases, web frameworks) 
  • Comfort with network debugging tools (e.g. tcpdump, traceroute, mtr) and working through network-layer problems.
  • Experience with profiling tools and methodologies
  • Experience with Prometheus and Grafana
  • Proficiency writing scripts and tooling in Python, Bash, or Go.

About IPinfo

IPinfo is a leading provider of IP address data. Our API handles over 100 billion requests a month, and we also license our data for use in many products and services you might have used. We started as a side project back in 2013, offering a free geolocation API, and we've since bootstrapped ourselves to a profitable business with a global team of over 60 people, and grown our data offerings to include geolocation, IP to company, carrier detection, and VPN detection. Our customers include T-Mobile, TransUnion, DataDog, DemandBase, and many more.

How We Work

We have an ambitious team, spread all over the globe. We sync up on a monthly all-hands Zoom call, and most teams do a call together every 2 weeks. Everything else happens asynchronously, via Slack, GitHub, Linear, and Notion. That means you can pick the hours that work best for you, to allow you to be at your most productive.

To thrive in this environment you'll need to have high levels of autonomy and ownership. You have to be resourceful and able to work effectively in a remote setup. 

Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*