DevOps Engineer

  • Full-time

Company Description

Pet360, Inc. is the largest and fastest growing integrated media & ecommerce company dedicated to the U.S. consumer pet industry. We are headquartered in the Philadelphia area with offices in NYC, Miami, Colorado and Louisville and have distribution centers on both coasts. We are redefining the pet parenting experience by providing pet owners easy access to the tools and resources they need – trusted information from a team of experts, connections to other pet parents, and convenient online ordering solutions for pet food, meds and supplies.

As the company continues to grow, we’re looking to expand the Pet360 Pack by adding talented individuals who enjoy an innovative, fast-paced and entrepreneurial environment where change is encouraged and your impact is instantly visible. The company is backed by leading private equity and venture capital firms and is well capitalized to continue our exceptional growth.

Our family of premium brands includes Pet360.com – the most comprehensive online resource for pet parents; petMD.com – the world’s largest digital resource for pet health information; PetFoodDirect.com the leading online retailer of pet food, medications and supplies; BlogPaws – the largest professional network of pet bloggers and social media enthusiasts; and Only Natural Pet – a complete line of natural pet supplies specifically formulated to be biologically appropriate for cats and dogs. Today, our network reaches more than 12 million pet parents each month!

 

Job Description

We are looking for a DevOps Engineer whose primary responsibility is the development, management and support of all production systems including front-end, middle-tier and application servers. This position will be part of the team that sets policy, monitors, prioritizes, and develops standards, as well as maintains service levels, for the production environment. Controls activities related to day-to-day maintenance such as creating backups, editing indexes and performing installations and upgrades in both a physical and virtual environment. Works on complex problems where analysis of situations or data requires evaluation of intangible variables.

While we're looking for an individual that can handle emergencies, the ideal candidate will have a strong dedication to process, standardization, security, documentation, and minimizing the risk of emergency situations. Such an individual should view producing good documentation and maintaining tight configuration management as mission critical activities. Position needs to exercise independent judgment in developing methods, techniques, and evaluation criterion for obtaining results. This position requires a self starter, willing to take on large and small challenges while operating under minimal supervision.

 

Duties and Responsibilities: 

    • Research, specify, configure and deploy servers and storage with a focus on high performance/availability and scalability.
    • Performance tuning of web, application, DB servers, network and storage devices.
    • Must design, deploy, maintain and troubleshoot infrastructure for multiple production websites.
    • Ensure the development and use of an effective preventive maintenance program suitable to meet the operation objective of "99.9% availability".
    • Develop and deploy monitoring systems and system checks.
    • Handle large projects related to security, audit and compliance including new infrastructure rollouts.
    • Work closely with the DB group and stakeholders to meet their requirements.
    • Prepare periodic reports regarding unit's activities and goals attainment.
    • Proactively suggest alternative solutions and designs, third party tools, etc. to solve the problems at hand.
    • As a member of the team will respond to all system problems on a 24x7x365 basis, will take part in on-call rotation.
    • Frequently called upon to solve problems; has strong problem solving skills and the ability and desire to learn new technologies rapidly.
    • Able to deal effectively with internal and external groups including clients and vendors.
    • Other duties as assigned.


Qualifications

Competencies:

    • Requires 5+ years of systems and network administration experience managing a high-volume, highly-available Linux based environment.
    • High Availability and failover scenarios, load balanced environments in both Co-located datacenters and via Cloud providers
    • Strong working knowledge of storage environments (SAN, NAS).
    • Strong troubleshooting skills (network and server level) are a critical requirement for this role.
    • An expert level understanding of Nginx, PHP, LDAP, BIND.
    • In-depth, hands-on knowledge of TCP/IP, IP security, DNS, SMTP, SNMP, NFS, DHCP, Samba and FTP.
    • Experience with an RDBMS such as MySQL and/or Postgres.
    • Experience with clustering technologies, such as RedHat Cluster Services.
    • Proficiency with BSD, RHEL and/or CentOS, Windows Server.
    • Automation via scripting (Perl, Python, Bash).
    • File Replication, File System Tuning, Upgrading and Configuring.
    • Excellent interpersonal and communication (verbal and written) skills.
    • Documentation of processes and procedures.
    • Daily monitoring and maintenance of servers.
    • Test and apply new software and maintenance patches, including patch management services.
    • Complete security audits on a routine basis.
    • Ability to troubleshoot across all layers of the OSI stack.
    • Ongoing effort to maintain skills and knowledge at the state of the art level.
    • Proven ability to work independently or in a team environment.

Additional Information

This is a full time, salaried position with a full benefits package, including paid time off after 90 days, medical, dental,  and a generous product discount. Lots of other fun company perks too!