Energy Impact Partners
95
companies
1,128
Jobs

Site Reliability Engineer

TRC Companies

TRC Companies

Software Engineering
Clifton, VA, USA · New York, NY, USA · United States
Posted on Dec 17, 2024
About Us

Groundbreaker. Game changer. Pioneer.

TRC has long set the bar for clients who require more than just engineering, combining science with the latest technology to devise innovative solutions that stand the test of time. From pipelines to power plants, roadways to reservoirs, schoolyards to security solutions, clients look to TRC for breakthrough thinking backed by the innovative follow-through of an industry leader.

TRC's professionals work with a broad range of commercial, industrial and government clients and the communities they serve. We deliver breakthrough solutions that address local needs -- so our clients can better succeed in an ever-changing world.

Working at TRC means tackling interesting, meaningful projects. We pride ourselves on our collaborative spirit, entrepreneurial zeal and agile corporate structure. We recognize that the expertise of our staff is our strongest asset, so we generously reward employees for successful performance and invest in their careers through training and the development of new skills and certifications.


Overview

We are looking to hire a Site Reliability Engineer (SRE) to build and maintain software that enables our customers to navigate our site quickly. The ideal candidate will provide skilled problem-solving measures that enable product expansion and improve the customer experience to accomplish company objectives.

The ideal candidate will have a background in the utility industry and possess strong knowledge of agile methodologies, specifically Scrum. The SRE will be responsible for ensuring the reliability, availability, and performance of our software products. This role involves working closely with both onshore and offshore teams to support migrations and product development. Excellent communication skills and the ability to collaborate effectively with diverse teams are essential. To be a successful SRE, you should be meticulous and detail-oriented, with excellent technical and information security skills.

If you are an experienced SRE with unique ideas and enjoy working in a collaborative, cross-functional work environment, we’d love to hear from you.


Responsibilities

Key Responsibilities (Duties):

  • Develop and provide operational support for full-stack software applications.
  • Ensure the reliability, availability, and performance of software products in production.
  • Collaborate with development operations staff to create, monitor, and troubleshoot the entire system including infrastructure.
  • Increase system resilience and serve larger customer volumes with expert-level coding, bulletproof release, and change management skills.
  • Monitor and manage system health, performance, and capacity planning.
  • Automate repetitive tasks to improve system efficiency and reduce manual intervention.
  • Collect operating system data and report performance metrics to stakeholders.
  • Manage cloud and database system maintenance, debugging production issues as they arise.
  • Experience building software and computer systems using a variety of languages (JavaScript, Python, etc.).
  • Comfortable working with cloud-native infrastructure, such as AWS Lambda, and Azure Cloud Services.
  • Impeccable creative and communication skills.
  • Ability to problem solve in a fast-paced, high-stakes environment.
  • Monitoring application and VPC traffic for suspicious behavior.
  • Creating application policies and service level agreements metrics for measuring operations.
  • Develop and maintain incident response plans and conduct post-incident reviews to prevent future occurrences.
  • Participate in on-call rotations to provide timely support for critical systems.
  • Foster a culture of continuous improvement and learning within the team.
  • Consulting with staff, managers, and executives about the best operational practices and providing technical advice.

Qualifications

Minimum Required Qualifications:

Minimum qualifications are those an applicant must possess to be considered a candidate for the position.

  • Bachelor's degree in computer science, cyber security, or a related field.
  • 5+ years experience as a site reliability engineer or similar role.
  • Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.
  • Strong knowledge of agile methodologies, particularly Scrum.
  • Proficiency in scripting and automation tools (e.g., Python, Bash, Ansible).
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud) preferably AWS.
  • Excellent communication skills, both written and verbal.
  • Proven ability to work effectively with offshore and onshore teams.
  • Strong problem-solving skills and a proactive approach to identifying and resolving issues.
  • 5 + years of cloud infrastructure automation technologies (Terraform, Code Deploy).

    Preferred Qualifications:

    Preferred qualifications are not required for an applicant to be considered a candidate for the position.

    • Master’s degree in Computer Science, or a related field.
    • Certifications in cloud technologies (e.g., AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer).
    • Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
    • Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
    • Experience in performance tuning and optimization of software systems.
    • 6+ years of managing images (e.g., AWS AMIs) including remediation and patching strongly preferred.
    • Familiarity with the Energy Efficiency Domain a plus

Benefits*: TRC offers a competitive benefit package consisting of:

  • Medical, dental, vision, and disability insurance.
  • 401k package that includes both traditional and Roth IRA options and Company match.
  • Paid time off contingent upon full time or part time status and level of seniority (ranging from 15 to 25 days per year).
  • All full-time employees enjoy a minimum of 8 Paid Holidays per year.
  • TRC ensures that all employees, including those that work part-time, receive paid sick, family, and disability leave in accordance with the laws of their state of residence.

*These benefits are applicable to US employees depending on their employment status (full time, part time, or temporary). TRC offers jurisdiction specific benefit packages for International and Puerto Rico employees.

Compensation: In accordance with local law, the below salary range is applicable for the job and location associated with this requisition. Compensation will vary based upon experience, education, skill level, geographical location, and other compensable factors.

Other Compensation: Employee may be eligible for participation in the Company’s annual Discretionary Bonus Program which is contingent on Company, Business Segment, and Individual performance against Key Performance Indicators which vary by year and employee.


Disclaimers

TRC is an Equal Opportunity Employer. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other characteristic protected by applicable law. All employment decisions are made based on qualifications, merit, and business needs. We celebrate diversity and are committed to creating an inclusive environment for all employees.

The complete job description and application are available on TRC’s career site.

TRC accepts applications for this position on an ongoing, rolling basis and reserves the right to cancel this posting at any time.


Salary Range

USD $120,000.00 - USD $140,000.00 /Yr.