All roles

Site Reliability Engineer II- Data Platforms (Remote)

Remote · USA Full-time New today

This a Full Remote job, the offer is available from: United States Job Overview: The Data Platform Reliability Engineer is responsible for ensuring the stability, performance, and operational reliability of UNFI’s cloud-based and legacy data platforms. This role focuses on monitoring, troubleshooting, and automating operational workflows for Databricks, AWS services, and enterprise ingestion tools such as Fivetran/HVR, AWS DMS, DataStage, and Informatica, as well as supporting BI tools (Power BI, Tableau, Alteryx) and governance solutions. The engineer will work closely with external consulting partners and internal teams to maintain uptime, enforce governance standards, and optimize platform performance and cost efficiency. Job Responsibilities: Platform Reliability & Monitoring

  • Monitor health and performance of Databricks clusters, jobs, and workflows.
  • Maintain observability dashboards, alerts, and logs for AWS services and ingestion pipelines.
  • Respond to incidents, perform root cause analysis, and implement corrective actions. Cost and Performance Management Monitor and optimize platform costs across cloud and data services. Implement cost-control measures and provide regular reporting. Implement and maintain cost controls: cluster policies, auto-termination, right-sizing, job scheduling, storage lifecycle policies. Monitors spend and utilization for Databricks, AWS, ingestion, and BI services. Promote performance best practices. Monitoring and Observability
  • Build and maintain dashboards, alerts, and logs for Databricks, AWS services, ingestion pipelines, and BI refreshes.
  • Continuously tune alert thresholds to reduce noise and improve signal-to-action ratio.
  • Ensure end-to-end lineage/traceability for faster fault isolation across stages. External Support Team & Vendor Management Coordinate with external support teams for day-to-day operations and issue resolution. Coordinate with vendors for troubleshooting, service improvements, and escalations. Track and report on SLA adherence and vendor performance. Maintain operational runbooks, knowledge base, and handoff procedures between internal teams and external partners. Continuous Improvement Drive automation and efficiency in operational workflows. Optimize resource utilization and reduce manual intervention. BI Platform Operations
  • Support Power BI, Tableau, and Alteryx operations (gateway health, dataset refresh schedules, workspace/app permissions, data-source connectivity).
  • Monitor and improve dataset refresh reliability, query performance, and user access hygiene. Performs other duties as assigned. Job Requirements: Education/Certifications:
  • Bachelor’s degree in computer science, data analytics, systems analysis, or a related field Experience:
  • 3+ years in data platform operations or reliability engineering.
  • Hands-on experience with Databricks and AWS services in production environments.
  • Demonstrated success in maintaining high-impact data platforms, with a strong track record of managing complex environments.
  • Familiarity with ingestion tools (Fivetran, AWS DMS, DataStage, Informatica) and BI platforms (Power BI, Tableau, Alteryx).
  • Experience with SAP, master data management, and cross-functional processes across supply chain, finance, and operations Knowledge/Skills/Abilities
  • Strong troubleshooting and incident management skills.
  • Knowledge of governance, security, and RBAC principles.
  • Ability to work independently and collaborate with external partners.
  • Familiarity with Agile practices and DevOps principles. Understanding of governance, security, and privacy.
  • Good judgment is required for this position as there may be times when direct supervision may not be immediately available. Work Environment: Remote Role:
  • This position is classified as remote where the associate will perform remote work from their primary residence. Remote associates are welcome to work from the office but are not required to do so. While remote associates are not required to work from an office on a regular basis, they may be required to come to the office or other UNFI locations for necessary business reasons or if directed to do so by their manager. Physical Environment/Demands: Office Roles:
  • Most work is performed in a temperature-controlled office environment.
  • Incumbent may sit for long periods of time at a desk or computer terminal.
  • While performing the duties of this job, the employee is regularly required to sit; use hands to finger, handle, or feel; reach with hands and arms; and talk or hear.
  • Incumbent may use calculators, keyboards, telephones, and other office equipment in the course of a normal workday.
  • Stooping, bending, twisting, and reaching may be required in the completion of job duties. The above statements are intended to describe the general nature of the work performed by the employees assigned to this job. All employees must comply with Company policy and applicable laws. The responsibilities, duties and ski

Apply tot his job Apply To this Job

Related roles

Social Media Analyst Platform

Remote · USA Full-time

Work from Home | Internet Analyst | Social Media Evaluator

Remote · USA Full-time

AI Software Architect / Developer

Remote · USA Full-time

Software Engineering Manager - Container and Virtualisation Infrastructure

Remote · USA Full-time

Embedded Software Engineer (Remote with Travel)

Remote · USA Full-time

Software Consultant; ProjectSight US Posted

Remote · USA Full-time

Senior Consultant - Enterprise Software

Remote · USA Full-time

Consultant Engineer II

Remote · USA Full-time

Senior Manager, Senior Software Engineer

Remote · USA Full-time

Senior Associate, Data & Technology, Data Privacy Software Implementation Consultant (Remote)

Remote · USA Full-time

Test Engineer, Sr

Remote · USA Full-time

Experienced Full Stack Data Entry Professional – Media and Entertainment Industry

Remote · USA Full-time

Admin Jobs For College Students – VacancyGlobal

Remote · USA Full-time

Communications Associate

Remote · USA Full-time

Experienced Technical Engineering Manager for Cross-Functional Software Development and Team Leadership at blithequark

Remote · USA Full-time

Experienced Entry-Level Data Entry Specialist (Remote) in Allentown, PA at arenaflex

Remote · USA Full-time

Require Cary NCE Tutor in Cary, NC

Remote · USA Full-time

Associate, Investment Operations

Remote · USA Full-time

Experienced Live Chat Agent – Remote Customer Support Representative

Remote · USA Full-time

Experienced Data Entry Clerk – Remote Work From Home Position | Flexible Schedule | Competitive Daily Earnings Potential

Remote · USA Full-time