Site Reliability Engineer for FIS Data Exchange Reporting and Analytics
Company | Fidelity National Information Services |
---|---|
Location | Jacksonville, FL, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Senior |
Requirements
- Conceptually strong in Technology, Infrastructure, and Domain.
- Know at least one of (Unix/Windows Scripting, Python, Java, C++, C#)
- Exceptional and demonstrable web development experience.
- Experience with SQL Server databases – Always On Availability Group, SQL Managed Instance.
- Ability to Automate repetitive tasks (Scripting, RPA, Power Automate, UiPath, etc.)
- Knowledge in DevOPS (CI/CD) and relevant tools (Jenkins, Harness)
- Experience with Docker in a production environment including container orchestration (e.g., Kubernetes)
- Experience working in cloud-based infrastructure, specifically Azure (App Services, Redis, EventHub, Function Apps, SQL MI, SQL VM, APIM, App Gateway etc.) Snowflake, Astronomer, Airflow, and DBT.
- Experience with infrastructure as code – Terraform (TFE).
- Knowledge of configuration management systems like Ansible, Chef or Puppet.
- Knowledge in OS, Network, Middleware, Database, SSL (Secure Sockets Layer), Load Balancer
- Strong Knowledge in Tools like Dynatrace, Azure Monitor, App Insights, Log Analytics, KQL & ability to create Dashboards, Views/Alerts
- Strong Problem-Solving Skills –To troubleshoot and solve a problem quickly
- Bachelor’s degree in computer science, Information Systems with expertise in platform management.
- Minimum of 5+ years SRE experience.
- Strong understanding of software development life cycle and agile/SAFe methodologies.
- Coding experience beyond simple scripts
- Detective and Problem-Solving Skills
- Analytical and Proactive mindset
- Excellent communication, documentation and presentation skills.
- Self-starter, passionate about continuous improvement, with a deep desire to making a difference.
Responsibilities
- Build solutions and systems to manage platform infrastructure and applications.
- Contribute to system design consulting, platform management, and capacity planning.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Build monitoring that alerts on symptoms and incorporate self-healing.
- Run production environment by monitoring availability & taking a holistic view of system health.
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.
- Provide primary engineering support for multiple large, distributed software applications.
- Gather and analyse metrics from both platform resources and applications to assist in performance tuning and fault finding.
- Create sustainable & scalable systems and services through automation and uplifts.
- Balance feature development speed and reliability with well-defined service level objectives.
- Partner with stakeholders to design & deliver a reliable, scalable, secure & performant platform.
- Stay current on technical trends to suggest innovative tools and approaches to problems.
- A proactive approach to spotting problems, areas for improvement & performance bottlenecks.
- Identify and resolve problems promptly to meet and improve service levels and standards.
Preferred Qualifications
-
No preferred qualifications provided.