Colorado Technology Jobs

Colorado Jobs

Job Information

Tech Data Corporation SiteOps Reliability Engineer in Denver, Colorado

About this OpportunityThe Site Reliability Engineer role will be responsible for providing technicalsupport for our platform solutions and services. This role is expected to workclosely with the required development, enterprise infrastructure, andinternal business team (or external customers) to resolve and escalateproduction support incidents where necessary.How You Contribute to Our Vision: Key ResponsibilitiesAs a Site Reliability Engineer on our SiteOps team, you'll be part of agroup that's intensely focused on our customers and the health of oursolutions. Whether it's incident management, production support,advanced monitoring, or mentoring, SREs provide the foundation for issuetriage and speedy resolution with a continuous improvement mindset.You will:Serve as a Tier 2 to escalation point to issues that the Tier 1 team cannotresolve.Research problem tickets to address data, setup, and code issues toprovide responsive correction of issues.Assist in prioritization of enhancement and defect resolution.Monitor automated system alerts, log files, and other monitoring tool outputs.Design and develop emergency patches to address critical production issues.Manage third-party components used in the different Digital CommerceSolutions.Perform administrative functions for application software.Assist in providing support by participating in weekly on-call rotation.Interact with internal business customers, operation personnel, anddevelopment groups in troubleshooting and correction of issues.Manage the development, quality assurance, and production applicationenvironments, working closely with operations personnel to honor applicationService Level Agreements (SLA).Perform root cause analysis on issues that lead to the implementation ofprocesses to prevent repetitive problems.Conduct analysis on issues that lead to the implementation of solutions toprevent repetitive problems.Work on projects to better improve the Production Support model and processes.Candidates should have experience with many of the following:You have solved multiple problems by writing and documenting exceptionalscript solutions.You have extensive experience automating solutions to identifiedissues/bugs/anomalies. You have a passion for replacing manual processeswith efficient and concise automated solutions.You have been responsible for running critical services that multiplecustomers depend upon. You understand the importance and impact thatoperational optimization can have on a product and the positive ripple effectsthat it can have across an entire organization.You are empathetic: You take others' opinions into account and clearlycommunicate your thoughts to reach technical solutions quickly.You consider it necessary to understand and appreciate your customers andenjoy seeing your work improve the work of others.Mentorship and a Servant/Leader mentalityExperience in automation, specifically related to deployment, recovery,or other manual processes.Experience using telemetry to understand throughput, limitations, andconstraints in a service.Strong problem-solving skills and passion for solving hard problems as part ofa team and by individual investigation.Experience with REST APIs, JSON, and exposure to container-based technologies.Experience supporting zero fault-tolerant, scalable, and high-volumesystems applications in .NET.Experience in SQL Server 2012, Transact SQL, Stored Procedures.Great analytical skills and ability to think on the feet and work underpressure.Strong Windows/Unix platform skills and understanding of network,storage, tiered application environments, and security.Knowledge of Splunk, Graylog, Dynatrace, Application Insights orequivalent monitoring tools.Experience analyzing .Net thread/heap dumpsFamiliarity with AWS services such as S3, Lambda, SQS, SNS, EC2, EKS,Minimum RequirementsBachelor's degree in computer information systems preferred, but not required.4+ Years Dev Ops Engineering with a focus on problem resolution and platformopt