SRE Engineer Job at ARK Infotech Spectrum, Houston, TX

MTdHT2xCVmI4dU1vWUI3ZTN0Vmo2NGc2
  • ARK Infotech Spectrum
  • Houston, TX

Job Description

Looking for a SRE engineer with 8+ years of IT and Software experience to Run the production environment by monitoring availability and taking a holistic view of system health, Improve reliability, quality, and time-to-market of suite of software solutions- Application Monitoring tools- Datadog, Dynatrace, Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement, Provide primary operational support and engineering for multiple large-scale distributed software applications.

Responsibilities and Objectives

  • Build and Deploy Infrastructure and Software using DevOps and CI/CD
    Support Kubernetes Platforms such as EKS, AKS and Open Shift
    Supporting application teams deployed on Cloud Platforms
    Troubleshoot day to day issues on the cloud
    Ensuring safety and soundness of the platform
    Service Management, Process documentation, Knowledge documentation
    Collaborating with Engineering teams for defects, new features and operationalize them
    Respond to issues in a timely and efficient manner with the assistance of other team members and other resources with the goal to minimize the impact to our customers
    Assist in maturity efforts around application environments to create a more stable and effective solution to all consumers
    Work in conjunction with other application support members to create and facilitate a 24x7x365 resolution mechanism
    Assist in the continual improvement of documentation, processes, governance, customer onboarding, etc. around application environments
    Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding
    Partner with development teams to improve services through rigorous testing and release procedures
    Participate in system design consulting, platform management, and capacity planning
    Create sustainable systems and services through automation and uplifts
    Balance feature development speed and reliability with well-defined service-level objectives
    Monitor infra, apps and network components and DevOps pipelines.
    Review and provide inputs on overall design and observability of the platform
    Operational support for platform and workloads/products hosted on the platform
    Automate manual activities, Troubleshooting and Root cause analysis, Problem management
    Creating Observability Plans, Troubleshooting and defect management
    Cost Management

Required skills and qualifications

  • Cloud experience, AWS preferred.
    Ability to script in one or more languages (Python, Terraform etc.)
    Ability to understand (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript
    Experience with distributed computing and storage technologies as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn)
    Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
    DevOps, CI/CD, GitHub Actions or Jenkins
    Experience with monitoring software and ability to create dashboards and reports. SRE Engineer

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Report this job
  • Dice Id: 90970970
  • Position Id: 8485175

Job Tags

Contract work, Shift work,

Similar Jobs

Sharp Interiors

Lead Cabinet Installer Job at Sharp Interiors

 ...are growing our team, and looking for a professional cabinet installers, to grow with this us! As a company we are organized and cutting...  ...such as medical, assisted living facitilities, lawyers offices etc. As a talented installer you will have the tools to do... 

Tom Bell Chevrolet

Service Writer Job at Tom Bell Chevrolet

Tom Bell Chevrolet believes that no organization is any better than the people who work for it. Therefore, it is of the utmost importance that we set high standards of integrity with an enthusiastic attitude in all that we do. We promise to maintain a well-trained workforce...

Domino's Corporate

Crew Member - $15/hour + Tips - 9469 Baltimore National Pike Job at Domino's Corporate

 ...items, safe work environment, & opportunities for growth Benefits Dominos offers excellent benefits (eligibility dependent on hours worked/week) What were looking for in our Store Team Members: Demonstrates ability to maintain food and team member safety ... 

CalOpps

Safety & Training Compliance Officer Job at CalOpps

 ...Resources or the Directors authorized personnel, develops, monitors, conducts, and administers the Departments occupational health and safety and environmental programs and Water/Wastewater industrial training program in compliance with federal, state, and local safety,... 

Continuum Medical Staffing

OB GYN FORT WAYNE INDIANA Job at Continuum Medical Staffing

(Physician/MD qualifications required) Obstetrics And Gynecology - OB|GYN Fort Wayne Indiana Large OB|GYN Private Practice Group seeking additional OBGYN associates. (20 physicians, 9 mid-levels) 9 Office Locations Seeking additional associates for the offices at the...