Current Statistics

1,748,663 Total Jobs
393,294 Jobs Today
17,936 Cities
222,695 Job Seekers
146,729 Resumes

 

Site Reliability Engineer - Charlotte North Carolina

Company: Recurring Decimal
Location: Charlotte, North Carolina
Posted On: 05/04/2024

Site Reliability Engineer
Location- Hybrid - Charlotte, NC or Phoenix, AZ

Key Skills:

Experience with one or more Cloud Platforms (Azure, GCP)
Experience with Container technologies: Kubernetes, Docker, PKS, Azure Kubernetes Service (AKS)
5+ years of experience in Site Reliability engineering.
Must have two to three years of experience in Azure.
Experience setting up monitoring in applications and database.
Experience in ServiceNow, Jira, Confluence, Splunk, Azure Monitor, Google Cloud Monitoring.
Experience in third party services and third-party management
Excellent verbal, written, and interpersonal communication skills.



Responsibilities:

This role will be responsible for monitoring the applications and responding to events, incidents and changes originating from internal or vendor applications. Investigate incidents and problems and determine root cause. Will use ServiceNow, Jira, Confluence, Splunk, Azure Monitor, Google Cloud Monitoring.
Troubleshoot and resolve issues in live production environments and implement strategies to eliminate them with minimal support.
Manage applications through automation.
Support and monitor new and existing services, platforms, and application stacks.
Engage in improving the lifecycle of services deployment, operations, and refinement.
Provide technical expertise during service impacting events.
Collaborate with other engineers on code reviews, internal infrastructure improvements and process enhancements.
Use scalability testing to measure, tune and optimize system performance.
Participate in periodic 24x7 on-call duties.
Being accountable for resolving the outage via workaround or permanent fix
Ensuring all administration and reports are maintained and up to date including contacts information technical diagrams post major incident reviews.
Responsible for communicating with various stake holders & shipping IT Communication.
Responsible for the effective implementation of the process Incident, Change and Problem Management and conducts the respective reporting procedure.
Ensure the closure of all resolved and end-user confirmed Incident records.
Establish continuous process improvement cycles where the process performance activities roles and responsibilities policies procedures and supporting technology is reviewed and enhanced where applicable.
Headed Proof-of-Concepts on Splunk implementation, splunk indexing and plugins, mentored and guided other team members on Understanding the use case of Splunk.
Knowledge on Splunk Enterprise Deployments and enable continuous integration as part of configuration using (props.conf, Transforms.conf, Input.conf & Output.conf, Deployment.conf) management.
Knowledge of log parsing, complex Splunk searches, including external table lookups, Splunk data flow, components, features, and product capability.
Knowledge in setting up alerts and Monitoring recipes from the Machine generated data. More...

Send this job to a Friend     


Register an account with us and set up job agents! We'll email you immediately when jobs like this are posted on our site.


Your Account
Email:
Password:
Register a New Account

Can't find what you're looking for? Try searching here:
Google
 
Web www.localjobboard.com

Copyright 2024 LocalJobBoard.com. All Rights Reserved.

RSS Job Feeds

Site Reliability Engineer: Charlotte, North Carolina job search information from LocalJobBoard.com

Recruiter expertise by Recruiter Media Corporation

Job Offers Search Engine

Charlotte North Carolina job: Site Reliability Engineer, Charlotte North Carolina job search