Subject Matter Expert\/Senior Site Reliability Engineer
Reston, Virginia
Employer: Compunnel
Industry:
Salary: Competitive
Job type: Part-Time
Top skills:
Expertise SRE Concepts.
Be able to message correctly with - Enterprise architect, Portfolio managers, Business Leaders.
Deep dive on operations excellence, observability, interoperability, Dynatrace, splunk, OpenTelemetry.
Be able to run running training for large audience.
Level 2 of SRE
Drive chaos engineering - used to use gremlin â now AWS FIS (Fault injection simulator).
Resiliency.
AWS Resiliency Hub.
Education/Experience:
AWS technologies
ECS, EC2, Lambda, Step Functions, EMR, Glue, S3, RDS, DynamoDB
Configuring Health Checks and Implementing Alarms in AWS
Identify CloudWatch metrics for the AWS services - EC2, ECS, EMR, Glue-ETL, S3, RDS, Lambda, DynamoDB, Redshift
Be able to classify the critical metrics and create CW Alarms using the AWS Console and also using Terraform templates.
Splunk, Dynatrace
Create and manipulate dashboards / metrics.
Disaster Recovery / Failover Scenarios.
Understand how to make an environment resilient and highly available.
Influencing:
Candidate should be able to convince internal customers (AppDev teams) benefits of shifting left, leveraging SRE model for performance, availability, resiliency, etc.
Ability to clearly explain these things
Confidence and communication must have skills.
Understand how to implement KPIâs:
MTTR (Mean Time to Resolve) and MTTD (Mean Time to Detect), SLO, SLI, Error Budgets.
Highly desired skills:
Disaster Recovery / Failover Experience â Load Balancing, Resiliency, High Availability.
Education: Bachelors Degree
Additional client information:
Expertise SRE Concepts.
Be able to message correctly with - Enterprise architect, Portfolio managers, Business Leaders.
Deep dive on operations excellence, observability, interoperability, Dynatrace, splunk, OpenTelemetry.
Be able to run running training for large audience.
Level 2 of SRE
Drive chaos engineering - used to use gremlin â now AWS FIS (Fault injection simulator).
Resiliency.
AWS Resiliency Hub.
Education/Experience:
AWS technologies
ECS, EC2, Lambda, Step Functions, EMR, Glue, S3, RDS, DynamoDB
Configuring Health Checks and Implementing Alarms in AWS
Identify CloudWatch metrics for the AWS services - EC2, ECS, EMR, Glue-ETL, S3, RDS, Lambda, DynamoDB, Redshift
Be able to classify the critical metrics and create CW Alarms using the AWS Console and also using Terraform templates.
Splunk, Dynatrace
Create and manipulate dashboards / metrics.
Disaster Recovery / Failover Scenarios.
Understand how to make an environment resilient and highly available.
Influencing:
Candidate should be able to convince internal customers (AppDev teams) benefits of shifting left, leveraging SRE model for performance, availability, resiliency, etc.
Ability to clearly explain these things
Confidence and communication must have skills.
Understand how to implement KPIâs:
MTTR (Mean Time to Resolve) and MTTD (Mean Time to Detect), SLO, SLI, Error Budgets.
Highly desired skills:
Disaster Recovery / Failover Experience â Load Balancing, Resiliency, High Availability.
Education: Bachelors Degree
Additional client information:
Created: 2024-05-14
Reference: PANDC4905646
Country: United States
State: Virginia
City: Reston
Similar jobs:
-
Support Engineer V, ADSP:GAP Reliability Engineering
Amazon in Arlington, Virginia💸 $96800 per year -
Supplier Quality Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
Site Reliability Engineer (SRE)
SilverEdge in Reston, Virginia -
Maintenance/ Reliability Engineer
Gables Search Group in Charleston, West Virginia -
Reliability Maint Engineer
WestRock in Hopewell, Virginia -
Secret Senior Reliability Engineer
Insight Global in Hampton, Virginia -
Hardware Reliability Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
Hardware Reliability Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
Site Reliability Engineer TS/SCI with Polygraph
General Dynamics Corporation in Herndon, Virginia💸 $140250 - $189750. per year -
Supplier Quality Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
Site Reliability Engineering (SRE) Lead (W2 ONLY)
System One Holdings, LLC in Reston, Virginia -
Sr. Infrastructure Reliability Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
TypeScript CDK| Remote | Site Reliability Engineer (SRE)
Insight Global in Lorton, Virginia -
Sr. Infrastructure Reliability Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
Electrical and Instrumentation Reliability Engineer II
Gables Search Group in Hopewell, Virginia -
Software Engineering /Cloud Reliability Engineer (TS-SCI)
L-3 Technologies in Springfield, Virginia -
E&I Reliability Engineer
WestRock in West Point Mill, Virginia -
Site Reliability Engineer TS/SCI with Polygraph
General Dynamics Corporation in Herndon, Virginia💸 $140250 - $189750. per year -
Sr. Infrastructure Reliability Engineer, Infrastructure Reliability \u0026 Quality
Amazon in Herndon, Virginia -
Senior Electrical Maintenance and Reliability Engineer
Jacobs in Hampton, Virginia