Site Reliability Engineer
New York, New York
Employer: Insight Global
Industry: Programmer / Developer
Salary: Competitive
Job type: Part-Time
Day-to-day
Insight Global is looking for a Site Reliability Engineer to sit FULLY REMOTE for a large streaming service. This SRE will work with a team 9 engineers, 3 of them based offshore (Database Reliability Engineers, CI/CD engineers and managers in the larger SRE organization). The SRE will provide hands-on technical skills and be responsible for maintaining the cloud systems utilized to operate Direct-to-Consumer platforms to web and mobile applications. The SRE will provide a software-driven approach to operations, managing infrastructure as code (managing network through readable files instead of hardware), leveraging deployment pipelines, with a focus on automation, observability, and resiliency. This person must have hands-on experience with software development as well as maintaining the network and infrastructure of cloud platforms. This team will be building data pipelines and ensure product launches are running smoothly. An ideal candidate will come from a software engineering role that then progressed to SRE or DevOps.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com .
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .
Required Skills & Experience
Must Haves
3-5 years of Site Reliability Engineering experience previous experience within software development and/or engineering.
o Infrastructure as code experience (IaC), must have Terraform.
o Knowledge within Kubernetes and Docker for containerization
o Experience building out CI/CD pipelines
Experience with GKE is mandatory
Ability to write helm charts from scratch (not just used existing ones)
Worked with Airflow or Cloud Composer (knowing how to manage as well)
Programming experience GoLang, Python, Java, and/or JS
Cloud experience in AWS
Ability to solve problems independently
Sharp communication skills
Nice to Have Skills & Experience
Plusses
Experience with monitoring tools (preferably Prometheus/Grafana/Alertmanager).
Coding experience (Go, Python, Java)
GCP Cloud Experience knowing how to use internal systems
o Familiar with technologies within GCP data flow, networking, big query, big table, etc.
Basic networking (Load Balancing, Routing, Security Groups, VPC, Subnetting)
Working at well-known tech companies
Worked with Airflow
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.
Insight Global is looking for a Site Reliability Engineer to sit FULLY REMOTE for a large streaming service. This SRE will work with a team 9 engineers, 3 of them based offshore (Database Reliability Engineers, CI/CD engineers and managers in the larger SRE organization). The SRE will provide hands-on technical skills and be responsible for maintaining the cloud systems utilized to operate Direct-to-Consumer platforms to web and mobile applications. The SRE will provide a software-driven approach to operations, managing infrastructure as code (managing network through readable files instead of hardware), leveraging deployment pipelines, with a focus on automation, observability, and resiliency. This person must have hands-on experience with software development as well as maintaining the network and infrastructure of cloud platforms. This team will be building data pipelines and ensure product launches are running smoothly. An ideal candidate will come from a software engineering role that then progressed to SRE or DevOps.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com .
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .
Required Skills & Experience
Must Haves
3-5 years of Site Reliability Engineering experience previous experience within software development and/or engineering.
o Infrastructure as code experience (IaC), must have Terraform.
o Knowledge within Kubernetes and Docker for containerization
o Experience building out CI/CD pipelines
Experience with GKE is mandatory
Ability to write helm charts from scratch (not just used existing ones)
Worked with Airflow or Cloud Composer (knowing how to manage as well)
Programming experience GoLang, Python, Java, and/or JS
Cloud experience in AWS
Ability to solve problems independently
Sharp communication skills
Nice to Have Skills & Experience
Plusses
Experience with monitoring tools (preferably Prometheus/Grafana/Alertmanager).
Coding experience (Go, Python, Java)
GCP Cloud Experience knowing how to use internal systems
o Familiar with technologies within GCP data flow, networking, big query, big table, etc.
Basic networking (Load Balancing, Routing, Security Groups, VPC, Subnetting)
Working at well-known tech companies
Worked with Airflow
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.
Created: 2024-04-27
Reference: 341934
Country: United States
State: New York
City: New York
ZIP: 10036
Similar jobs:
-
Principal Engineer, Site Reliability Engineering, Core
Google in New York, New York -
Engineer II, Quality & Reliability Engineering
Volt in Islandia, New York💸 $54 - $60 per hour -
Site Reliability Engineer - USDS
TikTok in New York, New York -
Senior Site Reliability Engineer - FedRAMP
Tenable Network Security in New York City, New York💸 $128000.00 per year -
Reliability Engineer
Two Sigma Investments, LLC. in New York, New York -
Site Reliability Engineer, Global E-commerce - USDS (Multiple Positions)
TikTok in New York, New York💸 $145000 - $250000 per year -
Site Reliability Engineer - USDS (NY)
TikTok in New York, New York -
Site Reliability Engineer, Recommendation Infrastructure - USDS
TikTok in New York, New York -
Principal Engineer, AI, Trust, Security Site Reliability Engineering
Google in New York, New York -
Quality and Reliability Engineer II
Insight Global in Hauppauge, New York -
Technical Program Manager III, Site Reliability Engineering, Google Cloud
Google in New York, New York -
Associate - Reliability Production Engineer
Morgan Stanley in New York, New York💸 $100000 - $150000 per year -
Director - Site Reliability Engineering
American Express in New York, New York💸 $170000.00 per year -
Site Reliability Engineer (SRE), TikTok Ads Serving- USDS
TikTok in New York, New York -
Quality and Reliability Engineering Engineer II
Compunnel in Hauppauge, New York -
Site Reliability Engineer, Data Platform- USDS
TikTok in New York, New York -
Senior Site Reliability Engineer
MongoDB in New York City, New York -
Infrastructure Site Reliability Engineer (Entry Level) - USDS
TikTok in New York, New York -
Site Reliability Engineering Manager, Apple Services Engineering
Apple in New York City, New York -
Site Reliability Engineer, Data Engineering - USDS
TikTok in New York, New York