System Software Analyst - Los Alamos, NM - DOE Q (1176244)

Los Alamos, New Mexico


Employer: Day & Zimmermann Group
Industry: Information Technology
Salary: Competitive
Job type: Full-Time

Systems Analyst - Los Alamos, NM - DOE Q

As a Systems Analyst you will provide technology consulting to external customers and internal project teams. Responsible for providing technical support and/or leadership in the creation and delivery of technology solutions designed to meet customers' business needs and, consequently, for understanding customers' businesses. As trusted advisor create and maintain effective customer relationships so as to ensure customer satisfaction. Maintain knowledge of leading-edge technologies and industry/market domain knowledge. Actively contribute to the company's solutions portfolio by providing information ranging from technical knowledge to methodologies based on experience gained from customer projects. Shape technical direction and technical strategies within the organization and for external customers. Accountable for consistent and significant chargeability levels (or expense relief for internal project teams) and for assisting in meeting or exceeding revenue and customer satisfaction goals. Contribute to organization's profitability by generating and cultivating new business opportunities and by providing technical support for deal proposal development.

Role Requires:
  • Active Department of Energy (DOE) Q Clearance or have held one in the past 3 years; if previous clearance, must not foresee a problem with it being reinstated.
  • Role will be on-site at customer location in Los Alamos, NM

Contributions include applying developed subject matter expertise to solve common and sometimes complex technical problems and recommending alternatives where necessary. Might act as project lead and provide assistance to lower-level professionals. Exercises independent judgment and consults with others to determine best method for accomplishing work and achieving objectives.

Responsibilities:
  • Provide on-site system administration and HPC application consulting services.
  • Address and resolve the current top issues in the HPC environment.
  • Maintain the HPC systems availability to the customer
  • Monitor system performance and provide recommendations for improvements.
  • Collaborate with team members and stakeholders to deliver high-quality support and solutions.
  • Create and document site procedures, system diagrams, and other configuration or support documents
  • Maintain system software and firmware revisions, including patches, updates, and OS upgrades
  • Solve system hardware, software, and third-party software issues, and provide detailed and thoughtful analysis of problem and solution
  • Gather data, perform analysis, and escalate problems to higher-level product support groups and appropriate management when necessary to ensure timely resolution of system or customer issues
  • Provide solutions and implement repair or workarounds when possible, fully documenting steps taken when required
  • Manage software issues for both the system and user applications, submitting and tracking bugs as required

Education and Experience Required:
  • Bachelor's degree in Computer Science, Engineering, or related area of study
  • 4+ years HPC-related experience, ideally with large-scale HPC and parallel file system administration and support
  • Without a degree, three additional years of relevant professional experience (7+ years in total)
Knowledge and Skills:
  • Understanding of a HPC Data Center IT Operations environment
  • Expertise in HPC application consulting and support.
  • Strong system administration skills, particularly in HPC environments.
  • Extensive knowledge and experience with Linux operating systems (RHEL or SLES)
  • Experience with job scheduling and resource management tools.
  • Experience with various HPC hardware architectures and software stacks.
  • Knowledge of parallel file systems (e.g., Lustre, GPFS)
  • Familiarity with containerization technologies (e.g., Docker, Singularity).
  • Experience with scripting and automation tools (e.g., Python, Bash, Ansible).
  • Familiarity with cybersecurity best practices in HPC environments.
  • Ability to lead and work effectively in a team environment
  • Direct experience and demonstrated proficiency with multiple programming and scripting languages (e.g. Perl, Python, C, FORTRAN, etc.) preferred
  • Ability to maintain system software, utilizing debugging tools for problem isolation; will perform software builds, software upgrades, and patch installation as needed
  • Excellent interpersonal, customer relations and problem management skills, with the ability to stay calm and professional under pressure while working to strict deadlines
  • Experience with project planning and management, process management, and team or project leadership preferred
  • Able to clearly document processes and procedures with a focus toward mentoring and knowledge sharing
  • Occasional travel for training is required

Additional Beneficial Experience and Skills:
  • Security+ Certification
  • Familiarity with ticket-tracking software (any ticket tracking is good)


Employment Pre-requisites
The following requirements must be met to be eligible for this position: successful completion of a background investigation, and drug urinalysis.

SOC, a Day & Zimmermann company, is an Equal Opportunity Employer.

Created: 2024-09-06
Reference: 232699
Country: United States
State: New Mexico
City: Los Alamos