HPC SUPPORT ENGINEER
Descripción de la oferta de empleo
As a next generation digital business with worldwide leading positions in digital, cloud, data, advanced computing and security, it brings deep expertise for all industries in more than 47 countries.
By uniting unique high-end technologies across the full digital continuum with world-class talents, Eviden expands the possibilities of data and technology, now and for generations to come.
HPC Support Engineer.
A High-Performance Computing (HPC) support engineer plays a vital role in maintaining and optimizing computing environments, which are used by research institutions, industries, and organizations for tasks that require significant computational power, such as scientific simulations, large-scale data analysis, machine learning, and engineering computations.
Role Expectations.
HPC systems are often clusters of interconnected servers.
The engineer is responsible for the administration of these clusters, which includes installation, configuration, and maintenance of hardware and software.
Linux is the dominant OS in HPC environments.
The engineer ensures that the OS is updated, secure, and optimized for high-performance workloads.
HPC environments use job schedulers (e.
., SLURM, PBS, or LSF) to allocate resources efficiently.
The engineer manages these schedulers to ensure optimal job performance, queue management, and fair distribution of resources among users.
Designing and managing backup solutions for large volumes of data, ensuring minimal data loss in case of hardware failures or other disasters.
Interactions with SMC (Smart Management Center) which is the foundation for hosting infrastructure and application micro-services dedicated in managing a HPC supercomputer.
Support and maintain technology standards, processes, and policies related to on-prem/cloud Infrastructure in scope.
Contribute to international projects by providing consultancy regarding HPC infrastructure architectures (on-premises and cloud).
Suggest system changes in accordance with documented SOPs.
Produce and maintain appropriate documentation and diagrams describing system setups and overall inventory.
Capabilities and Expertise.
System Administration Red Hat expertise.
Networking, expertise in configuring and troubleshooting networking setups within HPC clusters, including understanding low-latency interconnects like InfiniBand or Omni-Path.
Scripting Proficiency, use scripting languages such as Bash, Python, or Perl for automating routine tasks like cluster monitoring, user onboarding, or job submissions.
Configuration Management, familiarity with tools like Ansible, Puppet, or Chef to automate the deployment and configuration of cluster nodes and services.
System Monitoring, Implement and manage monitoring tools (e.
., Prometheus, Grafana) to track system health, detect performance bottlenecks, and identify potential hardware or software failures.
Storage Management, familiarity with large-scale storage systems such as GPFS, Lustre, or NFS, and the ability to troubleshoot file system issues.
Nice to have.
Supercomputers knowledge, and understanding of advanced supercomputing platforms (e.
., Cray, IBM Blue Gene).
Experience with submitting Jobs in Schedulers like LSF, Slurm, GridEngine, etc.
Parallel Computing, experience with parallelism (shared and distributed memory architectures), MPI (Message Passing Interface), and OpenMP.
Why Join Us? Training and Certifications.
Access ongoing training and certifications for current and emerging technologies.
Hybrid Schedule.
Enjoy the flexibility of a hybrid work schedule.
Flexible Benefits.
Receive a range of benefits, including private medical insurance, company-supported CSR, sports and leisure activities, lunch vouchers, mobile phones, and laptops.
Reimbursement.
Get a yearly fixed amount for reimbursement.
Performance Bonus.
Earn an annual performance bonus based on your achievements.
Career Advancement.
Explore numerous opportunities for professional growth and career advancement.
Extra Vacation Days.
Take advantage of additional vacation days to relax and recharge.
Let’s grow together.
Detalles de la oferta
- Sin especificar
- En todo México
- Sin especificar - Sin especificar
- 20/11/2024
- 18/02/2025
Act as the l2/l3 (level-2/level-3) support to solve the customer issue collaborate with cross-group peers both proactively and reactively... please check review the below requirement and reply me back with your updated resume, contact details at the earliest and feel free to call/mail me at *****@*****......
As a full stack engineer assigned to the product/project ensure performance, maintainability, and functional requirements from design, development, testing to rollout and support... responsibilities spend 90% of your time actively designing and coding in support of the immediate team......
Importante empresa multinacional de origen suizo esta en búsqueda de tu talento como: project engineer escolaridad: ingeniero mecánico (titulado)... inglés avanzado (la entrevista es en inglés) edad: 26 a 30 años experiencia laboral de 3 a 4 años como mechanical engineer o en ingeniería de proyectos......
Responsibilities support the entire application lifecycle (concept, design, test, release, and support)... writing clean, readable, and testable code... experience with angular and typescript... research and suggest new mobile products, applications, and protocols... we must operate as one, supporting......
Essential: graduated industrial engineer or related... excellent technical and problem-solving skills... the position requires constant communication with clients abroad... project management and supervision... easy communication with clients... preferably more than one year experience in the automotive......
We are looking for an enthusiastic and motivated “senior system engineer” in our development team... candidateshould have good communication skills, be a team player and be able to lead a team... requisitos del puestogood knowledge of the architecture of the tandem platform good knowledge in handling......
Мы сосредоточены на создании, тестировании, развертывании приложений и инфраструктуры, которые помогут другим командам быстро масштабироваться, взаимодействовать, интегрироваться с данными в реальном времени и включать машинное обучение в свои продукты... — знание spark — знание scala и/или python —......
Administrative department business analyst, payroll manager, marketing specialist, administration supervisor, human resources officer, financial analyst, senior marketing analyst, logistics coordinator / expert, procurement officer, secretary / office assistants / office clerks / front desk clerks, account......
Requisitos del puestoqualifications:m/f,single or married , college level or grad... any course, with or without exp... administrative department business analyst, payroll manager, marketing specialist, administration supervisor, human resources officer, financial analyst, senior marketing analyst, logistics......
Description the newrich network is hiring a customer service agent to handle incoming support tickets for our digital platforms... requisitos del puestorequirements communicates with customers in english only reports to customer success manager; promptly and accurately respond to our customer’s inquiries......