ML Engineer – Distributed Training (FSDP/DDP)

SGS_JOB_2887

Engineering
Remote
Python
Pytorch
C++
Deep Learning
Model Training
Fine Tuning

Contract - 12 Months

Client is seeking a strong Python/ML Systems Engineer to join our team, an organization focused on making research breakthroughs in AI. Responsibilities include developing and maintaining deep learning libraries that support large-scale distributed training, open sourcing high quality code, creating documentation, supporting the community to onboard and effectively leverage the solutions built by the team, and bringing the latest research to products for connecting billions of users. The chosen candidate will work with a diverse and highly interdisciplinary team of scientists, engineers, and cross-functional partners, and will have access to cutting edge technology, resources, and research facilities.

Job Responsibilities:

  • Engineer, design, implement, and improve machine learning systems and tools for enabling research
  • Apply knowledge of relevant research domains, along with expert coding skills, to platform and framework development projects
  • Write clean and robust machine learning code
  • Create documentation and support users in ramping and getting productive with the libraries and tools built by the team.

Skills:

  • Experience with machine learning frameworks such as PyTorch or Tensorflow
  • Experience working with large datasets and data pipelines
  • Solid understanding of algorithms, data structures, and software engineering best practices.
  • Demonstrated ability to work collaboratively in a fast-paced, team-oriented environment.
  • Excellent problem-solving and communication skills.

Education/Experience:

  • Degree in Computer Science, Computer Engineering or relevant technical field
  • 2+ years Experience developing machine learning systems in Python or C/C++

Related Jobs

Firmware Software Engineer IV

Engineering
 Washington
12 Months

Location (mandatory): Redmond, WA The research team at client is looking for an experienced Embedded Software Engineer to develop firmware for a custom SoC. Years of Experience: 08 or more years.

C++
EMBEDDED C
MCU
RTOS

Firmware Engineer

Engineering
 Washington
12 Months

Location (mandatory): Redmond, WA We are seeing an Embedded Software Engineer to develop firmware and tools for a variety of AR and VR related devices.

STM32
FREERTOS
C++
Python
MCU

Android Engineer

Engineering
Remote
7 Months

Location (mandatory): Remote, USA The main function of a software engineer is to apply the principles of computer science and mathematical analysis to the design, development, testing, and evaluation of the software and systems that make computers work. A typical software engineer researches, designs, develops and tests operating systems-level software, compilers, and network distribution software for medical, industrial, military, communications, aerospace, business, scientific and general computing applications. Years of Experience: 4+ Years

Android Development
Jetpack Compose
UI
IOS

Senior Hardware Engineer – New Technology Investigation

Engineering
 California
80-90/Hr. on W2
12 Months

Location (mandatory): Sunnyvale, CA The main function of a hardware prototype engineer is to research, design, develop, test high-density, wearable electronics. The candidate will work with emerging technologies in a fast-paced team.

Electrical Engineering
Rigid Flex Boards
USB
MIPI
PCIE
SPI
I2C

Supply Chain Planner

Engineering
 California
58-60
3 Months

Location (mandatory): Sunnyvale, CA The main function of a Supply Chain Analyst is to define, build, manage and measure global asset management. A typical Supply Chain Analyst may be responsible for buying goods and services and analyze performance of suppliers.

Inventory Management
E2Open
Oracle Fusion

Production Planner

Engineering
 Ohio
6 Months

Location (mandatory): West Chester, OH The planner will be assigned to a Production/ Plant Supervisor and will also lead 2 Production Meetings a week. Experience working within an MRP/ERP system.

32-34/Excel
SAP
Production Planning
Materials Management

OS Developer - AOSP

Engineering
 California
90-100/Hr. on W2
12 Months

Location (mandatory): Redmond, WA or Sunnyvale, CA We are looking for OS developers with strong design and build skills, experience in multiple levels of the OS stack from drivers to frameworks and experience building embedded devices. A successful candidate in this role is self-driven, creative and doesn’t mind delving into different areas of the stack. This person will take initiative and should be willing to execute consistently in an agile, fast-paced environment.

AOSP
C++
JAVA
LINUX
EMBEDDED SYSTEMS

Safety Engineer

Engineering
 Ohio
12 Months

Location (mandatory): Oxford, OH Leads and promotes a SAFE First Culture Assures the facility is in compliance with safety and environmental programs, regulations, and company policy. Performing job hazard analysis to identify loss potential of our systems and processes and recommending appropriate corrective actions. Review location standards, policies, and practices as necessary to assure they are current and in concert with company and / or regulatory requirements. Work with Safety Manager to revise as necessary.

EHS
Safety
Training

Firmware Software Engineer V

Engineering
 Washington
12 months

Location : Redmond, WA We are looking for a Software Engineer specializing in embedded systems software engineering. The ideal candidate will have hands-on experience in embedded software/firmware development, low-level Android development, and STM32 microcontroller systems. Experience with FPGA platforms (Gowin, Xilinx) is also a benefit. Years of overall experience required: 8-10 years

SoC
RTOS
C/C++
STM32 microcontrollers
FPGA platform (Gowin/Xilinx)
logo

At SGS Consulting, we go beyond resume-job matches, creating meaningful connections and pathways for individuals to thrive in defining careers.


2025. All right reserved.
logologologologo