Senior ML Systems Engineer

SGS_JOB_2874

Engineering
Remote
Python
PyTorch
Distributed ML Training
FSDP / DDP
PyTorch DataLoader
OSS

Contract - 12 Months

ML Systems Engineer III We are seeking a strong ML Systems Engineer to join our Fundamental AI Research team; an organization focused on making research breakthroughs in AI. Responsibilities include developing deep learning libraries that support large-scale distributed training, open sourcing high quality code and reproducible results for the community, and bringing the latest research to client's products for connecting billions of users. The chosen candidate will work with a diverse and highly interdisciplinary team of scientists, engineers, and cross-functional partners, and will have access to cutting edge technology, resources, and research facilities. Years of Experience: 5+ years Degrees Required: Bachelors degree in Computer Science, Computer Engineering or relevant technical field How will performance be measured? - Completion and quality of engineering tasks. Candidate Value Proposition: This is working on cutting-edge Machine Learning training and inference code to create State of the Art research models. It’s working with leading researchers in the field. Cutting edge distributed training for creating state of the art ML models. Candidate Disqualifiers: Pure software engineers will not be a good fit. Experience with Large scale Model training with Pytorch is essential. Difficult Aspects of Job: Strong technical and communication skills will be needed to succeed in a fast-paced and ambiguous environment. Interview Process: 1-2 rounds (1 hour) We’ll setup a pool of interviewers to process the queue quickly. Mostly technical: experience with distributed training. How DDP/FSDP works, what are different parallelism techniques to scale models, what are their tradeoffs, which one would you use in which case, some back of the envelope calculation of memory/throughput requirements, so on.

Job Responsibilities:

  • Engineer, design, implement, and improve highly scalable machine learning systems and tools for enabling research
  • Apply knowledge of relevant research domains, along with expert coding skills, to platform and framework development projects
  • Write clean and robust machine learning code
  • Create documentation and support users in ramping and getting productive with the libraries and tools built by the team.

Skills:

  • 5+ years of Python experience
  • 2+ years of Pytorch experience
  • 0-2 years of Distributed ML Training (FSDP/DDP) experience
  • 5+ years of Dataset / Pytorch DataLoader experience
  • 3+ years of OSS experience
  • Demonstrated software engineering experience via work experience, or widely used contributions in open source repositories (e.g. GitHub)
  • Prior contributions to open-source AI/ML projects

Education/Experience:

  • 2+ years of experience with deep learning
  • 2+ years of experience developing machine learning algorithms in Python or C/C++
  • Experience with machine learning frameworks such as PyTorch or Tensorflow
  • Experience working with large datasets and data pipelines
  • Solid understanding of algorithms, data structures, and software engineering best practices.
  • Demonstrated ability to work collaboratively in a fast-paced, team-oriented environment.
  • Excellent problem-solving and communication skills.

Related Jobs

Audio Software Engineer

Engineering
 Washington
6 Months

Location: Redmond, WA We are seeking a highly skilled and motivated Software Engineer to join our specialized engineering team. This role is centered on the development of sophisticated software for advanced hardware control and lab automation, with a primary focus on aero-acoustic wind tunnel systems. In this role, you will use Python to design, build, and enhance control mechanisms for both a classic recirculating wind tunnel and a novel modular fan-array wind tunnel. This position offers a unique and exciting opportunity to work at the intersection of software development, robotics, acoustics, and aerodynamics.

Python
Audio
Robotics
Data Acquisition
Signal Processing

Software Engineer II

Engineering
 Washington
04 Months

This role sits at the center of our team's expansion and will have a direct impact on documentation, processes, and customer relationships. Work will vary from tool evaluation and integration to day-to-day hands-on lab and infrastructure support. In addition to core engineering responsibilities, this role requires a strong grasp of AI technologies and hands-on proficiency with AI-assisted development tools such as Claude Code. As AI reshapes how engineering work gets done, this engineer will be expected to leverage AI tooling in their own workflows and help drive adoption across the team. This role will operate as a point of contact for assigned projects, coordinate internal resources, and be supported by supervisors and technical resources. This person must be willing to take direction from leadership while also providing guidance and driving task ownership across team members. Our goal is to build and maintain strong, long-lasting customer relationships.

AI Assisted Development Tool – Claude Code
Python
LinuxOS
System Administration

Systems Engineer III

Engineering
Remote
12 Months

The main function of a systems engineer is to apply the principles of computer science and mathematical analysis to the design, development, testing, and evaluation of the software and systems that make computers work. A typical systems engineer analyzes user needs, and then designs, tests, and develops software to meet those needs.

Systems Engineering
Server Lifecycle
Troubleshooting
Test
TCP/IP Network
Linux
Server Management.

Data Engineer IV – Marketing & Ads

Engineering
Remote
08 Months

Location - Remote

SQL
Python
ETL
Marketing
Ads
Large Data Sets

Linguist III - German and Turkish Both

Engineering
Remote
09 Months

Location - Remote

Linguistics
Phonetics
Phonology
Regex
SQL
Python
Native Speaker of German or Turkish

Network Engineer II

Engineering
 California
$67 - $75/Hr. on W2
06 Months

Contract - 06 Months Pay Rate : $67 - $75/Hr. on W2 The Network/System Lab Engineer will support the our team by bringing up and maintaining the network lab environment. Responsibilities include integrating and troubleshooting new network equipment, coordinating delivery and installation of new products, testing switch testbeds for software validation, and maintaining accurate lab inventory records. The ideal candidate will have hands-on experience with network devices, strong problem-solving skills, and the ability to work in a fast-paced lab setting. Must be detail-oriented, organized, and able to communicate effectively with cross-functional team

IP
Cisco IOS
JunOS
Python
Power Supply
Network Device Bring-Up
Lab Work

Application Engineer – Rockwell

Engineering
 South Carolina
6 Months

Location (mandatory): Saint George, SC Seeking a skilled engineer to deliver high-quality designs and testing in accordance with project specifications, standards, budgets, and timelines. This role involves providing centralized software support for critical BMS applications. The successful candidate will collaborate with a team to design, install, commission, and service building automation and facility management systems.

SCADA
PLC
Rockwell
BMS

Analyzing Technician

Engineering
 Illinois
3 Months

Location (mandatory): Elgin, IL We are seeking a highly skilled Analysis Technician to join our dedicated team for the 1st shift. Based at our state-of-the-art facility, you'll be responsible for board-level analysis of failed 2-way subscriber products (both mobile and portable 2-way radios). This is an exciting opportunity to leverage your technical expertise in testing, diagnosing, and repairing electronic assemblies to identify the root causes of failures.

Electrical Engineer
RF

IT Manager

Engineering
British Columbia
4 Months

Location (mandatory): Vancouver, BC Maintain the lab infrastructure, network infrastructure, and asset/equipment inventory. Maintain and troubleshoot hardware, including: Cameras Workstations Servers Switches

System Administration
Camera Installation and Troubleshooting
logo

At SGS Consulting, we go beyond resume-job matches, creating meaningful connections and pathways for individuals to thrive in defining careers.


© 2026 All rights reserved.
logologologologo