AI Data Specialist

SGS_JOB_3380

Engineering
 California
SQl
Python
GENAI
Model Training
STEM Background

Contract - 12 Months

Location (mandatory): Menlo Park, CA • Despite rapid advancements in generative AI, achieving high-quality generation remains a challenge. This is primarily due to the scarcity of high-quality training data and the lack of reliable and robust evaluation metrics that can effectively capture subtle details in model outputs, which can significantly impact user experience. • Our team is dedicated to developing comprehensive data curation and evaluation solutions to enhance our model across various quality dimensions. These include visual quality, prompt adherence, identity preservation, naturalness, and visual text generation, among others. We employ diverse approaches, such as sourcing billions of images and identifying suitable ones through a combination of manual annotations and signals from machine learning models. We also utilize both manual and automated evaluation methods to pinpoint quality gaps and data requirements.

Job Responsibilities:

  • Data Curation: Manage data labeling workflows, including data enqueueing for labeling, UI for labeling, and extracting labels into datasets for the modeling team.
  • Data Engineering (Pipelines): Maintain large-scale, efficient, and reliable data processing pipelines (billions of images). This includes data sourcing, running machine learning models to understand content, and using LLMs to clean data.
  • Data Engineering (Governance): Maintain our portfolio of datasets, ensuring governance of access, retention, and privacy compliance.
  • Annotations
  • Spend time manually annotating training data based on modeling team requirements.
  • Use of LLMs and other models to annotate training data or to evaluate generated content. Then apply auditing to understand this model performance.
  • Analysis: Collaborate with engineers to identify and summarize model gaps based on evaluations. Utilize these findings to identify necessary data, and then mine and prepare that data for subsequent model training iterations.

Skills:

  • Verbal and written communication skills, problem solving skills, and interpersonal skills.
  • Attention to details and an aptitude to experimental investigations
  • Basic ability to work independently and manage one’s time.
  • Basic knowledge of Python, and SQL.
  • Basic knowledge of computer vision and generative models.
  • Basic knowledge with data ETL workflows & pipelines.
  • Usage of LLM for data labeling related work.
  • Verbal and written communication skills, problem solving skills, and interpersonal skills.
  • Basic knowledge of Python, Unix, and SQL.
  • Basic knowledge of computer vision and generative models

Education/Experience:

  • Associate's degree or equivalent training required in Computer Science, Electronic Engineering, Physics, Bioinformatics, or other STEM subjects.
  • Prior industrial experience in software development and testing and / or research experience in human computer interaction are preferred.
  • Worked at MAANG before is preferred
  • 1-2 years experience

Related Jobs

OpenXR Engineer

Engineering
 Washington
12 Months

Location (mandatory): Redmond, WA We’re looking for a Worker to help build and maintain OpenXR-based XR applications. You’ll work closely with engineers and cross-functional partners to prototype, implement, and iterate on immersive experiences that run across OpenXR-capable runtimes/devices.

OpenXR
C++
Unreal Engine

Solution Architect - ARAS

Engineering
Remote
3 Years

Location (mandatory): Remote Designs and defines system architecture and solutions for integrating multiple platforms, operating systems, cloud and applications. Determines systems specifications, input/output processes, and working parameters for hardware/software/cloud compatibility and maintenance of system security. Coordinates design of subsystems and integration of total system. Identifies, analyzes, and resolves program support deficiencies. Develops and documents the framework for integration and implementation for changes to technical standards and overall enterprise architecture. Assists in the development and management of an architecture governance process. Develops and recommends corrective actions and system solutions. Provides technical guidance for database administrators, software developers, and other stakeholders. Understands and determines requirements of infrastructure, network, and security in third party cloud offerings. Translates business requirements into functional and technical architecture. May guide decisions on which technologies to implement, operational planning and useful life of products. The individual selected for this role must have in depth Product Life Cycle Management (PLM), Systems Engineering background, Model Based Systems Engineer experience. Additionally, this individual must have experience developing and integrating ARAS Innovator platform

PLM
ARAS
SQL
Deployment
Integration

AR/VR Systems Engineer - Calibration

Engineering
 Washington
12 Months

Location (mandatory): Redmond, WA The main function of a software engineer is to apply the principles of computer science and mathematical analysis to the design, development, testing, and evaluation of the software and systems that make computers work. A typical software engineer researches, designs, develops and tests operating systems-level software, compilers, and network distribution software for medical, industrial, military, communications, aerospace, business, scientific and general computing applications.

C++
Bash
ROS
Calibration
IMU
Sensors
Data Collection

Maintenance Worker II

Engineering
 Massachusetts
06 months

Location:- Framingham MA 01701 We will be looking for a mixture of Commercial /Industrial plumbing experience and HVAC (AHU’s, exhaust fans, etc.) Commercial Maintenance/ General Maintenance

Operate
troubleshoot
and repair piping systems and equipment

Software Engineer

Engineering
 Massachusetts
3 Years

Location (mandatory): Lexington, MA Network performance data collection and analysis experience Experience with data visualization frameworks (Grafana or similar tools)

TCP/IP
Linux
Python
CI/CD
Devops

BMS Technician

Engineering
 Tennessee
12 Months

Location (mandatory): Franklin, TN Working with customers to provide service and repair solutions. Completing scheduled preventative maintenance and QC. Documentation of recommended repairs, completion of repairs. On call rotation. NW Commissioning, construction PTP, FPT commissioning as needed.

BMS
PLC
BAS

HVAC-Service/Install Technician

Engineering
 Virginia
3 Months

Location (mandatory): Ashburn, VA Area of work: Able to drive within a 60 mile radius servicing sites. Work Environment & Schedule: Field based role with daily travel to customer locations

HVAC
Refrigeration
EPA

Coupa Consultant

Engineering
Remote
6 Months

Client is implementing Coupa for global Indirect Procurement. This role supports and enhances Coupa’s P2P, Supplier Information Management (SIM), Core, and Coupa Risk Assess (CRA) modules across global regions. The Business Systems Analyst works closely with Procurement, Finance, and IT stakeholders to deliver functional enhancements, maintain system stability, and drive process improvements aligned with enterprise Procure to Pay operations. Preferred Experience (Nice-to-Have): Prior experience implementing or supporting global Procure to Pay or Supplier Management processes. Familiarity with Agile delivery frameworks. Experience with Coupa configuration, workflow design, or data analysis.

Coupa
SIM
P2P

Data Analyst

Engineering
Remote
6 Months

JOB DESCRIPTION: As a part of the Data Analytics team within the North America - Project Management Office (NA-PMO), you will be partnering with field organizations, finance, and Information Technology (IT) teams to initiate and support data-informed decision making in the underlying business.

Tableau
SQL
PowerBI
Appscripts
logo

At SGS Consulting, we go beyond resume-job matches, creating meaningful connections and pathways for individuals to thrive in defining careers.


© 2026 All rights reserved.
logologologologo