Indian Linguist III

SGS_JOB_3668

Engineering
Remote
Hindi
Tamil
Malayalam
Marathi
Bengali
Punjabi
Nepali
Kannada
Telugu
Python
Data Analysis
RegEx
Linguistics

Contract - 9 Months

Required YOE: 0-3 Years Must have a graduate degree in Linguistics Must be native speaker of a non-English language (preferably Hindi) with a high level of proficiency in another Indo-Aryan or South Dravidian language, plus broad knowledge of other languages in either of those two groups.

Job Responsibilities:

  • Perform linguistic analyses on large datasets.
  • Perform linguistic error analysis of AI model outputs, determining what the most frequent and severe error categories are.
  • Write and revise guidelines for human annotation and other AI projects, including but not limited to translation tasks.
  • Conduct typological and sociolinguistic research on a large number of languages, highlighting their similarities and differences.
  • Perform linguistic analyses for Responsible AI (toxic language, hate speech, gender bias and other cultural biases) in massively multilingual settings.
  • Conduct linguistic literature reviews on various NLP-adjacent topics and summarize findings.
  • Compare the quality of deliveries between vendors, identify error patterns, and provide actionable feedback.
  • Provide information or guidance relative to any aspect of linguistic knowledge (typology, morpho-syntax, sociolinguistics, classification, phonetics/phonology, pragmatics, etc.).
  • Reach out to and collaborate with native speakers in various languages.
  • Communicate results of linguistic analyses to engineers and research scientists.

Skills:

  • Must have strong written and spoken communication skills, especially business and research communication.
  • Must be native speaker of a non-English language (preferably Hindi) with a high level of proficiency in another Indo-Aryan or South Dravidian language, plus broad knowledge of other languages in either of those two groups.
  • Working knowledge in other languages is a plus. Proficiency in a low-resource language is valued.
  • Must be able to code in Python (must) and query databases using SQL, other coding languages used for data analysis are a plus.
  • Must be able to independently work through complex requests and perform under pressure.
  • Strong ability to work independently, prioritize, plan, and track work, as well as report progress
  • Education or training in the basics of project management is a plus
  • Self-motivation is a must
  • Working knowledge of international language-classification standards is valued.
  • Experience working cross-functionally
  • Experience collaborating with machine learning, NLP, or software engineers, or data scientists
  • Experience contributing to research papers
  • Important: Preferably no known conflicts of interest in the fields of machine translation, ASR, TTS, or LLM research (as the candidates need to be contributing to research papers)

Education/Experience:

  • Graduate degree in Linguistics or related field is a must; PhD is a plus a background or specialization in corpus linguistics is a plus experience with field work is a plus
  • A graduate degree in Literature or English is not an appropriate substitution
  • Degree in Computer Science with a specialization in NLP is not an appropriate substitution
  • Must have a very firm grasp of the following linguistic fields: language typology, syntax, morphology, sociolinguistics (especially dialectology and discourse analysis), corpus linguistics, writing systems, pragmatics, phonology.
  • Must have some experience with applying basic Natural Language Processing techniques.

Related Jobs

Mechanical Technician IV

Engineering
 California
$40 – $45 per hour
6 Months

Location: Sunnyvale, CA

Prototype
Assembly
Test
Python
C++
ADB
NX
SolidWorks
CAD

Software Engineer IV

Engineering
 Washington
12 Months

Summary: We are looking for a Software Engineer to maintain and expand the software and systems in our data collection lab in Redmond, WA, in order to ensure that the data collection process is efficient, accurate and reliable. You will build software for working with cutting-edge prototype devices and integrating them into our data collection platform. You will work with a team of audio/video engineers and recording technicians, as well as an international team of software engineers and interdisciplinary audio researchers. You will be expected to work with them to scope and prioritise tasks, and to collaboratively produce high-quality, maintainable software.

Software
Python
FastAPI
Audio
Video
Sensor
Data Collection

Turkish Linguist II

Engineering
Remote
3 Months

Summary: The main function of a TTS Linguist CW is to determine speech data needs and make for data-based model and product improvements.

Turkish
Python
Data Analysis
Regular Expression
RegEx
Linguistics

Engineering Program Specialist III

Engineering
 California
$55 – $60 per hour
12 Months

Location: Sunnyvale, CA Summary: As an Engineering Program Specialist, your top priority will be partnering with Technical Program Managers to deliver hardware to cross-functional team(s) on time to support product development. This role requires strong communication skills and attention to details as you will be working with large cross-functional teams both internally and externally. Managing and keeping track of information are extremely critical to the success of this role.

Supply Chain
Procurement
Allocation
Communication
Budget Management
Forecasting

Ranking Data Scientist V

Engineering
Remote
4 Months

Detailed Job Description: The main function of a Ranking Data Scientist is to produce innovative improvements to machine learning models by leveraging exploratory data analysis from complex and high-dimensional datasets.

Ranking
Recommendation
Algorithm
Data
SQL
Python

Austronesian Linguist III

Engineering
Remote
9 Months

Required YOE: 0-3 Years Must have a graduate degree in Linguistics Must be native speaker of a non-English language (preferably Indonesian) with a high level of proficiency in another Austronesian language, plus broad knowledge of other languages in the same family.

Indonesian
Austronesian
Python
Data Analysis
RegEx
Linguistics

European Linguist III

Engineering
Remote
9 Months

Detailed Job Description: Required YOE: 0-3 Years Must have a graduate degree in Linguistics Must be near-native proficient in one of the two following languages: German or Turkish and must have a high level of proficiency (CEFRL C1) in the other one of the two previously mentioned languages

German
Turkish
Python
Data Analysis
RegEx
Linguistics

Data Analyst III

Engineering
Remote
6 Months

Summary: The main function of a data analyst is to coordinate changes to computer databases, test, and implement the database applying knowledge of database management systems. A typical database analyst/programmer is responsible for planning, coordinating and implementing security measures to safeguard the computer database. Typical Day in the Role: Monitoring the progress within the platform (multiple dashboards), identifying trends, preparing reports and insights, building out a report that will go out to leadership to provide visibility into the work being done, where improvements need made, etc.

Data
Product
SQL
Python
AI

Marathi Linguist II

Engineering
Remote
3 Months

Summary: The main function of a TTS Linguist CW is to determine speech data needs and make for data-based model and product improvements.

Marathi
Python
Data Analysis
Regular Expression
RegEx
Linguistics
logo

At SGS Consulting, we go beyond resume-job matches, creating meaningful connections and pathways for individuals to thrive in defining careers.


© 2026 All rights reserved.
logologologologo