Language Data Scientist - Saskatoon, SK
1 day ago

Job description
Job Title: Language Data Scientist
Location: Remote within Canada (excluding Quebec)
Employment Type: Full-Time (40 hours per week) Fixed-Term
Who we are:
Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world's biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.
By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we're helping usher in the promise of AI. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms.
Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We're poised for a period of explosive growth over the next few years.
About the Role:
Innodata is building a team of Language Data Scientists and Gen AI experts to help our customers advance GenAI applications. You will work hands-on with multi-modal and multi-lingual datasets and collaborate with cross-functional partners. You will use your experience with human and synthetic data workflows to drive innovation and continuous improvement. The ideal candidate must have the right mix of skills in (computational) linguistics and human evaluation tasks, data science, and data engineering.
Key Responsibilities:
- Design/improve workflows to create data for AI/ML training and evaluation. Includes human annotation and data collection workflows, as well as synthetic ones.
- Dive deep into existing workflows and processes to gather data and insights, make recommendations, and drive improvement through innovation and cross-functional collaboration with customers
- Critically assess annotation tooling and workflows
- Quantitatively analyze large datasets, perform statistical analysis, calculate metrics, and make recommendations to improve accuracy and performance
- Work closely with client stakeholders on understanding goals, gathering requirements, proposing solutions and executing them.
Qualifications:
- Knowledge of how components of GenAI products or services combine to work
- Collaborating with cross-functional teams to define AI project requirements and objectives, ensuring alignment with overall business goals
- MA in (computational) linguistics, data science, computer science (AI / ML / NLU), quantitative social sciences or a related scientific / quantitative field, PhD strongly preferred
- Language and language data expertise: Extensive experience working with human language data and designing human evaluation tasks, including multi-phase and complex workflows.
- Deep understanding of language and its relationship with culture
- Ability to identify ambiguity and subjectivity in language
- Ability to work with multi-lingual and multi-modal projects
- Language and language data expertise:
- Quantitative Analysis Skills: Advanced knowledge of statistics, metrics (e.g. f1 score, inter-rater reliability metrics), and data analysis methods such as sampling.
- Technical skills:
- Experience with Natural Language Processing (NLP) techniques and tools, such as SpaCy, NLTK, or Hugging Face.
- Proficiency in Python to
- handle / transform large datasets (e.g. pre- and postprocessing data, pandas)
- perform quantitative analyses
- visualize data (for example matplotlib, seaborn)
- Data processing:
- Deep understanding of data pipelines to support ML and NLP workflows,
- Knowledge of efficient data collection, transformation, and storage
- Knowledge of data structures, algorithms, and data engineering principles
- Excellent interpersonal skills for effective cross-functional stakeholder engagement
- Excellent problem-solving skills, with the ability to think critically and creatively to develop innovative AI solutions
- Ability to work independently and collaborate as part of a team
- Adaptable to changing technologies and methodologies
- Ability to translate experience, research and development information to understand client products and services.
Preferred Qualifications:
- Conducting research to stay up-to-date with the latest advancements in generative AI, machine learning, and deep learning techniques
- Knowledge of optimizing existing generative AI models for improved performance, scalability, and efficiency
- Experience of developing and maintaining ML/AI pipelines, including data preprocessing, feature extraction, model training, and evaluation
- Model Fine-Tuning: Knowledge of Fine-tuning pre-trained models to adapt them to specific tasks and datasets, improving their performance and relevance
- Developing clear and concise documentation, including technical specifications, user guides, and presentations, to communicate complex AI concepts to both technical and nontechnical stakeholders
- Contributing to establishing best practices and standards for generative AI development with customers and within the organization
- Providing technical mentorship and guidance to junior team members
- Understanding of techniques such as GPT, VAE, and GANs
Salary Range: Up to $120k CAD
Rates at Innodata vary depending on a wide array of factors, which may include but are not limited to the role, skill set, educational background and geographic location.
Similar jobs
Job Title: Language Data Scientist · Location: Remote within Canada (excluding Quebec) · Employment Type: Full-Time (40 hours per week) Fixed-Term · Who We Are · Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cit ...
1 day ago
The Decision Scientist will report directly to the Manager, Advanced Analytics. · The Decision Scientist plays a critical role in extracting value from SHA's data assets. · Decision Scientists are accountable to support Advanced Analytics & Modeling projects by providing expertis ...
1 week ago
The Decision Scientist will report directly to the Manager, Advanced Analytics. They play a critical role in extracting value from SHA's data assets. · The Decision Scientist plays a critical role in extracting value from SHA's data assets. Decision Scientists are accountable to ...
3 weeks ago
An opportunity has arisen for a Healthcare Scientist responsible for undertaking translational research and development initiatives for the genetic diagnosis of cancer, · Performing critical evaluation of new technology and subsequent translational implementation into routine dia ...
1 month ago
The Saskatchewan Research Council (SRC) is seeking an Associate Geospatial Scientist to anchor SRC's geomatics and spatial data infrastructure while supporting climate and climate-resilience initiatives. · Degree in Computer Science, Geomatics, Geography, Engineering, Environment ...
1 month ago
Type: · Full-Time · Travel: · Up to 75% during peak season · About aerialPLOT · aerialPLOT is a web-based platform designed for agricultural research and on-farm evaluations. We automate the extraction of data from aerial imagery at operational scale, reducing the reliance on man ...
2 days ago
The Saskatchewan Research Council (SRC) is seeking an Associate Geospatial Scientist to anchor SRC's geomatics and spatial data infrastructure while supporting climate and climate-resilience initiatives. · ...
1 month ago
aerialPLOT is a web-based platform designed for agricultural research and on-farm evaluations. · We automate the extraction of data from aerial imagery at operational scale, reducing the reliance on manual collection and subjectivity. · The platform gives researchers, agronomists ...
3 days ago
+This is an exciting opportunity for a recent graduate with less than 2 years of experience to contribute to projects that restore impacted sites and protect natural ecosystems. · + · Conduct fieldwork including soil and groundwater sampling, environmental monitoring, vegetation ...
1 month ago
We are the home of ambitious, passionate and innovative world shapers. · ...
2 weeks ago
Research Technician, Wheat Genomics and Molecular Breeding, Crop Development Centre
Only for registered members
The Durum Wheat Genetics, Genomics and Breeding team at the University of Saskatchewan is seeking two highly motivated research technicians to work in the molecular breeding lab. · The incumbent will be involved in a range of research projects with plant breeders, geneticists, mo ...
1 month ago
Research Technician, Wheat Genomics and Molecular Breeding, Crop Development Centre
Only for registered members
The Durum Wheat Genetics, Genomics and Breeding team led by Professor Dr. Curtis Pozniak at the University of Saskatchewan (USASK), Crop Development Centre (CDC) is seeking two highly motivated research technicians to work in the molecular breeding lab. · The incumbent will be in ...
1 month ago
We are seeking Snowflake Data Architects for our AI & Data practice team. · ...
3 weeks ago
Northern Nutrients seeks a Quality Control and Research and Development Manager to lead quality control program at fertilizer production plant and manage research projects. · ...
3 weeks ago
We are looking for a Water Quality Scientist to join our team in Saskatoon. As a Water Quality Scientist, you will design and deliver water and sediment quality assessments that safeguard ecosystems and meet regulatory standards. · Degree in Chemistry, Environmental Science, Envi ...
1 month ago
We are the home of ambitious, passionate and innovative world shapers. With an unmatched breadth and depth of engineering expertise our global minds unite to power local solutions. · ...
1 week ago
Postdoctoral Fellow, Researcher in Haskap Genotyping, Plant Sciences
Only for registered members
+The successful candidate will contribute to research focused on characterizing haskap genetic resources maintained within the USask Fruit Program. · + · sampling · conducting ploidy analysis · ...
1 month ago
Postdoctoral Fellow, Researcher in Haskap Genotyping, Plant Sciences
Only for registered members
The successful candidate will contribute to research focused on characterizing haskap (Lonicera caerulea) genetic resources maintained within the USask Fruit Program. · The postdoctoral researcher will play a central role in advancing the research and breeding efforts of the USas ...
1 month ago
We are seeking a Quality Control and Research Development Manager to lead our quality control program at our fertilizer production plant in Saskatoon. The ideal candidate will have a background in chemistry and be able to solve problems related to fertilizer quality and performan ...
3 weeks ago
+ Work on projects related to water and wastewater treatment , industrial and mine sites. · + Conduct engineering design under supervision of experienced engineer · + Collects data under supervision of experienced engineer<+ Uses computer software to solve basic engineering probl ...
1 week ago