Data Engineer Needed – Multimodal Dataset Infrastructure

Only for registered members Canada

2 weeks ago

Default job background

Job summary

We're building a data infrastructure company focused on producing non public high value multimodal datasets derived from real world creative workflows.

This project sits at the intersection of
multimodal data engineering
dataset productization
privacy aware capture of real world human behavior
infrastructure for advanced AI systems,

The ideal candidate will have experience with multimodal data pipelines video audio text metadata dataset design for ML or research use handling large video audio datasets clean schema design and documentation working close to research or infrastructure teams,
  • Multimodal Alignment and Segmentation
A key challenge is aligning and structuring data across modalities.
  • Pipeline Design and Implementation
You will help build a repeatable automatable pipeline to ingest raw recordings at scale apply preprocessing steps normalization compression format standardization generate structured outputs according to defined schemas support dataset versioning reproducibility incremental updates export datasets in formats usable by advanced ML teams tooling choices are flexible the focus is on robust design not a specific stack.

Benefits

This starts as a pilot engagement clear deliverables and milestones high autonomy and trust technical depth is valued over speed strong communication is essential if it goes well this can evolve into a long term collaboration shaping the core data infrastructure of the company.

About This Project

This project builds upon our previous success in creating non-public multimodal Datasets being utilized by multiple prominent deep technology companies today including but not limited to Google Amazon Meta Microsoft etc., by an experienced Data Engineer Dataset Engineer who has built scalable architectures before we aim create another successful story here!
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Data Engineer

    1 month ago

    Only for registered members Remote job $100 - $0 (USD) budget

    We are looking for an experienced NLP / Machine Learning/Data engineer to analyze and classify a large multilingual dataset of social media comments into two categories: Hate and Love. · ...

  • Only for registered members Remote job $3,000 - $0 (USD) budget

    We are seeking an experienced geospatial developer to design and implement an internal system that processes address-level infrastructure availability data and visualizes it spatially across defined geographic regions. · ...

  • Only for registered members East St. Paul $75,000 - $90,000 (CAD)

    Synchena Consulting Inc is seeking a motivated and experienced individual to fill our Municipal Infrastructure Specialist role reporting to the Director/CEO. · Support the maintenance and enhancement of GIS and municipal infrastructure asset datasets. · Maintain and improve clien ...

  • BIM Technician

    1 month ago

    Only for registered members Ottawa

    This role supports a common data environment where civil infrastructure and utilities information can be reused across multiple systems and lifecycle needs. · Geo-reference and update utilities datasets · Convert CAD-based information into GIS · ...

  • Only for registered members Remote job $25 - $30 (USD) per hour

    You will do the end-to-end setup of a high-performance analytical database proof of concept using ClickHouse. · ...

  • Data Engineer

    3 days ago

    Only for registered members Remote job $5 - $40 (USD) per hour

    We're building an MVP dataset of all healthcare organizations (clinics,hospitals etc.) in the U.S. · ...

  • Only for registered members Remote job

    We're looking for a senior · backend data engineer to own the ongoing maintenance · , reliability, · and expansion of our data pipelines.Ongoing dataset expansion: · schematic evolution, · backfills, · versioning, · ...

  • Only for registered members Remote job $10 - $30 (USD) per hour

    We are looking for a B2B Data Infrastructure & Enrichment Specialist to design build and maintain scalable system for continuous B2B data acquisition and management. This is not one-off list building role You will be responsible creating managing end-to-end data supply chain that ...

  • Only for registered members Remote job

    We are hiring an AI engineer to build a private Retrieval-Augmented Generation.. · Designing the RAG architecture · Processing and chunking a large document dataset · ...

  • Data Processor

    1 month ago

    Only for registered members Dartmouth OTHER

    We are looking for a detail-oriented Data Processor to join our Asset Integrity team, · supporting offshore projects through the processing and analysis of complex 3D datasets. · Processing, validating, and managing point cloud datasets from photogrammetry,laser,and sonar sources ...

  • Only for registered members Remote job $40 - $55 (USD) per hour

    +We aim to ensure the accuracy and reliability of our data collection and reporting · supporting our energy efficiency goals. · ...

  • Sepal City Of Vancouver

    We are looking for an Observability Engineer with 3+ years of experience to help us understand, debug, and operate complex production systems at scale. · ...

  • Sepal Quebec

    We are looking for an Observability Engineer with 3+ years of experience to help us understand debug and operate complex production systems at scale. · ...

  • Sepal Burnaby

    We are looking for an Observability Engineer with 3+ years of experience to help us understand, debug, and operate complex production systems at scale. · Design complex queries over massive log datasets. · Create synthetic datasets that simulate real-world DevOps logs. · ...

  • Sepal Airdrie

    We are looking for an Observability Engineer with 3+ years of experience to help us understand, debug, and operate complex production systems at scale. · ...

  • Sepal Laval

    Sepal AI builds the world's hardest tests for AI grounded in real-world software systems. We are looking for an Observability Engineer with 3+ years of experience to help us understand, debug, and operate complex production systems at scale. · You will work deeply with production ...

  • Sepal Montreal

    We are looking for an Observability Engineer with 3+ years of experience to help us understand debug and operate complex production systems at scale. · Design complex distributed queries over massive log and telemetry datasets. · ...

  • Sepal Halifax

    Sepal AI builds the world's hardest tests for AI grounded in real-world software systems. We are looking for an Observability Engineer with 3+ years of experience to help us understand, debug, and operate complex production systems at scale. · ...

  • Sepal Saskatoon

    +We are looking for an Observability Engineer with 3+ years of experience to help us understand, debug, and operate complex production systems at scale. · +Design complex, distributed queries over massive log and telemetry datasets. · Explore creative ways to challenge AI's reaso ...

  • Sepal Edmonton

    Observability engineer with experience in observability engineer, production engineer, or platform engineer roles. · ...