Back to all roles

Data and Statistics

/

Trusted operator

Synthetic Data Engineer

Data generation

Mid Full-Time Remote / Any Permanent

Compensation

Not Disclosed

Engagement

Full-Time

Permanent role. Full-time commitment. Remote-first, with periodic in-person off-sites.

Scope of role

Drive specific initiatives with minimal supervision. Deepen craft. Begin mentoring others.

01 — The role

Why this role exists at EduRankAI

Build systems to generate high-quality synthetic training data for EduRankAI's AI models.

02 — The work

What you will own

  • 01 Design and implement synthetic data generation pipelines
  • 02 Evaluate synthetic data quality for downstream model training
  • 03 Collaborate with ML teams on data requirements
  • 04 Build tooling for controlled data augmentation

03 — The expertise

What we look for

PythonData GenerationLLM APIsQuality EvaluationStatistics

04 — The bar

Who thrives here

  • 3+ years in data engineering or ML
  • Experience with synthetic data or data augmentation techniques

05 — How we work

The EduRankAI environment

Remote-first, async-first

Work from anywhere. We optimise for deep work, not face time. Periodic in-person off-sites for the full-time team.

High autonomy, high standards

We hire adults and trust them. You will be expected to set your own goals, communicate clearly, and ship.

Builders, not bureaucrats

We optimise for clarity over process. Make the call, ship the work, write up what you learned.

Bharat-built, globally ambitious

We are an Indian frontier AI lab. We build for India first and the world second — in that order.

06 — Hiring process

What to expect after you apply

  1. 01

    Application review

    Every application is read personally within five business days. We respond either way.

  2. 02

    Take-home or live exercise

    Role-specific. Time-boxed. Real problems we are actually working on, not invented puzzles.

  3. 03

    Conversations

    Deep technical and values conversations with the team you would join. No trick questions. No panel ambushes.

  4. 04

    Offer or honest no

    If yes: digital offer letter, signed in-portal, transparent terms. If no: written feedback if you want it.

Ready to apply?

We read every application personally. If you are the right person for this role — regardless of pedigree, background, or where you are based — you will hear back from us within five business days.