
Software Engineer, Data Infrastructure & Acquisition
Speechify
Software Engineer, Data Infrastructure & Acquisition
Speechify is seeking a Software Engineer for Data Infrastructure & Acquisition in Rotterdam to build and manage petabyte-scale data pipelines for AI model training. The role involves operating GCP infrastructure, extending ingestion pipelines, and collaborating with scientists to deliver high-quality datasets. Ideal candidate has 5+ years of software development experience, proficiency in Python, Docker, and Terraform.
Software Engineer, Data Infrastructure & Acquisition
Speechify is seeking a Software Engineer for Data Infrastructure & Acquisition in Rotterdam to build and manage petabyte-scale data pipelines for AI model training. The role involves operating GCP infrastructure, extending ingestion pipelines, and collaborating with scientists to deliver high-quality datasets. Ideal candidate has 5+ years of software development experience, proficiency in Python, Docker, and Terraform.
Salary
Core Qualifications
Technical (Must-have)
Soft Skills
Preferred Qualifications
Technical (Nice-to-have)
Key Responsibilities
- Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
- Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform
- Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models
- Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products