IQVIA Logo

IQVIA

AI Data Engineer

Reposted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Sofia, Sofia-grad
Mid level
Remote
Hiring Remotely in Sofia, Sofia-grad
Mid level
As a mid-level Data Engineer, you'll build and enhance data infrastructure for AI initiatives, design data pipelines, ensure compliance, and collaborate with engineers to ensure data quality and reliability.
The summary above was generated by AI

Role Description
We are seeking a mid-level Data Engineer to join our AI team. In this role, you will build, operate, and enhance the data infrastructure supporting our Agentic AI initiatives. You will collaborate with ML engineers, AI scientists, and product managers to deliver reliable data pipelines that enable autonomous and semi-autonomous AI agents. As part of the R&DS AI Innovation Program, you will contribute to production-ready, secure, and compliant data solutions while progressively growing toward deeper architectural ownership.

Key Responsibilities

Mandatory

  • Design, develop, and maintain scalable data pipelines and ETL/ELT processes supporting AI research, prototyping, and production use cases.

  • Collaborate with AI scientists and engineers to translate data requirements into ingestion, transformation, and serving solutions.

  • Apply data governance and security controls ensuring compliance, auditability, and protection of sensitive information.

  • Monitor, troubleshoot, and resolve data pipeline failures, performance issues, and schema changes.

  • Continuously improve reliability through testing, observability, documentation, and automation.

  • Design and maintain efficient data models (e.g., star schemas, feature-ready datasets, semantic layers) supporting analytics, ML workflows, and AI agent operations.

  • Implement automated data validation, schema checks, and pipeline testing to ensure high-quality data delivery across systems.

Preferred

  • Contribute to data architectures supporting agent workflows, including training data preparation, retrieval layers, and inference logging.

  • Build and enhance pipelines supporting near real-time agent interactions and feedback signals.

  • Strong SQL skills, with experience designing analytical queries and working with relational and NoSQL databases.

  • Implement and operate vector embedding stores, knowledge graph ingestion pipelines, and retrieval mechanisms.

  • Implement data quality controls suitable for ML/LLM pipelines in regulated environments.

  • Assist with performance tuning to reduce latency in agent-driven workflows.

  • Familiarity with infrastructure-as-code and automated deployment for data pipelines.

Qualifications

Education

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.

Experience

  • Typically 3+ years of professional experience in data engineering, including production-grade pipeline development.

Programming & Technologies
  • Strong proficiency in Python; working experience with Java or Scala.

  • Solid knowledge of SQL and experience with NoSQL databases.

  • Familiarity with data warehousing and lakehouse platforms.

Cloud & Data Platforms
  • Hands-on experience with at least one major cloud platform (AWS, Azure, or GCP).

  • Experience with orchestration frameworks and CI/CD practices for data pipelines.

Preferred Qualifications
  • Familiarity with vector databases and embedding lifecycle management.

  • Experience with containerization and orchestration tools (Docker, Kubernetes).

  • Understanding of RAG data pipelines, LLM fine-tuning datasets, and evaluation signals.

  • Exposure to streaming or event-driven data processing systems.

IQVIA is a leading global provider of clinical research services, commercial insights and healthcare intelligence to the life sciences and healthcare industries. We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com

IQVIA is committed to integrity in our hiring process and maintains a zero tolerance policy for candidate fraud. All information and credentials submitted in your application must be truthful and complete. Any false statements, misrepresentations, or material omissions during the recruitment process will result in immediate disqualification of your application, or termination of employment if discovered later, in accordance with applicable law. We appreciate your honesty and professionalism.

Top Skills

AWS
Azure
Docker
GCP
Java
Kubernetes
NoSQL
Python
Scala
SQL

Similar Jobs

4 Days Ago
Remote
Senior level
Senior level
Healthtech
Lead the design and optimization of data infrastructure for AI initiatives. Collaborate with teams to develop scalable data pipelines and ensure data quality and governance, while implementing advanced data models and monitoring systems.
Top Skills: AirflowAWSAzureDaskDockerFlinkGCPGoJavaJuliaKafkaKubernetesNoSQLPythonRayRustScalaSparkSQLTerraformVector Databases
9 Days Ago
Remote or Hybrid
Junior
Junior
Information Technology
The AI & Data Engineer will design, implement, and evaluate AI models, preprocess data, and collaborate in EU-funded R&D projects, ensuring compliance with regulations.
Top Skills: AWSAzureDjangoDockerFastapiFlaskGCPKubernetesPythonPyTorchScikit-LearnTensorFlowTransformers
Yesterday
Remote or Hybrid
Junior
Junior
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Support brand strategy implementation, product development, pricing strategies, and integrated marketing plans, while managing stakeholder relationships and monitoring performance.

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account