AI Data Engineer- HealthTech AI Up to $180,000 | U.S (Remote) | Full-Time Deeprec.ai is partnering up with an AI focused HealthTech company centred around early-stage cancer detection.
This is a remote Data Engineering role focused on building and maintaining scalable pipelines that ingest, clean, and structure large, complex healthcare datasets.
What You’ll Do- Work with Data Scientists and ML Engineers to define data needs for LLM and ML models.
- Build and maintain scalable data pipelines for large healthcare datasets.
- Ensure data quality through cleaning, validation, and monitoring.
- Design efficient data structures and schemas for model training and use.
- Source new data while ensuring compliance with healthcare regulations (e.g., HIPAA)
Requirements- Bachelor’s degree in Computer Science, Engineering, or a related field.
- Experience as a Data Engineer working with large-scale or big data systems such as Apache Spark
- Strong programming skills in Python, Scala, or Java.
- Experience with ETL pipelines, data warehousing, and data modelling.
- Familiarity with cloud platforms (AWS, GCP, or Azure) and tools like Apache Spark.
- Strong problem-solving skills
Nice to Have- Master’s degree in Computer Science, Engineering, Data Science, or a related field.
- Experience working with healthcare data and standards such as FHIR or HL7.
- Familiarity with machine learning concepts and LLM fine-tuning workflows.
- Experience using data orchestration tools such as Apache Airflow.
Why Join?- Help shape the future of healthcare by building AI that improves early cancer detection and saves lives.
- Work on high-impact, real-world AI used directly in clinical settings at scale.
- Competitive salary, benefits, and flexible remote/hybrid working options.
- Join a mission-driven, fast-growing team focused on innovation and health equity.
- Continuous learning with exposure to cutting-edge AI, ML, and healthcare technologies.