Patryk S
Data Engineer ML Specialist
Compétences
Voir mes services


Portfolio
Expérience professionnelle
CapsicoHealth, Inc.
6 mos
Data Enginerr
Aug 2025 - Dec 2025 • 4 mos
- Architected complete end-to-end ML prediction pipeline integrating REDCap patient data retrieval, causal inference model execution (320+ models), and automated clinical decision support delivery, allowing to predict patient drug response in seconds. - Coordinated cross-functional collaboration with 4 stakeholders across Denmark, Netherlands, Germany, and US to align technical requirements with clinical workflows and regulatory constraints. - Developed RESTful API infrastructure using Flask and Python to orchestrate data flow between 320+ prediction models (Causal forest, XGBoost), implementing parallel processing for scalable model execution. - Built feature engineering pipeline handling medication normalization, dose standardization, and healthcare-specific data transformations using Python/R integration via secure subprocess management.
Data Engineering Intern
Jun 2025 - Aug 2025 • 2 mos
- Supported developing and validating 8 causal inference models (Causal forest, XGBoost) for treatment effect predictions, establishing foundation for production healthcare ML system. - Engineered data validation framework ensuring input data quality for R-based statistical models, implementing type checking, range validation, and error tracking for patient records. - Created inference execution system integrating R statistical models with Python-based pipeline, automating previously manual model deployment and reducing execution time by implementing subprocess-based model calls. - Collaborated with clinical teams to translate medical domain requirements into technical feature specifications and model metadata schemas.