p
prateek_715

Prateek T

@prateek_715

Data Engineer

Inde
Anglais, Hindi
Certaines informations sont présentées en anglais.
À propos de moi
I am a Data Engineer with hands-on experience in PySpark, Kafka, Python, SQL, and the Hadoop ecosystem. Currently, I build large-scale data pipelines and ETL workflows at Infosys, focusing on medallion architecture and Spark optimization. I have a strong foundation in ML-powered data products and experience taking projects from EDA to deployed APIs.... Plus d’infos

Compétences

p
prateek_715
Prateek T
hors ligne • 
Temps de réponse moyen de 1 heure

Voir mes services

Formules & Macros
I will solve your excel problems

Expérience professionnelle

Infosys

Data Engineer

Infosys • Temps plein

Sep 2025 - Present10 mos

Deployed on Databricks platform; helped build production pipelines processing daily 2–9 GB datasets (7-12 million rows): designed schema transformations for medallion architecture, engineered PySpark optimizations (partition pruning, shuffle hash, broadcast joins), implemented data serialization tuning; optimizations reduced job execution time by upto 20% in some pipelines. Led data quality validation, schema design improvements, and schema evolution to accommodate upstream data changes; worked cross-functionally with team lead and senior engineers on parallelism optimization strategies