Saif Mahin
Vetted Pro
Level 2
Python Developer: AI Data Extraction and Web Scraping
Certifié par Fiverr Pro
Saif Mahin a été sélectionné par l'équipe Fiverr Pro pour son expertise.
Certifié pour
Scraping de données
Compétences

Voir mes services


Vous souhaitez travailler sur une base horaire ?
Dites à Saif Mahin ce dont vous avez besoin.
20 $US
/
heurePortfolio
Expérience professionnelle
Python Developer
SupplyCopia • Temps plein
Dec 2022 - Present • 3 yrs 5 mos
As a Python Developer at Supply Copia, I build scalable data pipelines, AI-powered document processing systems, and automation frameworks that handle large-scale unstructured data with high accuracy. What I've built and delivered: Document & Invoice Processing: Designed end-to-end invoice extraction pipelines processing 100K+ documents monthly, transforming unstructured PDFs into clean, structured datasets (Excel, CSV, Parquet). Built AI-assisted parsing using OpenAI APIs and LangChain to resolve field ambiguities and boost extraction accuracy. Created automated QA frameworks to catch mismatches in amounts, vendors, and invoice numbers at scale. AI & Intelligent Systems: Integrated embedding models and re-rankers (BGE) for schema mapping and intelligent column matching. Contributed to AI chatbot development, connecting LLMs with structured data and knowledge bases. Led automation initiatives using AWS Lambda, reducing manual effort and improving processing speed. Web Scraping & Automation: Engineered high-performance scraping systems with concurrency, retry logic, proxy rotation, and anti-bot strategies for large-scale data collection. Built and deployed REST APIs using FastAPI and Flask for internal tools and data workflows. Designed S3-based orchestration workflows for storing and processing structured outputs. Data Engineering & Analytics: Developed Snowflake-based data pipelines with monthly partitioned tables and consolidated reporting layers. Built data reconciliation systems using fuzzy matching (RapidFuzz), normalization, and rule-based + AI logic. Implemented parallel processing (ThreadPoolExecutor, batching, checkpointing) to handle thousands of vendors efficiently. I work closely with cross-functional teams to deliver reliable, production-ready solutions that drive data accuracy, automation, and business efficiency.
149 Avis
| (143) | ||
| (6) | ||
| (0) | ||
| (0) | ||
| (0) |
Détails de la notation
- Niveau de communication avec le freelance
- Qualité de la livraison
- Valeur de la livraison
Trier par
martijnp17
Client récurrent

Pays-Bas
Happy with the work Saif delivers! We've placed 18 orders at this moment of time.

garricklau

États-Unis
he took the time to understand exactly what I needed and produced and documentation that proved his skill
vindavis1

Australie
Saif was really good. He knows what we are after. Great communication and we got what we promised.
p_dmdr
Client récurrent

Pays-Bas
Excellent work, just like last year. Will come back next year.
leonardodurso

Azerbaïdjan

