Data / AI professional based in Paris.
I build production-grade NLP and machine learning systems, and I turn technical work into clear value for business stakeholders.
🔭 Open to Data Analyst, Data Scientist and AI Engineer roles, with a focus on data consulting.
Languages: Python, SQL, DAX
ML / NLP: scikit-learn, Hugging Face Transformers (CamemBERT), Time Series Forecasting (ARIMA, SARIMA, LSTM)
GenAI / LLM: Retrieval-Augmented Generation (RAG), Anthropic Claude API, Gemini API, ChromaDB, Ollama
Data viz: Power BI (star schema modeling)
Cloud / Infra: AWS (Lambda, S3, EventBridge), Docker, FastAPI, Streamlit
MLOps: MLflow, Git, GitHub Actions (CI), pytest, pre-commit, ruff, uv
sherlock: Production-grade French NLP pipeline. Built with uv, Pydantic, Typer CLI, loguru, MLflow, CamemBERT, pytest and GitHub Actions CI.
camembert-discours-politique: CamemBERT fine-tuning for French political discourse classification. 15,000+ texts, F1 macro 0.63 (thesis graded 87/100).
techradar: Serverless AWS agent (Lambda, S3, EventBridge) that scrapes and summarizes tech news with the Anthropic Claude API, delivered via SendGrid.
askmydocs: RAG assistant for querying your own documents. Built with ChromaDB, the Gemini API and a Streamlit / Docker stack.
HR-Dashboard: HR analytics dashboard built in Python during my Data Analyst internship at Crédit Mutuel Alliance Fédérale (HR Digital Transformation team).
newflights: Airfare optimization app. React / TypeScript / Vite frontend, FastAPI backend.

