Skip to content
View kachiann's full-sized avatar
πŸ’­
Learning new stuff is fun!!
πŸ’­
Learning new stuff is fun!!

Block or report kachiann

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kachiann/README.md

Hi there, I'm Kachi πŸ‘‹

PhD Mathematician | Data Scientist | Data Engineer | MLOps Engineer

I build end-to-end machine learning systems and cloud data pipelines β€” from raw data ingestion to deployed models β€” with a focus on Data Engineering, MLOps, NLP, and reproducible research. My PhD research was on building Quasi-Monte Carlo Algorithms.

πŸš€ Open to: Data Science Β· Data Engineering Β· ML Engineering Β· MLOps roles
πŸ“ Based in: Rhineland-Palatinate, Germany


πŸŽ† About Me

  • πŸŽ“ PhD in Mathematics β€” dissertation on building Quasi-Monte Carlo Algorithms
  • βš™οΈ Streamlining the ML lifecycle for efficient model deployment and management in MLOps
  • 🌱 PhD focused on constructing new QMC point sets and lattice rules; interested in exploring how these low-discrepancy structures can replace standard Monte Carlo sampling in ML workflows β€” from numerical integration in Bayesian inference to more efficient training data sampling
  • ✍️ I write about data science on Medium
  • πŸ“„ Published research on Google Scholar Β· ORCID

πŸ˜„ How to Reach Me

LinkedIn Medium Google Scholar X


πŸ› οΈ Languages

Python R JavaScript MATLAB HTML SQL

πŸ“Š Data Science

Pandas Scikit-Learn NumPy Tableau Apache Spark

πŸ€– Machine Learning

PyTorch TensorFlow Keras SciPy

βš™οΈ MLOps

MLflow MageAI Databricks

πŸ—οΈ Data Engineering

GCP BigQuery Terraform Streamlit

πŸ—£οΈ Natural Language Processing

SpaCy Gensim


Pinned Loading

  1. maternal-health-ai-assistant maternal-health-ai-assistant Public

    Agentic AI assistant for nursing mothers. RAG over WHO/AAP/CDC sources combined with live USDA nutrition API tool-use. Built with LangChain, GPT-4o, FAISS, and Streamlit.

    Python 1

  2. Citi-Bike-Analytics-Pipeline Citi-Bike-Analytics-Pipeline Public

    Batch data engineering pipeline on GCP (Terraform, GCS, BigQuery) with partitioned warehouse and interactive Streamlit dashboard.

    Python 1

  3. project-mlops project-mlops Public

    Predict bike-sharing demand using machine learning pipeline for MLOps-Zoomcamp project, optimizing bike distribution and availability.

    Jupyter Notebook 2

  4. Hotel_Booking_Cancellations Hotel_Booking_Cancellations Public

    Machine learning model to predict booking cancellations for INN Hotels Group, enabling dynamic pricing, targeted retention, and optimized overbooking to reduce revenue loss

    Jupyter Notebook

  5. Bike_Sharing_Streamlit_App Bike_Sharing_Streamlit_App Public

    A Streamlit web application for analyzing bike sharing data and predicting rental demand using machine learning. Features include data exploration, usage pattern visualization, and demand forecasti…

    Jupyter Notebook