Customer Support Topic Discovery Pipeline

An unsupervised machine learning pipeline that discovers latent customer support topics from 1.22M tweets and automatically generates targeted FAQs.

Features

data_preprocessing.ipynb: Handles raw data ingestion, text cleaning, and formatting.
clustering_pipeline_final.ipynb: Executes the embedding generation, clustering algorithms, UMAP visualization, and LLM-driven FAQ creation.

Python, Sentence Transformers, K-Means, UMAP, LLMs

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
clustering-pipeline-final.ipynb		clustering-pipeline-final.ipynb
data-preprocessing.ipynb		data-preprocessing.ipynb
readme.md		readme.md