Skip to content

twopranav/FAQ-Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Customer Support Topic Discovery Pipeline

An unsupervised machine learning pipeline that discovers latent customer support topics from 1.22M tweets and automatically generates targeted FAQs.

Features

  • Extracts topics using Sentence Transformers and K-Means clustering.
  • Utilizes silhouette-based dynamic k selection for optimal cluster sizing.
  • Visualizes high-dimensional text data using UMAP.
  • Generates automated FAQ documents for each topic cluster using an LLM.

Project Structure

  • data_preprocessing.ipynb: Handles raw data ingestion, text cleaning, and formatting.
  • clustering_pipeline_final.ipynb: Executes the embedding generation, clustering algorithms, UMAP visualization, and LLM-driven FAQ creation.

Tech Stack

Python, Sentence Transformers, K-Means, UMAP, LLMs

About

An unsupervised ML pipeline that discovers latent customer support topics from 1.22M tweets and automatically generates targeted FAQs using Sentence Transformers, K-Means, and LLMs.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors