RhymeMapper is a Python tool for analyzing rhyme schemes in rap lyrics using phonetic data.
It splits words into syllables, extracts phonetic features (nucleus and coda), and color‑codes rhyming syllables in the terminal output. It also computes statistical metrics (density, diversity, etc.) and generates comparative graphs.
- Phonetic transcription using
g2p_en - Syllable splitting with
syllabify(fallback heuristic for unknown words) - Rhyme signature based on vowel nucleus and consonant coda (slant rhyme support)
- Terminal output with ANSI colors for rhyming syllables
- Adjustable minimum occurrence threshold for rhyme groups
- Batch analysis from CSV files (
scripts/generate_stats.py) - Statistical visualisations (boxplots, scatter plots, similarity heatmaps) via
analysis/ Makefileto automate common tasks
- make demo
IT TAKES A BIT OF TIME !!!!
- make stats
- you can use this one : https://www.kaggle.com/datasets/ceebloop/rap-lyrics-for-nlp?resource=download
- make plots
make test
Python 3.10+
g2p_en, syllabify, pandas, matplotlib, seaborn, numpy, scikit-learn



