I'm passionate about Natural Language Processing and Computer Vision, currently pursuing my Master's degree in CS (AI Track).
- Modular, config-driven pipeline for preprocessing Sign Language datasets with pose and video outputs using MediaPipe, MMPose, and YOLO.
- Tech Stack:
PythonMediaPipeMMPoseYOLO
- A system integrated VGG-16 (Face) and ResCNN (Voice) recognition for enhanced biometric security.
- Tech Stack:
TensorFlowCNNSignal ProcessingDeep Learning
- A 6M-params lightweight STR framework leveraging MAE pretraining to achieve high accuracy in Union14M dataset.
- Tech Stack:
PyTorchMasked Autoencoding (MAE)Vision Transformer
- Using pose extraction and T5-small to study on the Frame rate influence on Sign Language Translation.
- Tech Stack:
PyTorchGoogle-T5Vision Transformer
- A collection to catalog all the Italian brainrot.
- Crawl data from the Taiwan stock market and analyze through ML models.
- Tech Stack:
PythonWeb ScrapingMachine Learning



