GitHub LogoLinkedIn Logo

Saeid Alavi

Machine Learning Research Associate @ UHN

After receiving gold medal in Electrical Engineering from University of Western Ontario, I decided to pursue graduate studies where I completed my MASc in Biomedical Engineering at the University of Toronto under supervision of Dr. Babak Taati, Vector Affiliated Scientist. I am Currently Working as a Machine Learning Research Associate at University Health Network (UHN) in Toronto, Canada. My research interests include various deep learning sub-fields including Natural Language Processing (NLP), Computer Vision, and Speech Processing. I am also interested in applying machine learning techniques to solve real-world problems in healthcare and medicine.

Me

Projects

a selection - other projects can be found on myGitHub Profile

  • NLP
  • Unsupervised Learning
  • Contextual Embedding
  • Prompt Engineering
  • Utilized web scraping solutions to gather data from diverse sources using Python and web scraping frameworks such as beautifulSoup, Scrapy, and Selenium (for JavaScript-heavy websites).
  • Developed a comprehensive dataset for Natural Language Processing (NLP) tasks, which was successfully added to the Hugging Face Model Hub.
  • Collaborated with a team of NLP experts affiliated with Vector Institute to define annotation guidelines for the dataset, ensuring high-quality annotations and facilitating unsupervised learning tasks.
  • Extensively worked with a wide range of NLP models, from traditional approaches like Word2Vec and GloVe to state-of-the-art models such as BERT, and LLMs such as GPT-4 and LLAMA.
  • Performed prompt engineering: conducted in-depth research and analysis of prompts for LLMs such as GPT-4 and LLaMa, understanding the impact of prompting on model behaviour and output quality.
  • Applied unsupervised learning techniques to the dataset, including constrained clustering, to group the wall clues into meaningful clusters based on contextual embeddings.
  • Paper: Saeid Alavi Naeini, Raeid Saqur, Mozhgan Saeidi, John Giorgi, Babak Taati. "Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset."Advances in Neural Information Processing Systems (2023).
  • Automatic Speech Recognition
  • Dynamic Time Warping
  • Neurological Disorders
  • Data Augmentation
  • Curated and collected a large-scale dataset of TTS speech samples covering a wide range of accents, dialects, and speaking rates, ensuring representation of various linguistic characteristics.
  • Developed fine-tuning pipelines and strategies to adapt pre-trained ASR models to the specific attributes of TTS speech data, addressing challenges such as accent variations, speaking rate differences, and dysarthric disorders.
  • conducted extensive data processing, including utilization of Dynamic Time Warping for temporal segmentation of speech samples.
  • Conducted thorough evaluations and benchmarking of the fine-tuned ASR models, comparing their performance with the baseline models on standard evaluation metrics such as Word Error Rate (WER) and accuracy.
  • Collaborated with domain experts and linguists to validate the improvements achieved by proposed models, ensuring the enhanced models accurately captured dysarthric speech.
  • Implemented MLOps practices by developing a web application that analyzes dysarthric speech and performs automatic temporal segmentation. The web app was developed using Docker and Kubernetes to ensure efficient deployment and scalability of machine learning models.
  • Paper: SA Naeini, L Simmatis, D Jafari, Y Yunusova, B Taati. "Improving Dysarthric Speech Segmentation with Emulated and Synthetic Augmentation." IEEE Journal of Biomedical and Health Informatics (JBHI). IEEE, 2023 (submitted).
  • Paper: SA Naeini, et al. "Concurrent Validity of Automatic Speech and Pause Measures During Passage Reading in ALS" 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI). IEEE, 2022.
  • Computer Vision
  • Deep Learning
  • Neurological Disorders
  • Applied transformer-based computer vision techniques to perform repetition counting in videos, achieving accurate and efficient analysis of repetitive actions or behaviors.
  • Fine-tuned pre-trained transformer models on the repetition counting task, leveraging transfer learning to capture visual patterns and repetitive dynamics.
  • Leveraged automatic repetition counting techniques to perform temporal segmentation and parsing of neurological assessment video data, enabling accurate analysis and understanding of repetitive actions in the context of neurological assessments.
  • Paper: SA Naeini, et al. "Automated Temporal Segmentation of Orofacial Assessment Videos." 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI). IEEE, 2022.
  • Web Development
  • AWS
  • React
  • flask
  • UX/UI Design
  • Developed a web application for remote data collection during the COVID-19 pandemic, enabling efficient and secure data gathering from remote locations.
  • Designed and implemented the user interface (UI) and user experience (UX) of the web app, focusing on simplicity, intuitiveness, and accessibility to accommodate users with varying technical expertise.
  • Developed secure user authentication and authorization mechanisms to protect sensitive data and ensure proper access control for different user roles.
  • Provided ongoing maintenance, bug fixes, and feature enhancements to optimize the web app's performance and address evolving user requirements.
  • Utilized AWS cloud services such as EC2 to store and process web application’s content.
  • Integrated Dropbox API to store data on HIPPA compliant platform.
  • Paper: Simmatis, L., Alavi Naeini, S., et al. (2023). "Analytical Validation of a Webcam-Based Assessment of Speech Kinematics: Digital Biomarker Evaluation following the V3 Framework." Digital Biomarkers, 7(1), 7-17.
© Saeid Alavi Naeini