Skip to content
View AsiyahSpeight's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report AsiyahSpeight

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AsiyahSpeight/README.md

πŸ‘‹πŸΎ Hi there, I'm Asiyah Speight!

I'm a recent Data Science graduate from Chapman University with published research in machine learning and NLP. I build end-to-end data solutionsβ€”from web scraping and data pipelines to predictive models and interactive dashboardsβ€”with a focus on making technology accessible and impactful.


πŸ“„ Published Research

Chapman University Digital Commons | December 2025

Evaluated neural machine translation and semantic similarity detection for Arabic–English hadiths using the full Sahih Bukhari corpus (7,550 hadiths). MarianMT transformer model fine-tuned on the full dataset improved BLEU scores by 49.6% compared to baseline. Ten Siamese architectures tested for semantic similarity, achieving ~50% accuracy, demonstrating the importance of large, domain-specific corpora for translation and analysis.

🎀 Presented at Chapman University Student Research Day:

  • Oral Presentation
  • Poster Presentation

πŸ”¬ Research Highlights:

  • Fine-tuned MarianMT transformer on 7,550 Arabic hadiths
  • 49.6% BLEU score improvement over baseline
  • Tested 10 Siamese network architectures (LSTM, BiLSTM, GRU, Transformer)
  • Developed Arabic-specific representations for semantic similarity
  • Built web scraping pipeline for data collection and preprocessing

πŸ’» Tech Stack: Python, PyTorch, Transformers (MarianMT, Hugging Face), BeautifulSoup, NLP, Deep Learning, LSTM, BiLSTM, GRU

πŸ“– Read Full Paper β†’


πŸš€ What I Do:

  • Machine Learning & NLP: Build neural networks, transformers, and similarity detection models
  • Data Engineering: Design databases, optimize SQL queries, and create data pipelines
  • Data Visualization: Create interactive dashboards using Tableau, Power BI, and Python libraries
  • Web Scraping & Automation: Extract and process data from web sources with BeautifulSoup
  • Statistical Analysis: Apply predictive modeling and feature engineering to complex datasets

πŸ’Ό Featured Projects:

Neural machine translation system using MarianMT transformers to translate 7,550 Arabic hadiths, achieving 49.6% BLEU score improvement. Built 10 deep learning models (LSTM, BiLSTM, GRU) for semantic similarity detection. Published research in Chapman University Digital Commons.

Full-stack platform with Streamlit frontend and MySQL backend, featuring normalized database design, role-based access control, and interactive dashboards for shelter operations.

  • Tech: Python, Streamlit, MySQL, SQL, Database Design

Analyzed 500K+ luxury resale product listings using Tableau, Power BI, and Alteryx to identify pricing inefficiencies and optimize revenue strategies.

  • Tech: Tableau, Power BI, Alteryx, Python, Data Visualization

Built and optimized Random Forest classifier with hyperparameter tuning to predict heart disease risk from medical datasets.

  • Tech: Python, scikit-learn, Pandas, Statistical Modeling

🎯 I'm currently:

  • πŸ” Seeking Data Analyst or Data Scientist roles where I can apply ML and analytics to drive insights
  • 🌱 Expanding my cloud computing skills (AWS, Azure)
  • πŸ“š Reading research papers on transformer architectures and NLP applications

πŸ’‘ I'm interested in:

  • Data science & machine learning for social impact
  • Natural language processing and multilingual AI
  • Healthcare analytics and predictive modeling
  • Ethical AI, fairness, and algorithmic bias
  • Interactive data storytelling and visualization

πŸ’– I'm looking to collaborate on:

  • Open-source data science and ML projects
  • NLP applications for underrepresented languages
  • Data visualizations that tell compelling stories
  • Projects with social impact and real-world applications

πŸ“« How to reach me:


πŸ› οΈ Technical Skills

Languages & Tools:

Python SQL R Java JavaScript

Data Science & ML:

TensorFlow PyTorch scikit-learn Pandas NumPy

Visualization & BI:

Tableau Power BI Streamlit Alteryx

Database & DevOps:

MySQL Git Linux


πŸ“Š GitHub Stats

Asiyah's GitHub stats Top Languages


πŸ˜„ Pronouns: She/Her


⚑ Fun Facts:

  • πŸ“š I love manga and fantasy novels
  • πŸ§•πŸΎ I enjoy modest fashion and expressing creativity through style
  • 🌍 I'm passionate about creating inclusive spaces in techβ€”you belong here!
  • πŸ—£οΈ Currently learning Arabic and Thai

πŸ’¬ Open to collaborations, opportunities, and conversations about data science, ML, and tech for good!

Popular repositories Loading

  1. AsiyahSpeight AsiyahSpeight Public

    Config files for my GitHub profile.

  2. CPSC-350-Data-Structures CPSC-350-Data-Structures Public

    Mastery projects completed in CPSC 350: Data Structures and Algorithms at Chapman University.

    C++

  3. CPSC-231-Mastery-Projects CPSC-231-Mastery-Projects Public

    Mastery projects completed in CPSC 231: Computer Science II (Java Programming) at Chapman University

    Java

  4. computer-networking-top-down-approach-notes computer-networking-top-down-approach-notes Public

    Forked from VasanthVanan/computer-networking-top-down-approach-notes

    Notes from "Computer Networking: A Top-down Approach"

  5. asiyah-portfolio asiyah-portfolio Public

    Personal portfolio website

    HTML

  6. 408-petadoption 408-petadoption Public

    This is the repository for CPSC 408 final project, where we can share coding files and other relevant files

    Python