I'm a recent Data Science graduate from Chapman University with published research in machine learning and NLP. I build end-to-end data solutionsβfrom web scraping and data pipelines to predictive models and interactive dashboardsβwith a focus on making technology accessible and impactful.
Bridging Machine Learning and Islamic Scholarship: A Study in Hadith Translation and Similarity Analysis
Chapman University Digital Commons | December 2025
Evaluated neural machine translation and semantic similarity detection for ArabicβEnglish hadiths using the full Sahih Bukhari corpus (7,550 hadiths). MarianMT transformer model fine-tuned on the full dataset improved BLEU scores by 49.6% compared to baseline. Ten Siamese architectures tested for semantic similarity, achieving ~50% accuracy, demonstrating the importance of large, domain-specific corpora for translation and analysis.
π€ Presented at Chapman University Student Research Day:
- Oral Presentation
- Poster Presentation
π¬ Research Highlights:
- Fine-tuned MarianMT transformer on 7,550 Arabic hadiths
- 49.6% BLEU score improvement over baseline
- Tested 10 Siamese network architectures (LSTM, BiLSTM, GRU, Transformer)
- Developed Arabic-specific representations for semantic similarity
- Built web scraping pipeline for data collection and preprocessing
π» Tech Stack: Python, PyTorch, Transformers (MarianMT, Hugging Face), BeautifulSoup, NLP, Deep Learning, LSTM, BiLSTM, GRU
- Machine Learning & NLP: Build neural networks, transformers, and similarity detection models
- Data Engineering: Design databases, optimize SQL queries, and create data pipelines
- Data Visualization: Create interactive dashboards using Tableau, Power BI, and Python libraries
- Web Scraping & Automation: Extract and process data from web sources with BeautifulSoup
- Statistical Analysis: Apply predictive modeling and feature engineering to complex datasets
Neural machine translation system using MarianMT transformers to translate 7,550 Arabic hadiths, achieving 49.6% BLEU score improvement. Built 10 deep learning models (LSTM, BiLSTM, GRU) for semantic similarity detection. Published research in Chapman University Digital Commons.
- Tech: Python, PyTorch, Transformers, BeautifulSoup, NLP
- Publication: https://digitalcommons.chapman.edu/cusrd_abstracts/779/
Full-stack platform with Streamlit frontend and MySQL backend, featuring normalized database design, role-based access control, and interactive dashboards for shelter operations.
- Tech: Python, Streamlit, MySQL, SQL, Database Design
Analyzed 500K+ luxury resale product listings using Tableau, Power BI, and Alteryx to identify pricing inefficiencies and optimize revenue strategies.
- Tech: Tableau, Power BI, Alteryx, Python, Data Visualization
Built and optimized Random Forest classifier with hyperparameter tuning to predict heart disease risk from medical datasets.
- Tech: Python, scikit-learn, Pandas, Statistical Modeling
- π Seeking Data Analyst or Data Scientist roles where I can apply ML and analytics to drive insights
- π± Expanding my cloud computing skills (AWS, Azure)
- π Reading research papers on transformer architectures and NLP applications
- Data science & machine learning for social impact
- Natural language processing and multilingual AI
- Healthcare analytics and predictive modeling
- Ethical AI, fairness, and algorithmic bias
- Interactive data storytelling and visualization
- Open-source data science and ML projects
- NLP applications for underrepresented languages
- Data visualizations that tell compelling stories
- Projects with social impact and real-world applications
- π§ Email: speightasiyah@gmail.com
- πΌ LinkedIn: linkedin.com/in/asiyahspeight
- π Portfolio: AsiyahSpeight.github.io/asiyah-portfolio
- π Research: Published Paper on ML & NLP
Languages & Tools:
Data Science & ML:
Visualization & BI:
Database & DevOps:
- π I love manga and fantasy novels
- π§πΎ I enjoy modest fashion and expressing creativity through style
- π I'm passionate about creating inclusive spaces in techβyou belong here!
- π£οΈ Currently learning Arabic and Thai
π¬ Open to collaborations, opportunities, and conversations about data science, ML, and tech for good!