Skip to content
View ferjozsot23's full-sized avatar

Block or report ferjozsot23

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ferjozsot23/README.md

Fernando Soto / ML·AI Engineer

Software Engineer at Thoughtworks · MSc Artificial Intelligence, USFQ (2026) · Quito, Ecuador

I work on NLP applied to social media auditing and economic forecasting. My current research combines news text with macroeconomic data to predict country risk in Ecuador, and audits TikTok's recommendation algorithm to measure psychological risk in content served to young users.


Research

USFQ Data Hub — Country Risk Perception
NLP on Ecuadorian news media combined with macroeconomic time series (EMBI, CPI, employment, trade flows) to predict country risk perception using ML. In progress.

TikTok Algorithmic Auditing — MSc thesis
Sock puppet methodology with custom metrics (SFH, CWR, PSI, AV) to audit TikTok's FYP and assess psychological risk of recommended content for Spanish-speaking youth. 2 papers in preparation.


Projects

Project What it does Stack
tiktok-fyp-auditor FYP data collection via sock puppet accounts. Playwright automation + Whisper large-v3 transcription. 470 videos collected. Python · Playwright · Whisper
tiktok-psych-risk-classifier Binary risk classifier on video transcripts. BETO embeddings, RepeatedStratifiedKFold 5×5, Wilcoxon testing. SVM F1 = 0.764. Python · BETO · scikit-learn
agentic-graphrag Three-agent GraphRAG pipeline with ChromaDB + NetworkX and custom LLM-as-judge evaluation. Python · OpenAI · ChromaDB
merchant-churn-predictor XGBoost churn model with SHAP explainability and React dashboard. Built in 48h at Interact2Hack 2026. Python · XGBoost · React

nandojozesoto@gmail.com · Open to remote ML/AI roles

Pinned Loading

  1. agentic-graphrag agentic-graphrag Public

    Three-agent GraphRAG pipeline (Retrieval/Answer/Graph) combining ChromaDB vector search with NetworkX knowledge graph for document QA. With custom LLM-as-judge evaluation framework.

    HTML

  2. merchant-churn-predictor merchant-churn-predictor Public

    XGBoost merchant churn model with temporal feature engineering, SHAP explainability, and React dashboard. Built end-to-end in 48h for Interact2Hack 2026.

    JavaScript

  3. tiktok-fyp-auditor tiktok-fyp-auditor Public

    Data collection pipeline for algorithmic auditing of TikTok's For You Feed using sock puppet accounts. Playwright + Whisper large-v3 transcription. Phase 1 of an MSc thesis (USFQ, 2026).

    Python

  4. tiktok-psych-risk-classifier tiktok-psych-risk-classifier Public

    Spanish BERT (BETO) binary classifier for psychologically risky TikTok content. RepeatedStratifiedKFold 5x5 + Wilcoxon test. Phase 2 of an MSc thesis pipeline (USFQ, 2026). Dataset anonymized for p…

    Jupyter Notebook