📈 Technical Indicators • 💬 Reddit Sentiment • 📰 Financial News • 🌍 Macroeconomics • 🧠 Sequential Deep Learning
Alpha Engine is a research-grade multimodal financial forecasting framework designed to predict directional movement of Australian Securities Exchange (ASX) stocks using deep learning and heterogeneous financial modalities.
The framework integrates:
- 📈 Historical stock market data
- 📊 Technical indicators
- 💬 Reddit financial sentiment
- 📰 Financial news sentiment
- 🌍 Macroeconomic market-regime indicators
- 🧠 Sequential deep learning architectures
The project investigates whether multimodal temporal fusion architectures can outperform traditional standalone machine learning approaches for ASX stock forecasting.
✅ Real-time ASX prediction interface
✅ Dynamic model switching
✅ Probability confidence visualization
✅ Deep learning inference API
✅ Multi-model forecasting engine
✅ Modern financial dashboard UI
✅ Production-ready FastAPI backend
✅ Cloud-hosted multimodal AI system
| Model | Configuration | Test AUC |
|---|---|---|
| 🥇 Best Model | LSTM + Transformer + News FinBERT | 0.6449 |
| 🥈 Second Model | LSTM + Transformer + Macro + Reddit + News | 0.6237 |
| RQ | Research Question |
|---|---|
| RQ1 | How does integrating sentiment data affect ASX stock prediction performance? |
| RQ2 | Do macroeconomic and geopolitical variables improve forecasting capability? |
| RQ3 | Do multimodal sequential architectures outperform traditional ML models? |
| RQ4 | How effectively can deep sequential models learn temporal financial dependencies? |
┌────────────────────┐
│ Yahoo Finance API │
└─────────┬──────────┘
│
Historical OHLCV Data
│
▼
┌────────────────────┐
│ Technical Features │
└─────────┬──────────┘
│
┌─────────────────────┼─────────────────────┐
│ │ │
▼ ▼ ▼
┌────────────────┐ ┌────────────────┐ ┌──────────────────┐
│ Reddit Dataset │ │ News Headlines │ │ Macro Indicators │
└───────┬────────┘ └────────┬───────┘ └────────┬─────────┘
│ │ │
▼ ▼ ▼
┌────────────────┐ ┌────────────────┐ ┌──────────────────┐
│ FinBERT NLP │ │ FinBERT / │ │ Market Regime │
│ Sentiment │ │ VADER Pipeline │ │ Features │
└───────┬────────┘ └────────┬───────┘ └────────┬─────────┘
│ │ │
└────────────┬───────┴────────────┬──────┘
▼ ▼
┌─────────────────────────────┐
│ Multimodal Feature Fusion │
└─────────────┬──────────────┘
▼
┌──────────────────────────────────┐
│ Sequential Deep Learning Models │
│ LSTM / Transformer / Hybrid │
└──────────────────────────────────┘
┌──────────────────────────┐
│ Next.js Frontend │
│ (Vercel) │
└────────────┬─────────────┘
│
HTTPS REST API Calls
│
▼
┌──────────────────────────┐
│ FastAPI Backend │
│ (Render) │
└────────────┬─────────────┘
│
┌────────────────┼────────────────┐
│ │ │
▼ ▼ ▼
┌────────────────┐ ┌────────────────┐ ┌────────────────┐
│ PyTorch Models │ │ Scalers (.pkl) │ │ Metadata JSON │
└────────────────┘ └────────────────┘ └────────────────┘
│
▼
┌─────────────────────────┐
│ Multimodal Inference AI │
└─────────────────────────┘
│
▼
ASX Directional Prediction
| Layer | Technology |
|---|---|
| Frontend | Next.js 16 + TailwindCSS |
| Backend API | FastAPI |
| Deep Learning | PyTorch |
| NLP | HuggingFace Transformers |
| Hosting | Vercel + Render |
| Model Serving | Artifact-based inference |
| Version Control | GitHub |
| Attribute | Details |
|---|---|
| Primary Ticker | CBA.AX |
| Market | Australian Securities Exchange |
| Source | Yahoo Finance (yfinance) |
| Period | 2015–2024 |
| Frequency | Daily |
| Problem Type | Binary Classification |
POST /predict{
"model": "best",
"ticker": "CBA.AX"
}{
"ticker": "CBA.AX",
"prediction": "UP",
"confidence": 0.7421,
"model": "LSTM + Transformer",
"target": "Target_t7"
}| Endpoint | Description |
|---|---|
/predict |
Run stock direction inference |
/models |
Return available deployed models |
/health |
API health check |
📊 Technical Indicators & Feature Engineering
- Daily Returns
- Log Returns
- Lag Returns (
t-1,t-2,t-3,t-5)
- Rolling Moving Averages
- MACD
- MACD Signal
- Ichimoku Indicators
- Bollinger Bands
- Rolling Volatility
- Rolling Standard Deviation
- Volume Change
- Volume Moving Average
- RSI (Relative Strength Index)
| Component | Details |
|---|---|
| Source | r/AusFinance, r/ASX_Bets |
| NLP Model | FinBERT |
| Purpose | Retail investor sentiment modelling |
| Aggregation | Daily grouped aggregation |
[
"sentiment_mean",
"sentiment_std",
"positive_ratio",
"negative_ratio",
"post_volume"
]| Component | Details |
|---|---|
| Source | The Guardian Open Platform API |
| Initial Model | VADER |
| Final Model | FinBERT |
| Observation | FinBERT > VADER for financial forecasting |
[
"news_sentiment_mean",
"news_sentiment_std",
"news_positive_ratio",
"news_negative_ratio",
"news_headline_volume"
]📉 Market Regime Indicators
| Variable | Description |
|---|---|
^VIX |
Global volatility / fear index |
^GSPC |
S&P500 spillover effects |
AUDUSD=X |
Currency sentiment |
CL=F |
Oil futures |
GC=F |
Gold futures |
[
"vix_return",
"sp500_return",
"audusd_return",
"oil_return",
"gold_return"
]| Target | Description |
|---|---|
Target_t1 |
Next-day movement |
Target_t7 |
7-day future movement |
✅ Target_t7 consistently outperformed Target_t1 across nearly all architectures and feature groups.
This suggests:
- medium-horizon forecasting contains stronger learnable temporal structure,
- while next-day prediction behaves closer to Efficient Market Hypothesis assumptions.
| Model | Purpose |
|---|---|
| Logistic Regression | Linear interpretable baseline |
| XGBoost | Nonlinear ensemble baseline |
AUC ≈ 0.50–0.53
Traditional tabular methods struggled to capture temporal dependencies and multimodal interactions.
| Architecture | Description | Performance |
|---|---|---|
| Improved LSTM | Multi-layer temporal sequence model | Strong |
| LSTM + CNN | Local temporal motif extraction | Moderate |
| LSTM + Transformer | Sequential memory + temporal attention | Best Overall |
| LSTM + Informer | Efficient long-sequence attention | Weak Generalization |
| Configuration | Value |
|---|---|
| Target | Target_t7 |
| Features | Stock + News FinBERT |
| Sequence Length | 60 |
| Hidden Size | 96 |
| Optimizer | AdamW |
| Epochs | 150 |
| Metric | Score |
|---|---|
| Validation AUC | 0.7075 |
| Test AUC | 0.6449 |
evaluation_metrics = [
"ROC-AUC",
"F1-Score",
"Accuracy",
"Confusion Matrix",
"Threshold Analysis",
"Probability Distribution Analysis"
]Models demonstrated stronger ranking capability than calibrated binary directional classification.
Therefore:
- ROC-AUC became the primary evaluation metric,
- instead of raw accuracy alone.
🔒 Research-Grade Temporal Integrity
✅ Chronological splitting only
✅ No random shuffling
✅ Sentiment shifted by +1 day
✅ Scaling fitted only on training data
✅ Sequence generation after preprocessing
✅ No future information leakage
sentiment(t-1) ---> predicts ---> stock movement(t)This preserves realistic financial forecasting conditions.
[
"Stock Only",
"Stock + Reddit FinBERT",
"Stock + News FinBERT",
"Stock + Macro",
"Stock + Reddit + News",
"Stock + Macro + News",
"Stock + Macro + Reddit + News"
]ASX-Stock-Prediction/
│
├── Backend/
│ ├── app/
│ │ ├── main.py
│ │ ├── inference.py
│ │ ├── model.py
│ │ └── config.py
│ │
│ ├── artifacts/
│ │ ├── best_lstm_transformer_news_finbert/
│ │ └── second_lstm_transformer_finbert_macro/
│ │
│ ├── requirements.txt
│ └── runtime.txt
│
├── frontend/
│ ├── app/
│ ├── public/
│ ├── package.json
│ └── .env.local
│
├── notebooks/
│ └── alphaengine.ipynb
│
├── data/
│ └── multimodal datasets
│
├── README.md
└── LICENSE| Category | Technologies |
|---|---|
| Deep Learning | PyTorch |
| NLP | HuggingFace Transformers |
| Financial NLP | FinBERT |
| ML | Scikit-learn, XGBoost |
| Data Processing | Pandas, NumPy |
| Visualization | Matplotlib, Seaborn |
| Financial Data | yfinance |
✅ ASX-specific multimodal forecasting framework
✅ Temporal multimodal fusion methodology
✅ Medium-horizon forecasting superiority discovery
✅ FinBERT > VADER financial NLP finding
✅ Sequential models > tabular models finding
✅ Transformer-enhanced temporal forecasting improvements
✅ Research-grade leakage-safe methodology
- Explainable AI (SHAP/LIME)
- Cross-stock generalization
- Portfolio optimization
- Event-aware transformers
- Attention visualization
- Regime-switching architectures
- Reinforcement learning integration
git clone https://github.com/yourusername/ASX-Stock-Prediction.git
cd ASX-Stock-Predictioncd Backend
python -m venv venv
source venv/bin/activatepip install -r requirements.txtuvicorn app.main:app --reloadBackend runs on:
http://127.0.0.1:8000
cd frontend
npm install
npm run devFrontend runs on:
http://localhost:3000
NEXT_PUBLIC_API_URL=https://your-render-backend-url.onrender.comMODEL_DIR=artifacts/
DEVICE=cpuEach deployed model contains:
model.pth
scaler.pkl
feature_columns.json
metadata.json
| File | Purpose |
|---|---|
model.pth |
PyTorch trained weights |
scaler.pkl |
StandardScaler object |
feature_columns.json |
Ordered feature schema |
metadata.json |
Model configuration |
This architecture enables:
✅ Multi-model deployment
✅ Dynamic model switching
✅ Production-safe inference
✅ Modular AI deployment
✅ Scalable experimentation
Research Notebook
↓
Model Training
↓
Best Model Selection
↓
Artifact Serialization
↓
FastAPI Inference Engine
↓
Render Cloud Deployment
↓
Next.js Financial Dashboard
↓
Vercel Production Hosting
| Service | Purpose |
|---|---|
| Vercel | Frontend Hosting |
| Render | Backend API Hosting |
| GitHub | Version Control |
| HuggingFace | NLP Models |
| yfinance | Financial Data |
Alpha Engine bridges the gap between:
| Academic Research | Production AI Systems |
|---|---|
| Financial forecasting research | Cloud AI deployment |
| Multimodal deep learning | Real-time inference |
| Financial NLP experimentation | Production APIs |
| Sequential temporal modelling | Full-stack AI engineering |
This project evolved from:
🎓 Research Thesis
➡️ into
🚀 Deployable Financial AI Platform
Alpha Engine is actively evolving from a research-grade forecasting framework into a scalable real-world financial AI platform.
The following production upgrades and research extensions are planned for future releases.
Current deployment focuses on:
CBA.AX
Future versions will support:
- Commonwealth Bank (
CBA.AX) - BHP Group (
BHP.AX) - CSL Limited (
CSL.AX) - NAB (
NAB.AX) - ANZ (
ANZ.AX) - Westpac (
WBC.AX) - ASX200 constituent forecasting
Goal:
- generalized cross-stock inference,
- portfolio-scale forecasting,
- sector-aware modelling.
Future research models include:
- Temporal Fusion Transformer (TFT)
- PatchTST
- TimeGPT-style architectures
- Cross-attention multimodal transformers
- Event-aware transformers
- Hierarchical attention systems
These architectures aim to improve:
- long-term temporal understanding,
- event sensitivity,
- market regime adaptation.
Planned upgrades:
✅ Live market data streaming
✅ Real-time news ingestion
✅ Streaming Reddit sentiment analysis
✅ Intraday prediction support
✅ Continuous model updates
✅ Automated retraining pipelines
Future architecture:
Live APIs
↓
Streaming NLP Pipelines
↓
Real-Time Feature Fusion
↓
Continuous Inference Engine
Future deployment versions will include:
- SHAP explainability
- Attention heatmaps
- Feature importance dashboards
- Temporal contribution visualization
- Prediction reasoning interfaces
Goal:
- improve interpretability,
- enhance financial trustworthiness,
- support human-AI collaboration.
Upcoming UI improvements:
- Interactive financial charts
- Historical prediction explorer
- Confidence trend visualization
- Portfolio dashboard
- Mobile-responsive analytics
- AI-generated market summaries
Future research directions include:
- Cross-market forecasting
- Crypto-financial multimodal fusion
- Reinforcement learning trading agents
- Regime-switching neural systems
- Event-driven forecasting
- Financial graph neural networks
- LLM-enhanced market reasoning
Alpha Engine aims to evolve into:
A fully scalable multimodal financial intelligence platform capable of combining quantitative finance, financial NLP, deep temporal learning, and cloud AI deployment into a unified real-time forecasting ecosystem.
| Phase | Status |
|---|---|
| Research Framework | ✅ Completed |
| Deep Learning Experiments | ✅ Completed |
| Multimodal Fusion System | ✅ Completed |
| Production API Deployment | ✅ Completed |
| Cloud Frontend Deployment | ✅ Completed |
| Real-Time Streaming AI | 🚧 In Progress |
| Explainable AI Dashboard | 🔜 Planned |
| Portfolio Intelligence System | 🔜 Planned |
| Large-Scale Multi-Stock Engine | 🔜 Planned |
🎓 Software Engineering — Artificial Intelligence
📍 Melbourne, Australia
🧠 AI Researcher • Full-Stack Developer • Financial AI Enthusiast
A Research-to-Production Financial AI System