Skip to content
/ nlp Public

Natural Language Processing service for Readmigo

License

Notifications You must be signed in to change notification settings

readmigo/nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Readmigo NLP

CI

Natural Language Processing service for Readmigo.

Tech Stack

  • Language: Python 3.11+
  • Framework: FastAPI
  • NLP Libraries: spaCy, NLTK, jieba

Features

  • Text tokenization and analysis
  • Sentence boundary detection
  • Word difficulty assessment
  • Chinese text segmentation
  • Bilingual text alignment

Project Structure

├── app/
│   ├── main.py          # FastAPI application
│   ├── routers/         # API routes
│   ├── services/        # NLP services
│   └── models/          # Data models
├── scripts/             # Utility scripts
└── tests/               # Test cases

Development

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Download spaCy models
python -m spacy download en_core_web_sm

# Start development server
uvicorn app.main:app --reload

API Endpoints

Endpoint Description
POST /tokenize Tokenize text into words
POST /sentences Split text into sentences
POST /difficulty Assess word difficulty level
POST /align Align bilingual paragraphs

About

Natural Language Processing service for Readmigo

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors