torch-vision

Here are 2 public repositories matching this topic...

Rehman110-F / image_caption_model

An image captioning model using ResNet-50 encoder + Transformer decoder trained on MS COCO. Served via FastAPI with a drag-and-drop frontend and Docker support for CPU deployment.

python docker natural-language-processing computer-vision deep-learning transformers cnn pytorch image-captioning resnet ms-coco uvicorn torchvision fastapi torch-vision encoder-decod

Updated Mar 13, 2026
Jupyter Notebook

rehmanashraf0314 / image_caption_model

Star

An image captioning model using ResNet-50 encoder + Transformer decoder trained on MS COCO. Served via FastAPI with a drag-and-drop frontend and Docker support for CPU deployment.

python docker natural-language-processing computer-vision deep-learning transformers pytorch image-captioning resnet encoder-decoder-model uvicorn torch-vision

Updated Apr 27, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the torch-vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the torch-vision topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly