DocEx

A privacy-first document image extractor that runs entirely in your browser. Extract images from documents and eBooks without uploading files to any server.

Features

Privacy by design — All processing happens in your browser.
Wide format support — DOCX, PPTX, XLSX, Keynote, Pages, Numbers, EPUB, MOBI, AZW3.
Lossless extraction — Images extracted directly from document structures.
Smart filtering — Automatically removes icons and thumbnails under 10KB.
Batch download — Export all images as a single ZIP file.

Limitations: Older Office formats (.doc, .ppt, .xls) and DRM-protected eBooks are not supported.

Getting Started

Prerequisites

Node.js 18 or higher
npm, yarn, or pnpm

Installation

Clone the repository and install dependencies:

git clone https://github.com/Eyozy/docex.git
cd docex
npm install

Development

Start the development server:

npm run dev

Open your browser and navigate to http://localhost:5173

Production Build

Build for production:

npm run build

Preview the production build:

npm run preview

The build output will be in the dist directory, ready for deployment to any static hosting service.

Project Structure

docex/
├── src/
│   ├── components/          # UI components
│   ├── composables/         # Composition functions
│   ├── workers/             # Background processing
│   ├── utils/               # Helper functions
│   └── i18n/                # Translations
├── public/
└── vite.config.ts

Development

Adding New Format Support

Add file signature detection in src/workers/extractor.worker.ts
Implement the extraction logic in the appropriate parser
Update UI translations in src/i18n/en-US.ts and zh-CN.ts
Add the format extension to accepted types in src/components/DropZone.vue

Contributing

Contributions are welcome. Please feel free to submit a pull request.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes
Push to the branch (git push origin feature/amazing-feature)
Open a pull request

License

MIT License — see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
public		public
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocEx

Features

Getting Started

Prerequisites

Installation

Development

Production Build

Project Structure

Development

Adding New Format Support

Contributing

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DocEx

Features

Getting Started

Prerequisites

Installation

Development

Production Build

Project Structure

Development

Adding New Format Support

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages