WARNING: THIS SITE IS A MIRROR OF GITHUB.COM / IT CANNOT LOGIN OR REGISTER ACCOUNTS / THE CONTENTS ARE PROVIDED AS-IS / THIS SITE ASSUMES NO RESPONSIBILITY FOR ANY DISPLAYED CONTENT OR LINKS / IF YOU FOUND SOMETHING MAY NOT GOOD FOR EVERYONE, CONTACT ADMIN AT ilovescratch@foxmail.com
Skip to content

Releases: eellak/glossAPI

Glossapi 0.1

30 Nov 08:33

Choose a tag to compare

Features

  • Multi-GPU processing
    for faster and more efficient processing.

  • Text extraction with Docling
    from PDF and other file types

  • Greek OCR support
    recognize Greek text from images and PDFs using DeepSeek OCR or RapidOCR.

  • Formula recognition
    in LaTeX with Docling's math enhancement model or DeepSeek OCR

  • Fast CPU-only Text Extraction
    with self.batch_policy = "safe" using pypdfium backend

glossapi v0.0.5

12 Mar 10:02

Choose a tag to compare

glossapi v0.0.5 Pre-release
Pre-release

Publishing through workflow

glossapi pipeline

11 Mar 09:30

Choose a tag to compare

glossapi pipeline Pre-release
Pre-release

Pipeline to extract text (from pdf for now), section it, annotate sections (table of contents, bibliography etc) of textbooks or academic papers in a parquet file.

glossapi pipeline

11 Mar 09:27

Choose a tag to compare

glossapi pipeline Pre-release
Pre-release

A pipeline for text extraction and annotation.

glossapi pipeline

11 Mar 09:20

Choose a tag to compare

glossapi pipeline Pre-release
Pre-release

Pipeline to extract text (from pdf for now), section it, annotate sections (table of contents, bibliography etc) of textbooks or academic papers in a parquet file.

v0.0.3.5.2-alpha

11 Mar 08:32

Choose a tag to compare

v0.0.3.5.2-alpha Pre-release
Pre-release
Update GitHub workflow to publish to TestPyPI