a2ps |
Any to PostScript filter |
agrep |
A tool for the fast searching of text allowing for errors in the search pattern |
aiksaurus |
A thesaurus lib, tool and database |
an |
Very fast anagram generator with dictionary lookup |
ansifilter |
Handles text files containing ANSI terminal escape codes |
antiword |
free MS Word reader |
antixls |
Print out an XLS file with minimal formatting, or extract the data into CSV |
apvlv |
Alf's PDF Viewer Like Vim |
asa |
ASA Carriage control conversion for ouput by Fortran programs |
asciidoc |
A plain text human readable/writable document format |
aspell |
A spell checker replacement for ispell |
atril |
Atril document viewer for MATE |
bact |
Boosting Algorithm for Classification of Trees |
barcode |
barcode generator |
bdf2psf |
Converter to generate console fonts from BDF source fonts |
bibclean |
BibTeX bibliography prettyprinter and syntax checker |
bibletime |
Qt Bible-study application using the SWORD library |
bibutils |
Interconverts between various bibliography formats using common XML intermediate |
binfind |
Search files for a byte sequence specified on the command line |
blahtexml |
TeX-to-MathML converter |
blogc |
A blog compiler |
bogosort |
A file sorting program which uses the bogosort algorithm |
build-docbook-catalog |
DocBook XML catalog auto-updater |
c2ps |
Generates a beautified ps document from a source file (c/c++) |
calibre |
Ebook management application |
capyt |
A python3 CLI utility to interface with cpy.pt paste service |
catdoc |
Converter for Microsoft Word, Excel, PowerPoint and RTF files to text |
cb2bib |
Tool for extracting unformatted bibliographic references |
cedilla |
UTF-8 to postscript converter |
chasen |
Japanese Morphological Analysis System, ChaSen |
cherrytree |
A hierarchical note taking application (C++ version) |
cmark |
CommonMark parsing and rendering library and program in C |
cmigemo |
Migemo library implementation in C |
code2html |
Converts source files to colored HTML output |
convertlit |
CLit converts MS ebook .lit files to .opf (xml+html+png+jpg) |
convmv |
convert filenames to utf8 or any other charset |
coolreader |
CoolReader - reader of eBook files (fb2,epub,htm,rtf,txt) |
cpdf |
A command line tool for manipulating PDF files |
crf++ |
Yet Another CRF toolkit for segmenting/labelling sequential data |
crm114 |
A powerful text processing tool, mainly used for spam filtering |
cuneiform |
An enterprise quality OCR engine by Cognitive Technologies |
cwtext |
Text to Morse Code converter |
dbacl |
Digramic Bayesian text classifier |
dblatex |
Transform DocBook using TeX macros |
delta |
Heuristically minimizes interesting files |
dictd |
Dictionary Client/Server for the DICT protocol |
diction |
Diction and style checkers for english and german texts |
diff-pdf |
A simple tool for visually comparing two PDF files |
diffpdf |
Program that textually or visually compares two PDF files |
ding |
Tk based dictionary (German-English) (incl. dictionary itself) |
discount |
A Markdown-to HTML translator written in C |
djview |
Portable DjVu viewer using Qt |
djvu |
DjVu viewers, encoders and utilities |
docbook2X |
Tools to convert docbook to man and info |
docbook-dsssl-stylesheets |
DSSSL Stylesheets for DocBook |
docbook-sgml-dtd |
Docbook SGML DTD 4.2 |
docbook-sgml-utils |
Shell scripts to manage DocBook documents |
docbook-xml-dtd |
Docbook DTD for XML |
docbook-xml-simple-dtd |
Simplified Docbook DTD for XML |
docbook-xsl-ns-stylesheets |
XSL Stylesheets for Docbook |
docbook-xsl-stylesheets |
XSL Stylesheets for Docbook |
docx2txt |
Convert MS Office docx files to plain text |
dos2unix |
Convert DOS or MAC text files to UNIX format or vice versa |
dvipng |
Translate DVI files into PNG or GIF graphics |
dvipsk |
DVI-to-PostScript translator |
dvisvgm |
Converts DVI files to SVG |
ebook-tools |
Tools for accessing and converting various ebook file formats |
editorconfig-core-c |
EditorConfig core library written in C |
enchant |
Spellchecker wrapping library |
enscript |
Powerful text-to-postscript converter |
epspdf |
GUI and command-line converter for [e]ps and pdf |
epstool |
Creates or extracts preview images in EPS files, fixes bounding boxes |
evince |
Simple document viewer for GNOME |
expander |
Expander is a utility that acts as a filter for text editors |
extract_url |
extracts URLs from correctly-encoded MIME email messages or plain text |
fb2edit |
Create and edit fb2 books |
fblog |
Small command-line JSON Log viewer |
fbpdf |
framebuffer pdf and djvu viewer |
fbreader |
E-Book Reader. Supports many e-book formats |
fictionup |
A command-line markdown to fb2 convertor |
flpsed |
Pseudo PostScript editor |
foliate |
gtk ebook reader built with gjs |
ghostscript-gpl |
Interpreter for the PostScript language and PDF |
gnome-doc-utils |
A collection of documentation utilities for the Gnome project |
gocr |
An OCR (Optical Character Recognition) reader |
grip |
Preview GitHub Markdown files like Readme locally before committing them |
groonga |
An Embeddable Fulltext Search Engine |
groonga-normalizer-mysql |
Groonga plugin that provides MySQL compatible normalizers |
grutatxt |
A converter from plain text to HTML and other markup languages |
gspell |
Spell check library for GTK+ applications |
gtkspell |
Spell checking widget for GTK |
gtranslator |
GNOME Translation Editor |
gv |
Viewer for PostScript and PDF documents using Ghostscript |
hd2u |
Dos2Unix like text file converter |
highlight |
Converts source code to formatted text (HTML, LaTeX, etc.) with syntax highlight |
hnb |
A program to organize many kinds of data in one place |
htag |
random signature maker |
html2text |
HTML to text converter |
html401 |
DTDs for the HyperText Markup Language 4.01 |
htmlc |
HTML template files expander |
htmldoc |
Convert HTML pages into a PDF document |
htmlinc |
HTML Include System by Ulli Meybohm |
htmlmin |
A configurable HTML Minifier with safety features |
htmlrecode |
Recodes HTML file using a new character set |
htmltidy |
Tidy the layout and correct errors in HTML and XML documents |
html-xml-utils |
A number of simple utilities for manipulating HTML and XML files |
htp |
An HTML preprocessor |
hunspell |
Spell checker, morphological analyzer library and command-line tool |
hyperestraier |
a full-text search system for communities |
iso-codes |
ISO language, territory, currency, script codes and their translations |
itex2mml |
A LaTeX into XHTML/MathML converter |
jabref |
Java GUI for managing BibTeX and other bibliographies |
jabref-bin |
Java GUI for managing BibTeX and other bibliographies |
jo |
JSON output from a shell |
kbibtex |
BibTeX editor to edit bibliographies used with LaTeX |
kjots |
Note taking utility by KDE |
krop |
A tool to crop PDF files |
languagetool |
A proof-reading tool for many languages |
lcdf-typetools |
Font utilities for eg manipulating OTF |
lesspipe |
a preprocessor for less |
letterize |
Generate English-plausible alphabetic mnemonics for a phone number |
libabw |
Library parsing abiword documents |
libebook |
Library parsing various ebook formats |
libepubgen |
EPUB generator for librevenge |
libetonyek |
Library parsing Apple Keynote presentations |
libexttextcat |
Library implementing N-gram-based text categorization |
libgepub |
GObject based library for handling and rendering epub documents |
libgxps |
Library for handling and rendering XPS documents |
liblangtag |
An interface library to access tags for identifying languages |
libmspub |
Library parsing Microsoft Publisher documents |
libmwaw |
Library parsing many pre-OSX MAC text formats |
libnumbertext |
Number to number name and money text conversion libraries |
libodfgen |
Library to generate ODF documents from libwpd and libwpg |
libpaper |
Library for handling paper characteristics |
libqxp |
Library parsing QuarkXpress documents |
libspectre |
Library for rendering Postscript documents |
libstaroffice |
Import filter for old StarOffice documents |
libwpd |
WordPerfect Document import/export library |
libwpg |
C++ library to read and parse graphics in WPG |
libwps |
Microsoft Works file word processor format import filter library |
libxmlpatch |
A set of tools to create and apply patch to XML files using XPath |
linuxdoc-tools |
A toolset for processing LinuxDoc DTD SGML files |
llpp |
graphical PDF viewer which aims to superficially resemble less(1) |
logmerge |
Small and powerful script to merge two or more logfiles |
lout |
High-level language for document formatting |
lv |
Powerful Multilingual File Viewer |
mandoc |
Suite of tools compiling mdoc and man |
manpager |
Enable colorization of man pages |
master-pdf-editor |
A complete solution for viewing and editing PDF files |
mathtex |
Lets you easily embed LaTeX math in your own html pages, blogs, wikis, etc |
mecab |
Yet Another Part-of-Speech and Morphological Analyzer |
mpage |
Many to one page printing utility |
msort |
A program for sorting files in sophisticated ways |
multitail |
Tail with multiple windows |
mupdf |
A lightweight PDF viewer and toolkit written in portable C |
mythes |
A simple thesaurus for Libreoffice |
namazu |
Namazu is a full-text search engine |
nfoview |
Simple viewer for NFO files, which are ASCII art in the CP437 codepage |
nuspell |
Spell checker library and CLI for complex natural languages |
o3read |
Converts OpenOffice formats to text or HTML |
ocrad |
GNU Ocrad is an OCR (Optical Character Recognition) program |
odt2txt |
A simple converter from OpenDocument Text to plain text |
openjade |
Jade is an implementation of DSSSL for formatting SGML and XML documents |
openpaperwork-core |
Core part of Paperwork (plugin management) |
openpaperwork-gtk |
Paperwork plugins |
opensp |
A free, object-oriented toolkit for SGML parsing and entity management |
pandoc |
Conversion between markup formats |
paperwork |
a personal document manager for scanned documents (and PDFs) |
paperwork-backend |
Backend part of Paperwork (Python API, no UI) |
paps |
Unicode-aware text to PostScript converter |
par |
a paragraph reformatter, vaguely similar to fmt, but better |
pastebinit |
A software that lets you send anything you want directly to a pastebin |
pdf2html |
Converts pdf files to html files |
pdf2oo |
Converts pdf files to odf |
pdfarranger |
Merge or split pdfs; rearrange, rotate, crop pages. |
pdfgrep |
A tool similar to grep which searches text in PDFs |
pdfjam |
pdfnup, pdfjoin and pdf90 |
pdfminer |
Python tool for extracting information from PDF documents |
pdfsandwich |
generator of sandwich OCR pdf files |
pdftk |
gcj-free version of pdftk written in Java |
pelican |
A tool to generate a static blog, with restructured text or markdown input files |
pep |
General purpose filter and file cleaning program |
pinfo |
Hypertext info and man viewer based on (n)curses |
po4a |
Tools to ease the translation of documentation |
podofo |
PoDoFo is a C++ library to work with the PDF file format |
poppler |
PDF rendering library based on the xpdf-3.0 code base |
poppler-data |
Data files for poppler to support uncommon encodings without xpdfrc |
ps2eps |
Generate Encapsulated Postscript Format files from one-page Postscript documents |
ps2pkm |
Tool that converts a PostScript type1 font into a corresponding TeX PK font |
psiconv |
An interpreter for Psion 5(MX) file formats |
pspdftool |
Tool for prepress preparation of PDF and PostScript documents |
pspresent |
A tool to display full-screen PostScript presentations |
pstotext |
Extract ASCII text from a PostScript or PDF file |
psutils |
PostScript Utilities |
pytextile |
A Python port of Textile, A humane web text generator |
qpdf |
Command-line tool for structural, content-preserving transformation of PDF files |
qpdfview |
A tabbed document viewer |
q-text-as-data |
A CLI tool that allows direct execution of SQL-like queries on text |
rarian |
A documentation metadata library |
recode |
Convert files between various character sets |
reed |
This is a text pager (text file viewer), used to display etexts |
refbase |
Web-based solution for managing scientific literature, references and citations |
restview |
reStructuredText viewer |
rman |
PolyGlotMan man page translator AKA RosettaMan |
rnc2rng |
RELAX NG Compact to regular syntax conversion library |
rnv |
A lightweight Relax NG Compact Syntax validator |
robodoc |
Automating Software Documentation |
ronn |
Converts simple, human readable textfiles to roff for terminal display, and HTML |
rpl |
Intelligent recursive search/replace utility |
rtf2html |
RTF to HTML converter |
sablotron |
An XSLT Parser in C++ |
scdoc |
Standalone tool for generating man pages with a simple syntax |
scrollkeeper-dtd |
DTD from the Scrollkeeper package |
sdcv |
Console version of Stardict program |
sgml-common |
Base ISO character entities and utilities for SGML |
sgrep |
Use structural criteria to grep and index text, SGML, XML and HTML and filter |
sigil |
Multi-platform WYSIWYG ebook editor for ePub format |
simple-fb2-reader |
A simple gtk3 reader for fb2 ebooks |
sloccount |
Tools for counting Source Lines of Code (SLOC) for a large number of languages |
spellutils |
spellutils includes 'newsbody' (useful for spellchecking in mails, etc.) |
stardict |
A international dictionary supporting fuzzy and glob style matching |
sword |
Library for Bible reading software |
sword-modules |
All the unlocked modules for app-text/sword, grouped by language |
t1utils |
Type 1 Font utilities |
tabler |
A utility to create text art tables from delimited input |
talkfilters |
Convert ordinary English text into text that mimics a stereotyped dialect |
teckit |
Text Encoding Conversion toolkit |
teseq |
A tool for analyzing files that contain control characters and sequences |
tessdata_best |
Most accurate trained models for app-text/tesseract |
tessdata_fast |
Fast integer versions of trained models for app-text/tesseract |
tessdata_legacy |
Trained models for app-text/tesseract compatible with the legacy engine |
tesseract |
An OCR Engine, originally developed at HP, now open source. |
texi2html |
Perl script that converts Texinfo to HTML |
texlive |
A complete TeX distribution |
texlive-core |
A complete TeX distribution |
tidy-html5 |
Tidy the layout and correct errors in HTML, HTML5 and XML documents |
tkinfo |
Info Browser in TK |
tkman |
TkMan man and info page browser |
tofrodos |
Utility that converts ASCII files between the MSDOS and the Unix format |
tokyodystopia |
A fulltext search engine for Tokyo Cabinet |
trang |
Multi-format schema converter based on RELAX NG |
tree |
Lists directories recursively, and produces an indented listing of files |
ttf2pk2 |
Freetype 2 based TrueType font to TeX's PK format converter |
ttf2pt1 |
True Type Font to Postscript Type 1 Converter |
txt2man |
Scripts to convert regular ASCII text to man pages |
txt2pdbdoc |
Text/HTML to Doc file converter for the Palm Pilot |
txt2tags |
Generate marked up documents (HTML, etc.)from a plain text file with markup |
u2ps |
A text to PostScript converter like a2ps, but supports UTF-8 |
unac |
Library and command-line tool for removing accents from characters |
unpaper |
Post-processor for scanned and photocopied book pages |
unrtf |
Converts RTF files to various formats |
uudeview |
uu, xx, base64, binhex decoder |
vgrep |
A pager for grep, git-grep and similar grep implementations |
vilistextum |
HTML to ASCII converter programmed to handle incorrect html |
wdiff |
Create a diff disregarding formatting |
webgen |
A template-based static website generator |
wgetpaste |
Command-line interface to various pastebins |
wiki2beamer |
Tool to produce LaTeX Beamer code from wiki-like input |
writerperfect |
Various formats to Open document format converter |
wscr |
A Lightweight and Fast Anagram Solver |
wv |
Tool for conversion of MSWord doc and rtf files to something readable |
wv2 |
Excellent MS Word filter lib, used in most Office suites |
xapers |
Personal document indexing system |
xapian-omega |
An application built on Xapian, consisting of indexers and a CGI search frontend |
xchm |
Utility for viewing Compiled HTML Help (CHM) files |
xdvik |
DVI previewer for X Window System |
xhtml1 |
DTDs for the eXtensible HyperText Markup Language 1.0 |
xhtml11 |
DTDs for the eXtensible HyperText Markup Language 1.0 |
xiphos |
A Gtk+-based Bible-study frontend for SWORD |
xlhtml |
Convert MS Excel and Powerpoint files to HTML |
xlsx2csv |
Convert MS Office xlsx files to CSV |
xml2 |
These tools are used to convert XML and HTML to and from a line-oriented format |
xml2doc |
Tool to convert simple XML to a variety of formats (pdf, html, txt, manpage) |
xmldiff |
A tool that figures out the differences between two similar XML files |
xmlformat |
Reformat XML documents to your custom style |
XML-Schema-learner |
Algorithmic inferencing of XML schema definitions and DTDs |
xmlstarlet |
A set of tools to transform, query, validate, and edit XML documents |
xmlto |
script for converting XML and DocBook documents to a variety of output formats |
xournal |
An application for notetaking, sketching, and keeping a journal using a stylus |
xournalpp |
Handwriting notetaking software with PDF annotation support |
xpdf |
The PDF viewer and tools |
yelp-tools |
Collection of tools for building and converting documentation |
yodl |
Your Own Document Language: a pre-document language and tools to process it |
zathura |
A highly customizable and functional document viewer |
zathura-cb |
Comic book plug-in for zathura with 7zip, rar, tar and zip support |
zathura-djvu |
DjVu plug-in for zathura |
zathura-meta |
Meta package for app-text/zathura plugins |
zathura-pdf-mupdf |
PDF plug-in for zathura |
zathura-pdf-poppler |
PDF plug-in for zathura |
zathura-ps |
PostScript plug-in for zathura |