-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
ropey
A fast and robust text rope for Rust
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
similar
A diff library for Rust
-
convert_case
Convert strings into any case
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
const_format
Compile-time string formatting
-
lopdf
PDF document manipulation
-
tabled
An easy to use library for pretty print tables of Rust
structs andenums -
heck
case conversion library
-
pulldown-cmark
A pull parser for CommonMark
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
lazy-regex
lazy static regular expressions checked at compile time
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
sqlformat
Formats whitespace in a SQL string to make it easier to read
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
ansi-to-tui
Convert ANSI color and style codes into Ratatui Text
-
html2text
Render HTML as plain text
-
mime_guess
detection of a file's MIME type by its extension
-
termimad
Markdown Renderer for the Terminal
-
prometheus-client
Open Metrics client library allowing users to natively instrument applications
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16andu32string types are provided, including support for UTF-16 and UTF-32… -
nucleo-matcher
plug and play high performance fuzzy matcher
-
printpdf
reading and writing PDF files
-
const-str
compile-time string operations
-
fuzzy-matcher
Fuzzy Matching Library
-
unicase
A case-insensitive wrapper around strings
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
diff
An LCS based slice and string diffing implementation
-
lindera
A morphological analysis library
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
grok
popular Java & Ruby grok library which allows easy text and log file processing with composable patterns
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
strin terms of displayed width -
regress
A regular expression engine targeting EcmaScript syntax
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
roff
ROFF (man page format) generation library
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
garde
Validation library
-
chardetng
A character encoding detector for legacy Web content
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
protobuf
Protocol Buffers - Google's data interchange format
-
charabia
detect the language, tokenize the text and normalize the tokens
-
text-size
Newtypes for text offsets
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
unicode-script
exposes the Unicode
ScriptandScript_Extensionproperties from UAX #24 -
diffy
Tools for finding and manipulating differences between files
-
synoptic
low-level, syntax highlighting library with unicode support
-
os_display
Display strings in a safe platform-appropriate way
-
focaccia
no_std implementation of Unicode case folding comparisons
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
line-index
Maps flat
TextSizeoffsets to/from(line, column)representation -
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
cow-utils
Copy-on-write string utilities for Rust
-
pdf-extract
extract content from pdfs
-
regex-syntax
A regular expression parser
-
unicode-reverse
Unicode-aware in-place string reversal
-
rxing
port of the zxing barcode library
-
heatseeker
A fast, robust, and portable fuzzy finder
-
harfrust
A complete HarfBuzz shaping algorithm port to Rust
-
typstyle
The CLI for Typstyle
-
ferritin
Human-friendly CLI for browsing Rust documentation
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
prettydiff
Side-by-side diff for two files
-
stfu8
Sorta Text Format in UTF-8
-
stringzilla
Search, hash, sort, fingerprint, and fuzzy-match strings faster via SWAR, SIMD, and GPGPU
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
regex-cursor
regex fork that can search discontiguous haystacks
-
emojic
Emoji constants
-
any_ascii
Unicode to ASCII transliteration
-
edit
Open a file in the default text editor
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
html2md
binary to convert simple html documents into markdown
-
stop-words
Common stop words in many languages
-
marmite
easiest static site generator
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
crop
A pretty fast text rope
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
dwrote
Lightweight binding to DirectWrite
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
html-to-markdown-cli
Command-line interface for html-to-markdown - high-performance HTML to Markdown converter
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
igrepper
The interactive grepper
-
epub-builder
generating EPUB files
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
icu_pattern
ICU pattern utilities
-
str_indices
Count and convert between indexing schemes on string slices
-
olpc-cjson
serde_json Formatter to serialize as OLPC-style canonical JSON
-
hypher
separates words into syllables
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
font-types
Scalar types used in fonts
-
markdown_timesheet
processing markdown files to extract and format timesheet data
-
htop
HTML to PDF converter
-
mdbook-epub
An EPUB renderer for mdbook
-
giallo
A code highlighter giving the same output as VSCode
-
htmd
A turndown.js inspired HTML to Markdown converter
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
hgrep
grep tool with human-friendly search output. This is similar to
-Coption ofgrepcommand, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
kak-lsp
Kakoune Language Server Protocol Client
-
hck
A sharp cut(1) clone
-
usage-lib
working with usage specs
-
lsp-textdocument
A LSP text documents manager that map of text document
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
minimizer
Minimize files to find minimal test case
-
xan
The CSV magician
-
sdml-cli
Rust CLI for Simple Domain Modeling Language (SDML)
-
chewing
(酷音) intelligent Zhuyin input method
-
entities
raw data needed to convert to and from HTML entities
-
aptu-cli
CLI for Aptu - Gamified OSS issue triage with AI assistance
-
simdnbt
an unnecessarily fast nbt decoder
-
kbremap
Custom keyboard layouts for windows
-
arrow-string
String kernels for arrow arrays
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
mupdf
Safe Rust wrapper to MuPDF
-
uwc
Counts things in unicode text files
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
glob-match
An extremely fast glob matcher
-
mdbook-yapp
mdBook preprocessor for simple text replacements
-
languagetool-rust
LanguageTool API bindings in Rust
-
mime-infer
detection of a file's MIME type by its extension
-
difflib
Port of Python's difflib library to Rust
-
arborium-cli
Command-line syntax highlighter powered by arborium
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
deno_media_type
Media type used in Deno
-
serpl
terminal UI for search and replace, ala VS Code
-
repgrep
An interactive command line replacer for
ripgrep -
readable-name-generator
Generate a readable name for throwaway infrastructure
-
matchers
Regex matching on character and byte streams
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
inlinable_string
inlinable_stringcrate provides theInlinableStringtype – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and theStringExttrait… -
mktoc
Generate Table of Contents from Markdown files
-
esed
Easy sed
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
hyperlink
Very fast link checker for CI
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
at-commands
AT Commands builder and parser for Rust #![no_std]
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
rst_parser
a reStructuredText parser
-
qpdf
Rust bindings to QPDF C++ library
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
savvy
R extension interface
-
hongdown
A Markdown formatter that enforces Hong Minhee's Markdown style conventions
-
terraphim_automata
Automata for searching and processing knowledge graphs
-
codebook
A code-aware spell checker library (dependency for codebook-lsp)
-
rschess
chess library with the aim to be as feature-rich as possible
-
markdown-tool
A CLI utility for converting Markdown into AST and vice versa
-
boxen
creating styled terminal boxes around text with performance optimizations
-
red-sed
An experimental drop-in replacement for GNU sed, written in Rust
-
scrape-cli
Command-line HTML extraction tool powered by scrape-rs
-
collclean
Clean up collaboration commands in LaTeX files
-
iepub
epub、mobi电子书读写
-
gitfluff
Commit message linting tool with presets, custom formats, and cleanup automation
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
base-d
Universal base encoder: Encode binary data to 33+ dictionaries including RFC standards, hieroglyphs, emoji, and more
-
sapling-streampager
streampager is a pager for command output or large files
-
nuch
A CLI to manage Markdown content and images for Nuxt Content sites
-
jx
An interactive JSON explorer for the command line
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
buup
Core transformation library with zero dependencies
-
decasify
A CLI utility and library to cast strings to title-case according to locale specific style guides including Turkish support
-
zhconv
Traditional, Simplified and regional Chinese variants converter powered by MediaWiki & OpenCC rulesets and the Aho-Corasick algorithm 中文简繁及地區詞轉換
-
mdbook-indexing
mdbook preprocessor for index generation
-
treegrep
regex pattern matcher that displays results in a tree structure with an interface to jump to matched text
-
yggdrasil-cli
Yggdrasil is a project flattener and diff engine that turns any subset of your codebase into a single AI-ready codex (index + contents), or compares snapshots with annotated diffs
-
dptran
run DeepL translations on command line written by Rust
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
todo_lib
Collection of utilities for todo.txt format
-
kazoe
Fast wc replacement
-
kas-text
Text layout and font management
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
ai-translator
基于 AI 的多语言文本翻译工具,支持自定义提示词
-
harper-core
The language checker for developers
-
type-safe-id
A type-safe, K-sortable, globally unique identifier
-
awabi
A morphological analyzer using mecab dictionary
-
termdiff
Write a diff with color codes to a string
-
deno_doc
doc generation for deno
-
sara-cli
CLI for Sara - Requirements Knowledge Graph
-
srgn
A grep-like tool which understands source code syntax and allows for manipulation in addition to search
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
md-tui
A terminal markdown viewer
-
mintyml
A minialist alternative syntax to HTML
-
ferris-says
flavored replacement for the classic cowsay
-
presenterm
A terminal slideshow presentation tool
-
near-facsimile
Find similar or identical text files in a directory
-
rdfless
A colorful pretty printer for RDF (Turtle/TriG/N-Triples/N-Quads/PROV-N) with ANSI colors
-
typst-kit
Common utilities for Typst tooling
-
quixote
Quizzes and tests in Markdown
-
mq-lang
Core language implementation for mq query language
-
uncomment
A CLI tool to remove comments from code using tree-sitter for accurate parsing
-
stylin
Convert markdown to pandoc markdown with custom styles
-
mad-useful
A multi-tool utility for file operations and text processing
-
turbovault-parser
Obsidian Flavored Markdown (OFM) parser
-
bloodtree
A hierarchical note-taking system with focus on relationships between nodes
-
trueno-rag
Pure-Rust Retrieval-Augmented Generation pipeline built on Trueno
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
oranda
🎁 generate beautiful landing pages for your projects
-
inlyne
Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye
-
str-utils
some traits to extend
[u8],strandCow<str> -
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
arf-strings
Encoding and decoding for ARF strings
-
wit_owo
interacting with the Wit.ai API
-
madato
command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
unicode_categories
Query Unicode category membership for chars
-
markdown-peek
Markdown previewer in browser and terminal
-
patchkit
parsing and manipulating patch files
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
treelog
A highly customizable, optimized, and modular tree rendering library
-
datafusion-functions
Function packages for the DataFusion query engine
-
wordcut-engine
Word segmentation/breaking library
-
ascii
ASCII-only equivalents to
char,strandString -
ul/kak-lsp
Kakoune Language Server Protocol Client
-
unicode-casing
Titlecase helper function on characters
-
norad
Read and write Unified Font Object files
-
blockwatch
Language agnostic linter that keeps your code and documentation in sync and valid
-
chat-gpt-lib-rs
interacting with OpenAI's ChatGPT API, providing a simple interface to make API requests and handle responses
-
hermes-tool
CLI tools for Hermes - index management, simhash, sorting, and data processing
-
rapidfuzz
rapid fuzzy string matching library
-
koji
An interactive CLI for creating conventional commits
-
ctj
A command-line tool to convert CSV to JSON written in Rust
-
normy
Ultra-fast, zero-copy text normalization for Rust NLP pipelines & tokenizers
-
stringdex
A suffixtree search system for static sites
-
inflection-rs
Inflection is a string transformation library. It singularizes and pluralizes English words, and transforms strings from CamelCase to underscored string.
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
reword
some utility functions for human-readable formatting of words
-
mime_guess2
detection of a file's MIME type by its extension
-
icy_sauce
handling SAUCE – Standard Architecture for Universal Comment Extensions
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
quickmark-cli
Lightning-fast Markdown/CommonMark linter CLI tool with tree-sitter based parsing
-
rds2rust
A pure Rust library for reading and writing R's RDS (R Data Serialization) files without requiring an R runtime
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
writ
A hybrid markdown editor combining raw text editing with live inline rendering
-
ncount
A word count tool intended to derive useful stats from markdown
-
phs
Runtime for Phlow Script, PHS
-
vaporetto
pointwise prediction based tokenizer
-
llmvm-core
The core application for llmvm
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
awful_dataset_builder
Build LLM-ready Q/A datasets from reference text-to-question mappings produced by Awful Knowledge Synthesizer
-
memory-indexer
An in-memory full-text fuzzy search indexer
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
apisnip
A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️
-
whitespace-sifter
Sift duplicate whitespaces away!
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
epub
support the reading of epub files
-
textra
type less, write more
-
rsxxd
hex-dump-type utility rsxxd
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
cirru_parser
Parser for Cirru text syntax
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
askalono
detect the contents of license files
-
tantivy-common
common traits and utility functions used by multiple tantivy subcrates
-
twilight-mention
working with mentions in the Twilight ecosystem
-
asimov-prompt
ASIMOV Software Development Kit (SDK) for Rust
-
retrofont-cli
CLI for retrofont: Render and convert retro ASCII/ANSI art fonts
-
bulletty
a pretty TUI feed reader (RSS+ATOM) that stores articles locally as Markdown files
-
rhai-autodocs
Custom documentation generator for the Rhai scripting language
-
skyspell
Fast and handy spell checker for the command line
-
obsidian-logging
A journaling/logging CLI that stores logs in Obsidian markdown files
-
rich_rust
port of Python's Rich library for beautiful terminal output
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
uroman
A self-contained Rust reimplementation of the uroman universal romanizer
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
litho-book
Litho Book is a modern web documentation reader specifically designed for the Litho (deepwiki-rs) documentation generation engine. It provides…
-
wildcard
matching
-
pomsky
A new regular expression language
-
cpf_cnpj
Validador de CPF e CNPJ para Rust
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
flickzeug
A fork of diffy: diff, patch, and merge library featuring Myers' algorithm, unified diff format parsing, fuzzy patch application, and three-way merge with conflict detection
-
mdcat
cat for markdown: Show markdown documents in terminals
-
aico-cli
Scriptable control over LLMs from the terminal
-
swappy
An anagram generator
-
semtools
Semantic search and document parsing tools for the command line
-
unindent
Remove a column of leading whitespace from a string
-
sentencex
Sentence segmentation library with wide language support optimized for speed and utility
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
makepad-widgets
Makepad widgets
-
wayland-clipboard-listener
impl wlr-data-control-unstable-v1, listen for clipboard
-
aasvg
Convert ASCII art diagrams to SVG with automatic light/dark mode support
-
markdown-harvest
designed to extract, clean, and convert web content from URLs found in text messages into clean Markdown format. Originally created as an auxiliary component for Retrieval-Augmented Generation (RAG)…
-
seeyou-cub
reading and writing the SeeYou CUB binary file format, which stores airspace data for flight navigation software
-
slugomatic
🐌 A simple CLI tool to slugify and unslugify text, perfect for branch names and URLs
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
anda_db_hnsw
A high-performance vector search library in Rust
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
agentroot
Fast local semantic search for codebases and knowledge bases with AI-powered features
-
jdpub
Annotate source documents with Japanese readings and definitions
-
wsl-clip
High-performance clipboard bridge for WSL2
-
ttypr
terminal typing practice
-
mdbook-preprocessor
assist implementing an mdBook preprocessor
-
regex-literal
delimited regular expression literals
-
measured
A better way to measure your application statistics
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
terraphim-cli
CLI tool for semantic knowledge graph search with JSON output for automation
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
galm
pattern matching library
-
rustdoc-stripper
manipulate rustdoc comments
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
yangon
A high-performance, stack-allocated string type for Rust with fixed capacity and zero heap allocations
-
dom-content-extraction
Content extraction via text density paper
-
braillify
Rust 기반 크로스플랫폼 한국어 점역 라이브러리
-
cmx
Rust Spectral Color Management Library
-
ib-matcher
A multilingual, flexible and fast string, glob and regex matcher. Support 拼音匹配 (Chinese pinyin match) and ローマ字検索 (Japanese romaji match).
-
cicero-sophia
High-performance NLU (natural language understanding) engine built in Rust for speed, accuracy, and privacy
-
zepub
epub、mobi电子书读写
-
codex
Human-friendly notation for Unicode symbols
-
dug
An exhaustive name resolution aggregator
-
basalt-tui
Basalt TUI application for Obsidian notes
-
merge-code
A CLI tool for merging multiple code files into a signle file
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
typst-batch
Typst unofficial library for batch processing — friendly API, shared resources, virtual files, customizable diagnostics
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
rsrpp
project for research paper pdf
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
word-tally
Output a tally of the number of times unique words appear in source input
-
txt_to_md
Command converting from a txt file to a markdown file
-
sbnf
A BNF-style language for writing sublime-syntax files
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
cmark-writer
A CommonMark writer implementation in Rust for serializing AST nodes to CommonMark format
-
xrusty
Parse documents and transform using χrust
-
unreal-doc
generating documentation from Unreal C++ sources
-
text_io
really simple to use panicking input functions
-
isbn
handling ISBNs
-
howmany
A blazingly fast, intelligent code analysis tool with parallel processing, caching, and beautiful visualizations
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
blurhash
A pure Rust implementation of blurhash
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
topiary-queries
tree-sitter query files compatible with Topiary
-
sd
An intuitive find & replace CLI
-
alfrusco
building Alfred workflows with Rust
-
sile
Simon’s Improved Layout Engine
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
unidown
Convert Markdown to Unicode
-
opentalk-types-common-identifiers
Common identifier types for OpenTalk crates
-
pdflens-mcp
An MCP server for reading PDFs, coded by human, designed for AI
-
acdc-parser
AsciiDocparser using PEG grammars -
unbom
Remove UTF-8 BOM from files
-
swc_ecma_transformer
Compatibility layer for the ECMAScript standard
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
codeprism-core
Core incremental polyglot parser and graph builder for code analysis
-
gulagcleaner_rs
Ad removal tool for PDFs
-
zspell
Native Rust library for spellchecking
-
drova_plugins
Main plugins for drova
-
patto
🪽 Yet another plain text format for quick note taking and task management
-
panfix
parsing: linear time parsing of multifix operators
-
zettel-cli
cli app for Luhmann-style Zettelkasten management
-
regexr
A high-performance regex engine built from scratch with JIT compilation and SIMD acceleration
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
oyster-md
Static site generator for Markdown with bidirectional links and HTML rendering
-
asciidork-backend
Asciidork backend
-
lazy-grep
A high-performance, line-oriented command-line tool for searching text with regular expressions
-
asposepdf
Aspose.PDF for Rust via C++ is a powerful toolkit that allows developers to manipulate PDF files directly and helps do various tasks for PDF. Contains unique features for converting PDF to other formats.
-
opserve
Opinionated page server
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
tagged-urn
Tagged URN - Flat tag-based identifier system
-
tokstream-cli
CLI token stream simulator using Hugging Face tokenizers
-
htmlsnob_rules
HTML validator, formatter and autofixer
-
constellate
Rust-powered CLI + live editor for curated markdown workspaces (requirements, docs, ADRs, audits, support, status-driven tasks) that build/serve/CRUD a themeable knowledge portal from a single binary
-
lindera-wasm
A morphological analysis library for WebAssembly
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
minspan
a package for determining the minimum span of one vector within another
-
eloran
Comics and Ebook web library written in rust, with reading, search, reading status, bookmarks
-
annatto
Converts linguistic data formats based on the graphANNIS data model as intermediate representation and can apply consistency tests
-
koicore
core KoiLang module
-
bareun_rs
an unofficial Rust library for Bareun, a Korean morphological analyzer
-
mdvault
CLI tool for managing markdown vaults with structured notes, validation, and search
-
datafusion-spark
DataFusion expressions that emulate Apache Spark's behavior
-
pulumi_gestalt_core
Core Pulumi Gestalt implementation
-
instant-segment
Fast English word segmentation
-
bmfont_rs
Load/ save/ manipulate BMFont files
-
vi
An input method library for vietnamese IME
-
svgbob
Transform your ascii diagrams into happy little SVG
-
lipilekhika
A transliteration library for Indian Brahmic scripts
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
asciisavers
A small collection of ascii screensavers
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
jsongrep
A JSONPath-inspired query language for JSON documents
-
linestats
Group similar text lines and compute numeric statistics
-
semantic-edit-mcp
MCP server for semantic code editing with tree-sitter
-
greppy-cli
Sub-millisecond semantic code search and trace with AI reranking (Claude/Gemini/Ollama)
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
arborium-c
C grammar for arborium (tree-sitter bindings)
-
kanpyo
Japanese Morphological Analyzer
-
syslog_fmt
A formatter for the 5425 syslog protocol
-
simsearch
lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here)
-
string_wizard
manipulate string like a wizard
-
monster-regex
A custom regex spec
-
embeddenator-workspace
Workspace management utilities for embeddenator development
-
dmos
Djot HTML renderer with advanced features
-
shaperglot
Test font files for OpenType language support
-
libdav
CalDAV and CardDAV client implementations
-
oxford_join
Join string slices with Oxford Commas!
-
gatekpr-patterns
Regex pattern registry and pre-built pattern sets for Shopify validation
-
hawkeye-fmt
The formatter library for hawkeye cli
-
derivre
A derivative-based regular expression engine
-
reconcile-text
Intelligent 3-way text merging with automated conflict resolution
-
shibuichi
zsh prompt preprocessor to add git integration
-
smart-patcher
Patcher based on rules
-
codefmt
a markdown code block formatter
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
pipefog
Stream-structured data obfuscator for JSON/YAML/CSS
-
whatwg_streams
whatwg_streams for rust
-
latexmk-diff-head
LaTeX compilation tool that generates diff PDFs against Git commits
-
mandown
Markdown to groff (man page) converter
-
commitbot
A CLI assistant that generates commit and PR messages from your diffs using LLMs
-
no-crlf
A CLI tool to convert CRLF line endings to LF in text files
-
quickmd
Quickly preview a markdown file
-
richrs
port of the Rich Python library for beautiful terminal output
-
udataframe_rs
A pure Rust library for data frame operations, particularly useful for processing data extracted from PDF files or OCR recognize
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
ferrous-opencc
A pure Rust implementation of Open Chinese Convert (OpenCC), for fast and reliable conversion between Traditional and Simplified Chinese
-
marqant
Quantum-compressed markdown format for AI consumption with 90% token reduction
-
ocr-rs
A lightweight and efficient OCR library based on PaddleOCR models, using the MNN inference framework for high-performance text detection and recognition
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
ultra-nlp
A NLP library
-
fuzzy-aho-corasick
Aho–Corasick automaton with fuzzy matching
-
ascii-fmt
CLI tool to fix and align ASCII diagrams generated by AI agents
-
Inscribe
A markdown preprocessor that executes code fences and embeds their output
-
dirgrab
CLI tool to concatenate file contents from directories, respecting Git context
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
noil
file explorer using text buffers
-
roe
Unicode case conversion
-
nu_plugin_emoji
a nushell plugin called emoji
-
erg_common
A common components library of Erg
-
lo_
A modern Rust utility library delivering modularity, performance & extras ported from JavaScript Lodash
-
fits-io
A pure-Rust FITS file handling library inspired by CFITSIO, focused on safety, clarity, and performance
-
ndg-commonmark
Flavored CommonMark processor for Nix-related projects, with support for CommonMark, GFM, and Nixpkgs extensions
-
pathmut
Command line utility for manipulating path strings
-
steer-tui
Terminal UI library for Steer
-
bear-query
A read-only Rust library for querying the Bear note-taking app's SQLite database with minimal interference
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
mdvalidate
Markdown schema validation engine
-
kmdbook
Creates a book from markdown files
-
hanja_hangul
that converts Chinese characters to Korean characters. That is, convert hanja to hangul
-
markdown-org-extract
CLI utility for extracting tasks from markdown files with Emacs Org-mode support
-
brk_string_wizard
manipulate string like a wizard
-
omry-search
Search-related abstractions for the Omry project
-
search-text
A fast and flexible command-line tool to recursively search for text or regex patterns in files under a directory
-
kreuzberg-cli
Command-line interface for Kreuzberg document intelligence
-
unidoc
Unite all Markdown
-
sara-core
Core library for Sara - Requirements Knowledge Graph CLI
-
crabular
A high-performance ASCII table library for Rust
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
ruma-events
Serializable types for the events in the Matrix specification
-
regect
A cli tool to quickly test regular expressions
-
zalo
A code highlighter giving the same output as VSCode
-
blitztext
fast keyword extraction and replacement in strings
-
yaml-include
A lib and a CLI for recursively parsing "!include" data in yaml files
-
turndown-cdp
Convert CDP-style DOM nodes to Markdown
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
mdbook-linkcheck2
A backend for
mdbookwhich will check your links for you -
gibberish-or-not
Figure out if text is gibberish or not
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
asimov-imap-module
ASIMOV module for IMAP email import
-
substring
method for string types
-
escaping
configurable string escaping and unescaping
-
tu
CLI tool to convert a natural language date/time string to UTC
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
nanofts
High-performance full-text search engine in Rust
-
deeprl
DeepL client library with all the things (blocking)
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
typwriter
compiling, formatting, and watching Typst documents, with PDF metadata and permission management
-
fdn
uniformly change file or directory names and support rollback these operations
-
mdbook-angular
mdbook renderer to run angular code samples
-
pure-tui
A modern terminal-based word processor for Markdown and other structured text documents
-
repose-text
Text handling (wrappers around cosmic-text apis)
-
secular
No Diacr!
-
pdf_tables
Scrape text from tables in PDF files
-
gazenot
Gaze Not Into The Abyss, Lest You Become A Release Engineer
-
bbd
Binary Braille Dump
-
y-octo
High-performance and thread-safe CRDT implementation compatible with Yjs
-
go22dos
go to todos
-
strs_tools
Tools to manipulate strings
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
claude-kb-cli
CLI for generating, validating, and searching Markdown knowledge bases under a .claude/ hierarchy
-
recase
Changes the convention case of input text
-
quickctx
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
gllm
Pure Rust library for local embeddings, reranking, and text generation with MoE-optimized inference and aggressive performance tuning
-
wrap-ansi
A high-performance, Unicode-aware Rust library for intelligently wrapping text while preserving ANSI escape sequences, colors, styles, and hyperlinks
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
vlazba
Lojban words generator and analyzer
-
string_pipeline
A flexible, template-driven string transformation pipeline for Rust
-
leptos-sync-components
Leptos components for synchronization UI
-
zeitgrep
Find frecent results in git repositories using regular expressions
-
chatpack-cli
CLI tool for parsing and converting chat exports into LLM-friendly formats
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
rust_metrics
Incremental evaluation metrics for various machine learning pipelines
-
rsword_chirho
Core SWORD module library in pure Rust
-
ascii-img2-cli
ASCII image generation CLI
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
furigana
Map furigana to a word given its reading
-
swc-plugin-inferno
SWC plugin for InfernoJS
-
capns
Core cap URN and definition system for FGND plugins
-
rwkv-tokenizer
A fast RWKV Tokenizer
-
vibequest
A vibe-coded scripting language focused on developer happiness with a REPL implementation in Rust
-
ox_content_search
Full-text search engine for Ox Content
-
forbidden-bands
8-bit string handling library
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
ssfmt
Excel-compatible ECMA-376 number format codes
-
giallo-kak
Kakoune syntax highlighter using TextMate grammars
-
xim-ctext
compound text en/decoder
-
mdpdf
A fast CLI tool to convert Markdown files to PDF
-
rst
a reStructuredText parser and renderer for the command line
-
tsuki
Lua 5.4 ported to Rust
-
my-notes
note taking app for taking hierarchical notes in markdown
-
wikipedia_prosesize
Count Wikipedia prose size
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
searchfox-lib
searchfox.org API access
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
worf-launcher
Wayland application launcher inspired by wofi, rofi, and walker. Written in Rust with GTK4, supporting multiple modes (math, drun, file, ssh, run, emoji, search, auto), modern theming, and high performance.
-
kfst-rs
Fast and portable HFST-compatible finite-state transducers
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
shiba-preview
markdown previewer
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
kataru
YAML-based dialogue engine for game development
-
aki-gsub
substitude text command, replace via regex
-
linkup
Automatically add links to Markdown files
-
fonts
High-performance font parsing and analysis library for Grida Canvas
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
booky
analyze English text
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
artificial
Typed, provider-agnostic prompt-engineering SDK for Rust
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
tendril
Compact buffer/string type for zero-copy parsing
-
blocklet
A cross-platform CLI tool that generates ASCII art using Unicode block characters, similar to figlet but with beautiful solid Unicode blocks instead of outlines or hash symbols
-
ragit
git-like rag pipeline
-
dictator-frontmatter
Markdown frontmatter decree for Dictator structural linter
-
airs-memspec
Streamline AI-assisted development with Multi-Project Memory Bank management and GitHub Copilot integration
-
xml-3dm-cli
3DM XML Tree Differencing and Merging Tool CLI
-
facet-singularize
Fast, no-regex English singularization for the facet ecosystem
-
skimtoken
Fast token count estimation library
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
engish
A language utility for sampling and building words
-
aurora-semantic
Local embedded semantic search engine for source code, designed for IDE integration
-
easymark
Lightweight Markdown rendering utility that just works
-
shifter
A CLI tool to rotate lines from stdin by n positions
-
spellcode-cli
Minimal CLI frontend for codebook
-
ntcip
National Transportation Communications for ITS Protocol
-
browsing
Autonomous web browsing for AI agents - Rust implementation with MCP support
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactorsub-command. Compile BibTeX files to legacythebibliographyTeX code using thecompilesub-command… -
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
pretty-console
A fluent, zero-cost API for styling terminal text with colors and attributes
-
docbox-storage
Docbox storage layer abstraction
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
docx_mcp_rust
A Rust-based MCP (Model Context Protocol) server for creating and manipulating DOCX files
-
quant1x
Cross-language standard library for quantitative trading
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
ggemtext
Glib-oriented Gemtext API
-
markex
Fast, non-validating markup element extractor (Tag Element, MdRef, MdCodeBlock, MDSection)
-
kdl-xml
XML<->KDL conversion
-
spanned
string processing with file/line/col information and the regular rust
strAPI -
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
readability
Port of arc90's readability project to rust
-
picodiff
Tiny GUI app to compare text easily
-
mdbook-wordcount
Word count for mdbook, inspired by the mdbook tutorial
-
flatpage
file system based markdown flat page
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
bangumi-api
An api implementation for Bangumi website
-
dash-em
Enterprise-Grade Em-Dash Removal Library — SIMD-Accelerated String Processing
-
connected-papers
client for Connected Papers integrated with Semantic Scholar utilities
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
opusmeta
reading and writing metadata to opus files
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
document_tree
reStructuredText’s DocumentTree representation
-
tphrase
A translatable phrase generator
-
dspy-rs
A DSPy rewrite(not port) to Rust
-
indent
Functions for indenting multiline strings
-
fast_symspell
Spelling correction & Fuzzy search
-
plsfix
Text cleaner upper
-
fontcull
Pure Rust font subsetting library
-
normalized-line-endings
Line endings normalizer
-
bibleref
Structures and functions for managing Bible references
-
threeway_merge
Git-style 3-way string merging using proven algorithms from libgit2/xdiff. Statically links xdiff (LGPL-2.1+).
-
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
mdriver
Streaming markdown printer for the terminal with syntax highlighting
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
yosina
Japanese text transliteration library
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
streplace
A tiny library for matching and replacing in strings and slices with user-defined functions
-
csvpretty
A command-line tool that formats CSV input into tables with Unicode box-drawing characters
-
hexout
A compact and dependency-free, flexible and customizable hex dump library for Rust that provides beautiful, configurable binary data visualization
-
fkys-rs
F*cking Kill Yourself lang interpreter written in Rust
-
mago-casing
converting strings between various common casing styles
-
mq-markdown
Markdown parsing and manipulation utilities for mq
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
fm
Non-backtracking fuzzy text matcher
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
mdbook-exercises
An mdBook preprocessor for interactive exercises with hints, solutions, and test execution
-
rob_test_sagebox_integration_001
Internal test crate for validating Sagebox packaging and README rendering. Not intended for public use.
-
cron_clock
A cron expression parser and schedule explorer. Rich documentation and case studies and related upper-level libraries are available.
-
iregex
Intermediate representation for Regular Expressions
-
mantra-miner
your software recite mantras while it runs
-
mask-pii
A lightweight library to mask PII (Personally Identifiable Information) like emails and phone numbers
-
copyit
A cross-platform clipboard tool similar to pbcopy/pbpaste
-
mdless
A terminal-based markdown file viewer
-
zipcodes
Query US zipcodes without SQLite
-
srt2txt
Convert SRT subtitle files into clean plain text (strip timestamps, tags, merge lines)
-
atog
ascii to greek - prints greek letters given latin alphabets as input
-
tre-regex
Rust safe bindings to the TRE regex module
-
nu_plugin_regex
nu plugin to search text with regex
-
yore-cli
Fast document indexer for finding duplicates and searching content
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
nu-command
Nushell's built-in commands
-
runmunch
hunspell's unmunch tool for expanding dictionary words using affix files
-
unimorph
Command-line interface for UniMorph morphological data
-
mdbook-qr
An mdBook preprocessor that generates a QR code using fast_qr
-
twincat
ADS tools
-
mdbook-mermaid-ssr
mdbook preprocessor to add mermaid support with server-side rendering
-
opml
OPML library for Rust
-
runefix-core
Unicode character display width engine supporting CJK, emoji, and grapheme clusters
-
cosmic-text-tessera-fork
Pure Rust multi-line text handling
-
smt-str
working with SMT-LIB strings in Rust
-
syllabize-es
Syllabize Spanish text, and much more
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath. -
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen, giscus and many other apps
-
rmeow
A command line tool that aims to be a replacement for cat/bat with better highlighting
-
hemoglobin
Bloodless
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
redactor
Secure PDF redaction library with Type3 font support using MuPDF
-
url_encor
A lightweight library to encode and decode special characters in urls
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
krilla-rxing
Render barcodes (QR Codes, Aztec, Data Matrix, etc) using rxing into a krilla Surface (PDF)
-
md-scatter
split up and reassemble markdown files
-
markon
A lightweight Markdown renderer with GitHub styling and Medium-like annotations
-
oak-pretty-print
Syntax highlighter supporting multiple programming languages
-
rustpress
增量编译倒分页无后端 Rust 纯静态博客程序
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
searcher_txt
A copy of grep that I made to show that im bad at rust
-
legalis-eu
European Union jurisdiction support for Legalis-RS (GDPR, Consumer Rights, Competition, Treaties)
-
phd
an esoteric gopher server
-
vidyut-prakriya
A Sanskrit word generator
-
autosurgeon
working with data in automerge documents
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
hanconv
Convert between Chinese characters variants
-
zh_num
Convert ASCII numbers and zh words
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
typst-count
Count words and characters in Typst documents
-
r2md
Entire codebase to single markdown or pdf file
-
htmd-cli
The command line tool for htmd
-
liblevenshtein
Levenshtein/Universal Automata for approximate string matching using various dictionary backends
-
hyli-registry
Hyli Registry - Upload and download ELF binaries
-
text-to-ascii-art
program to convert text to ASCII art
-
mdbook-footnote
mdbook preprocessor for footnotes
-
unicode-ccc
Unicode Canonical Combining Class detection
-
armnod
random string generator
-
genedex
A small and fast FM-Index implementation
-
teip
Masking tape to help commands "do one thing well"
-
cleanpy
Python 3.12+ 最佳实践知识库 - MCP 服务器
-
palpad
A really simple static site generator
-
nobom
remove UTF-8 BOM (Byte Order Mark) from stdin and write to stdout
-
byteutils
that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…
-
autotex
Continuously compile TeX and LaTeX
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
lethe-core-rust
High-performance hybrid retrieval engine combining BM25 lexical search with vector similarity using z-score fusion. Features hero configuration for optimal parity with splade baseline…
-
case
A set of letter case string helpers
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}in text with their respective value. It can also be used as a rust library -
streamdown
A streaming markdown renderer for modern terminals (Rust port of Streamdown)
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
comically
fast manga & comic optimizer for e-readers
-
dnd-character
A Dungeons and Dragons character generator
-
glyphana
Quickly find, inspect & collect unicode glyps
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
lumin
searching and displaying local files
-
mono
Mono repository automation toolkit
-
mds
A skim-based
*.mdexplore and surf note-taking tool -
mdbook-replace
mdBook preprocessor that simply replaces text
-
large-text-core
Core library for handling large text files search and replace efficiently
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
xhtml_parser
Non-validating XHTML Tree-based parser
-
unicode-matching
match Unicode open/close brackets
-
mdbook-git
Insert git commit files and diffs into mdbook
-
fasttext
binding
-
ttlint
Small, fast utility to lint text
-
simplematch
Fast wildcard pattern matching for strings and bytes with a simple api
-
indent_write
Write adapters to add line indentation
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
latinga
High-performance, Zero-Copy Uzbek Cyrillic-Latin transliterator
-
kaff_sso
Small-buffer-optimized generic buffer and UTF-8 string type
-
slugrs
A fast, locale-aware slugify library for Rust
-
rust-port-kill
A cross-platform command-line tool to manage listening ports and processes
-
model2vec-rs
Official Rust Implementation of Model2Vec
-
rsxiv
Tools for working with arXiv and the arXiv API
-
recursive-file-loader
recursively load files via references in the files
-
oak-markdown
High-performance incremental Markdown parser for the oak ecosystem with flexible configuration, optimized for documentation and content creation
-
dprint-development
Helper functions for testing dprint plugins
-
tars-bin
A small, fast, static site generator
-
emoji-remover
A fast command-line tool to remove emojis from source code files
-
mdbook-numbering
A mdBook preprocessor that adds numbers to headings and code block lines (for mdbook 0.5.0 and above)
-
mdka
HTML to Markdown converter
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
alphabet_detector
Natural language alphabet detection library
-
memchr-rs
Fast memchr and memchr2 implementations in Rust
-
anaso_site_api_models
API models for Ana.so
-
kaiba
domain library - Core types and interfaces for AI persona system
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
fast-slice-utils
Highly optimized slice utilities using SIMD instructions when available
-
deencode
Reverse engineer encoding errors
-
clima
A minimal Markdown reader in the terminal
-
ankit-builder
TOML-based Anki deck builder with .apkg generation and AnkiConnect import
-
neo4j_cypher
A flexible and intuitive query builder for Neo4j and Cypher
-
COXave
Instruments for codings
-
mdv
Terminal Markdown Viewer
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
waterui-str
String utilities for WaterUI
-
madoru
markdown task runner
-
torudo
A terminal-based todo.txt viewer and manager with TUI interface
-
trpl
A support crate for The Rust Programming Language book
-
patiencediff
algorithm
-
genpdf
User-friendly PDF generator written in pure Rust
-
codesearch
A fast, intelligent CLI tool with multiple search modes (regex, fuzzy, semantic), code analysis, and dead code detection for popular programming languages
-
lil-tabby
A macro-based library for creating visually appealing tables with automatic column spanning
-
redact-ner
Named Entity Recognition for PII detection using ONNX Runtime
-
mini-calc
A Fully-Featured Configurable (mini) Rust Calculator
-
text-similarity-metrics
A high-performance Rust library for computing text similarity using multiple algorithms
-
xid
Globally unique sortable id generator. A Rust port of https://github.com/rs/xid.
-
dodo-zh
operate on pinyin and cedict.u8 file
-
darksouls3
Raw structures and bindings for From Software's title Dark Souls III
-
lister-cli
Lister: Navigate Markdown Lists
-
sk-skimmer
Fuzzy Finder in rust!
-
postcode_extractor
extract and identify postcodes
-
mdbook-findrep
mdBook find / replace preprocessor
-
codebook-lsp
A code-aware spell checker with language server implementation, installable via cargo install
-
steelkilt
Draft 0.4 RPG rule set
-
sketchir
Sketching primitives for IR: minhash/simhash/LSH-style signatures
-
agentai
designed to simplify the creation of AI agents
-
tibco_ems
A high level API for the Tibco EMS
-
langram
Natural language detection library
-
difference-rs
text diffing and assertion library
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
zen-rs
generating non-interactive content like cards or files
-
json_to_table
pretty print JSON as a table
-
smoltok-core
Byte-Pair Encoding tokenizer implementation in Rust
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
caseless
Unicode caseless matching
-
datadriven
Rewritable table-driven testing
-
basic-text
Basic Text strings and I/O streams
-
extract-strings
Extract ascii strings from files
-
neofile
linux file command for windows. PATCH 4.
-
loc
Count lines of code (cloc) fast
-
printwell-cli
Command-line tool for HTML to PDF conversion
-
fontheight
Find out the vertical extents your font reaches on shaped words
-
fencecat
Walkdir cat with markdown fenced code output
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
mdbook-quiz
Interactive quizzes for your mdBook
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
ere
A compile-time alternative for POSIX extended regular expressions
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
rust-regex-dsl
Regular expression DSL
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
md-formatter
A fast, opinionated Markdown formatter
-
pad
padding strings at runtime
-
transportations_library
A comprehensive Rust-based library implementing transportation engineering methodologies (e.g. the Highway Capacity Manual (HCM)) with Python bindings.
-
textcon
Template text files with file/directory references for AI/LLM consumption
-
rascii_art
Advanced ASCII Art Generator
-
awk-rs
A 100% POSIX-compatible AWK implementation in Rust
-
moonwave
generating documentation from comments in Lua source code
-
mdbook-bibtex
Adds support for bibliographies to mdbook. Backed by hayagriva, supporting all CSL citation styles.
-
aki-xtee
copy standard input to each files and standard output
-
tarzi
Rust-native lite search for AI applications
-
agentroot-mcp
Model Context Protocol server for agentroot - AI assistant integration
-
syntaxfmt
A derive macro-based library for flexible syntax tree formatting with pretty printing support
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ,ŋandê. -
atomic-plus
type extensions for the atomic standard library
-
mdbook-driver
High-level library for running mdBook
-
terraphim_hooks
Unified hooks infrastructure for Terraphim AI - knowledge graph-based text replacement and validation
-
mdsh
Markdown shell pre-processor
-
matchr
A fast fuzzy matcher library written in Rust for use in CLI tools and TUI apps
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
adobe-cmap-parser
parse Adobe CMap files
-
tortilla
Somewhat syntax-aware text wrapping for source code and plain text documents
-
autofoam
related tools
-
caco3
common lib
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
wcount
CLI word counting tool
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
dwg-core
Deterministic Writing Guard core analysis engine for spotting AI-styled prose
-
CompactPrefixRadix
a minimalistic but efficient radix tree implementation with extra prefix support
-
ucp-cli
Command-line interface for Unified Content Protocol
-
p2panda-discovery
Confidential topic and node discovery protocol
-
hemoglobin-search
Hemoglobin search utilities
-
hy-mt
A lightweight machine translation inference library for Tencent Hunyuan MT models
-
kiru
Fast text chunking for Rust
-
orgflow
managing documents with support for tasks and notes
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
opentalk-types-signaling-legal-vote
Signaling types for the OpenTalk legal vote module
-
vibrato-rkyv
Vibrato: viterbi-based accelerated tokenizer with rkyv support for fast dictionary loading
-
jsrmx
command-line tool to manipulate JSON files. It can split large single-object JSON files into many files; merge multiple JSON files into one large JSON file; bundle multiple JSON files into one NDJSON file…
-
erebus
A CLI message generation library
-
iati-types
Core data types for IATI Activity v2.03: Activity, Transaction, Money, TxType, etc
-
gxter
A parsing library for creating and reading GTA 3/VC/SA GXT (text string) files
-
markdown-translator
A translation library with DeepLX API integration, rate limiting, and smart text chunking
-
text_trees
textual output for tree-like structures
-
bogrep
Full-text search for bookmarks from multiple browsers
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
ansic
does ansi parsing in a dynamic DSL and at compile time for efficient and zero cost ansi styling
-
text_utils_s
edit array. Example delete duplicate in array. Clear string
-
arxiv-cli
CLI to download papers from arXiv
-
sff
SemanticFileFinder (sff): Fast semantic file finder using sentence embeddings. Searches .txt, .md, .mdx files.
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
ucfirst
Uppercase the first letter of a string
-
human_regex
A regex library for humans
-
docloom
Programmatically compose documents and render them to Markdown or styled terminal output
-
mdbook-termlink
mdBook preprocessor that auto-links glossary terms throughout documentation
-
tetratto-shared
Shared stuff for Tetratto
-
crabular-cli
A CLI tool for generating ASCII tables
-
crlf-to-lf-inplace
Fast in-place CRLF to LF line ending conversion for Rust strings. Uses memchr for good performance without custom SIMD.
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
likewise
A diff library for Rust (fork of similar)
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
opencc-fmmseg
High-performance OpenCC-based Chinese conversion using FMM (Forward Maximum Matching) segmentation
-
encoding-next
Character encoding support for Rust
-
omnix-common
Common functionality for omnix frontends
-
blogr-cli
A CLI static site generator for blogs
-
diagnostic
Pretty diagnostic report
-
grapheme_machine
Grapheme cluster text segmentation (UAX #29) state machine for streaming input
-
mdbookkit
Support library for mdBook preprocessors in the mdbookkit project
-
tbll
tbll outputs data in tabular format
-
serpscraper
A CLI tool to fetch and convert search results into Markdown
-
xifetch
neofetch alternative written in Rust
-
mdref
Markdown Reference finding and migration tool
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
luciferous-case-converter
A CLI tool to convert text between different cases
-
saytify
greeting and farewell messages
-
ncp-matcher
plug and play high performance fuzzy matcher
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
unicode-language
detect language coverage given a list of codepoints
-
pups_text
Pretty Understandable Parsers
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
memvid-ask-model
LLM inference module for Memvid Q&A with local and cloud model support
-
hangeul_jamo_rs
A high-performance Korean Hangul syllable and jamo manipulation library. included Python bindings.
-
caseify
A CLI tool to convert strings between different cases
-
moobi
A random password generation CLI tool
-
goose-eggs
in writing Goose load tests
-
sleipnir
Memory safe font operations for Google Fonts
-
scrunch
full-text-searching compression
-
mdsql
SQL queries for markdown tables
-
spel-right
A fast and lightweight spell checker and suggester
-
rascii_art_img
Advanced ASCII Art Generator. Fork for imgii.
-
hayro-syntax
A low-level crate for reading PDF files
-
typed-oid
Typed Object IDs
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
wdl-lint
Lint rules for Workflow Description Language (WDL) documents
-
ttf_word_wrap
Wraps text based on character width
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
mdtablefix
mdtablefixunb0rks and reflows Markdown tables so that each column has a uniform width. When the--wrapoption is used, it also wraps paragraphs and list items to 80 columns. -
ucp-llm
LLM-focused utilities for the Unified Content Protocol
-
ungoliant
The pipeline for the OSCAR corpus
-
doxx
Terminal document viewer for .docx files
-
markov_strings
A simplistic Markov chain text generator
-
uv-requirements-txt
internal component crate of uv
-
ttf2woff2
A Pure Rust library and CLI for compressing TTF fonts to WOFF2 format
-
maybe-regex
Wrapper for strings that may be either a regex or a plain-text string
-
pdfgen
PDF rendering library
-
cai
User friendly CLI tool for AI tasks
-
llm-utl
Convert code repositories into LLM-friendly prompts with smart chunking and filtering
-
stenotype
Machine stenography primitives
-
spyne
All-in-one toolkit
-
filenamify
Convert a string to a valid filename
-
zeroten-denote
Handle denote name scheme
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
vectradb-chunkers
Chunking utilities for VectraDB in Rust
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
anycase
a case conversion library for Rust
-
colonnade
format tabular data for display
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
spellchk
A blazingly fast spellchecker CLI for any text file
-
parserst
A recursive-descent reST parser and renderer
-
constr
Constant string generics
-
csep
Cosine Similarity Embeddings Print
-
e9571_str_lib
string processing, sorting, counting, and hashing
-
adc-lang
Array-oriented reimagining of dc, a terse RPN esolang
-
pandoc_types
Rust port of pandoc-types
-
chord3
Create pdf songbooks from chopro source
-
onig-regset
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
shoco
port to Rust, a fast compressor for short strings
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
mudssky_utils
A comprehensive Rust utility library providing common functionality for everyday programming tasks
-
seam
Symbolic Expressions As Markup
-
sedregex
Sed-like regex library
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
doryen-rs
Pure rust OpenGL accelerated roguelike console API with native/wasm support
-
polished_scancodes
handling and mapping keyboard scancodes in Rust
-
himmelblau_red_asn1
A little library to encode/decode ASN1 DER
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
madskills
The toolchain for madskilling: lint, format, and wrangle Agent Skills like you mean it
-
koto_test_utils
Testing utilities for the Koto programming language
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
git2prompt
command-line tool that takes a GitHub repository URL, downloads its contents, and generates a single text file optimized for use as input to AI tools
-
sourcefile
Retain mapping information when concatenating source files, to make error messages more useful
-
soon
Predict your next shell command based on history — like shell autocomplete, but MORE stupid
-
mecrab
A high-performance, thread-safe morphological analyzer compatible with MeCab, written in pure Rust
-
bullet_stream
Bulletproof printing for bullet point text
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
tergo-formatter
Formatter for tergo
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
uresamp
HIFI uresamp delivers ultrasonic-fidelity text resampling via adaptive 64-bit floating-point spectral mapping, preserving Unicode 32-bit codepoint integrity with zero-phase distortion
-
codebase-to-prompt
bundling text files like code to single file
-
duca
Search and read Dante's Divine Comedy from your terminal
-
ponsic-winsafe
The dependency of the
ponsiccrate -
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
colourss
parsing CSS color strings into RGB values
-
ystd
An opinionated and batteries included
stdmirror for convenient, correct code and pleasant error messages -
paperless-api-client
Paperless-ngx API client
-
invisible-characters
A list of invisible characters
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
langextract-rust
extracting structured and grounded information from text using LLMs
-
rosetta-aisp
Bidirectional prose ↔ AISP symbolic notation conversion based on the Rosetta Stone mappings
-
illuminate-string
A comprehensive Rust library for advanced string manipulation and processing
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
izihawa-tantivy-bitpacker
Tantivy-sub crate: bitpacking
-
laser-pdf
programmatic PDF generation with precise, predictable layout control
-
qmd
Lightweight SOTA local search engine for AI agents in Rust
-
metatron
core library
-
stringzz
strings and opcodes extraction from various file formats
-
bm25-vectorizer
A minimal Rust library for creating sparse vector representations (embeddings) using the BM25 algorithm for information retrieval
-
blame-rs
Line-by-line authorship tracking for revisioned text
-
yamake
yet another make tool
-
monochora
gif to ascii art converter written in rust
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
mupdf-basic-text-extractor
Basic structured text extraction using mupdf-rs
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
pukram2html
converting Pukram-formatted text to HTML
-
cin
that simplifies command-line input in Rust, especially mimicking C++-style input
-
gitbook2text
A CLI tool to download GitBook pages and convert them to markdown and text
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
viddy
A modern watch command
-
proper-sort
Small crate for natural sorting of strings that include number and size data
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
latentdirichletallocation
Latent Dirichlet Allocation (LDA) using collapsed Gibbs sampling for topic modeling
-
css_lexer
A spec-compliant CSS tokenizer with zero-copy cursors and optional feature gates
-
folderwalk
Folder walking tool
-
quot
A fast and flexible command-line tool that converts text input into escaped string literals
-
mdtrans
Markdown parser and transformer using
pest.rs, focused on flexibility to a project’s needs -
grapheme-cluster-utils
Unicode-safe String manipulation utilities by extended grapheme cluster (UAX #29)
-
ansimake
Quickly convert pixel images of ANSI art created with AI to actual ANSI art
-
mq-hir
High-level Internal Representation (HIR) for mq query language
-
linkcheck2
extracting and validating links
-
markdown-readtime
estimate reading time for Markdown content
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ohos-ime
Bindings to the
inputmethodAPI of OpenHarmony -
pandoc
API that wraps calls to the pandoc 2.x executable
-
array_tool
Helper methods for processing collections
-
sigye-background
Background animations for sigye terminal clock
-
wkhtmlapp
Convert html to pdf or image
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
lib_gerber_edit
A libary for manipululating extended gerber data (RS-274X)
-
dm2xcod
DOCX to Markdown converter written in Rust
-
anda_nexus
Anda -- an AI Nexus powered by the Knowledge Interaction Protocol (KIP)
-
gemini-map
A command-line tool to run files in parallel through Google Gemini
-
nerdfetch-rs
A minimal, fast system fetch tool - rewritten in Rust for speed, clarity, and configurability
-
unicode-writing-script
The package provides detection of writing scripts from Unicode codepoints
-
bump-bin
Increments version with semver specification
-
ansi-align
Text alignment library with ANSI escape sequence and Unicode support
-
sigrs
Interactive grep (for streaming)
-
ps-str
String transcoding library
-
drova_sdk
Sdk for absolute converter of formats for dalet
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
egui_text_animation
text animation using egui
-
codebase-to-markdown
convert codebase to markdown format
-
zen-expression
Zen Expression Language
-
gh-emoji
Convert
:emoji:to Unicode using GitHub’s emoji names -
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
compression-prompt
Fast statistical compression for LLM prompts - 50% token reduction with 91% quality retention
-
good-pc
Just a silly little tool to pet your pc and display some ASCII art
-
nonsense
Lorem ipsum placeholder text generator with clipboard integration
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
rlex
A cursor-based, utf-8 Vec<char> lexer
-
inline_flexstr
copy/clone-efficient inline string type for Rust
-
md_check
Check markdown link validity
-
unsafe-tools-mimic
Size and alignment matched opaque types
-
rspack_plugin_copy
rspack copy plugin
-
xerg
Ultra-fast grep implementation in Rust - built for maximum speed with direct output and parallel processing
-
unicount
Alphabetic counter supporting unicode
-
zz-data
Data structures for Zanzarah apis
-
catfx
Apply effects to a text stream
-
gremlh
A CLI tool to find and fix invisible 'gremlin' characters (homoglyphs, zero-width spaces, Bidi overrides) in source code
-
sbom-diff
diff engine and cli for sbom comparison
-
ya
YAML parser in pure Rust
-
ident_case
applying case rules to Rust identifiers
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
mdbook-selfpath
A preprocessor for mdbook that allows self-referencing paths
-
regex-specificity
A heuristic-based crate to calculate the specificity of a regular expression pattern against a specific string
-
pspp
Statistical analysis software
-
husk-lexer
Lexer for the Husk programming language
-
trump
Spawn a background thread that prints a Donald Trump quote every 10 seconds
-
snips
Keep code snippets in markdown files in sync
-
nova-cite
Smart citation management with CrossRef/Zotero integration
-
n_gram
training n-gram language models
-
deucalion
High-performance Windows library for capturing decoded FFXIV packets
-
cosmic-text
Pure Rust multi-line text handling
-
yy1
Tiny utility to convert KiCad centroid files into Neoden YY1 pick and place machine format
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
pager2
pipe your output through an external pager
-
custard
A frontmatter-querying server
-
acroform
High-level PDF form manipulation library using lopdf
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
regextrie
High-performance Rust library designed for matching a large number of regular expressions against a given input string
-
rfgrep
Advanced recursive file grep utility with comprehensive file type classification - search, list, and analyze 153+ file formats with intelligent filtering and safety policies
-
mdbook-langtabs
An mdbook preprocessor that adds language tabs for code blocks
-
mdbook-blox
A preprocessor for mdbook to add figures and admonishment-like blox
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
gspell
Rust bindings for gspell
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
antex
Styled text and tree in terminal
-
readability-js
wrapper for Mozilla's Readability.js library
-
plot3d
reading, writing, and manipulating NASA PLOT3D structured grids
-
nib
static site generator
-
devek
CLI for copying HTML to clipboard
-
yara-x-fmt
A code-formatting library for YARA rules
-
luau-lexer
A lexer for the luau language
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
herring-automata
Automata construction for Herring
-
streampager
pager for command output or large files
-
input_py
you can use input like python3
-
frawk
an efficient Awk-like language
-
undoc-cli
CLI for undoc - Microsoft Office document extraction
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
regex-charclass
Manipulate and convert regex character classes
-
memkb
A local AI in memory RAG for markdown files in a folder exposed as MCP
-
maddi-recipe
parsing and scaling markdown recipes
-
zepub-mini
Minimal crate for writing epubs (in-memory)
-
docbox-secrets
Docbox secret management abstraction
-
ucm-engine
Transformation engine for the Unified Content Model
-
chonkie
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
like
A SQL like style pattern matching
-
case_clause
case clause macro for rust
-
kathoey
text feminization using open corpus linguistics data
-
md2pdf-rs
A CLI tool to convert Markdown to PDF using Typst
-
asimov-brightdata-module
ASIMOV module for data import powered by the Bright Data web data platform
-
rheo
A typesetting and static site engine based on Typst
-
mdbook
Creates a book from markdown files
-
tdoc
assorted CLI tools for working with FTML (Formatted Text Markup Language) documents
-
slugi
slugify text (e.g., 'Café au lait' → 'cafe_au_lait')
-
kiroku-tui
terminal-based personal journaling and note-taking tool
-
awful_news_vibes
Daily news meta-analysis pipeline with AI-powered clustering and D3 visualizations
-
matchy-paraglob
Glob pattern matching with Aho-Corasick for matchy (internal)
-
charx
A replacement for char::is_ascii*
-
rtl_isolate_fix
Plain-text RTL/LTR technical-token isolation using Unicode bidi isolates (LRI/PDI), with URL/email exclusions
-
ruchydbg
ML-powered debugger for Ruchy with SBFL fault localization
-
lformat
Clone of Lua
string.formatin Rust based on Cs(n)printf -
learner
learning stuff
-
semantic-query
AI-powered schema validation with automatic JSON generation for type-safe responses
-
libappindicator-zbus
zbus implement for libappindicator
-
ezemoji
Catigoryized Emoji's
-
edit-without-waiting
Open a file in the default text editor
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
kd-rust
A crystal clear command-line dictionary
-
text-editing
string with utilities for editing
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
retrogress
Progress bars with a thin API around complex features
-
chunk
The fastest semantic text chunking library — up to 1TB/s chunking throughput
-
okh-scraper
A scraper of Open Source Hardware (OSH) projects. based on the Open Know-How (OKH) standard
-
xarray
version of the XArray with copy-on-write capabilities
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
koruma-collection
A collection of common validators using koruma
-
hangeul
Korean alphabet manipulation library
-
prettyless
Wadler-style pretty-printing combinators in Rust with more features
-
domrs
Document builder and serializer
-
tracery
Text-expansion library
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
companies-house-api
consuming various APIs from Companies House
-
flxy
Full-text searching and scoring of strings
-
flo_rope
An attributed and streaming implementation of the rope data structure
-
ascii_table_rs
Elegant ASCII table renderer for Rust CLI and terminal apps
-
legalis-in
India jurisdiction support for Legalis-RS - comprehensive modeling of Indian law
-
inlet_manifold
A general purpose highlighting library
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
shift_or_euc
Detects among the Japanese legacy encodings
-
pdfvec
High-performance PDF text extraction library for vectorization pipelines
-
ik-rs
chinese segment, ik-analyzer for rust
-
lumis
Syntax highlighter powered by Tree-sitter and Neovim themes
-
skyspell_kak
skyspell - kakoune integration
-
mdbook-godbolt
A preprocessor for mdbook to add runnable code snippets via Godbolt
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
cli-boxes
Unicode box drawing characters for creating beautiful CLI interfaces
-
sansaccent
Convertit les chaînes françaises en slugs URL-friendly en supprimant accents et caractères spéciaux
-
vds
Visibly distinguishable string types for identifiers and codes
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
nsys-curses-utils
Rust *curses utilities
-
repoyank
Interactively traverse your repository, select files/directories, and quickly prepare structured snippets for LLM interactions
-
yuuang-test-napi
N-API bindings
-
jawk
JSON AWK
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
rsticle
Treat source files as articles / narrative documentation
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
univiz
A command-line tool for analyzing Unicode strings, providing detailed information about graphemes, code points, and UTF-8 byte sequences
-
speedreader
A command-line speed reading tool
-
crate2bib
Create BibLaTeX entries for crates hosted on crates.io
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
fast-str
A flexible, easy-to-use, immutable, efficient
Stringreplacement for Rust -
enum-ts
TypeScript Enum pattern matcher codegen
-
vibrato
viterbi-based accelerated tokenizer
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
utf16_lit
macro_rules to make utf-16 literals
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
reason-shell
Reason: A Shell for Research Papers
-
twitter_text_config
Configuration for twitter-text in Rust
-
typing_engine
A typing game engine for Japanese and English
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
trailfix
Trim trailing whitespace and ensure single newline at EOF
-
brk_rolldown_plugin_vite_html
brk fork of rolldown_plugin_vite_html
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
aozora2
Aozora Bunko format converter CLI
-
matrix-ui-serializable
Opinionated abstraction of the matrix-sdk crate with serializable structs
-
ebg
Eric's Blog Generator, a simple static site generator
-
hexxy
all over the place!
-
rzozowski
A regex crate using Brzozowski derivatives
-
numeronym-cli
A CLI tool that generates numeronym abbreviations from text
-
mdbook-jupyter
enables using jupyter notebooks in mdbook
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
wetext-rs
Text normalization library for TTS, Rust implementation of WeText
-
modeling
tools to analysis different languages by Ctags
-
xgrammar-rs
Efficient, Flexible and Portable Structured Generation for Rust - Rust bindings for XGrammar
-
cn-font-split
划时代的字体切割工具,CJK与任何字符!支持 otf、ttf、woff2 字体多线程切割,完美地细颗粒度地进行包大小控制。A revolutionary font subetter that supports CJK and any characters…
-
sphinx-rustdocgen
Executable to extract rustdoc comments for Sphinx
-
ucf
A universal code formatter
-
rivus-core
rivus tools
-
colored_text
adding colors and styles to terminal text
-
files-to-text-cli
Files to text cli
-
sortuniq
Find or count unique values in an input stream
-
mdbook-typst-pdf
mdbook typst pdf backend
-
rawgrep
Grep at the speed of raw disk
-
streamdown-plugin
Plugin system for streamdown extensibility
-
upid
Universally Unique Prefixed Lexicographically Sortable Identifier
-
libphext
A rust-native implementation of phext
-
purlu
A full-text search engine
-
logappend
Execute child process, read from stdin and stderr, emit into files, truncate at given total content sizes
-
unicode-charname
functions for retrieving Unicode character name properties as described in Unicode Standard Annex #44
-
pretty_regex
🧶 Elegant and readable way of writing regular expressions
-
tag_index
searchable index over key-value tags
-
mind-tree
Organize your thoughts in a tree-like structure
-
beautiful-md
A CLI tool to format and beautify Markdown files
-
journey-cli
A CLI-based journal application with automatic timestamping, vault management, and Obsidian integration
-
bitutils2
A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data
-
minigrep_pack
A small clip clone of minigrep
-
kiters
timestamps, request IDs, and external IDs
-
fmd
Find Markdown files by metadata - Search by tags, frontmatter, and custom fields
-
enma
serving anime and manga information 📦
-
fast-cat
An ergonomic macro for efficient string concatenation with a single memory allocation
-
mongodb-gridfs
Mongo GridFS
-
easy-regex
Make long regular expressions like pseudocodes
-
cadi-scraper
CADI Scraper/Chunker utility for converting source code repos and file data into reusable CADI chunks
-
yara-x-parser
A parsing library for YARA rules
-
legalis-fr
French jurisdiction support for Legalis-RS (Code civil, Code de commerce, Code du travail)
-
qrcode2pdf
Render barcodes (QR Codes, Aztec, Data Matrix, etc) using rxing into a krilla Surface (PDF)
-
json-carver
Digital forensics tool that reads (carves) JSON strings from a dump. Think of it as a more accurate and faster replacement for the strings(1) utility.
-
eco
reasoning about breaking changes in Rust ecosystems
-
genpdfi
User-friendly PDF generator written in pure Rust
-
minigrep_wise
minigrep project from the rust book project
-
ps-hash
Generates 64-byte ascii hashes with 256 bits of security
-
markdown2json
Reads a markdown file or directory of markdown documents and emits a structured JSON
-
qmd-cli
CLI for qmd - lightweight SOTA local search engine for AI agents
-
afrim-translator
Manage the predication system of the afrim input method
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
harfshapedfa
Glue functionality between skrifa & harfrust
-
riimut
Transform latin letters to runes & vice versa
-
blinc_layout
Blinc layout engine - Flexbox layout powered by Taffy
-
scraps_libs
Scraps is a portable CLI knowledge hub for managing interconnected Markdown documentation with Wiki-link notation
-
sniffer-rs
that simplifies fuzzy string matching in rust
-
include-preprocessor
Tooling for C preprocessor style include directives
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
rustkernel-orderbook
RustKernels Orderbook domain kernels
-
diffy-imara
Tools for finding and manipulating differences between files
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
feedr
feature-rich terminal-based RSS/Atom feed reader written in Rust
-
renderdag
An ASCII or Unicode renderer for directed acyclic graphs
-
hexdump
Easy hexdump to stdout or as an iterator
-
clipboard-stream
Async stream of clipboard change events
-
emoji
Every emoji, their metadata, and localized annotations
-
vidyut-lipi
A Sanskrit transliterator
-
mdbook-glossary
mdBook preprocessor to generate glossary tables
-
typoglycemia
A function to convert text to typoglycemic format
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
convert_case_extras
Extra features for convert_case
-
csvpp
Compile csv++ source code to a target spreadsheet format
-
terraphim-repl
Offline-capable REPL for semantic knowledge graph search
-
ezstr
A String wrapper supporting negative indexing with grapheme indexing for slices and Regex::find_iter and Regex::find
-
mdbook-inline-highlighting
mdBook preprocessor that enables support for inline highlighting
-
colored-hexdump
Create beautifuly colored hexdumps
-
cfd16-assembler
An assembler backend for the CFD-16 ISA
-
fontcull-font-types
Scalar types used in fonts. (Vendored fork for fontcull)
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
aho-corasick
Fast multiple substring searching
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
finding
command line finding tool
-
zed-collections
Standard collection type re-exports used by Zed and GPUI
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
ripdoc
Query Rust docs and crate API from the command line
-
linebreak
breaking a given text into lines within a specified width
-
timeblok
A language for event scheduling in plain text
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
thediff
Difference between 2 files in percentages
-
ised
An interactive tool for find-and-replace across many files
-
mdbook-fs-summary
Summary generator for mdbook
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
basen
Convert binary data to ASCII with a variety of supported bases
-
bmo-to-md
Dump a bugzilla bug to markdown, with attachments unziped
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
subslay
Text → emoji 💅🏻 Powered by Rust
-
sonai_metrics
Text metrics for sonai
-
kreuzberg-tesseract
Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements
-
yozuk
Chatbot for Programmers
-
alass-util
convenience API for subtitle synchronization with alass-core
-
quillmark-typst
Typst backend for Quillmark
-
wrapr
wrap your code for ai
-
ox_content_parser
High-performance Markdown parser for Ox Content
-
sakurs-core
High-performance sentence boundary detection using Delta-Stack Monoid algorithm
-
cloc
Count, or compute differences of, lines of source code and comments
-
minigrep_xyz1
For finding strings in text
-
chaf
A CLI filter tool that removes lines based on logical conditions using AND, OR, NOT, and parentheses
-
mdi
markdown include
-
strval
Parse strings into values
-
bash-builtins
implement loadable builtins for bash
-
eml2md
Convert EML files to Markdown
-
yekdast
normalizing and cleaning up Persian (Farsi) text
-
create_broken_files
Create broken files from other ones
-
css_recess_order
Recess-based sort order for CSS properties
-
quranize
Encoding transliterations into Quran forms
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
thesaurus
An offline thesaurus library for Rust
-
json_keyquotes_convert
convert JSON from and to JSON without key-quotes
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
marisa-rs
Safe Rust wrapper for the marisa-trie C++ library
-
miku_best_friend_printer
Contains a MikuPrinter that can produce an ASCII art of Miku - your best friend - from the music video 'Monitoring (Best Friend Remix)' by DECO*27
-
retrofont
Retro terminal font toolkit: TDF/FIGlet parsing, rendering, conversion
-
expression_parser
minimal Rust parser and evaluator for arithmetic expressions with variables, built using the pest parsing library. It converts a text formula into an abstract syntax tree (AST) and…
-
docki
cli for building and publishing documentation using asciidoctor
-
markov
A generic markov chain implementation in Rust
-
notidium
Developer-focused, local-first note-taking with semantic search and MCP integration
-
asciidork-eval
Asciidork eval
-
ojcmp
online judge comparer
-
ucd-general-category-ranges
Unicode character ranges by general category
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
ersa_lsp_core
LSP core for the GPC Scripting language. Intended to be used as a library.
-
oak-regex
High-performance incremental regular expression parser for the oak ecosystem with flexible configuration, optimized for pattern matching and text processing
-
img2epub
Convert images to EPUB
-
dedoc
Terminal-based viewer for DevDocs documentation
-
mkweb
A markdown to html converter and server
-
diffutils
A CLI app for generating diff files
-
term_grid
formatting strings into a grid layout
-
mdbook-tiny
Use mdbook to generate tiny and fast static sites
-
sesters
💱 Fast, offline currency converter 💴 💷 💶 💵
-
md-to-incodoc
Convert markdown to incodoc
-
dala
Dalia is a light weight formula language
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
unimorph-cli
Command-line interface for UniMorph morphological data
-
ilyvion-util
Collection of utility functions and types for use in my personal projects
-
aki-mcolor
mark up text with color
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
tmpltr
Template-based document generation CLI
-
typos-cli
Source Code Spelling Correction
-
tantivy-tokenizer-api
Tokenizer API of tantivy
-
flowmark
Fast, modern Markdown formatter with smart typography and paragraph wrapping
-
markdown-toc
Markdown Table of Contents generator
-
filecheck
writing tests for utilities that read text files and produce text output
-
markdown-extract
Extract sections of a markdown file
-
bpe-match
A pattern matching library for BPE tokenization, intended to replace regex-based approaches
-
rtranslate
dependency-free Rust wrapper for Google Translate public web API
-
lzy-codec
一種變長文本編解碼方案,支持對Unicode進行編解碼。編解碼效率、存儲空間全面優於UTF-8,未來會替代UTF-8成為新的世界通用編解碼標準。
-
pii
PII detection and anonymization with deterministic, capability-aware NLP pipelines
-
lex-analysis
Semantic analysis for the lex format
-
leptos_forge_ui_components
Components you can use in your leptos_forge application
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
text-document
Text document structure and management
-
regexml
XPath compatible regex engine
-
heavy-pdf-parser
Extract text from PDF files with support for multiple output formats
-
askalono-cli
detect the contents of license files
-
fortune-rs
classic BSD fortune program
-
pdfcat
Concatenate PDF files into a single document
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
rustpython-parser-vendored
RustPython parser vendored third-party crates
-
go-brrr
Token-efficient code analysis for LLMs - Rust implementation
-
penmanship
A Unicode character lookup library for converting text patterns to Unicode characters
-
lindera-ko-dic
A Korean morphological dictionary for Ko-Dic
-
ada-idna
Fast IDNA (Internationalized Domain Names in Applications) implementation in Rust
-
text-scatters
A cut-up technique generator from text and ebook files in the terminal
-
simple-xlsx-writer
memory efficient XLSX writer
-
ngram_rs
Facilitate creating ngrams in Rust to be used in the polars plugin
-
diamond-types
The world's fastest text CRDT
-
ipa-translate
translating between IPA and ASCII text
-
utf-64
The next-generation text encoding standard using 64 bits per character
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
tectonic
A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.
-
mdbook-linkcheck
A backend for
mdbookwhich will check your links for you -
chatter
Translate a human-readable dialogue script into Rust
-
tergo-tokenizer
R language tokenizer
-
gpui_rn_util
structs and functions (fork for React Native GPUI)
-
turndown-core
Core Markdown AST and serialization for turndown
-
directwrite
A safe abstraction for interacting with DirectWrite, intended initially to be used with direct2d for easy text rendering
-
nstd_core
NSTD core crate
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
tuicam
Terminal-based camera with switchable modes
-
iregex-automata
Finite automata definitions for the
iregexcrate -
biometrics
provide the vitals of a process in the form of counters, gauges, moments, and T-digests
-
qpprint
console printing/formatting
-
magic-regexp
creating regular expressions with ease
-
filename-refactor
Command to refactor file names
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
regex_generate
Use regular expressions to generate text
-
htmls
parsing HTML and extracting HTML elements or text
-
mpdf
writing PDF files
-
toolstr
formatting data into String's
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
symscan
Fast discovery of similar strings in bulk
-
skill-tree
generate graphviz files to show roadmaps
-
cgp-dispatch
Extensible data type dispatchers for CGP handlers
-
codump
A straightforward and flexible code/comment dump tool
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
furze
finite state transducers (fst) writen in rust
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
mdplayscript
An extension of Markdown for play scripts
-
magic_string
magic string
-
mdbook-org
A preprocessor plugin for mdBook that enables org-mode file support
-
merge3
merge tool for three-way merges
-
is_printable
Determine whether a given text-based value is printable
-
hunspell-rs
Rust bindings to the Hunspell library
-
unicode-width-16
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
animated-emojis-rs
Noto Animated emojis
-
mdbook-lint-rulesets
Modular rulesets for mdbook-lint - standard and mdBook-specific linting rules
-
cro_stem
A lightning-fast, zero-dependency Croatian stemming library written in Rust
-
gigagei
random quote fetching console utility
-
human-names
A random generator for human first and last names
-
fx-mistral
leverage the Mistral API for OCR and data extraction from PDFs
-
grapheme
Abstractions for working with extended Unicode grapheme clusters
-
mailidator
A lightweight Rust library for checking email address misspellings
-
string-overlap
A helper crate for "layering" ASCII art
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
cistring
A string type that preserve case, but compares insensitiveley
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
vtashkov-bf
Brainfuck interpreter
-
static-lang-word-lists
Runtime decompressed statically-included word lists
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
copier
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
subtitles
A command line programme to generate SRT format subtitles for use in video players
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
human-time-cli
A command-line tool for converting time durations to human-readable formats, built using the
human-timecrate -
mdbook-markdown
Markdown processing used in mdBook
-
prometheus-32bitfix
Prometheus instrumentation library for Rust applications
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
text-parsing
Hierarchical text processing preserving char position info
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
justcsv
CSV-file reader/writer
-
yara-x-cli
A command-line interface for YARA-X
-
random-zh
generating random Chinese characters
-
slugify-core
Fast, Unicode-aware slug generation library with multi-language bindings
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
opentalk-report-generation
OpenTalk report generation functionality
-
glow_glyph
A fast text renderer for glow, powered by glyph_brush
-
r-matrix
Rust port of cmatrix
-
webgrep
grep the web: a full-browser-spec search-focused ultra-simple way to read the web without having to leave the terminal
-
dirdocs
Generate terse, one-sentence documentation for files; plus Nushell-style directory viewers (dls, dtree) that surface .dirdocs.nu descriptions
-
gtk4cmark
A GTK4 widget for rendering CommonMark content
-
sayit
String replacements using regex
-
markdowndown
acquiring markdown from URLs with smart handling
-
see-cat
A cute cat(1)
-
did-toolkit
spec-compliant implementation of did-core, W3C's 'Decentralized Identity Documents'
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
gdnative-doc
Documentation tool for gdnative
-
matchy-literal-hash
O(1) exact string matching via memory-mapped hash tables (internal)
-
webdog
static site generator fit for a dog
-
scanlex
lexical scanner for parsing text into tokens
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
artificial-prompt
Fluent builders and helpers for composing markdown prompt fragments
-
bstr
A string type that is not required to be valid UTF-8
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
savagestr
SAVAGE string encoder/decoder. If can encode or decode by specifying the code page or the encoding name, it works, else it uses the savage way to encode/decode by using
String::from_utf8_lossy() -
duvet
A requirements traceability tool
-
fsays
flavored replacement for the classic cowsay
-
rjot
A minimalist, command-line jotting utility that's fast, private, and git-friendly
-
kotoba-formatter
Kotoba Code Formatter - Formats .kotoba files with consistent style
-
popgetter-py
Python bindings for popgetter
-
cli_boilerplate_automation
Cli Boilerplate Automation: Opiniated Cli Utilities for Basic Tasks
-
tiktokenx
A high-performance Rust implementation of OpenAI's tiktoken library
-
lightgrep
A fast, ergonomic grep-like tool in Rust
-
scan-rules
some macros for quickly parsing values out of text. Roughly speaking, it does the inverse of the print!/format! macros; or, in other words, a similar job to scanf from C.
-
pgf2json
Application Programming Interface to load and interpret grammars compiled in Portable Grammar Format (PGF). The PGF format is produced as a final output from the GF compiler. The library…
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
xi-rope
A generic rope data structure built on top of B-Trees
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
rustruut
Text-to-IPA converter and phonetic translator for Rust, powered by the Goruut phonemization engine
-
basic64
A straightforward implementation of base64
-
streaming_serde_json
streaming alternative to serde_json for processing JSON data sources that are too large to fit in memory
-
codetypo-dict
Source Code Spelling Correction
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
arabic_pdf_to_text
A CLI tool to convert Arabic PDFs to text using Google's Gemini API
-
uwurs
UwUify your strings with uwurs!
-
opstr
‘Operate on strings’ command line utility
-
supermarkdown
High-performance HTML to Markdown conversion for LLMs
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
mdbook-renderer
assist implementing an mdBook renderer
-
seams
High-throughput sentence extractor for Project Gutenberg texts with dialog-aware detection
-
fuse-rust
Fuse is a super lightweight library which provides a simple way to do fuzzy searching. Fuse-Rust is a port of Fuse-Swift, written purely in rust
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
getch-rs
getchfor Windows and Unix -
ferritin-common
rustdoc navigation and search
-
lsp-ty
type definitons for LSP
-
next-plaid-cli
Semantic code search powered by ColBERT
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
jira-mcp-server
AI-friendly JIRA MCP server with semantic search capabilities
-
viks
vim-like key crate
-
futf
Handling fragments of UTF-8
-
u8lit
Custom literal to convert strings to UTF-8 bytes
-
tectonic_engine_bibtex
The
bibtexprogram as a reusable crate -
ranked-searcher
Search inside text files using tf-idf formula, showing the most relevant search at the top
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
quake_text
Utils for Quake strings and characters
-
neo-mime
Strongly Typed Mimes
-
unicode-vo
Unicode vertical orientation detection
-
lre
Rust bindings for the regular expression library from QuickJS
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
epub2mdbook
convert EPUB files to MDBook format
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
mdbook-translator
A translation preprocessor plugin for mdBook that automatically translates documents using the DeepSeek API
-
swc-formatjs-visitor
formatjs custom transform visitor for SWC
-
oyster-lib
Markdown processing library with querying, bidirectional links, and HTML rendering
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
vestalia
Vestaboard API wrapper
-
litsea
extreamely compact word segmentation and model training tool implemented in Rust
-
vn-settings
Various settings intended to simulate visual novels
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
findcar
An extensible CLI tool that filters, sorts, and aggregates search results from car websites. Because of course you want to search for cars from the command line.
-
rustmax-doctest
Doctest runner for rustmax crate examples
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
artificial-openai
OpenAI backend adapter for the Artificial prompt-engineering SDK
-
phaier_markdown
A markdown parser and renderer
-
sik
A fast and concurrent command-line tool for searching patterns in files
-
clipcount
Counting words from the clipboard content
-
rumage
framework for making simple markdown sites
-
justify
plaintext while handling Unicode gracefully
-
utf8-fix
Fix invalid UTF-8 sequences in-place while preserving buffer size - useful for fuzzing and mutation testing
-
taboc
A table of contents generator for markdown documents
-
mutf8
Basic support for mutf8 strings
-
safe-regex-compiler
Regex compiler for the safe-regex crate
-
onig_sys
onig_syscrate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
asoiaf-api
Rust wrapper for the api of ice and fire
-
ib-unicode
Unicode utils
-
choco
markup language for dialogue systems
-
doc_loader
A comprehensive toolkit for extracting and processing documentation from multiple file formats (PDF, TXT, JSON, CSV, DOCX) with Python bindings
-
tidyvcf
command-line tool to convert VCF files to tab/comma separated tables
-
pullup
Convert between markup formats
-
imperative
Check for imperative mood in text
-
grapheme-utils
Handy utils for working with utf-8 [unicode] Extended Grapheme Clusters
-
shellvetica
Bring your shell output to the web
-
docstr
Ergonomic multi-line string literals
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
minigrep_mm
minigrep app from rust-book with public lib methods
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
coverio
Better code coverage reporting for Rust crates
-
ricecoder-research
Research and analysis utilities for code understanding
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
regex_ansi
ANSI escape sequence regular expression (Rust port of ansi-regex behavior)
-
bk-tree
A Rust BK-tree implementation
-
strloin
copy on write slices of a string
-
puppet-fmt
Automatic code formatter for puppet manifests
-
checkstream-policy
YAML-based policy engine for LLM safety rules, triggers, and actions
-
rexify
trait based matcher alternative to Regex
-
re2
Wrapper for the re2 C++ regex library
-
mullama
Comprehensive Rust bindings for llama.cpp with memory-safe API and advanced features
-
fusefiles
Concatenate a directory full of files into a single prompt for use with LLMs
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
base64-lib
A base64 encoder / decoder
-
text-tokenizer
Custom text tokenizer
-
niho
A command-line tool for converting romanized Japanese text to Japanese characters
-
ttpkit-utils
Various utilities for ttpkit
-
char-ranges
Iterate chars and their start and end byte positions
-
libharu
Rust binding for libharu(http://libharu.org/) PDF library
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
maytrix-value
Core value types for Maytrix
-
emoji-search
Fast fuzzy emoji searcher and picker for the terminal
-
msbwt2
multi-string BWT query library
-
eggbug
Bot library for cohost.org
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
pinyin2ch
converting Chinese Pinyin to Chinese characters with various levels of detail
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
icondata_zi
Zi icon set for the icondata family of crates
-
ib-pinyin
一个高性能拼音查询、匹配库
-
wildcard-trie
A space-efficient router / radix-trie that supports wildcards
-
substr-iterator
Substring extractor based on characters without allocation
-
libretranslate
A wrapper for the LibreTranslate web API
-
mdloc
command-line tool for processing image links in Markdown files. Download remote images and convert them to Base64 embedded format or local file references.
-
japanese-codepoints
A high-performance Rust library for Japanese character validation and code point handling based on JIS standards
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
html-index
Generate an HTML index
-
ari
ari. the extras library.
-
ricat
A Rust-Based implemenation of classic UNIX
catcommand -
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
codebook_downloader
Dictionary downloading utility for the Codebook spell checker
-
mat-o-viewer
A modern terminal file viewer combining cat, less, and grep with syntax highlighting and markdown rendering
-
cwc
A word counter utility that properly handles CJK and Unicode text
-
livre
type-safe PDF parser
-
hmd
Custom Markdown Engine for my personal blog
-
natural-sort-rs
sorting strings based on Natural sort order
-
terminal_tools
Power-Terminal TUI - fuzzy file and text finder, process manager, git browser, and more
-
screenplay-doc-parser-rs
Tools to parse Screenplay-formatted documents into semantically-typed structs
-
prettychars
Unicode text styling and named glyph lookup with zero runtime overhead
-
mdbook-plugin-utils
mdBook plugins
-
assemblage_db
Distributed Document/Graph DB for Connected and Overlapping Pages
-
stylish-style
Internal implementation details of
stylish-core -
waterui-text
Text and typography components for WaterUI
-
chroma-types
Chroma-provided crate for internal types used in the Chroma API
-
ob
A Blog and RSS system written in Rust
-
artbox
Render FIGlet text into a bounded rectangle with colors and gradients
-
synthphonia-rs
DryadSynth solver for string transformation programs
-
zine
opinionated tool to build your own magazine
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
carlotk
The main library for Carlo, a simple interpreted programming language
-
winsfs-cli
Site frequency spectrum estimation based on window expectation-maximisation algorithm
-
ik-mini-epub
Minimal async IK to EPUB downloader | Extremely minimal
-
catalog-of-markdown
Generate the catalog of markdown file
-
rustyink
Blazing fast static site generator
-
nano_banana_pro_prompt
High-quality integration for https://supermaker.ai/blog/nano-banana-pro-prompt-use-cases-ready-to-copy-paste/
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
noctisroll
Text-based TRPG dice rolling system
-
ironstorm_lookup
Lightning fast lookup table for auto completion, type ahead, suggestion engines
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
axonml-text
Text processing utilities for the Axonml ML framework
-
holy-carpet
customizable blog creator
-
spcfile
.spc file format reading and writing
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
serenity_utils
provide additional utilies for Discord bots created with serenity
-
topfew
CLI to find high frequency occurrences in structured text files
-
ix-embeddings
Shared embedding infrastructure for Ixchel with pluggable providers
-
mcat
a powerfull extended cat command, to cat all the things you couldn't before
-
aneurysm
Brainf**k interpreter
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
udpipe-rs
Rust bindings for UDPipe - a trainable pipeline for tokenization, tagging, lemmatization and dependency parsing of CoNLL-U files
-
md2adf
Markdown to Atlassian Document Format converter
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
typship
A cli for typst packages
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
convert-to-spaces
Convert tabs to spaces in a string
-
clip-sanitize
Meta-library for robust text sanitization, repair, and normalization
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
pdfcr
render a codebase to a pdf
-
ftb
A fast CLI tool to format and align Markdown tables
-
bbcode-tagger
BBCode tree parser and tagger
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
man
Generate structured man pages
-
hns
Human numeric sorting program — does what
sort -his supposed to do! -
binyl
A bitwise UTF-8 string inspection tool
-
unix_str
Unix-compatible strings regardless of platform
-
textnonce
Text based random nonce generator
-
pinzh
converting Pinyin to Zhuyin (Bopomofo)
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
htmlescape
HTML entity encoding and decoding
-
file-editor
Clean, elegant API for creating and editing text files
-
unicode-rs
A comprehensive Unicode character library for Rust applications with theme support
-
path2regex
Express style path to RegExp utility
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
foundation_nostd
Foundational nostd core implementation for the ewe_platform project
-
veltox
A fast, themeable static site generator written in Rust — ideal for documentation and blogs
-
redact-core
Core PII detection and anonymization engine - Presidio replacement
-
subject-classifier
classifying a commit by it's subject
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
artificial-types
Reusable prompt fragments and helper types for the Artificial prompt-engineering SDK
-
lowcharts
draw low-resolution graphs in terminal
-
lookbook
Component preview framework for Dioxus
-
bbx
A robust, performant BBCode pull parser
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
semire_core
An extension to my former semire_read crate now with more functionality
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
hebrew_accents
finding, filtering, and displaying Hebrew accents, specifically focusing on the Tiberian accent system as documented by the Masoretes
-
arabic-script
An expressive API for the characters of the Arabic script
-
weasel-gen
Random ascii generation animation until target string is met
-
ripmors
encoding and decoding international Morse code and several variants
-
gpui_util
A collection of utility structs and functions used by Zed and GPUI
-
opendal-util
OpenDAL
-
agnostic-levenshtein
Levenshtein distance for ASCII or Unicode strings
-
bat
A cat(1) clone with wings
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
BBSGlib11
num BBSGlib11
-
simple_peg
A command-line peg parser implemented by Rust
-
dprint-plugin-markdown
Markdown formatter for dprint
-
rustme
Automatically generate Rusty READMEs
-
vregex
Regular Expression engine
-
dictutils
Dictionary utilities for Mdict and other formats
-
lexical-sort
Sort Unicode strings lexically
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
text-search
easy to use plug and play wrapper around tantivy for simple search scenarios
-
markdown-code-runner
Automatically update Markdown files with code block output
-
natord-plus-plus
Natural ordering for Rust
-
lasr
Live Action Search and Replace
-
unobtanium-segmenter
A text segmentation toolbox for search applications inspired by charabia and tantivy
-
kvu
The simplest command line tool to manage key-value pair lines
-
unified-diff
GNU unified diff format
-
izihawa-tantivy-stacker
term hashmap used for indexing
-
rspack_plugin_case_sensitive
rspack case sensitive plugin
-
trees-rs
tree-rs is a drop-in replacement for the tree utility (ASCII tree view of folder directory), but written in Rust. The primary goal of this project is to provide a non-bloated and more…
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
wordcutw
A C-interface wrapper for Wordcut - a Lao/Thai word segmentation/breaking library
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
mdzk
Plain text Zettelkasten based on mdBook
-
markdown-live-preview
A live Markdown preview server with WebSocket and HTML rendering
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
enso-lazy-reader
An efficient buffered reader
-
hina
:]
-
east-asian-width
Determine the display width of Unicode characters in East Asian contexts
-
meztide
SDK for lotide
-
ranting
Linguistic formatting placeholder extensions for rust
-
block-id
generating opaque, unique, and short string values from (unsigned) integers
-
el_roi
simplify reading user input
-
raw-string
RawStr/RawString are like str/String, but without any guaranteed encoding
-
tkrar
Count frequency of words in a file or a directory
-
ncase
Enforce a case style
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
rulet
figlet implementation
-
pgn-traits
Traits for chess variations that can be annotated with Portable Game Notation
-
diff-man
diff utility lib
-
squishyid
Shorten and obfuscate IDs
-
xml1
sane, non compliant xml parser without allocations
-
dossiers
home for your specs, policies, and process docs
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
interslavic
in rust
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
asimov-serpapi-module
ASIMOV module for data import powered by the SerpApi search data platform
-
ccase
Command line interface to convert strings into any case
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
detect-newline-style
Determine a string's preferred newline character
-
detone
Decompose Vietnamese tone marks
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
atrium-api
API library for AT Protocol (Bluesky)
-
streamdown-config
Configuration loading and management for streamdown
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
mask-text
mask text with multiple masking options
-
highly_sendable
Containers for sending Strings and results within your process and beyond
-
parlib
A barebones parser combinator library, inspired by Haskells parsec
-
lex_sleuther
Sleuth out script languages by lexing
-
term-gpt
A fast, colorful ChatGPT CLI for your terminal!
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
mq-run
Command-line interface for mq Markdown processing tool
-
quillmark-cli
Minimal CLI for testing Quillmark packages locally
-
utf8proc-sys
Unsafe rust bindings to the utf8proc library
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
aimds-detection
Fast-path detection layer for AIMDS with pattern matching and anomaly detection
-
real_time_note_taker
A terminal UI tool to take time stamped notes in real time
-
rust_tokenizers
High performance tokenizers for Rust
-
kyte
Extensible mechanics for operational transformation in Rust that are generic with respect to their value (not constrained to text), wire-compatible with Quill and fully fuzzed
-
opentalk-types-signaling-meeting-report
Signaling types the OpenTalk meeting-report module
-
aki-mcycle
mark up text with cycling color
-
quickner-core
A fast and simple NER tool
-
nib-cli
A cli for a yet another static site generator Nib
-
transmutation
High-performance document conversion engine for AI/LLM embeddings - 27 formats supported
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
dbxcase
Dropbox-compatible case-folding algorithm
-
asciifolding
ascii folding library
-
xj_scanf
Safe reimplementation of
scanf() -
rtl_typepad
A minimal RTL typepad GUI for Persian/Arabic text with automatic Unicode bidi isolate injection for technical tokens
-
truecase
Restore correct letter casings in arbitrary text using a statistical model
-
ucp-translator-markdown
Markdown translator for UCP
-
pulldown_typst
A pull parser for Typst markup
-
legalis-us
United States jurisdiction support for Legalis-RS (Common Law)
-
rosetta-aisp-llm
LLM fallback for AISP conversion using Claude SDK - extends rosetta-aisp with AI-powered conversion
-
slugify
Macro for flexible slug generation
-
p4d-mdproof
Markdown to PDF converter
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
tg_bot_models
Telegram Bot API types (for API 4.3)
-
string-width
Accurate Unicode string width calculation for terminal applications, handling emoji, East Asian characters, combining marks, and ANSI escape sequences
-
microslop
Turn your text into beautifully chaotic, glitchy, Wandoze-level slop
-
grammateus
facilitate working with Ancient Greek words
-
character-set
High performance
set.contains(char) -
kmdbook-driver
High-level library for running mdBook
-
regex-chunker
Iterate over the data in a
Readtype in a regular-expression-delimited way -
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
kind-tree
Syntatic trees for Kind compiler
-
tiefdownlib
manage and convert TiefDown projects
-
gregex
Regex solver utilizing NFA
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
mupdf-sys
Rust FFI binding to MuPDF
-
superfold
A multilingual Rust library and CLI to process UTF-8 strings to exclude diacritics and fold non-phonetic graphemes into their phonetic ASCII representation
-
busperf_gui
GUI library for Busperf
-
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
vesti
A preprocessor that compiles into LaTeX
-
liwe
IWE core library
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
url-cleaner-engine
The engine behind URL Cleaner
-
jq-rs
Run jq programs to extract data from json strings
-
markup_engine
Zero-dependency, trait-based, ultra-lightweight markup renderer (Markdown + HTML + HTML + pluggable). Born from LeadSheetML, useful anywhere.
-
neopdf_capi
C language interface to NeoPDF
-
shimmytok
Pure Rust tokenizer for GGUF models with llama.cpp compatibility (SentencePiece + BPE + WPM + UGM + RWKV)
-
skimple
interface for the skim fuzzy-matcher
-
ansi-escape-sequences
High-performance Rust library for detecting, matching, and processing ANSI escape sequences in terminal text with zero-allocation static regex patterns
-
translitrs
Transliteration utility for Serbian language
-
cursive_hexview
hexviewer for cursive
-
ptero-cli
A text steganography CLI tool for Social Media
-
strange
A static website generator
-
controlled-option
Custom Option type with explicit control over niches and memory layout
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
markov_str
Markov Chain implementation optimized for text generation
-
badascii
Backend rendering library for BadASCII diagrams. Block diagrams in ASCII.
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
lindera-dictionary
A morphological dictionary library
-
prototty_common
Definition of common ui elements and views for use with prototty
-
context-notation
Featherweight semantic notation for text
-
cnpj
Brazilian CNPJ parsing, validating and formatting library
-
rslug
fast, and configurable library to create URL-friendly slugs from strings
-
char_reader
Safely read wild streams as chars or lines
-
rawk-core
Core library for the AWK interpreter
-
wtf8-rs
WTF-8 encoding
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
hitori
Generic compile-time regular expressions
-
qmd-mcp
MCP server for qmd - lightweight SOTA local search engine for AI agents
-
textwrap-cli
Command line interface for textwrap
-
inkline
Display colorized ascii art to the terminal
-
etch
Not just a text formatter, don't mark it down, etch it
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
anagrambot
find anagrams of words
-
u8char
An alternative to the char type that uses UTF-8 as its in-memory representation
-
Contact_Book
contact book
-
two_five_six
Two-five-six provides high-entropy, human-friendly identifiers
-
bible-io
working with Bible text data structures
-
zenpatch
A robust library for applying text-based patches, designed for AI coding agents with backtracking algorithm
-
linkcheck
extracting and validating links
-
text_distance
A collection of approximate string matching algorithms
-
language-tokenizer
Text tokenizer for linguistic purposes, such as text matching. Supports more than 40 languages, including English, French, Russian, Japanese, Thai etc.
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
good-morning
A welcome message generator
-
rubidium
Rb37
-
ascii_tree
generates ascii trees
-
minigrep_tuhin
A mini grep tool implemented in Rust
-
turbo-json-checker
A pushdown automaton low memory JSON bytes stream checker returning the JSON root-type followed by its start and end index in the Reader
-
mors
converting morsecode to text and reverse
-
mdlib
A beautiful markdown note-taking application
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
hubble
Official Hubble plugin SDK for Rust
-
cgp-monad
Monadic constructs for CGP computation
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
rustrings
Strings manipulation for Rust
-
bashdoc
generating documentation/help menu for user defined bash functions
-
awful_knowledge_synthesizer
Generate LLM-powered exam questions from YAML books, manpages, mdbooks, tealdeer pages, and code
-
rialo-s-spl-memo
SPL Memo v3 (Rialo adaptation)
-
libruskel
Generates skeletonized outlines of Rust crates
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
rst_renderer
a reStructuredText renderer
-
text_searcher
text searcher tool
-
turndown
An opionated Rust port of Turndown.js
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
rust-functions
A collection of Rust utility functions (starting with format_number)
-
yinzhe9
喵喵隐者9
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
pg_str
Adds str functions to Postgresql via an extension
-
zettel-core
Core library for Luhmann-style Zettelkasten management
-
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
mecrab-word2vec
High-performance Word2Vec implementation with Hogwild! parallelization for MeCrab
-
case-conv
Faster case conversion crate
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
moodle_api
Moodle Webservice Client in Rust
-
quickmark-core
Lightning-fast Markdown/CommonMark linter core library with tree-sitter based parsing
-
chardet
rust version of chardet
-
ragtime
Easy Retrieval Augmented Generation
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
sqdj
sqdj shortens delimited data
-
tiny-clean
A lightweight, high-performance string sanitizer with configurable rules
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
skills-ref-rs
agentskills library for validating, parsing, and managing Agent Skills
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
text_lines
Information about lines of text in a string
-
ferrissay
cowsay
-
rusty_regex
A minimalistic regex engine using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
base16-rs
in Rust offers capabilities for encoding and decoding data in Base16 format. By utilizing the hex library, you can transform data into its hexadecimal representation and also decode…
-
wagyan
CLI tool to convert text into extruded ASCII STL meshes (TTF/OTF supported)
-
base256emoji
A multibase base256emoji implementation
-
fontheight-cli
Find out the vertical extents your font reaches on shaped words
-
attuned-infer
Fast, transparent inference of human state axes from natural language
-
dtools
A collection of tools for working with Commodore disk images
-
mtf
Markdown Table Formatter
-
bardic
A performance-focused toolkit for procedural narrative generation
-
swc_ecma_regexp_ast
AST definitions of ECMAScript regular expressions
-
iso15924
ISO 15924 data
-
mdxjs
Compile MDX to JavaScript in Rust
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
mdvalidate-utils
functions for mdvalidate
-
ogam
A markup language for story writers
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
help_crafter
help message generator without hussle
-
scrambler
command line tool to scramble letters
-
suffixsort
suffix (inverse lexicographic) sorting
-
deindent
A command line utility and Rust library to format overly-indented text
-
pillar
small tool to format lines into columns
-
somedoc
A very simple document model and markup generator
-
usage-cli
CLI for working with usage-based CLIs
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
delay_writer
Wraps a writer and delays its output after each newline
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
record-query
doing record analysis and transformation
-
string_search
searching strings using substrings
-
encode_rs_fs
Read and Write files with encoding_rs
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
asciimath-unicode
Convert asciimath to unicode
-
ucd-raw
Uninterpreted access to the unicode UCD
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
nfa_regex
NFA regex engine for text processing
-
johalun/module
FreeBSD kernel module in Rust
-
pdf_composer_definitions
PDF Composer definitions crate
-
orly
Download O'Reilly books as EPUB
-
opentalk-roomserver-report-generation
OpenTalk RoomServer report generation
-
pest3_core
Early WIP prototype version of pest 3.0
-
corpus-preproc
A preprocessor for text and HTML corpora
-
fast_trie
A memory efficient trie library
-
langsan
sanitizing language model input and output
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
zrbecker-minigrep
a mini version of the grep tool
-
hanconv-cli
Convert between Chinese characters variants
-
sixbit
Small packed strings
-
hello_lib
Demonstrate Generics Function
-
reddish
A Rust utility library
-
wikidump
parsing Mediawiki XML dumps
-
arborium-theme
Theme support for arborium syntax highlighting
-
hunspell-sys
Bindings to the hunspell C API
-
replaxe
A command-line tool to replace text in files with easy patterns
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
pinyin-sort
sort pinyin
-
rfc9839-rs
RFC9839 to test for problematic Unicode code points
-
rakugaki
rendering TTF/OTF font characters as ASCII art in the terminal
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
rusttyper
Basic text layout, using rusttype
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
alpha-counter
Alphabetic counter
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
portmanteau
create portmanteaux
-
pdf_form
programatically filling out pdf forms
-
kanuni
AI-powered legal intelligence CLI - The Ottoman Edition
-
ascii_help
help you quickly convert ASCII codes
-
r4d
Text oriented macro processor
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
arinamcnulty-markdown-parser
Markdown parser - university project
-
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
committed
Nitpicking commit history since beabf39
-
minigrep_sensitive
search for a querry case sensitive or not, in a file
-
ansi-width
Calculate the width of a string when printed to the terminal
-
minify_selectors_utils
minify-selectors
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
reggy
friendly, resumable regular expressions for text analytics
-
promptify
A plaintext directory formatting tool for interacting with LLMs on the command line
-
cliche
Dead simple static site generator
-
tgrep
Toy grep that honors .gitignore
-
stringprep
algorithm
-
libcommand
creating ascii command interpreters with builtin argument-splitting
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
asaru
Asana Tasks Search CLI
-
bocu1
BOCU-1 compressed unicode encoding
-
num2en
For converting integer and decimal numbers into English cardinal or ordinal number words
-
diaryx_core
Core library for Diaryx - a tool to manage markdown files with YAML frontmatter
-
strip-prefix-suffix-sane
A small utility crate providing ‘sane’
strip_prefixandstrip_suffixmethods forstrthat returnselfinstead of anOptionwhen the prefix/suffix is not found -
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
legalis-au
Australia jurisdiction support for Legalis-RS (Commonwealth Constitution, ACL, Fair Work, Mabo)
-
tabulate-rs
Pretty-print tabular data. Port of python-tabulate to Rust.
-
ramp-text
Pure Rust
-
rgon
A command-line tool written in Rust that searches for a query string within a file
-
thoth-note
note-taking app written in Rust
-
trevordmiller
Personal CLI
-
owlz
"Owlz" ascii emojis, created randomly or by design
-
lipgloss-tree
A tree component for terminal user interfaces, styled with Lip Gloss
-
m_lexer
extensible regular expressions based lexer
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
fetch_mediawiki_configuration
Generate
parse_wiki_textconfiguration code for a MediaWiki wiki -
fastn-builtins
fastn: Full-stack Web Development Made Easy
-
to_markdown_table
An easy way to format any data structure into a Markdown table
-
cuteness
Cute static site (+ server) generator with a bunch of plugins :3
-
mintyml-cli
Creates HTML from MinTyML, a minialist alternative syntax to HTML
-
nxfetch
A minimal, fast and batteries included fetcher!
-
text_layout
Text layout algorithms
-
homoglyph-core
Domain layer for the homoglyph cli and service. Compute homoglyphs for an input sentence.
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
bin-rs
Binary Reader from multi source
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
correct_word
A No brainer 'did you mean' library for Rust
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
prefix_writer
writer that can prefix text that contains mulptiple lines or incomplete lines
-
icemelter
minimize files that trigger internal compiler errors (ICEs)
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
deepl-openapi
The DeepL API provides programmatic access to DeepL’s machine translation technology
-
mdbook-morsels
Morsels plugin for Mdbook
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
flatlake
In the works
-
unindenter
unindent text
-
gstring
String with support for Unicode graphemes
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
workstyle
Workspaces with style! This program will dynamically rename your workspaces to indicate which programs are running in each workspace. It uses the i3 ipc protocol, which makes it compatible with sway and i3…
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
upodesh
generating Bengali word suggestions based on a given input string
-
blackboxmc_java
BlackboxMC bindings for java.util
-
ascii_converter
converting between different ascii representations
-
zw
encoding and decoding text using zero-width characters
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
sevenmark-utils
Shared utilities for SevenMark crates
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
barbara
A command-line tool for instantly surface matching MDN articles by keyword or partial phrase
-
biodiff-wfa2-sys
Rust bindings for the WFA2 library
-
xml-string
String types for XML
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
rfc9839
RFC 9839 specification
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
fmt-extra
Extra formaters not shipped in the standard library
-
regex-split
split_inclusive for the regex crate
-
comment-strip
Remove comments out of text files
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
boykisser
A collection of compile time ASCII art and images, especially boykisser
-
crawdad-rkyv
Crawdad: ChaRActer-Wise Double-Array Dictionary with rkyv support
-
chromalog
A customizable logger with dynamic color coding and file logging
-
ctrl-z
A composable reader to treat
0x1Aas an end-of-file marker -
bpack
Fast, simple and easy to use data compression library meant to work with byte slices of String and str types
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
font-map
Macros and utilities for parsing font files
-
escrit
learning languages by reading texts
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
caser
Change text between PascalCase, camelCase, and snake_case
-
anslatortray
translate from English to Pig Latin!
-
probe-code
AI-friendly, fully local, semantic code search tool for large codebases
-
kodegen_native_notify
KODEGEN.ᴀɪ: Memory-efficient, Blazing-Fast, MCP tools for code generation agents
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
spellabet
Convert characters into spelling alphabet code words
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
padzapp
An ergonomic, context-aware scratch pad library with plain text storage
-
scfmt
formatting code with structured comments
-
noise_search
Nested Object Inverted Search Engine
-
lucide-dioxus
Dioxus port of Lucide
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
deliminator
Universal code documentation generator
-
kicad-text-injector
A tiny CLI tool that replaces variables of the style
${KEY}within KiCad PCB (pcbnew) files -
playbill
ASCII art title generator with random gradient effects
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
ctf-brute
Brute-force utilities for Rust
-
tweet-memo
A CLI tool to record Twitter-style short memos in Markdown files
-
crustword
Crusty Crosswords
-
roan-engine
The core engine for the Roan project
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
yhy-email-encoding
Low level email encoding RFCs implementations
-
notegraf
Core library for building a graph-oriented notebook
-
xenon-lexer
The Xenon compiler's lexer
-
thank
Shows info about crates used in your projects so you know who to thank for them and where to do the thanking
-
tagsearch
Filter plaintext files based on @keyword tags
-
slicestring
slicing Strings
-
jtool-grep
notebook-specific grep tool for jtool
-
rustwemoji
Twemoji for Rust
-
ftrace
trace files and paths
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
md_ol_util
Transform markdown unordered lists to ordered lists with current position expressions
-
elicitor-doc-latex
Backend for elicitor: generates fillable LaTeX forms from SurveyDefinition
-
book2pdf
CLI utility to turn published code documentation into PDFs
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
wcrs
GNU wc in Rust
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
makepad-rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
csvsc
Build processing chains for CSV files
-
yar_markdown
Markdown handling for yar
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
curtana
Simplified zero-cost wrapper over llama.cpp powered by lama-cpp-2
-
rpdf
PDF command-line utils written in Rust
-
s3grep
A fast, parallel grep tool for searching logs and unstructured content in AWS S3 buckets
-
charclass
define and modify unicode character classes
-
rustdoc-markdown
convert Rust documentation to Markdown, for use with LLMs
-
utf8-bytes
bytes::Bytes, but UTF-8
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
chargrid_decorator
chargrid views for decorating other views
-
yuto51942-servant
cli
-
bloom-html
HtmlNode type for bloom-client and bloom-server
-
createqrcode
A CLI tool to convert data into QR codes
-
matchy-match-mode
Shared MatchMode enum for matchy workspace (internal)
-
docket
markdown to HTML documentation rendering
-
uclanr
A random word picker that gives you actually useful words
-
libabbs
aosc-os-abbs maintenance
-
remake
writing maintainable regex and managing symbol soup
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
codetypo
Source Code Spelling Correction
-
memchunk
The fastest semantic text chunking library — up to 1TB/s chunking throughput
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingualcmdline tool. -
gchemol-parser
Text parsing made simple
-
clippers
Cross-platform clipboard management library
-
gh_page_tool
A github gh-pages tool for static blog site
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
legalis-ca
Canada jurisdiction support for Legalis-RS (Charter of Rights, Federal/Provincial Law, Quebec Civil Law)
-
tree-sitter-stack-graphs-python
Stack graphs definition for Python using tree-sitter-python
-
trie-match
Fast match macro
-
pprint
Flexible and lightweight pretty printing library for Rust
-
str_overlap
Methods for finding the overlap between two string slices
-
solr_post
command line tool and library to post files in a directory to a Solr server
-
matchpick
Find and replace multi-lines using a match-case
-
dr
Command-line data file processing in Rust
-
strip-codeblocks
strip markdown code blocks from text, preserving only the inner content
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
bgmtv
An API wrapper for bangumi.tv based on reqwest
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
cl
A command line tool for recording changes to be collected for use in a Keep A Changelog formatted CHANGELOG.md
-
sakurs-cli
Command-line interface for Sakurs sentence boundary detection
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
flashtext2
The FlashText algorithm implemented in Rust
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
zenbase-llml
Lightweight Language Markup Language - converts data structures to XML-like markup
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
small-bwt
BWT construction in small space
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
dec_from_char
Small library for converting unicode decimal into numbers
-
rspy
Pythonic interface for Rust
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
synox
Program synthesis of string transformations from input-output examples
-
legalis-th
Thailand jurisdiction support for Legalis-RS - Thai legal system with Buddhist Era calendar, FBA, BOI, PDPA, Labor law
-
nile-library
supporting nile
-
uwl
A management stream for bytes and characters
-
indeed
Append lines to a file with no shell bullshit
-
scribe-patterns
Advanced pattern matching and search algorithms for Scribe
-
saville_encode
Encode UTF-8 text into Peter Saville's colour code
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
align
aligning text
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
flux-tui
Fast and lightweight Terminal UI drawing library
-
charmap
one-to-(none/one/many) character mapping
-
serde_ssml
A robust Rust library for parsing, manipulating, and generating Speech Synthesis Markup Language (SSML) documents
-
mdast_util_to_markdown
Markdown to AST
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
macro_colors
colorful printing macros
-
lazy-string-replace
A lazy version of
String::replace, so that it can be formatted or recursively replaced without intermediate allocations -
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
hydroper_source_text
Source text containing line locations
-
esc
Escape characters in strings
-
minigrep_101
Minigrep implementation of the Rust Book
-
asimov-openai-module
ASIMOV OpenAI module
-
slugmin
Slug version which allows to use spaces and big letters
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
sauron-markdown
parsing markdown into sauron node
-
inslice
A command-line utility for filtering text input by columns and rows
-
cur
that will hunt for your regular expression
-
utf8-io
Traits and types for UTF-8 I/O
-
assemblage_view
Linearized View Model and Bindings for AssemblageDB
-
vidyut-kosha
A Sanskrit key-value store
-
pulldown-cmark-fork
A pull parser for CommonMark
-
replacer-cli
A flexible cli to replace strings in files or a directory
-
runestr-pancjkv
rune-based Pan-CJKV support -
text_manipulation_rs
generating random placeholder text in different languages
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
uwu-rs
uwuifying library
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
string-simple
containing some simple string utilities that I use in my other projects
-
arborium-highlight
Unified syntax highlighting for arborium - works with both static Rust grammars and WASM plugins
-
token-dict
basic dictionary based tokenization
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
jmemo
CUI tool for maintaining memos
-
matchable
Check if text is matching against string or regex in one API
-
shift_or_euc_c
C API for shift_or_euc
-
find_unicode
Find Unicode characters, the easy way!
-
ucd-util
A small utility library for working with the Unicode character database
-
streamparser
Adapt simple stream parsers for different types of streams
-
bubble-bath
Small and quick HTML sanitizer
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
dioxus-typst
Typst component for Dioxus
-
ean-rs
generating and validating EAN barcodes
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
trie-alg
Trie implementation
-
rjoin
joining CSV data on command line
-
english-core
English language inflector
-
syntaxdot-encoders
Encoders for linguistic features
-
atlas-memo
Atlas Program Library Memo
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate -
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
japanese-text
日本語テキスト正規化ライブラリ - 全角半角変換、ひらがなカタカナ変換
-
slugify-rs
generate slugs from strings
-
trustformers-tokenizers
Tokenizers for TrustformeRS
-
chonkier
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
lk-inside
A terminal user interface (TUI) application for interactive data analysis
-
indoc
Indented document literals
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
wdpe
WebDynpro Parse Engine
-
asciirend
no_std compatible ascii rendering core
-
hoedown
bindings for the Hoedown markdown processor
-
ottotom-testsupport
Test helpers used for the ottotom crate tests
-
pulldown_mdbook
A pull parser for mdBook
-
textcat
detect text categories. It can be used to detect the language of a given text
-
key-lang
To be the prettiest and simpliest script
-
aozora2text
Convert Aozora Bunko format to plain text
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2 -
code_generator
A code generator (Currently only targets C)
-
lithe
A Slim template engine by using Pest
-
decondenser
Pretty-print any text based on brackets nesting
-
hayro-font
A parser for CFF and Type1 fonts
-
fea-rs
Tools for working with Adobe OpenType Feature files
-
rustfmt_emitter
Rustfmt emitter library
-
niwa-generator
LLM-powered Expertise generation for NIWA
-
dmntk-gendoc
dmntk | Documentation generator
-
sourceannot
render snippets of source code with annotations
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
renamer-rs
process and rename files or text
-
salvation-cosmic-text
Pure Rust multi-line text handling
-
syllable
counter for use with reading level calculations
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
persian-tools-cli
cli for rust-persian-tools crate
-
afrim-memory
handle of sequential codes easier for an input method
-
chinese-telegraph
unicode to chinese telegraph code conversion
-
mq-repl
Read-Eval-Print Loop (REPL) for mq query language
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
squidge
shortens delimited data
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
floating-ui-utils
Rust port of Floating UI. Utilities for Floating UI.
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
rk-utils
A collection of utility functions and data structures for rust
-
invisible-sun
program for generating svg "suns"
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
waken_snowball
Snowball stemming algorithms for 33 languages
-
naromat
Convert text to narou novel format
-
kanjiwrapper
Wrapper for kanjiapi.dev
-
commit_crafter
AI powered tool for Git commit message generator
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
csvpeek-rs
A CLI tool to quickly peek into, list, and filter CSV data
-
ox_content_docs
Source code documentation generator for Ox Content (like cargo docs)
-
orgflow-tui
A terminal user interface for orgflow - manage notes and tasks with a smooth workflow
-
unicode-box-drawing
Unicode box-drawing characters
-
rulebook_query
Rulebook discovery and querying engine for searching, filtering, and extracting coding standards
-
csv_to_table
pretty print CSV as a table
-
stringsext
find multi-byte-encoded strings in binary data
-
rehuman
Unicode-safe text cleaning & typographic normalization for Rust
-
mention
code highlighter
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
pdfrust
PDF parser
-
latin
A standard library with a focus on common operations
-
snekdown
A parser for the custom snekdown markdown syntax
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
ogrep
searching in indentation-structured texts
-
computergeneration
compgen but all wrong
-
md-kroki
Render kroki diagrams inside markdown
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
best_utils_lib_ever
Best utility library/crate ever made!
-
unidecode
pure ASCII transliterations of Unicode strings
-
verba
working with Latin words
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
glowpub
A glowfic to epub converter
-
lesspub
CLI tool for downloading Sequences from LessWrong and exporting them as EPUB format ebooks
-
mdopen
Preview markdown files in a browser
-
jxoesneon-tectonic-io_base
Internal crate for the Tectonic project: tectonic_io_base
-
mdbook-permalinks
Generate permalinks in mdBook using paths
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
hex-utilities
working with hexadecimal numbers
-
llmtext
Turns any website into a single LLM-ready markdown file
-
ltxcut
formats a table-like stream into a LaTeX-table
-
campfire
A tiny static site generator, greatly inspired by Zola
-
py-regex
A small wrapper around the Python regex module via PyO3
-
codive-tools
Tool implementations for Codive
-
runefix-cli
Unified CLI frontend for runefix tools: width detection, markdown formatter, and terminal testing
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
gimme
Pull useful data out of your clipboard
-
checkstream-classifiers
ML and pattern-based classifiers for toxicity, PII, and prompt injection detection
-
minigrep_necimye
Functions required to find the lines in file that contains the query. query and file path should be entered through command line preceded by two dashes. Ex: cargo run -- body filename.txt
-
asimov-module-cli
ASIMOV Module Command-Line Interface (CLI)
-
rcut
replacement for GNU cut that supports UTF-8
-
docer
Document generation toolkit for creating diagrams, PDFs, and structured documentation artifacts
-
toonconv
CLI tool for converting JSON to TOON (Token-Oriented Object Notation) format
-
markdown-table
Creating markdown tables with Rust!
-
enc-check
inspect utf-8 and utf-16 character encodings
-
incredimo
just another font for your terminal
-
libxdiff
Rust bindings for the libxdiff C library
-
xee-xpath
XPath 3.1 library API
-
dcsv
Dyanmic csv reader,writer,editor
-
validated-slice
Helper macros to implement std traits for custom validated slice types
-
humnum
Human numeric sorting program — does what
sort -his supposed to do! -
rustfits
A light-weight FITS file reader in Rust
-
grammalecte_client
Grammalecte HTTP client
-
streamdown-syntax
Syntax highlighting for streamdown via syntect
-
aki-txpr-macro
the more easy to use libaki-*
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
mdbook-grammar
An mdBook preprocessor to help with the Rust specification
-
tessera-embeddings
Multi-paradigm embedding library: ColBERT, dense, sparse, vision-language, and time series models
-
mq-crawler
Directory crawler for batch Markdown file processing
-
amdm
Rust client for amdm.ru with Russian lyrics stress marking and meter analysis
-
meet
building website
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
delim
parsing values from delimited strings
-
seven_seg
Seven-segment digital display for terminal
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantraframework -
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
slabs
Text chunking for RAG: fixed, sentence, recursive, and semantic strategies
-
opencc-jieba-rs
High-performance Chinese text conversion and segmentation using Jieba and OpenCC-style dictionaries
-
twitter-text
in Rust
-
wildflower
Wildcard matching against strings
-
tiny-grep
grep-like text search utility written in Rust
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
voirs-g2p
Grapheme-to-Phoneme conversion for VoiRS speech synthesis
-
asimov-chromium-module
ASIMOV module for Chromium (and Brave, Google Chrome) bookmark import
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
minigrepbk
A collection of utilities to make performing searching text in a certain document
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern -
fcowsay
working with cowsay
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
mdbook-keeper
An improved testing experience for mdbook
-
char-positions
Iterate chars and their positions, i.e. line, column, and byte ranges
-
byteforge
A next-generation byte-level transformer with multi-signal patching and SIMD optimization
-
vew
Visualize lsof output
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
jmdict-fast
Blazing-fast Japanese dictionary engine with FST-based indexing
-
localization
t! macro, the easiest way
-
LitePhoton
A blazingly fast text file/csv file/etc scanner
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
august
& program for converting HTML to plain text
-
litsea-cli
Litsea is an extreamely compact word segmentation and model training tool implemented in Rust
-
mdbook-summary
Summary parser for mdBook
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
html-auto-p
function like
wpautopin Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
zed-util
A collection of utility structs and functions used by Zed and GPUI
-
excavate
A command line tool to excavate fields from multi-column texts
-
minigrep_jeck
minigrepis a grep clone that takes a query and searches for the query in the file; with added support for regex -
cfasttext-sys
fastText ffi binding
-
hangul
manipulate Hangul Syllables
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
tree-sitter-stack-graphs-java
Stack graphs for the Java programming language
-
lang-unicodes
Effortlessly convert and validate text across various Unicode ranges, supporting multiple languages for seamless internationalization in your applications
-
simple-xml-builder
XML builder/writer
-
jlu
Command-line utilities for on-the-fly investigation of JSON Lines
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
typos-dict
Source Code Spelling Correction
-
aklat
create books from markdown files (like Gitbook)
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
hayro-write
rewriting pages of a PDF file
-
vndb_rs
Bare-bones VNDB api
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
vec-string-to-static-str
providing utilities for converting vectors of
Strings into vectors of&'static str -
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
crate2bib-cli
A CLI tool for the crate2bib crate
-
limace
Slugify some strings
-
abbreviation_extractor
extracting abbreviations from text
-
notedown_ast
Notedown Abstract Syntax Tree
-
code-agent
AI-Native Code Assistant Library
-
shallow
long text
-
typst-ts-core
Core function of Typst.ts
-
sparklet
small flashcards library
-
alemat
type-safe building of MathML
-
lexikit
CLI that packs an entire project into a single LLM-ready document (Markdown or //-comment style) with UTF-8 detection, binary handling, glob include/exclude filters, and .gitignore awareness
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
text-utils
Text utils for unescaping and align
-
const_format_proc_macros
detail of the
const_formatcrate -
grep-regex
Use Rust's regex library with the 'grep' crate
-
benda
Use Bend language functions and capabilities in Python!
-
codetypo-vars
Source Code Spelling Correction
-
miette-arborium
Arborium-powered syntax highlighter for miette diagnostics
-
meme_generator_utils
Meme generator utils
-
fmty
Composable
core::fmtutilities -
molten_ember
Render Markdown beautifully in the terminal 🔥
-
ucp-translator-html
HTML to UCM document translator
-
j4l_minigrep
learning Project for test
-
arbitrator
Format text based on a set of rules and regexes
-
code-splitter
Split code into semantic chunks using tree-sitter
-
dimgrep
command-line text search tool written in Rust, based on the minigrep example from The Rust Programming Language
-
utf8-supported
Determine the UTF-8 support of the current locale
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
story-dl
Story web scraping
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
highlights
Convert kindle highlights into different formats
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
opengrep
Advanced AST-aware code search tool with tree-sitter parsing and AI integration capabilities
-
uiua-doc-gen
Documentation generator for Uiua libraries
-
assert-text
the testing macro tools
-
badascii-mdbook
Embed badascii diagrams in your mdbook. See badascii.me for the editor.
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
node-emoji
Convert
:emoji:to Unicode using GitHub’s and EmojiDB’s emoji names -
az_minigrep
minigrep CLI program to search a query string from a file and print out the matched lines
-
untex
Understand and manipulate TeX files with ease
-
chinese_detection
Classify a string as either English, Chinese, or Pinyin
-
termwrap
Wrap Unicode text with ANSI color codes
-
sqlify
CLI tool for formatting SQL queries
-
steve
Search Technical Evidence Very Easily
-
lindera-cc-cedict
A Chinese morphological dictionary for CC-CEDICT
-
scribe-webservice
Web service interface for Scribe repository analysis
-
reflexo-vec2svg
Export a frame of Typst document into svg element
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
rescue-blanket
Escape values while they are being formatted
-
e2m
A lightweight CLI tool that transforms English words into emojis using only Rust’s standard library
-
aho-corasick-unsafe
Fast multiple substring searching
-
table_to_html
interface to convert a
tabled::Tableinto a HTML table (<table>) -
mdbook-mermaid-animate
mdbook preprocessor to add mermaid animation support, before mdbook-mermaid preprocessor
-
ispell
Easily run ispell, hunspell or aspell from Rust
-
to_fraktur
Function that converts any string to fraktur font
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
sttx
belt for transforming speech-to-text data
-
ssort
CLI tool for suffix (inverse lexicographic) sorting
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
homoglyph-service
homoglyph-cli
-
regex-automata
Automata construction and matching using regular expressions
-
token-parser
parsing texts into data structures
-
html-compare
compare html files
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
three-dcf-core
Document-to-dataset encoding library for LLM training data preparation. Converts PDFs, Markdown, HTML into structured formats optimized for machine learning.
-
tmenu
TUI fuzzy finder
-
words-count
Count the words and characters, with or without whitespaces
-
kitoken
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
-
dequote
Remove nested quotes around text
-
goodname
assist you with cool naming of your methods and software
-
simple-markdown-parser
handling markdown. May include extra utilities
-
ai-context-gen
A context generator for Rust repositories that creates structured markdown files with relevant information for LLMs and AI agents
-
daft-functions-utf8
UTF-8 functions for the Daft project
-
minigrep-cli-tool
cli tool to search upon a text file for lines containing specified string
-
raw_str
type which contains a byte slice that is believed to contain UTF-8 string data, but which has not been validated
-
file-search
File indexing and search
-
mq-task
A task runner using Markdown
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
rsticle-cli
Command line tool to convert source files into narratives/articles
-
mitex-lexer
Lexer for MiTeX
-
fozzie
A quick fuzzy finder for the terminal
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
search-in-terminal
A terminal-based search tool
-
syntax-html
cli application for generating syntax-highlighted HTML of code files
-
m2h
Convert Markdown to HTML with syntax highlighting
-
twitter_text_parser
Parser for twitter-text in Rust
-
libanubhav
management system written in Rust
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
lucide-yew
Yew port of Lucide
-
static_table
creates pretty tables at compiler time
-
scripter
A screenplay compiler
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
venus-sync
Sync engine for Venus - converts .rs notebooks to .ipynb
-
skribo
low-level text layout
-
rawstring
A string type that can hold data which may or may not be valid UTF-8
-
mdtranslation
prepare multi-lingual Markdown documents
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
regex-lite
A lightweight regex engine that optimizes for binary size and compilation time
-
opentalk-roomserver-types-legal-vote
OpenTalk RoomServer Types Legal Vote
-
eytan-minigrep
minigrep from "the book"
-
tiny-ver
tiny version parser
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
indent_tokenizer
Generate tokens based on indentation
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
xpath-cli
Evaluate XPath selectors on XML or HTML documents
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
aki-mline
match line, regex text filter like a grep of linux command
-
xhtmlchardet
Character set detection for XML and HTML
-
kindle_epub_converter
sanitize epubs for kindles
-
textshrink
A minimalistic lib that reduces text size by replacing specific character sequences with single Unicode characters (e.g., 'mm' becomes '㎜'). On average, this compresses text by ~5%, with variation also ~5%.
-
text-fx
A collection of text processing utilities for Rust
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
ryaspeller
lib for searching typos in text, files and websites
-
no-comment
Remove rust-style line and block comments from a char iterator
-
regex_static
Compile-time validated regex, with convenience functions for lazy and static regexes
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
jxoesneon-tectonic-engine_bibtex
Internal crate for the Tectonic project: tectonic_engine_bibtex
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
lindera-decompress
A morphological analysis library
-
grep-search
search query in contents lib
-
sauron-md
parsing markdown into sauron node
-
aki-stats
output the statistics of text, like a wc of linux command
-
fontcull-klippa
Subsetting a font file according to provided input. (Vendored fork for fontcull)
-
mdbook-rustdoc-links
Link to Rust API docs by name in mdBook
-
zummi
fun lib that produces spoonerisms
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
transition-table
transition table utilities for keyword parser
-
poly_doctest
Polyglot documentation snippet test generator - extract code snippets from docs and generate tests
-
rust-md2html
CLI to convert Markdown to HTML with file watching
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
md-include
include any file in markdown files
-
unicodeit-cli
The command line interface to unicodeit
-
llm-shield-scanners
Security scanners for LLM Shield toolkit
-
wz
Count words, fast
-
asimov-ollama-module
ASIMOV Ollama module
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
pinot
Fast, high-fidelity OpenType parser
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
toshi
A full text search engine based on Tantivy
-
google-fonts
Download and cache TTF fonts from Google
-
uniwhat
Display the unicode characters text
-
bbd-lib
Binary Braille Dump
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
smartcat
Putting a brain behind
cat. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
dianac
An emulator, compiler, and interpreter for the Diana Compiled Language
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
marker
finding issues in CommonMark documents
-
cmark-syntax
Syntax highlighting for pulldown-cmark
-
groop
Outline-aware grep with syntax highlighting
-
staple
powerful static site generator
-
mdja
日本語に最適化されたMarkdownパーサー - CommonMark + GFM対応、目次生成、読了時間計算
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
tree-formatter-rs
format hierarchical structures as text-based trees
-
cargo-cargofmt
Cargo file formatter
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
word_filter
A Word Filter for filtering text
-
beard
Handy macro to generate formatted text in rust
-
mandate
Convert Markdown or YAML manuals into roff manpages
-
asciimath-text-renderer
Render asciimath in terminal
-
kelp
A convert tool for Japanese
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
write16
A UTF-16 analog of the Write trait
-
catatau
A TUI EPUB reader with fuzzy search and navigation
-
shaygrep
Minigrep from the Rust Book, for my practice
-
indentation
Formatter
-
rustex
auto-generated LaTeX files in Rust
-
markovish
Markov chain implementation for text generation
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
notion2md
converting Notion pages to Markdown
-
legalis-vn
Vietnam jurisdiction support for Legalis-RS - Vietnamese legal system with socialist market economy, Labor Code, Enterprise, Investment
-
unidok
A powerful, readable, easy-to-learn markup language
-
yassgy
A static site generator
-
kizame
(刻め!) - CLI for MeCrab morphological analyzer and data pipeline
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
markdown-gen
generating Markdown files
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
kth-lines
Command line tool for filtering stdin lines that just work
-
doccy
brace based markup language
-
jxoesneon-tectonic-xdv
Internal crate for the Tectonic project: tectonic_xdv
-
cindex
CSV indexing library
-
toresy
term rewriting system based on tokenization
-
toml_to_table
pretty print TOML as a table
-
grepox
Minimalist's grep written in Rust
-
wordnik_list
Quick, local library to check valid words within the Wordnik word list
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
ligotab
Format delimited data with lightweight markup
-
tabwriter
Elastic tabstops
-
rexis-rag
Rexis RAG - High-performance Retrieval-Augmented Generation framework with memory-first agents, vector search, and LLM integration
-
json2markdown
convert JSON data into well-structured Markdown format
-
regexgrep
ripgrep tool that suports regular expressions
-
vizaminigrep
search text
-
eversal-esi
ESI Library for the Eversal project
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
moenarchbook
Creates a book from markdown files
-
krilla-svg
Converting SVG files to PDF
-
termio
styling terminal output with CSS-like syntax
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
wikitext_table_parser
implement for parse the wikitext's table
-
almanaculum
Core types and traits for analysis
-
ultron-syntaxes-themes
Syntaxes and themes dump for ultron
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
segtok
Sentence segmentation and word tokenization tools
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
scribe-graph
Graph-based code representation and analysis for Scribe
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
formatjson
Formats JSON files
-
html_to_epub
A command line converts .html file to .epub file
-
bazaar
formats and protocols
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
grep-printer
grep crate's Sink trait that provides standard printing of search results, similar to grep itself
-
tui-syntax
Tree-sitter based syntax highlighting for TUI applications
-
hashmoji_generator
Code generation tool for hashmoji
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
lingua-russian-language-model
The Russian language model for Lingua, an accurate natural language detection library
-
iata_bcbp
IATA BCBP Parser in Rust Based on Resolution 792
-
Ds_minigrep
grep implementation from The Rust Programing Book
-
md-parser-kma
parser that converts basic Markdown syntax (headers, bold, italic, lists) into corresponding HTML tags
-
md-inc
Include files in Markdown docs
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
character_frequency
counting character frequencies in a string concurrently
-
promptune
A tiny tool to auto-augment prompts programmatically
-
scie
research about how to build simple code identify engine for different languages
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
mark-html
efficient Markdown to HTML parser written in Rust
-
pdf-create
low-level, strongly-typed PDF creation library
-
varcon
Source Code Spelling Correction
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
aki-json-pick
The json pick out command
-
encoding_c
C API for encoding_rs
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
k-edit-distance
A basic edit distance implementation for korean characters
-
endf_parser
parsing ENDF-6 format nuclear data
-
rigrep
grep from Rust Book
-
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
md-designer
A CLI tool for creating design docs in Markdown
-
exit-with-code
A CLI which returns the exit code of your choice
-
grep-searcher
Fast line oriented regex searching as a library
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
text-tables
A terminal/text table prettifier with no dependencies
-
gret
command line tool to search for patterns and show matches in a tree structure
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
skyspell_core
skyspell core library
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
mdvault-core
Core library for mdvault - markdown vault management
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
unflow
DSL to convert design to code
-
ahtml-from-markdown
Convert Markdown to ahtml HTML element trees
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
regex_quote_fixer
Rewrites grep regexpressions for the use in the regex crate
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
bukvalno
A cli tool for converting images to ascii art
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
prettythanks
frontend to dtolnay/prettyplease library
-
file-size
a function formatting file sizes in 4 chars
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
bpmf_py
A Bopomofo and Pinyin library
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
findtext_textfile
Search text in text file
-
asimov-x-module
ASIMOV module
-
typos
Source Code Spelling Correction
-
grepmini
grep-like command line tool written in Rust to search for text patterns in files. Built following the Rust book's example.
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
oeb-rs
oeb电子书解析库
-
korrektor
work with Uzbek language text processing
-
rrename
" Opinionated tool to rename files in batch. Match regular expression, replace some characters I consider noise to kebab case
-
custard_lib
A core library for Custard
-
ox_content_ssg
Static Site Generation for Ox Content documentation
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
telegram-escape
Escape text for Telegram's MarkdownV2 format
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
mini-openai
An OpenAI API client with minimal dependencies
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
caribon
A repetition detector program and library
-
repub
convert markdown documents to epub
-
stringsort
Pathological sorting of string characters
-
chisel-lexers
Chisel backend lexers/scanners
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
pcre2
High level wrapper library for PCRE2
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
saurus
Markdown to LaTeX Transpiler
-
hayro-svg
Converting PDF pages into SVG files
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
mq-lsp
Language Server Protocol implementation for mq query language
-
syntaxdot-tokenizers
Subword tokenizers
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
warrah
command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages
-
aghpb
Anime girls holding programming books API wrapper for 🦀Rust
-
mitex-parser
Parser for MiTeX
-
widest-line
Find the widest line in a string with proper Unicode and ANSI escape code support
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
k-lighter
syntax highlighter for K language
-
strfry
randomize a string
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
aglow
Render Markdown beautifully in the terminal 🔥
-
utf8-command
UTF-8 encoded
std::process::Commandoutput -
font-awesome
A typed manifest of constants of Font Awesome unicode code points
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
forming
lightweight architecture as code language. 架构描述语言
-
quillmark-core
Core types and functionality for Quillmark
-
milligrep
Custom simplified implementation of grep
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
dtxt-detect
dangerous text detection, optimized for high speeds
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
encoded-words
Encoded Words for usage in MIME headers
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
actiondb
A safe and efficient unstructured text (log) parsing library
-
grep-matcher
A trait for regular expressions, with a focus on line oriented search
-
rep-grep
wgrep/write-grep CLI
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
paltoquet
rule-based general-purpose tokenizers
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
supercat
A syntax highlighting alternative to cat
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
blz-cli
CLI for blz – fast local llms.txt search
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
rins_markdown_parser
markdown parser written on Rust
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
chunk_norris
splitting large text into smaller batches for LLM input
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
top-english-words
retrieve top words from the English language
-
tform
format plain text into well-structured Markdown or HTML
-
rust_nickname_generater
that generates user/nick names based on the rust language
-
roxy_cli
A command-line static site generator
-
stylish-stringlike
API for string-like objects that have styles applied
-
latin1str
Windows-1252 string types
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
ngrams
Generate n-grams from sequences
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
kanji_hanzi_converter
Translate Japanese Kanji to Chinese Hanzi
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
presciidoc
Preprocessing AsciiDoc for other tools
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
codetypo-cli
Source Code Spelling Correction
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
contractions
expand contractions in English
-
markdown-tables
generating markdown-formatted tables
-
text2art
translate text to ascii art
-
heyiwei
何意味
-
detect-indent
Detect the indentation of code
-
lithe-cli
A cli of lithe
-
marcus
An experimental Markdown parser written in Rust
-
cli_app_capo
CLI application with Unix-like tools
-
asimov-dataset-cli
ASIMOV Dataset Command-Line Interface (CLI)
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
morc
Dead simple, minimal markdown generator library written in Rust
-
bigstr
A command-line tool to make string BIG
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
lindera-python
A morphological analysis libraries and command line interface
-
xssan
that sanitizes HTML to prevent XSS vulnerabilities
-
lex-babel
Format conversion library for the lex format
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
dekor
styling and character repository in Rust
-
kwindex
based "keyword index" library crate that maintaining an index of words from texts
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
stylometry-analyzer
Minimal CLI tool that combines one or more
.txtfiles, extracts user-authored text, and enforces a minimum size. Hash-embeds text chunks and queries a local vector DB to classify writing style… -
strizer
minimal and fast library for text tokenization
-
izihawa-tantivy-tokenizer-api
Tokenizer API of tantivy
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
japhonex
Japanese phone number checker for Rust
-
hexify
Format hex
-
ox_content_og_image
OG image generation for Ox Content
-
repa
Peak Performance Pattern Seeker
-
flerp
CLI tool that does XYZ
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
crossandra
A straightforward tokenization library for seamless text processing
-
flw
Process text via configurable tasks
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
unitoken
Fast BPE tokenizer/trainer with a Rust core and Python bindings
-
bible-reference-rs
Extract Bible references from plain text
-
arabic_text_utils
Arabic text processing and manipulation
-
addbib
An app to add linked bibliographies to markdown files
-
parattice
Recursive paraphrase lattice generator
-
jxoesneon-tectonic-xetex_format
Internal crate for the Tectonic project: tectonic_xetex_format
-
letrs-fonts
A collection of FIGlet fonts
-
screen_printer
displaying rectangular blocks of text to a terminal
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
wg-ragsmith
Semantic chunking and RAG utilities for document processing and retrieval-augmented generation
-
lingua-arabic-language-model
The Arabic language model for Lingua, an accurate natural language detection library
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
rtlicious
A nom-based parser for Yosys RTLIL files
-
tattoy-wezterm-char-props
Unicode and other character properties
-
pray
A tui tool for preparing a prompt to the llms
-
lindera-filter
Character and token filters for Lindera
-
utf8toipv4
Convert UTF-8 to ipv4 addresses and vice versa
-
braille_pics
producing text-art pictures using Braille characters
-
p101_enc
convert Olivetti P101 program to and from different encodings
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
spongedown
Converts markdown to html with svgbob support
-
mdbook-core
The base support library for mdbook, intended for internal use only
-
findtext_pdf
Search text in PDF
-
ftd-rt
ftd
-
ngram-search
Ngram-based indexing of strings into a binary file
-
hulk
An ultra simple no hassle static site generator
-
cw
Count Words, a fast wc clone
-
betacode
conversion
-
xxxxx_rust_sts
A collection of useful string and file utilities for Rust
-
agentroot-core
Core library for agentroot - semantic search engine with AST-aware chunking and hybrid search
-
grepster
command-line tool for searching text in files
-
ron_to_table
pretty print RON as a table
-
asimov-readwise-module
ASIMOV module
-
castle_tokenizer
Castle Tokenizer: tokenizer
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
rizzer
Fuzzy matching tool to find string similarity
-
mini__grep
grep tool
-
ssml-parser
parsing speech sythnesis markup language
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
write-html
writing HTML in Rust
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
mdbook-diataxis
A preprocessor for mdBook which helps apply the Diátaxis documentation framework
-
term-rustdoc
A TUI for Rust docs
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
encoding-index-japanese
Index tables for Japanese character encodings
-
goya
morphological analyzer for Rust and WebAssembly
-
is-vowel
Heuristically test whether a character is a vowel letter
-
quicksilverx
easy to use grep clone
-
khat
A cat clone, nothing more nothing less
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
mitex-spec
Specification Library for MiTeX
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
mq-view
CLI tool for rendering Markdown documents with syntax highlighting and rich text formatting
-
varcon-core
Varcon-relevant data structures
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
lingua-japanese-language-model
The Japanese language model for Lingua, an accurate natural language detection library
-
hsk
Return HSK Level for Simplified Chinese Characters
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
termbook
behind the
termbook-cli -
tex
The νTeX typesetting engine
-
dd
a clone of the unix coreutil dd
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
gmi2html
Convert text/gemini into HTML
-
imgr
image to ascii converter
-
jxoesneon-tectonic-cfg_support
Internal crate for the Tectonic project: tectonic_cfg_support
-
borderrs
Add stylish borders around your text and datastructures
-
mq-dap
Debug Adapter Protocol implementation for mq
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
lindera-cli
A morphological analysis CLI
-
quewuigrep
grep-like tool written in Rust
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
cn-font-utils
utils of cn-font-split
-
rusty-grep
RustyGrep is a command-line tool written in Rust that allows users to search for patterns in files
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
jp_utils
Utils for working with Japanese text
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
indexable_str
creating indexable strings
-
cli_app_capo15
CLI application with Unix-like tools
-
jxoesneon-tectonic-bridge_harfbuzz
Internal crate for the Tectonic project: tectonic_bridge_harfbuzz
-
uniaxe
replace Unicode letters with Ascii equivalents
-
ox_content_vite
Vite Environment API integration for Ox Content
-
lindera-compress
A morphological analysis library
-
bqrs
apply boolean query to text
-
strip-tags
Strip HTML and PHP tags from strings
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
unicode_escape
decoding escape sequences in strings
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
pygmentize
wrapper for syntax highlighting
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
latex-to-html
Latex to html converter
-
chisel-parsers
Chisel parser front ends
-
rsnltk
Rust-based Natural Language Toolkit
-
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
common-words-all
Most common words sorted by ngram frequency
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
mdbook-translation
prepare multi-lingual mdBook books
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
tectonic_pdf_io
Xdvipdfmx’s PDF, XDV, and image I/O APIs in C, as a Rust crate
-
yamc
Markdown Converter - A fast and feature-rich markdown to HTML and PDF converter
-
catmark
Console printer for CommonMark
-
text-sanitizer
convert text to plain ASCII text
-
looneygrep
A powerful command-line search tool with web page support and replace functionality
-
asimov-anthropic-module
ASIMOV Anthropic module
-
autoruby
Easily generate furigana for various document formats
-
uscan
A universal source code scanner
-
lignin-html
HTML renderer for lignin VDOM Nodes. This crate is primarily for static and server-side rendering. For client-side use, see lignin-dom.
-
xsv
A high performance CSV command line toolkit
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
sentencepiece
Binding for the sentencepiece tokenizer
-
jxoesneon-tectonic-dep_support
Internal crate for the Tectonic project: tectonic_dep_support
-
rbatis_xml_parser
Push based streaming parser for xml
-
ced
Dead easy csv editor
-
suffix
arrays
-
asciidork-opts
Asciidork opts
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
truncrate
truncate strings economically without splitting your characters, symbols, emojis etc
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
htmlfixinator
A composable HTML transformation library with filters for cleaning, modifying, and standardizing HTML content
-
html2runes
An HTML to Text converter
-
char_stream
Unified character reading interface to str, String, bytes, File and Stdin for Rust language
-
kmdbook-html
mdBook HTML renderer
-
mq-formatter
Code formatter for mq query language
-
findtext_sheet
Search text in SpreadSheet
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
cutters
Rule based sentence segmentation library
-
poetry-book
Create a poetry book in latex, starting from plain text
-
vaporetto_rules
Rule-base filters for Vaporetto
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
tiniestsegmenter
Compact Japanese segmenter
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
env2toml
Convert env vars to toml text
-
jxoesneon-tectonic-bridge_freetype2
Internal crate for the Tectonic project: tectonic_bridge_freetype2
-
unicode-bidi
Unicode Bidirectional Algorithm
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
tnil
Parsing, glossing, and generating utilites for New Ithkuil
-
rustextile
Textile markup language parser for Rust
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
lingua-korean-language-model
The Korean language model for Lingua, an accurate natural language detection library
-
inclip
terminal utility for clipboard interaction and convenience
-
sbert
Sentence Bert (SBert)
-
rure
A C API for Rust's regular expression library
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
pretok
A string pre-tokenizer for C-like syntaxes
-
namesake
A CLI tool and library to generate names by combining words
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
schmfy
Schmfication library