-
unicode-width
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
ropey
A fast and robust text rope for Rust
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
icu_segmenter
Unicode line breaking and text segmentation algorithms for text boundaries analysis
-
fuzzy-matcher
Fuzzy Matching Library
-
unicode-script
exposes the Unicode
ScriptandScript_Extensionproperties from UAX #24 -
unicode-truncate
Unicode-aware algorithm to pad or truncate
strin terms of displayed width -
unicode-bidi
Unicode Bidirectional Algorithm
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
text-size
Newtypes for text offsets
-
os_display
Display strings in a safe platform-appropriate way
-
cow-utils
Copy-on-write string utilities for Rust
-
sscanf
(inverse of format!()) Macro based on Regex
-
pdf-extract
extract content from pdfs
-
bstr
A string type that is not required to be valid UTF-8
-
str_indices
Count and convert between indexing schemes on string slices
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
prettydiff
Side-by-side diff for two files
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
stfu8
Sorta Text Format in UTF-8
-
embedded-text
TextBox for embedded-graphics
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory
-
daachorse
Double-Array Aho-Corasick
-
sprintf
Clone of C s(n)printf in Rust
-
strck
Checked owned and borrowed strings
-
hck
A sharp cut(1) clone
-
meow-cli
Print ASCII cats to your terminal
-
difflib
Port of Python's difflib library to Rust
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
byteyarn
hyper-compact strings
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
resb
reading and writing ICU resource bundle files
-
quick-quiz-generator
generate interactive quizzes blazingly fast, right in your terminal
-
glyph_brush_layout
Text layout for ab_glyph
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
wordcut-engine
Word segmentation/breaking library
-
whitespace-sifter
Sift duplicate whitespaces away!
-
hi-doc-jumprope
fast rope (fancy string) library built on top of Skiplists - hi-doc fork
-
aki-gsub
substitude text command, replace via regex
-
aki-mcolor
mark up text with color
-
ncount
A word count tool intended to derive useful stats from markdown
-
scrolling_window_pattern_matcher
A unified pattern matching library with optional context support and extractor-driven architecture for streaming data processing
-
aki-xtee
copy standard input to each files and standard output
-
colx
Extract the specified columns from FILES or stdin
-
aki-mcycle
mark up text with cycling color
-
swappy
An anagram generator
-
perg
A fast, feature-rich text search tool similar to grep, written in Rust
-
slugomatic
🐌 A simple CLI tool to slugify and unslugify text, perfect for branch names and URLs
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
biip
Redact sensitive information from text!
-
lexxor
A fast, extensible, greedy, single-pass text tokenizer for Rust
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
aki-mline
match line, regex text filter like a grep of linux command
-
nu_plugin_emoji
a nushell plugin called emoji
-
aki-stats
output the statistics of text, like a wc of linux command
-
fuzzy-muff
Fuzzy Matching Library
-
content_inspector
Fast inspection of binary buffers to guess/determine the encoding
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
svgbob
Transform your ascii diagrams into happy little SVG
-
herolib-core
Core utilities including text processing, networking, and HeroScript configuration language
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
wgpu_glyph
A fast text renderer for wgpu, powered by glyph_brush
-
zaz
cross-platform textual UI toolkit with bindings for Rust, C++, Zig and etc
-
kiru
Fast text chunking for Rust
-
unicode-charname
functions for retrieving Unicode character name properties as described in Unicode Standard Annex #44
-
fontique
Font enumeration and fallback
-
osclip
small utility to copy text from terminal to system clipboard
-
kugiri
Marker-based block editing CLI
-
niblits
Token-aware, multi-format text chunking library with language-aware semantic splitting
-
xim-ctext
compound text en/decoder
-
threeway_merge
Git-style 3-way string merging using proven algorithms from libgit2/xdiff. Statically links xdiff (LGPL-2.1+).
-
recase
Changes the convention case of input text
-
pdf_tables
Scrape text from tables in PDF files
-
cql2
Parse, validate, and convert Common Query Language (CQL2) text and JSON
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
pastebinit-rs
Just Paste It! A simple CLI tool to paste text to various pastebin services
-
mago-text-edit
A text editing library for Mago
-
whitespace-rs
Tools for fixing whitespace problems in text and source code files
-
jumprope
fast rope (fancy string) library built on top of Skiplists
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
inboxbot
A telegram bot to save messages to a file
-
xa
Execute Anything via LLM - A CLI tool for arbitrary text processing using LLMs
-
assert-text
the testing macro tools
-
booky
analyze English text
-
simplematch
Fast wildcard pattern matching for strings and bytes with a simple api
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
ttlint
Small, fast utility to lint text
-
sapling-renderdag
Render a DAG into ASCII or Unicode text
-
shifter
A CLI tool to rotate lines from stdin by n positions
-
difference-rs
text diffing and assertion library
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
plsfix
Text cleaner upper
-
dec-sixbit
DEC SIXBIT encoding
-
ansi-align
Text alignment library with ANSI escape sequence and Unicode support
-
bevy_text_animation
text animation library for Bevy
-
niho
A command-line tool for converting romanized Japanese text to Japanese characters
-
ephemeral_hub
Ephemeral Hub is a temporary, no-login-required hub for text, files, and collaborative brainstorming. It provides a digital equivalent of a scrap piece of paper or a temporary whiteboard…
-
fasttext
binding
-
text-editing
string with utilities for editing
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
subslay
Text → emoji 💅🏻 Powered by Rust
-
srt2txt
Convert SRT subtitle files into clean plain text (strip timestamps, tags, merge lines)
-
xquo
Quote null splited lines for Bash command line
-
thai-number-text
Convert numbers to Thai text with Baht currency, accounting style, and scientific notation support
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
line_cutter
quickly derive structs that de/serialize positionally encoded text data
-
stylish
implementing colorized text
-
syllabize-es
Syllabize Spanish text, and much more
-
tui-dialog
A widget for entering a single line of text in a dialog for Ratatui
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
language-tokenizer
Text tokenizer for linguistic purposes, such as text matching. Supports more than 40 languages, including English, French, Russian, Japanese, Thai etc.
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
penmanship
A Unicode character lookup library for converting text patterns to Unicode characters
-
text-scatters
A cut-up technique generator from text and ebook files in the terminal
-
text-tokenizer
Custom text tokenizer
-
french-numbers
Represent numbers in French language
-
allegro_font
Allegro 5 font addon Rust wrapper
-
slugrs
A fast, locale-aware slugify library for Rust
-
protextinator
Text management, made simple
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
freya-edit
Text Editing APIs for Freya
-
askalono-cli
detect the contents of license files
-
txlib
text based epub library
-
plainllm
A plain & simple LLM client
-
grapheme
Abstractions for working with extended Unicode grapheme clusters
-
topiary-config
Configuration parser and defaults for Topiary
-
mdbook-replace
mdBook preprocessor that simply replaces text
-
eunicode
A text processing CLI and library that helps sanitize text by removing the naughty bits to make strings good and safe
-
maybe_xml
XML processing library
-
topiary-queries
tree-sitter query files compatible with Topiary
-
skimple
interface for the skim fuzzy-matcher
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
aozora2
Aozora Bunko format converter CLI
-
wordcloud-rs
A word-cloud image generation crate
-
waterui-str
String utilities for WaterUI
-
rehuman
Unicode-safe text cleaning & typographic normalization for Rust
-
chonkie
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
trxx
pack and unpack text files
-
clip-sanitize
Meta-library for robust text sanitization, repair, and normalization
-
tokenizations
alignments library
-
aozora2text
Convert Aozora Bunko format to plain text
-
read_input
Ask for user input until the data inputted is valid
-
nixel
Parser for the Nix Expressions Language
-
prose
Reformat text pleasantly
-
nib
static site generator
-
clipcount
Counting words from the clipboard content
-
japanese-text
日本語テキスト正規化ライブラリ - 全角半角変換、ひらがなカタカナ変換
-
rs-tail1
Prints the last line of the input
-
interpolator
runtime format strings, fully compatible with std's macros
-
lowcharts
draw low-resolution graphs in terminal
-
text-similarity-metrics
A high-performance Rust library for computing text similarity using multiple algorithms
-
extractous
fast and efficient way to extract content from all kind of file formats including PDF, Word, Excel CSV, Email etc... Internally it uses a natively compiled Apache Tika for formats are not supported natively by the Rust…
-
argsplitter
A light-weight command line argument parser
-
microslop
Turn your text into beautifully chaotic, glitchy, Wandoze-level slop
-
rslug
fast, and configurable library to create URL-friendly slugs from strings
-
mini_markdown
Small markdown renderer
-
patcher
generating and applying Git-style unified diff patches
-
sixbit
Small packed strings
-
grapheme_machine
Grapheme cluster text segmentation (UAX #29) state machine for streaming input
-
recoco-splitters
Text splitters/parsers for ReCoco, an all-Rust fork of CocoIndex with greater flexibility
-
project-finder
Fast project finder for developers
-
up2code
check code listings in Markdown files against a GitHub repository
-
alphabetic
featuring a struct that represents a letter in Latin-script alphabet
-
nib-cli
A cli for a yet another static site generator Nib
-
egui_keyboard
virtual keyboard widget for egui
-
wordnet-db
Memory-mapped reader for prebuilt WordNet database files
-
clipin
get text from clipboard or stdin
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
fpas
command line software helps you create password from some text
-
editdistancek
Fast algorithm for computing edit distance
-
spyne
All-in-one toolkit
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
clearhead_cli
A CLI tool and library for working with the clearhead framework
-
flat
Project multi-dimensional data onto the flat textual plane
-
reda-geometry
geometry library
-
printerator
simply print iterators without collecting them
-
md-parser-kma
parser that converts basic Markdown syntax (headers, bold, italic, lists) into corresponding HTML tags
-
image-anonymizer
A command-line tool to detect and mask sensitive content in images
-
bevy_ecs_typewriter
ECS typewriter system for Bevy engine
-
bevy_pretty_text
Text effect library for Bevy
-
simple-summarize
Fast extractive text summarizer in Rust (with 30-70% compression)
-
lavendeux-parser
Extensible parsing engine for expression evaluation
-
matchy-extractor
Fast extraction of IPs, domains, emails, hashes from text (internal)
-
thediff
Difference between 2 files in percentages
-
rust-logger
logger
-
dec_from_char
Small library for converting unicode decimal into numbers
-
tiny-grep
grep-like text search utility written in Rust
-
rustdoc-text
A lightweight library and CLI tool to view Rust documentation as plain text
-
wordnet-types
Shared types and part-of-speech enums for WordNet data
-
string-view
Work with views into string slices. Safely extend, reduce without losing parent string size. Modify str slices in-place to avoid allocations.
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
grink
find and check web links in text files
-
quot
A fast and flexible command-line tool that converts text input into escaped string literals
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
intuicio-framework-text
Text types framework module for Intuicio scripting platform
-
file-action
file operations including reading/writing text, CSV, and JSON files
-
fexpr
filter query language parser so that you can build SQL, Elasticsearch, etc. queries safely from user input.
-
santiago
A lexing and parsing toolkit for Rust
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
utf8-io
Traits and types for UTF-8 I/O
-
egui_text_animation
text animation using egui
-
text-span
Struct covering areas of text
-
easy-sgr
An easy to use Select Graphics Rendition(SGR) library
-
syllarust
quickly counting syllables
-
genpdfi
User-friendly PDF generator written in pure Rust
-
colored-print
Concise syntax for colors and styles in a format string literal
-
corn
socket server free to chat & a web server display files
-
crawdad-rkyv
Crawdad: ChaRActer-Wise Double-Array Dictionary with rkyv support
-
suff_collections
Fast realization of suffix array and suffix tree
-
extract-frontmatter
that allows a user to extract an arbitrary number of lines of 'front-matter' from the start of any string
-
slugi
slugify text (e.g., 'Café au lait' → 'cafe_au_lait')
-
mi_primera_lib
Basic math and text utilities for Rust
-
warrah
command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages
-
duat-jump-list
A jump list for Duat, meant to be used by modes
-
advancedresearch-translate
translation or reading ancient texts in their original language
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
btt
Binary to text encodings
-
line_diff
compare single lines by tokenizing them into chunks
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
translitrs
Transliteration utility for Serbian language
-
suffix
arrays
-
sayit
String replacements using regex
-
zen-colour
The most minimal terminal colour and style crate
-
chronofold
A conflict-free replicated data structure (a.k.a CRDT) for versioned text
-
psource
CLI tool to pretty print source code to stdout or directly to the clipboard
-
varion
텍스트 기반 DSL로 구성된 대화 시스템 파싱 라이브러리
-
text-grid
create formatted plain-text tables
-
braid-text
Simpleton merge-type and text diffing for Braid-HTTP
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
sula-alus
A CLI tool to encode strings and files using Sula-Alus encoding
-
dsmsg
Randomly generate online messages from Dark Souls, Demon's Souls, Bloodborne, Sekiro, and Elden Ring
-
lformat
Clone of Lua
string.formatin Rust based on Cs(n)printf -
gotpl
providing full Go template (text/template and html/template) support via FFI
-
e2m
A lightweight CLI tool that transforms English words into emojis using only Rust’s standard library
-
human-names
A random generator for human first and last names
-
tracery
Text-expansion library
Try searching with DuckDuckGo.