#corpus

  1. ziggy

    A multi-fuzzer management utility for all of your Rust fuzzing needs 🧑‍🎤

    v1.5.0 450 #fuzzer #multi-fuzzer #corpus #honggfuzz #utility #afl
  2. graphannis

    new backend implementation of the ANNIS linguistic search and visualization system

    v4.1.1 160 #query-language #linguistics #visualization #graph #corpus #graph-search #aql #annis #text-search #search-and-visualization
  3. stackpack

    A compressor-agnostic compression pipeline

    v1.0.1 #pipeline #compression #compressor-agnostic #decode #encode #corpus #compressors
  4. tooltest

    CLI conformance testing for MCP servers

    v0.3.0 #mcp-server #testing #testing-server #conformance #corpus #json-output #state-machine #exit-code #repeatable
  5. cctr

    CLI Corpus Test Runner

    v0.27.0 #test-runner #testing #json-output #corpus #env-var #test-suite #multi-line #command-output #setup-teardown #test-output
  6. cargo-test-fuzz

    v7.2.5 #cargo-subcommand #testing #test-harness #afl-fuzz #testing-fuzzing #corpus #test-cargo #serialization
  7. artemisia

    annotation editor is intended to be used for the RIDGES corpus. It is based on [graphANNIS]…

    v0.6.0 #annotations #data-model #corpus #editor #graph-annis #principles #import-export
  8. ecfuzz

    Evolutionary Coverage-guided Fuzzing engine

    v0.2.4 #coverage-guided #corpus #engine #genetic-algorithm #evolutionary-algorithms #logging #tree-based #llvm #mutating #grammar
  9. annatomic

    annotation editor is intended to be used for the RIDGES corpus. It is based on [graphANNIS]…

    v0.4.0 #annotations #data-model #corpus #editor #graph-annis #principles #import-export
  10. ungoliant

    The pipeline for the OSCAR corpus

    v2.0.0 #corpus #common-crawl #oscar #pipeline #web-crawler #fasttext #gz #packaging
  11. rc-zip-corpus

    A collection of zip files for testing

    v0.1.3 #testing #corpus #rc-zip #collection
  12. depyler-corpus

    Deterministic scientific corpus analysis for Python-to-Rust transpilation quality measurement

    v3.25.0 #corpus #scientific #transpiler #metrics #analysis
  13. cctr-expr

    internal component crate of cctr

    v0.27.0 #cctr #array-object #testing #math #test-runner #regex #corpus #json-output #expression-language #forall
  14. cctr-corpus

    internal component crate of cctr

    v0.27.0 #testing #cctr #corpus #parser #test-runner #multi-line #test-files #file-level #json-output #test-suite
  15. profuzz_core

    profuzz is a generic approach to easily create a fast and easy-to use network protocol fuzzer for custom targets

    v0.1.0 #fuzzer #health-check #generic #create #network-protocol #corpus #embedded #tui #binary-protocol #network-stack
  16. graphannis-cli

    command-line interface to the new backend implementation of the ANNIS linguistic search and visualization system

    v4.1.1 #command-line-interface #linguistics #back-end #visualization #search-and-visualization #annis #backend-of-annis #corpora #corpus
  17. graphannis-capi

    C-API to the ANNIS linguistic search and visualization system

    v4.1.1 #corpus #graph-annis #c-api #linguistics #visualization #search-and-visualization #backend-of-annis
  18. wimbd

    A CLI for inspecting and analyzing large text datasets

    v0.3.0 320 #statistics #dataset #big-data #ngrams #search #counting-bloom-filter #corpus #text-data
  19. graphannis-webservice

    web service to the new backend implementation of the ANNIS linguistic search and visualization system

    v4.1.1 #web-services #graph-annis #linguistics #visualization #back-end #search-and-visualization #backend-of-annis #corpora #corpus
  20. kathoey

    text feminization using open corpus linguistics data

    v1.1.5 #russian #binary-encoding #corpus
  21. graphannis-malloc_size_of

    fork of the malloc_size_of crate, which is part of the Servo codebase, to make it available to the graphANNIS corpus search library as dependency

    v2.0.0 240 #malloc-size-of #servo #graph-annis #corpus #memory-size
  22. chess_compression

    A chess compression library

    v0.5.0 #chess #chess-moves #compression #testing #java #lichess #straight #tweak #scala #corpus
  23. oscar-io

    Readers/Writers for OSCAR Corpora

    v0.4.0 100 #reader-writer #oscar #corpus #corpora
  24. rusty-dawg

    building and querying Directed Acyclic Word Graphs (DAWGs) and Compacted DAWGs (CDAWGs) for efficient string indexing and searching

    v0.2.2 #search-indexing #python-bindings #dawg #cdawg #graphs #ram #compacted #corpus #corpora #web-server
  25. corpus

    Centrally Organized, Relative Path Uniqueness Strategy

    v0.2.1 #relative-path #central #path
  26. annis-web

    experimental version of ANNIS corpus search frontend

    v0.2.0 #experimental #web-frontend #corpus #front-end #annis #csv #web-search #linguistics #corpora
  27. corpus-preproc

    A preprocessor for text and HTML corpora

    v0.1.0 #pre-processor #corpus #text #cli
  28. graphannis-core

    supports graph representation and generic query-functionality

    v4.1.1 160 #graph #system #linguistics #back-end #visualization #annis #corpus #backend-of-annis #search-and-visualization
  29. canfuzz

    A coverage-guided fuzzing framework for Internet Computer canisters, built on libafl and pocket-ic

    v0.5.0 #fuzzer #canister #instrumentation #libafl #coverage-guided #wasm-module #pocket-ic #test-cases #corpus #code-coverage
  30. Try searching with DuckDuckGo.

  31. angr

    analyse ngrams in text files

    v0.1.0 #ngrams #text #optimization #tool #nlp #keyboard-layout #corpus #sed #text-file
  32. memas-sdk

    Control Plane APIs for MeMaS (Memory Management Service)

    v0.1.0 #control-plane #api-client #create-user #corpus #memory-management #cp #open-api-specification
  33. corpus-count

    Util to count words and character ngrams in a corpus

    v0.1.1 #ngrams #corpus #count
  34. opus_tools

    Miscellaneous tools for working with the OPUS parallel text corpus

    v0.1.3 #opus #sentence #parallel #tar-gz #corpus
  35. oscar-tools

    Tools for processing OSCAR Corpora

    v0.4.0 #oscar #corpus #document-oriented #json-lines #version #corpora
  36. ptb-reader

    parsing of the merged Penn Treebank format

    v0.9.1 #corpus #parser #treebank #ptb
  37. tanaka

    interface the Tanaka Corpus of parallel Japanese-English sentences

    v0.1.0 #corpus #japanese #dictionary
  38. opus-parse

    parse OPUS

    v0.0.3 #opus #parser #corpus #monolingual #xml
  39. graphannis-malloc_size_of_derive

    fork of the malloc_size_of_derive crate, which is part of the Servo codebase, to make it available to the graphANNIS corpus search library as dependency

    v2.0.0 #servo #corpus #fork #graph-annis #codebase #part-of-servo #field-attributes
  40. solana_libra_fuzzer

    Solana Libra fuzzer

    v0.0.0 #fuzzer #libra #solana #target #corpus