#html-parser

  1. html5ever

    High-performance browser-grade HTML5 parser

    v0.38.0 2.2M #html-parser #html5 #html
  2. html2text

    Render HTML as plain text

    v0.16.7 187K #html-parser #convert-html #html-text
  3. markup5ever

    Common code for xml5ever and html5ever

    v0.38.0 2.2M #xml #html5ever #xml-parser #html-parser #whatwg #xml-document #html5 #tree-builder #xml5ever #forms
  4. lol_html

    Streaming HTML rewriter/parser with CSS selector-based API

    v2.7.1 153K #html-parser #css-parser #rewriter #html-rewriter #css-selectors
  5. libxml

    wrapper for libxml2 - the XML C parser and toolkit developed for the Gnome project

    v0.3.8 52K #xml #xml-parser #html-parser #xpath
  6. tree-sitter-html

    HTML grammar for tree-sitter

    v0.23.2 123K #tree-sitter #html-parser #html
  7. html5gum

    A WHATWG-compliant HTML5 tokenizer and tag soup parser

    v0.8.3 37K #html-parser #tokenize #whatwg #html5 #html #tokenizer
  8. tl

    Fast HTML parser written in pure Rust

    v0.7.8 115K #html-parser #dom #html
  9. ftml

    Foundation Text Markup Language - a library to render Wikidot text as HTML

    v1.37.1 110 #ast #wikidot #html-parser #parser #wikijump
  10. dom_query

    HTML querying and manipulation with CSS selectors

    v0.24.0 1.8K #css-selectors #html-parser #css #css-parser #scraping
  11. scrape-cli

    Command-line HTML extraction tool powered by scrape-rs

    v0.2.2 #css-selectors #html-parser #simd-accelerated #extract #nodejs #html5 #wasm #command-line-tool #batch-processing #web-scraping
  12. markup5ever_rcdom

    Basic, unsupported DOM structure for use by tests in html5ever/xml5ever

    v0.36.0+unofficial 350K #html5ever #dom-tree #html-parser #node #serialization #text-node #xml5ever #automated-tests #document-tree #unsupported
  13. swc_html_parser

    HTML parser

    v18.0.0 5.1K #swc #html-parser #babel #typescript-compiler #node #javascript #parser-compiler
  14. astral-tl

    Fast HTML parser written in pure Rust

    v0.7.11 51K #html-parser #html #parser
  15. dompa

    A lightweight, zero-dependency HTML5 document parser

    v1.1.2 320 #html-parser #html5 #parser #dom #serializer
  16. swc_html_ast

    AST definitions of html

    v18.0.0 5.4K #swc #ast #babel #html-parser #typescript-compiler #javascript
  17. legible

    port of Mozilla's Readability.js for extracting readable content from web pages

    v0.4.0 #html-parser #article #readability #extract
  18. web_atoms

    Atoms for xml5ever and html5ever

    v0.2.3 661K #html5ever #xml-parser #string-optimization #atom #specification #html5 #xml5ever #html-parser #serialization
  19. rphtml

    A html parser written in rust

    v0.5.12 550 #html-parser #minify-html #html
  20. html_parser

    general purpose html/xhtml parser

    v0.7.0 80K #dom #pest-parser #pest #html
  21. blitz-html

    Blitz HTML parser

    v0.2.0 1.7K #html-parser #blitz #rendering-engine #css #web-page
  22. blitz-dom

    Blitz DOM implementation

    v0.2.4 5.8K #dom #blitz #rendering-engine #html-parser #html-rendering #css-parser #markdown-rendering #virtual-dom #web-page
  23. oak-html

    HTML markup language parser with support for web content and document structure processing

    v0.0.2 #html-parser #parser #markup #web-html
  24. mdka

    HTML to Markdown converter

    v1.6.5 1.8K #html-markdown-converter #html-parser #markdown-parser
  25. htmlite

    An HTML manipulation toolkit

    v0.18.0 #html-parser #toolkit #html #parser
  26. editorjs2html

    converts Editor.js output into clean HTML, supporting multiple block types efficiently

    v0.1.12 1.1K #html-parser #editor-js #editorjs-to-html #editorjs
  27. html-filter

    parse, filter, search and edit an HTML file

    v0.2.1 #html-parser #scraping #html
  28. dom_finder

    HTML parsing with CSS selectors

    v0.5.0 650 #css-selectors #css #scraping #html-parser #selectors
  29. scrape-core

    High-performance HTML parsing library core

    v0.2.2 #css-selectors #html-parser #scraping #dom
  30. meta_oxide

    Universal metadata extraction library supporting 13 formats (HTML Meta, Open Graph, Twitter Cards, JSON-LD, Microdata, Microformats, RDFa, Dublin Core, Web App Manifest, oEmbed, rel-links…

    v0.1.1 #html-parser #extract-metadata #extract #web #web-extract #metadata-parser
  31. parserst

    A recursive-descent reST parser and renderer

    v0.1.1 #restructuredtext #render-markdown #html-parser #ast #static-site-generator #markup-parser #doc-string #recursive-descent #convert-html #parser-and-renderer
  32. escaper

    HTML entity encoding and decoding

    v0.1.1 3.8K #html-parser #xml #parser
  33. readability-js

    wrapper for Mozilla's Readability.js library

    v0.1.5 110 #readability #html-parser #parser #wrapper
  34. skyscraper

    XPath for HTML web scraping

    v0.7.0-beta.2 1.0K #html-parser #xpath #web-scraping #html-text #text-document #parse-error
  35. asciidork-backend-html5s

    Asciidork Semantic HTML backend, based on jirutka/asciidoctor-html5s

    v0.33.0 #asciidork #html #asciidoc #parser #semantic #html-parser
  36. readability-rust

    port of Mozilla's Readability library for extracting article content from web pages

    v0.1.0 6.7K #html-parser #article #content-extraction #parser
  37. html_transpose

    html table transpose library

    v0.1.1 #html-table #table-cell #transpose #html-escaping #merged #transposing #convert-html #web-scraping #2d-grid #html-parser
  38. hash-tag

    Markdown to HTML parser

    v0.1.16 600 #render-markdown #html-parser #markdown-parser
  39. cari

    popular HTML parsing utility pup

    v1.0.0 #css-selectors #html-parser #command-line #scraping
  40. trek-rs

    A web content extraction library that removes clutter from web pages

    v0.2.1 550 #html-parser #readability #extract #parser #wasm
  41. brik

    HTML tree manipulation library - a building block for HTML parsing and manipulation

    v0.10.0 #html-parser #css-selectors #namespaces #building-block #dom #html5ever #siblings #ancestor #svg #safe-mode
  42. rieltor_parser

    A parser for extracting detailed apartment information from the rieltor.ua website's HTML

    v0.1.4 200 #html-parser #apartment #ua-parser #information #price #room #currency #house-numbers #grammar #characteristics
  43. node-html-parser

    Fast HTML parser for Rust & WASM producing a lightweight DOM with CSS selector querying

    v0.1.0 #html-parser #css #dom #wasm
  44. fast_html5ever

    High-performance browser-grade HTML5 parser

    v0.26.6 1.6K #html-parser #html5ever #whatwg #html5 #serialization #tree-builder #browser-grade #utf-8 #forms #xml-parser
  45. scrapr-core

    web scraping library for Python

    v0.1.1 #web-scraping #html-parser #web #html
  46. html-query

    jq, but for HTML

    v1.2.2 600 #css-selectors #jq #html-parser #extract #web-page #convert-json
  47. html5tokenizer

    An HTML5 tokenizer with code span support

    v0.5.2 180 #html-parser #html5 #whatwg #tokenizer
  48. ruma-html

    Opinionated HTML parsing and manipulating

    v0.6.0 4.6K #html-parser #ruma #matrix-ruma
  49. tagparser

    A lightweight Rust library for parsing HTML tags with powerful filtering capabilities

    v0.6.0 390 #html-parser #web-scraping #html #web
  50. lithtml

    A lightweight and fast HTML parser for Rust, designed to handle both full HTML documents and fragments efficiently

    v0.8.0 230 #html-parser #dom #html5 #lite
  51. pochoir-extra

    Extra utilities for the pochoir template engine

    v0.15.0 #css #pochoir #component-system #scoped-css #checker #accessibility #debug-mode #debugging #html-parser #real-time
  52. unobtanium-text-pile

    Turns HTML into externally annotated plain text that is optimized for being serialized to the postcard format

    v0.2.0 #text-format #html-text #serialization #language-text #postcard #text-spans #pile #marker #html-parser #unobtanium
  53. mark-html

    efficient Markdown to HTML parser written in Rust

    v0.2.0 #html-parser #markdown #html #parser
  54. readability-js-cli

    Command-line interface for readability-js

    v0.1.5 160 #html-parser #readability #wrapper
  55. designtime-jsx

    Lightweight Rust parser for JSX-style HTML and custom components - built for the DesignTime language

    v1.0.5 190 #html-parser #design-time #component #jsx
  56. html_editor

    Pure and simple HTML parser and editor

    v0.7.0 1.6K #html-parser #dom #editor
  57. parse-html

    project to parse HTML

    v0.4.1 #html-parser #ast #lexer #dom-tree #tags
  58. toks

    Efficient tokens for html5ever::rcdom::RcDom Handle parsing aiming for O(1) HTML DOM walking & efficiency

    v1.4.0 600 #html-parser #html
  59. prejsx

    A JSX-to-HTML transpiler written in Rust using pest and meval

    v0.1.0 #transpiler #html-parser #rust #jsx #html
  60. sauron-html-parser

    parsing dynamically parsing html at runtime

    v0.70.0 950 #html-parser #web #html
  61. html2pango

    convert html to pango

    v0.6.0 1.4K #convert-html #html-parser #pango
  62. parsed-html

    parsing HTML documents. It supports reading HTML documents in an event-based fashion.

    v0.1.0 #html #event-based #document #events #fashion #text-content #html-parser
  63. facet-html

    HTML parsing for facet using the format architecture with html5gum

    v0.42.0 #html-parser #streaming-parser #html5 #parser #facet
  64. nanoneo

    lisp-like dsl which "compiles" into html

    v0.6.1 #html #dsl #lisp-like #document #html-parser
  65. zbuf

    “Zero-copy” string and bytes buffers

    v0.1.2 #byte-buffer #zero-copy #utf-8 #input #performance-optimization #html5 #xml-parser #whatwg #html-parser #html5ever
  66. sxd_html

    Add HTML parsing support to sxd_document. This enables to evaluate XPath expressions on HTML documents.

    v0.1.2 110 #html-parser #sxd-xpath #sxd-document #html5ever
  67. domparser

    A super fast html parser and manipulator written in rust

    v0.0.7 #html-parser #manipulator #dom #super #node #serialization #html-string #napi #css #css-selectors
  68. html_simple_parser

    parser for html files to extract tags, child tags, attributes, etc

    v0.1.1 #html-parser #tags #validation #extract #child #grammar #credits #dom #file-structure
  69. bobo_html_parser

    parser of html markdown

    v0.1.1 #html-parser #pest-parser #pest
  70. scraprr

    web scraping library for Python

    v0.1.3 #web-scraping #html-parser #web
  71. capricorn

    Parse html according to configuration

    v0.1.93 #html-parser #query #config #node #attr #parser-config
  72. antwerp

    An open-source framework ported from JavaScript to Rust for GitHub pages and built with the Marcus HTML to MarkDown parser

    v0.3.3 #render-markdown #markdown-parser #github-pages #javascript #javascript-parser #html-parser #markdown-template #web-framework #github-page #html-template
  73. reget

    recipe parser for html and json-ld with optional markdown support

    v0.2.3 #markdown #recipe #html-parser #json-ld #document
  74. fast_markup5ever

    Common code for xml5ever and html5ever

    v0.11.1 2.0K #xml-parser #html-parser #serialization #html5ever #whatwg #tree-builder #html5 #forms #performance-optimization #document-parser
  75. h2s

    A declarative HTML parser, which works like a deserializer from HTML to struct

    v0.18.0 #html-parser #dom #scraping
  76. scrapr-bindings

    web scraping library for Python

    v0.1.1 #web-scraping #html-parser #web
  77. sauron-parse

    parsing html syntax

    v0.40.0 110 #svg-parser #html-parser #svg
  78. halldyll-parser

    HTML/CSS parsing and content extraction for halldyll scraper

    v0.1.0 #html-parser #css-parser #css #extract #selectors
  79. sitescraper

    Scraping Websites in Rust!

    v0.2.1 #html-parser #scraping-tool #webscrape
  80. rust-pickaxe

    HTML data extraction library

    v0.5.5 170 #html #xpath #html-parser #extract #css-selectors #python-packages
  81. Try searching with DuckDuckGo.

  82. victoria-dom

    Minimalistic HTML parser with CSS selectors

    v0.1.2 #css-parser #html-parser #css
  83. unhtml

    A magic html parser

    v0.8.0 900 #html-parser #html #parser
  84. html5ever_macros

    High-performance browser-grade HTML5 parser − compiler plugins

    v0.2.7 290 #html5ever #browser-grade #html-parser #compiler-plugin #html5 #parser-compiler #xml-parser
  85. makepad-html

    Makepad html parser

    v1.0.0 310 #html-parser #makepad #makepad-html-parser
  86. wappu

    fast and flexible web scraping library for Rust, designed to efficiently navigate and extract data from websites. Perfect for data mining, content aggregation, and web automation tasks.

    v0.3.0 490 #web-scraping #html-parser #web-content #web-crawler #extract #data-mining #web-page #web-data #fetch-and-parse #navigate
  87. spider_scraper

    A css scraper using html5ever

    v0.1.2 1.4K #web-scraping #css-selectors #html-parser #serialization #web-crawler
  88. pochoir-parser

    HTML parser for the pochoir template engine

    v0.12.2 100 #html-parser #expression #pochoir #tree #html-template #templating #event-handling
  89. rohanasantml

    An easy way to write your messy html code in a better way

    v0.0.2 #interpreter #html-parser #compiler #parser-compiler
  90. sauron-syntax

    parsing html syntax and converting it into sauron view

    v0.1.4 #svg-parser #html-parser #svg
  91. html_forge

    A robust and efficient HTML parsing library for Rust

    v0.1.0 110 #html-parser #dom #parser #html
  92. html5ever-atoms

    Static strings for html5ever

    v0.3.0 1.7K #html5ever #html-parser #specification #html5 #string #xml-parser #serialization #whatwg #ucs-2 #utf-8
  93. htmlstream

    Lightweight HTML parser for rust

    v0.1.3 190 #html-parser #document #github #io
  94. silkenweb-parse

    Parse HTML into Silkenweb data

    v0.10.0 170 #html-parser #silkenweb #reactive
  95. html_parser_tarasenko

    Базовий HTML-парсер на Rust з використанням Pest

    v0.1.2 #html-parser #pest-parser #tarasenko #викори
  96. parsex

    Simplistically, quickly and efficiently parse and modify HTML documents

    v0.1.1 #html-parser #html #parser
  97. hyperparse

    A HyperText Markup Language (HTML) parser written in Rust. (WIP)

    v0.1.2 #ast #html-parser #markup-language #token-tree #text-content
  98. html5ever_dom_sink

    Basic DOM implementation for html5ever

    v0.2.0 #html5ever #html-parser #document #dom #html5 #whatwg #serialization #xml-parser
  99. ahref

    Extract 'a' tags from html page

    v0.3.0 130 #html-parser #cli-parser #web #cli
  100. smoldown

    Native Rust library for parsing Markdown

    v0.1.0 #markdown-parser #html-parser #md
  101. lightml

    Parser for XML and HTML

    v0.0.2 #css-parser #xml-parser #html-parser #parser-selector #selectors
  102. html-query-ast

    Expression parser for hq: jq, but for HTML

    v0.2.2 550 #css-selectors #html-parser #expression-parser #jq #hq
  103. wax-cli

    An extension of HTML written in Rust

    v0.2.1 #html-parser #cli-parser #html
  104. htmldom_read

    HTML reader that parses the code into easy-use tree

    v0.5.0 #html-parser #node-tree #node #tree #parser
  105. html_parse

    Html parser, wrapper of html5ever

    v1.1.2 #html-parser #html5ever #parser #html
  106. de_hypertext

    serde_json ergonomics for parsing html

    v0.1.4 250 #html-parser #serde-json #ergonomics
  107. nom_html_parser

    A parser to convert HTML string to HTML tree structure written with Nom

    v0.1.1 #html-parser #nom
  108. match_token

    Procedural macro for html5ever

    v0.35.0 1.2M #html5ever #html5 #proc-macro #syntax #tree-builder #html-parser #xml-parser #whatwg #serialization #ucs-2
  109. markdown_to_html_parser

    parses Markdown syntax into HTML

    v0.1.0 #markdown-parser #html-parser #render-markdown #convert #grammar
  110. graburl

    Get all url's from website

    v0.1.8 #cli-parser #html-parser #web #cli #parser
  111. cda-dl

    Minimal async library for extracting video stream URLs from cda.pl

    v0.1.0 #async-stream #video-stream #url #http-request #extract #cda #html-parser
  112. html_parser_rscx

    general purpose html/xhtml parser

    v0.7.1 #html-parser #dom #pest #html
  113. microformats-cli

    A command line tool for parsing HTML as Microformats

    v0.9.0 110 #html-parser #command-line
  114. eml2html

    Converts EML files to HTML

    v0.1.0 #eml #html-parser #cli-parser #utility #html
  115. rusthtml

    A html parser written in rust

    v0.2.4 #html-parser #html #parser
  116. tag_parser

    just parse tags like html

    v0.1.2 #html-parser #tags #tag-name
  117. loa

    HTML parser written in pure Rust,no-std

    v0.1.8 #html-parser #pure-html-parser #html
  118. rs_html_parser_tokenizer

    Rs Html Parser Tokenizer

    v0.0.10 #html-parser #tokenize #browser #handle #tags #parser-error #processing-instructions #closing #case-insensitive #notes
  119. rs_html_parser

    Rs Html Parser

    v0.0.10 #html-parser #tokenize #browser #tags #processing-instructions
  120. h2s_core

    A core part of h2s

    v0.18.0 #html-parser #part-of-h2s #struct #extract #from-html #debugging #partial-eq