Convert HTML to clean Markdown instantly. Full reference guide for developers and content creators migrating to Markdown-based platforms.
๐ Open HTML to Markdown ConverterFree ยท No signup ยท 100% client-side ยท Paste from Google Docs
Markdown is a lightweight markup language created by John Gruber in 2004. It lets you write formatted documents using plain text โ asterisks for bold, hashes for headings, backticks for code โ that can be converted to HTML automatically. The design goal was that Markdown source should be readable as-is, without requiring a renderer.
Today, Markdown is the lingua franca of technical writing. It powers GitHub READMEs and documentation, Notion pages, Obsidian notes, Hugo and Jekyll blogs, Stack Overflow questions, and Reddit posts. Any developer or content creator who works with code or structured content will eventually need to work with Markdown.
GitHub
READMEs, issues, wikis, documentation
Notion
Pages, databases, project docs
Obsidian
Personal knowledge base, notes
Hugo / Jekyll
Static site generator content
Jupyter
Notebook documentation cells
Slack / Discord
Basic message formatting
Content Migration (WordPress โ Static Sites)
Migrating a WordPress blog to Hugo, Jekyll, or Gatsby requires converting HTML post content to Markdown. The ToolForge converter handles the most common HTML structures that WordPress generates โ paragraphs, images, lists, code blocks, and shortcodes.
Documentation (HTML Docs โ GitHub README)
Existing HTML documentation pages often need to become GitHub-flavored Markdown for repository wikis and READMEs. Converting manually is error-prone โ this tool handles all heading levels, code blocks, and tables automatically.
Simplification
HTML generated by rich text editors (WordPress, Drupal, legacy CMS systems) is often bloated with inline styles, empty spans, and non-semantic markup. Converting to Markdown strips all of that, leaving clean, minimal content that is easier to maintain.
Portability
Markdown is platform-agnostic plain text. It works in any editor, renders in any modern tool, and produces readable diffs in git. HTML is tightly coupled to browsers and rendering engines.
Version Control
Markdown diffs are human-readable. When you change a paragraph, git shows exactly which words changed. HTML diffs from WYSIWYG editors often include changes to attributes, classes, and structure that obscure the actual content change.
Complete mapping of every HTML element to its Markdown equivalent. Elements marked GFM require GitHub Flavored Markdown support.
| HTML Element | Markdown Output | Notes |
|---|---|---|
| <h1> โ <h6> | # through ###### | ATX or Setext style |
| <p> | Plain paragraph text | Blank line = new paragraph |
| <strong>, <b> | **bold** | Or __bold__ (configurable) |
| <em>, <i> | *italic* | Or _italic_ (configurable) |
| <a href="url">text</a> | [text](url) | Or referenced style: [text][1] |
| <img src="s" alt="a"> |  | Title attribute becomes tooltip |
| <ul> / <li> | - item | Also *, + (configurable) |
| <ol> / <li> | 1. item | Auto-numbered in output |
| <code> | `inline code` | Single backtick |
| <pre><code> | ```\ncode block\n``` | Fenced or 4-space indented |
| <blockquote> | > quoted text | Nested blockquotes supported |
| <table> | | col | col |\n|---|---| | GitHub Flavored Markdown tables |
| <hr> | --- | Horizontal rule |
| <br> | Two trailing spaces | Or blank line for paragraph break |
| <del>, <s> | ~~strikethrough~~ | GitHub Flavored Markdown only |
Export WordPress posts
Go to Tools โ Export โ All Content. WordPress generates an XML file (.wxr) containing all post HTML.
Extract post HTML
Open the .wxr file and find <content:encoded> blocks. Each block contains the full HTML of a post. Copy the HTML content.
Paste into the converter
Paste each post's HTML into the ToolForge HTML to Markdown converter. Adjust options (heading style, code blocks) to match your target platform.
Review and fix images
Image src paths will still point to your WordPress install. Update them to your new hosting location or CDN. Alt text and captions are preserved automatically.
Save as .md files
Download each converted post as a .md file. Name files according to your static site generator's naming convention (e.g., 2026-06-07-post-title.md for Jekyll).
The easiest method is the rich-text paste feature. When you copy from Google Docs, the clipboard includes the underlying HTML representation alongside the plain text. The ToolForge converter can capture this HTML directly:
Open your Google Doc and select all content (Ctrl+A / Cmd+A)
Copy to clipboard (Ctrl+C / Cmd+C)
In the converter, paste into the "Rich Text Input" area (not the HTML textarea)
The tool captures the Google Docs HTML from your clipboard
Click Convert โ headings, bold, italic, links, and lists are preserved
Download the .md file and add front matter (title, date) for your platform
CommonMark
StandardThe standardized Markdown specification (commonmark.org). Defines precise, unambiguous parsing rules. Supported by Pandoc, GitLab (partially), and most strict parsers. Does not include tables or task lists.
GitHub Flavored Markdown (GFM)
DefaultGitHub's superset of CommonMark that adds tables, task lists (- [x]), strikethrough (~~text~~), fenced code blocks with language hints, and URL auto-linking. The de facto standard for most developers. This converter outputs GFM by default.
MultiMarkdown
ExtendedExtended Markdown with footnotes, definition lists, metadata, and more. Used by some academic writing tools. Less common in web development contexts.
Pandoc Markdown
AdvancedThe richest Markdown variant, used by the Pandoc document conversion tool. Supports footnotes, citations, math (LaTeX), and dozens of output formats (PDF, DOCX, EPUB). For academic and technical writing.
Two-panel layout โ HTML input on left, Markdown output on right
Powered by Turndown โ gold standard HTMLโMarkdown library
GitHub Flavored Markdown (GFM) output โ tables, strikethrough, task lists
Live preview tab โ rendered Markdown as it will appear
Rich text paste โ captures HTML from Google Docs clipboard
HTML file upload โ drag-and-drop .html or .htm files
Configurable options โ heading style, bullet marker, code blocks, links
One-click copy to clipboard
Download as .md file
Sample HTML pre-loaded for instant demo
100% client-side โ content never sent to server
Works offline after first page load
Standard Markdown (CommonMark) does not include tables, but GitHub Flavored Markdown (GFM) does. The HTML to Markdown converter outputs GFM pipe tables by default. These are supported on GitHub, GitLab, Notion, Obsidian, and most modern Markdown renderers. If your platform uses strict CommonMark, table HTML will be preserved as raw HTML.
HTML entities (&, <, ", {, etc.) are decoded during conversion and output as their literal characters in Markdown. For example, & becomes &, and ' becomes '. Special Markdown characters that could be misinterpreted (like * or _) are automatically escaped with a backslash.
HTML <img> tags are converted to Markdown image syntax: . The alt attribute becomes the alt text, the src becomes the URL, and the title attribute (if present) becomes the image tooltip. Note that relative image paths may need to be updated manually after converting, since the images will be referenced from a different location.
Markdown is a human-readable plain text format โ easy to write, diff in git, and read without rendering. HTML is more powerful (supports complex layouts, inline styles, custom elements) but is harder to read as source and creates noisier git diffs. For documentation in code repositories, wikis, and static site generators, Markdown is almost universally preferred.
Yes. This is the reverse operation โ all Markdown parsers output HTML. Use our Markdown Editor which includes a live preview (HTML rendering of Markdown) and a copy-as-HTML button. Most static site generators (Hugo, Jekyll, Gatsby) also output HTML from Markdown at build time.
The converter preserves all formatting that Markdown supports: headings, bold, italic, strikethrough, links, images, lists, code, blockquotes, tables, and horizontal rules. Formatting that has no Markdown equivalent โ inline styles (font-size, color), divs, spans, custom classes โ is stripped unless you toggle "Preserve unsupported HTML" in the options.
GitHub Flavored Markdown is GitHub's extension of the CommonMark spec that adds tables, task lists (- [x] checked), strikethrough (~~text~~), and fenced code blocks with syntax highlighting. It has become the de facto standard and is supported by GitHub, GitLab, Notion, Obsidian, and most modern Markdown tools. This converter outputs GFM by default.
Not directly with this browser-based tool, but you can convert individual pages. Copy the page source HTML and paste it into the converter. For bulk conversion of websites or content management systems, command-line tools like Pandoc or the turndown Node.js package are better suited. They can process hundreds of files in batch with a single script.
Paste HTML, configure options, and download clean Markdown in seconds. Works with Google Docs, WordPress, and any HTML source.