What a keyword density analyzer does
A keyword density analyzer counts every word and phrase in your text, ranks them by frequency, and reports their density as a percentage of total words. It breaks results into three tables: 1-grams (single words), 2-grams (two-word phrases), and 3-grams (three-word phrases). Each entry shows raw count, percentage density, and TF-IDF weight, which measures how unique the term is compared to generic usage.
Unlike a keyword density checker that tracks terms you specify, this tool is exploratory. You paste content and it tells you what's already there. If you're auditing a competitor's page, reverse-engineering your own post, or trying to figure out why a page ranks for an unexpected term, this tool surfaces the answer. High-density terms are what search engines see as the page's topic.
The TF-IDF weight helps separate signal from noise. "The" appears 100 times in a 2,000-word article, so its raw density is high. But TF-IDF is near zero because "the" appears in every article. A term like "semantic clustering" might appear only eight times, but if it's rare across the web, its TF-IDF weight is high. That's the term doing the heavy lifting for topical relevance.
Stopword filtering is essential for clean results. Without it, the top 20 entries in your 1-gram table are function words: "the," "and," "of," "to," "in," "a," "is," "that," "for," "it." You learn nothing. Toggle stopwords off and the table shows your actual topic terms: the nouns, verbs, and adjectives carrying meaning. We support seven languages, each with its own stopword list, so filtering works correctly whether you're analyzing English, Spanish, or Dutch content.
How to use this keyword density analyzer
- Paste your content into Your text, or drop in a URL and we fetch the page with nav and footer stripped.
- Toggle Ignore stopwords to hide common words like "the," "and," "of," and "is." Leave it on unless you're diagnosing why a filler word has high density.
- Pick your Language so stopword filtering works correctly. We support English, Spanish, French, German, Italian, Portuguese, and Dutch.
- Hit Analyze density. You get three tables: 1-grams, 2-grams, and 3-grams, each sorted by density. Terms flagged as over-optimized (above safe thresholds) are highlighted.
- Scan the top five entries in each table. Those terms are what the page signals to search engines. If a term you didn't intend is in the top five, you're over-indexing on it.
- Export the tables as CSV if you need to compare density across multiple pages or track changes over time.
Try pasting a competitor's page that ranks for a keyword you're targeting. The 2-gram and 3-gram tables show you exactly which long-tail phrases they're optimizing for. You can match or beat those densities in your own content. If you want a faster check for specific keywords you already know, use the keyword density checker instead.
Why n-gram analysis matters
Single-word density tells you almost nothing. A page about "content marketing strategy" might have "content" at 3%, "marketing" at 2.5%, and "strategy" at 2%. You could conclude the page is well-optimized. But if the 2-gram table shows "content strategy" at 0.4% and "marketing strategy" at 0.3%, the page isn't optimized for the full phrase. It's optimized for the component words, which is weaker.
Search engines understand phrases, not just individual words. Google's BERT update in 2019 prioritized multi-word phrase comprehension. Pages that use the exact target phrase in natural contexts outrank pages that scatter the words. N-gram analysis shows you whether your content clusters terms into phrases or treats them as isolated words. Once you know which phrases to target, tools like our content brief generator help you build outlines that naturally incorporate them.
A study from Moz in 2022 found that pages ranking in the top three for competitive keywords had 2-gram density 1.8x higher than pages ranking 4-10, even when 1-gram density was identical. The difference was phrase coherence. Analyzing n-grams surfaces that gap before you publish.
The practical consequence is that you can't optimize by single words alone. If your target keyword is "email marketing automation," checking that "email," "marketing," and "automation" each appear at 2% tells you nothing about whether "email marketing automation" appears as a complete phrase. The 3-gram table answers that question directly. If the full phrase is missing from the top ten 3-grams, your page isn't optimized for the term you think it is.
Common mistakes
- Only looking at 1-grams. Single-word density is almost useless for modern SEO. Scan 2-grams and 3-grams first, then check 1-grams to spot filler overuse.
- Ignoring TF-IDF weight. A term with 2% density and low TF-IDF is generic. A term with 1% density and high TF-IDF is your topical anchor. Sort by TF-IDF, not density, to find the terms carrying your SEO weight.
- Treating high density as bad. High density is only a problem if it sounds unnatural or exceeds 3.5% for multi-word phrases. A niche term at 4% in a technical article is fine. "Best free AI tools" at 5% in a listicle is spam.
- Forgetting to filter stopwords. If you leave stopwords on, the top 1-grams are "the," "and," "of," "to," "a." You learn nothing. Toggle them off unless you're debugging a specific filler word problem.
- Skipping the export. If you're tracking density over time or comparing multiple pages, copy-pasting tables by hand is slow and error-prone. Export as CSV and load it into a spreadsheet where you can sort, filter, and diff at scale.
Advanced tips
- Run this tool on your top-ranking page and your newest draft side by side. Export both as CSV and compare n-gram density line by line. Match the densities within 0.5 percentage points to replicate what's already working.
- Use the 3-gram table to find accidental keyword variations. If "project management software," "project management tool," and "project management platform" all appear with similar density, you're splitting focus. Pick one primary phrase and consolidate.
- Sort the 2-gram table by TF-IDF and look at the top ten. Those terms are your semantic core. If none of them match your target keyword, the page is off-topic.
- Combine this tool with our lsi keyword generator. Generate a list of related keywords, paste your draft into the analyzer, and check whether your LSI terms show up in the 2-gram and 3-gram tables. If not, you're missing topical coverage.
- Run the analyzer on your meta description and title separately. A meta title at 8% density for your primary keyword looks spammy in the SERP even if your body content is clean. Keep title and description under 5% density each.
- Use the 1-gram table to catch filler overuse. If "really," "very," or "actually" appears in the top 20 single words, you're leaning on weak modifiers. Cut them and the writing tightens automatically.
Once you've analyzed density and fixed over-optimization, the next step is readability and structure. Run the cleaned draft through our reading level checker to confirm it's accessible, then use the word counter to check average sentence length and identify overused words the analyzer might have missed. If you're working on a page that targets a specific keyword and you want faster checking, our keyword density checker lets you input target terms up front and skips the exploratory tables. When you're building content from scratch and need related keywords to track, the lsi keyword generator produces a list of semantic cousins you can paste into the analyzer to verify coverage.