Running a technical SEO audit without a checklist is like debugging without logs — you'll miss things. Here's the checklist we use internally at CrawlX, updated for 2026.
1. Crawlability & Indexing
The foundation of technical SEO. If search engines can't crawl and index your pages, nothing else matters.
- robots.txt — Verify it's not blocking important pages or resources. Check for accidental disallow rules that crept in during development.
- XML Sitemap — Present, valid, and submitted to Search Console. Should only include indexable, canonical URLs. No 404s, no redirects, no noindexed pages.
- Canonical Tags — Every page has a self-referencing canonical unless it's intentionally canonicalized to another URL. Watch for conflicting signals (canonical says X, but sitemap includes Y).
- Meta Robots — No accidental noindex tags on important pages. Common culprit: staging environment meta tags that survive deployment.
- Crawl Budget — For large sites (50K+ pages), monitor crawl stats in Search Console. Reduce waste by blocking faceted navigation, internal search results, and parameter variations.
2. Site Architecture
How your pages are organized and linked determines how search engines understand your content hierarchy.
- URL Structure — Clean, descriptive, hierarchical. Avoid parameter-heavy URLs when possible.
- Internal Linking — Every important page should be reachable within 3 clicks from the homepage. Use descriptive anchor text.
- Breadcrumbs — Implement with BreadcrumbList structured data. Helps both users and search engines understand page hierarchy.
- Pagination — Use rel="next"/"prev" or implement infinite scroll with crawlable fallback URLs.
3. Core Web Vitals
Google's page experience signals remain a ranking factor. Focus on the three metrics:
- LCP (Largest Contentful Paint) — Target under 2.5 seconds. Optimize images, preload critical resources, use a CDN.
- INP (Interaction to Next Paint) — Target under 200ms. Minimize main thread blocking, defer non-critical JavaScript.
- CLS (Cumulative Layout Shift) — Target under 0.1. Set explicit dimensions on images/ads, avoid injecting content above the fold after load.
4. Structured Data
Structured data helps search engines understand your content and can earn rich results in SERPs.
- Relevant Schema Types — Match schema to page type: Article, Product, FAQ, HowTo, LocalBusiness, etc.
- Validation — Test all structured data with Google's Rich Results Test. Fix errors and warnings.
- Completeness — Include all recommended properties, not just required ones. More complete markup has a better chance of earning rich results.
5. HTTPS & Security
- Full HTTPS — No mixed content warnings. All resources (images, scripts, fonts) loaded over HTTPS.
- HSTS Header — Implement HTTP Strict Transport Security to prevent downgrade attacks.
- Security Headers — CSP, X-Frame-Options, X-Content-Type-Options as appropriate.
6. International SEO (If Applicable)
- Hreflang Tags — Correctly implemented with reciprocal tags. Every language/region variant references all others.
- Language Detection — Don't redirect users based on IP. Let search engines crawl all language versions.
7. Mobile
- Mobile-First Indexing — Ensure mobile version has the same content, structured data, and meta tags as desktop.
- Viewport Configuration — Proper viewport meta tag. No horizontal scroll on mobile.
- Tap Targets — Interactive elements at least 48px with adequate spacing.
Using CrawlX for Your Audit
CrawlX checks all of the above automatically. Run a crawl, and you'll get a prioritized list of issues with AI-generated fix suggestions for each one. Start with the free tier — 100 pages is enough to validate your most important templates.