AI-powered evaluation using the Model Context Optimization BS Detection Framework, based solely on publicly available website content.
Based on 796 businesses audited.
Project Gutenberg has 26.7 points less BS than the average for Media, News & Publishing.
Media, News & Publishing BS: Project Gutenberg (gutenberg.org)
Project Gutenberg is a rare example of a 0% bullshit entity. It functions as a pure utility, delivering exactly what it claims with no marketing overhead or semantic drift.
Implement Organization and Person schema to technically link the foundation and its late founder to established archive registries. Add a formal Editorial Standards or Ethics policy page to satisfy the proof expectations for the publishing industry. Link the social media masthead directly to verified foundation accounts to increase technical proof_links_count.
The website exhibits an extremely high substance ratio, lead by an H1 that provides a specific, quantifiable claim: Project Gutenberg is a library of over 75,000 free eBooks. Fluff-saturated headings are non-existent; instead, headings like Audio Books and Top 100 EBooks last 7 days lead directly to data. Body text is composed of specific nouns (Moby Dick, Jane Austen) and technical specifics (EPUB, Kindle, public domain). Concept repetition is minimal, restricted to necessary restatements of its free status.
AI does not consolidate duplicates — it embeds whatever it crawls. Generate your URL & Canonical Hygiene Audit to quantify the identity conflicts that break your semantic cohesion.
There is absolute alignment between the homepage signal and the sub-page substance. The H1 promise of free eBooks is meticulously fulfilled on the Top 100 page, which displays real-time download statistics (e.g., 37,880 for Moby Dick), and the Categories page, which provides a granular taxonomy of the library. No contradictions exist between the core identity of a volunteer-based archive and the functional pages for donating or help.
Identify the current state and friction diagnosis of your specific business model. Generate your Executive SEO Strategy to quantify the financial or conversion cost of strategic misalignment.
The site avoids all forms of trust theatre, opting for radical transparency instead. The review_count is 0 because the site relies on the inherent validity of the public domain and its 50-year history. The Donate page provides forensic-level detail on funding, including a specific fee breakdown for PayPal (1.99% plus 49 cents), which validates the claim of being a minimalist, staff-thin operation.
Proof density is exceptionally high, with thousands of specific entities mentioned across the four pages. The site provides specific counts for the Open Audiobook Collection (5,000 titles) and names the exact volunteer communities (LibriVox, Distributed Proofreaders) that maintain the service. The ratio of verifiable names and numbers to vague marketing assertions is approximately 50:1.
To review a full competitive diagnostic applied to an enterprise level technical SEO agency, including a direct comparison against Dejan, examine the complete executive audit. View the iPullRank Executive SEO Strategy Dashboard for a practical example of how perception gaps, value prop drift, and audience misalignment are surfaced in real audits.
The value proposition is entirely unique and cannot be replicated by competitors. The site identifies itself as pioneering free eBooks since 1971, a claim supported by its massive, hand-curated archives. There are zero matches for industry clichés like journalism reimagined or award-winning newsroom; the language is purely functional and utilitarian, making it immune to commodity marketing patterns.
The only measurable gap is technical: schema_json is null across all crawled pages. While the site references authoritative figures like founder Michael Hart and late CEO Dr. Greg Newby, it lacks the structured data (Person schema) to technically codify this authority. However, the site’s historical footprint and partnerships with MIT and Microsoft provide high offline authority that isn’t yet reflected in its metadata.
There is no disconnect between claims and reality; the site claims to offer 75,000 books and proceeds to list them by popularity, category, and recent release. Download counts (e.g., 71,344,118 in the last 30 days) are presented as raw statistics rather than marketing assertions. The presence of technical specifics regarding computer-generated audiobooks via a collaboration with MIT further reinforces credibility.
Media, News & Publishing BS: Project Gutenberg (gutenberg.org)
Project Gutenberg is categorized under Media, News & Publishing, but it operates as a non-profit digital library and archive. The content confirms this by providing access to over 75,000 eBooks rather than contemporary news reporting, though it adheres to high standards of source verification and source metadata.
Before embeddings, before entities, before retrieval — the crawler must reach the text. Open the Crawlability & Indexation Guide to learn how access failures erase meaning long before interpretation begins.
“The score of 7 is almost entirely driven by technical metadata gaps (missing schema) and minor repetition of the core value proposition. The site lacks the fluff, cliches, and drift that typically drive high BS scores.”
