Project Gutenberg: BS evaluation as part of Media, News & Publishing

AI-powered evaluation using the Model Context Optimization BS Detection Framework, based solely on publicly available website content.

← Back to Media, News & Publishing BS Detector

BS Level

Media, News & Publishing

33.7 Avg BS

Based on 796 businesses audited.

✓ Less BS than average

Project Gutenberg has 26.7 points less BS than the average for Media, News & Publishing.

BS Detector

Media, News & Publishing BS: Project Gutenberg (gutenberg.org)

https://gutenberg.org 📍 Industry: Media, News & Publishing

7 BS / 100

Expert Verdict

Project Gutenberg is a rare example of a 0% bullshit entity. It functions as a pure utility, delivering exactly what it claims with no marketing overhead or semantic drift.

Info Density Power-words vs. Substance ratio.

7% BS

Semantic Coherence Homepage promise vs. Sub-page reality.

0% BS

Trust & Proof Verifiable evidence vs. Trust Theatre.

0% BS

Commodity Fingerprint Detection of industry clichés/templates.

0% BS

Identity & Authority Expert verifiability & Schema depth.

33% BS

BS Reduction Prescription

Implement Organization and Person schema to technically link the foundation and its late founder to established archive registries. Add a formal Editorial Standards or Ethics policy page to satisfy the proof expectations for the publishing industry. Link the social media masthead directly to verified foundation accounts to increase technical proof_links_count.

Pillar: Information Density Analysis Diagnosis: Substance vs. Fluff Ratio

Info Density Power-words vs. Substance ratio.

2 Impact Weight: 30 / 100

7% BS

The website exhibits an extremely high substance ratio, lead by an H1 that provides a specific, quantifiable claim: Project Gutenberg is a library of over 75,000 free eBooks. Fluff-saturated headings are non-existent; instead, headings like Audio Books and Top 100 EBooks last 7 days lead directly to data. Body text is composed of specific nouns (Moby Dick, Jane Austen) and technical specifics (EPUB, Kindle, public domain). Concept repetition is minimal, restricted to necessary restatements of its free status.

AI does not consolidate duplicates — it embeds whatever it crawls. Generate your URL & Canonical Hygiene Audit to quantify the identity conflicts that break your semantic cohesion.

Pillar: Semantic Coherence Analysis Diagnosis: Semantic Drift Check

Semantic Coherence Homepage promise vs. Sub-page reality.

0 Impact Weight: 20 / 100

0% BS

There is absolute alignment between the homepage signal and the sub-page substance. The H1 promise of free eBooks is meticulously fulfilled on the Top 100 page, which displays real-time download statistics (e.g., 37,880 for Moby Dick), and the Categories page, which provides a granular taxonomy of the library. No contradictions exist between the core identity of a volunteer-based archive and the functional pages for donating or help.

Identify the current state and friction diagnosis of your specific business model. Generate your Executive SEO Strategy to quantify the financial or conversion cost of strategic misalignment.

Pillar: Trust & Proof Analysis

Trust & Proof Verifiable evidence vs. Trust Theatre.

0 Impact Weight: 20 / 100

0% BS

Diagnosis: Trust Theatre

The site avoids all forms of trust theatre, opting for radical transparency instead. The review_count is 0 because the site relies on the inherent validity of the public domain and its 50-year history. The Donate page provides forensic-level detail on funding, including a specific fee breakdown for PayPal (1.99% plus 49 cents), which validates the claim of being a minimalist, staff-thin operation.

Evidence: Proof Density

Proof density is exceptionally high, with thousands of specific entities mentioned across the four pages. The site provides specific counts for the Open Audiobook Collection (5,000 titles) and names the exact volunteer communities (LibriVox, Distributed Proofreaders) that maintain the service. The ratio of verifiable names and numbers to vague marketing assertions is approximately 50:1.

To review a full competitive diagnostic applied to an enterprise level technical SEO agency, including a direct comparison against Dejan, examine the complete executive audit. View the iPullRank Executive SEO Strategy Dashboard for a practical example of how perception gaps, value prop drift, and audience misalignment are surfaced in real audits.

Pillar: Commodity Fingerprint Analysis Diagnosis: Industry Cliché & Template Detection

Commodity Fingerprint Detection of industry clichés/templates.

0 Impact Weight: 15 / 100

0% BS

The value proposition is entirely unique and cannot be replicated by competitors. The site identifies itself as pioneering free eBooks since 1971, a claim supported by its massive, hand-curated archives. There are zero matches for industry clichés like journalism reimagined or award-winning newsroom; the language is purely functional and utilitarian, making it immune to commodity marketing patterns.

Pillar: Identity & Authority Analysis

Identity & Authority Expert verifiability & Schema depth.

5 Impact Weight: 15 / 100

33% BS

Diagnosis: Authority Gaps

The only measurable gap is technical: schema_json is null across all crawled pages. While the site references authoritative figures like founder Michael Hart and late CEO Dr. Greg Newby, it lacks the structured data (Person schema) to technically codify this authority. However, the site’s historical footprint and partnerships with MIT and Microsoft provide high offline authority that isn’t yet reflected in its metadata.

Evidence: Performance vs. Claims

There is no disconnect between claims and reality; the site claims to offer 75,000 books and proceeds to list them by popularity, category, and recent release. Download counts (e.g., 71,344,118 in the last 30 days) are presented as raw statistics rather than marketing assertions. The presence of technical specifics regarding computer-generated audiobooks via a collaboration with MIT further reinforces credibility.

Industry Match & Score Summary

Media, News & Publishing BS: Project Gutenberg (gutenberg.org)

https://gutenberg.org

BS: 7/ 100

Industry Classification

Project Gutenberg is categorized under Media, News & Publishing, but it operates as a non-profit digital library and archive. The content confirms this by providing access to over 75,000 eBooks rather than contemporary news reporting, though it adheres to high standards of source verification and source metadata.

Before embeddings, before entities, before retrieval — the crawler must reach the text. Open the Crawlability & Indexation Guide to learn how access failures erase meaning long before interpretation begins.

“The score of 7 is almost entirely driven by technical metadata gaps (missing schema) and minor repetition of the core value proposition. The site lacks the fluff, cliches, and drift that typically drive high BS scores.”

To understand and learn thinking like AI, visit our educational environment (Project Gutenberg example) that uses the same data this audit was generated from, and try it yourself.

Get a Strategic Holistic View

Media, News & Publishing BS: Project Gutenberg (gutenberg.org)

Media, News & Publishing BS: Project Gutenberg (gutenberg.org)

Free. No Signup Required.

Business Intelligence Engine

Machine-Readability Framework

BS Identity and Score for Project Gutenberg

Media, News & Publishing BS: Project Gutenberg (gutenberg.org)

Media, News & Publishing BS: Project Gutenberg (gutenberg.org)

Free. No Signup Required.

Business Intelligence Engine

Machine-Readability Framework