Open Dataset · SARI v0.2
SARI Publisher Audit Dataset (Wave 1)
Per-site agent-readability scores for 103 publisher websites audited against the Scaletific Agent-Readability Index. Sortable in-page, downloadable as CSV, citable under CC-BY-4.0. Sourced from the same audit pipeline that produced the companion deep dive and visual findings.
What's in the dataset
One row per publisher site, 25 columns per row. Scores are integers (Discovery, AI Bot Policy) or one-decimal floats (the three averaged categories). Boolean signals are true / false. Per-bot directives are true when the directive is present in the parsed robots.txt; the policy_class derived field summarises the per-bot pattern.
Identity
site· apex domainname· publisher namecohort· editorial cohort labeltier· 1=top, 2=mid, 3=long-tail/nicheconfidence· high / medium / low
Scores
score_total· 0–100score_discovery· 0–25score_article_structure· 0–30score_identity_attribution· 0–20score_content_addressability· 0–15score_ai_bot_policy· 0–10
Discovery
has_llms_txt· booleanhas_sitemap· booleanhas_mcp_well_known· booleanai_bot_directive_count· integer
Per-bot
blocks_GPTBot· booleanallows_GPTBot· booleanblocks_ClaudeBot· booleanblocks_Google_Extended· booleanblocks_PerplexityBot· booleanaddresses_OAI_SearchBot· booleanallows_OAI_SearchBot· booleanpolicy_class· derived enum
Full leaderboard (103 rows)
Sorted by total SARI score, descending. Each row has a stable anchor link (e.g. #wired-com) so individual records can be referenced in citations or shared in research. Rows tagged low confidence were excluded from cohort medians because fewer than three articles could be sampled.
| # | Site | Cohort | Total | D | A | I | C | P | llms | mcp | AI policy | Conf. |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | #Polygonpolygon.com | culture-entertainment | 81.0 | 20 | 30.0 | 15.0 | 10.0 | 6 | ✓ | Blanket | high | |
| 2 | #Pocket-lintpocket-lint.com | reviews-service | 79.7 | 20 | 30.0 | 15.0 | 11.7 | 3 | ✓ | Partial | high | |
| 3 | #Seeking Alphaseekingalpha.com | business-finance | 75.0 | 15 | 29.0 | 15.0 | 10.0 | 6 | ✓ | Blanket | high | |
| 4 | #The Vergetheverge.com | tech | 75.0 | 10 | 30.0 | 15.0 | 10.0 | 10 | Differentiated | high | ||
| 5 | #Eatereater.com | vertical-food | 74.0 | 10 | 29.0 | 15.0 | 10.0 | 10 | Differentiated | high | ||
| 6 | #Bloombergbloomberg.com | top-tier-news | 73.0 | 15 | 27.0 | 10.0 | 15.0 | 6 | ✓ | Blanket | high | |
| 7 | #Voxvox.com | top-tier-news | 73.0 | 10 | 28.0 | 15.0 | 10.0 | 10 | Differentiated | high | ||
| 8 | #Marketing Brewmarketingbrew.com | vertical-marketing | 71.0 | 20 | 30.0 | 5.0 | 10.0 | 6 | ✓ | Blanket | high | |
| 9 | #Morning Brewmorningbrew.com | newsletter-hybrid | 71.0 | 20 | 30.0 | 5.0 | 10.0 | 6 | ✓ | Blanket | high | |
| 10 | #ZDNetzdnet.com | tech | 71.0 | 10 | 30.0 | 15.0 | 10.0 | 6 | Blanket | high | ||
| 11 | #CNETcnet.com | tech | 70.0 | 10 | 29.0 | 15.0 | 10.0 | 6 | Blanket | high | ||
| 12 | #Vulturevulture.com | culture-entertainment | 70.0 | 10 | 30.0 | 10.0 | 10.0 | 10 | Differentiated | high | ||
| 13 | #CNBCcnbc.com | business-finance | 69.0 | 20 | 23.0 | 10.0 | 10.0 | 6 | ✓ | Blanket | high | |
| 14 | #Travel + Leisuretravelandleisure.com | vertical-travel | 69.0 | 10 | 28.0 | 15.0 | 10.0 | 6 | Blanket | high | ||
| 15 | #Varietyvariety.com | culture-entertainment | 69.0 | 10 | 28.0 | 10.0 | 15.0 | 6 | Blanket | high | ||
| 16 | #NPRnpr.org | top-tier-news | 68.5 | 20 | 27.5 | 5.0 | 10.0 | 6 | ✓ | Blanket | medium | |
| 17 | #Rest of Worldrestofworld.org | indie-longform | 68.3 | 10 | 24.0 | 15.0 | 13.3 | 6 | Blanket | high | ||
| 18 | #The Registertheregister.com | tech | 68.3 | 10 | 28.3 | 10.0 | 10.0 | 10 | Differentiated | high | ||
| 19 | #404 Media404media.co | indie-longform | 67.0 | 10 | 29.0 | 15.0 | 10.0 | 3 | Partial | high | ||
| 20 | #Lonely Planetlonelyplanet.com | vertical-travel | 67.0 | 10 | 29.0 | 15.0 | 10.0 | 3 | Partial | high | ||
| 21 | #Scientific Americanscientificamerican.com | vertical-science | 67.0 | 10 | 26.0 | 15.0 | 10.0 | 6 | Blanket | high | ||
| 22 | #The Guardiantheguardian.com | top-tier-news | 66.0 | 10 | 30.0 | 10.0 | 10.0 | 6 | Blanket | high | ||
| 23 | #VentureBeatventurebeat.com | tech | 64.0 | 10 | 29.0 | 5.0 | 10.0 | 10 | Differentiated | high | ||
| 24 | #Adweekadweek.com | industry-trade | 63.0 | 10 | 30.0 | 10.0 | 10.0 | 3 | Partial | high | ||
| 25 | #Associated Pressapnews.com | top-tier-news | 62.7 | 10 | 30.0 | 6.7 | 10.0 | 6 | Blanket | high | ||
| 26 | #Tom's Guidetomsguide.com | reviews-service | 62.3 | 5 | 29.0 | 15.0 | 13.3 | 0 | Silent | high | ||
| 27 | #Inc.inc.com | business-finance | 62.0 | 10 | 24.0 | 15.0 | 10.0 | 3 | Partial | high | ||
| 28 | #Modern Healthcaremodernhealthcare.com | industry-trade | 62.0 | 10 | 26.0 | 5.0 | 15.0 | 6 | Blanket | high | ||
| 29 | #The New York Timesnytimes.com | top-tier-news | 62.0 | 10 | 26.0 | 10.0 | 10.0 | 6 | Blanket | medium | ||
| 30 | #Semaforsemafor.com | top-tier-news | 62.0 | 10 | 24.0 | 15.0 | 10.0 | 3 | Partial | high | ||
| 31 | #The Athletictheathletic.com | vertical-sports | 62.0 | 10 | 26.0 | 10.0 | 10.0 | 6 | Blanket | medium | ||
| 32 | #Bon Appétitbonappetit.com | vertical-food | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 33 | #Condé Nast Travelercntraveler.com | vertical-travel | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 34 | #MedPage Todaymedpagetoday.com | vertical-health | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 35 | #The New Yorkernewyorker.com | top-tier-news | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 36 | #Pitchforkpitchfork.com | culture-entertainment | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 37 | #MIT Technology Reviewtechnologyreview.com | tech | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 38 | #Wiredwired.com | tech | 61.0 | 10 | 30.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 39 | #Financial Timesft.com | top-tier-news | 60.7 | 10 | 24.7 | 10.0 | 10.0 | 6 | Blanket | high | ||
| 40 | #The Washington Postwashingtonpost.com | top-tier-news | 60.5 | 10 | 24.5 | 10.0 | 10.0 | 6 | Blanket | medium | ||
| 41 | #Fast Companyfastcompany.com | business-finance | 60.3 | 10 | 22.3 | 15.0 | 10.0 | 3 | Partial | high | ||
| 42 | #DEV Communitydev.to | tech | 60.0 | 10 | 30.0 | 11.7 | 8.3 | 0 | ✓ | Silent | high | |
| 43 | #Engadgetengadget.com | tech | 60.0 | 5 | 30.0 | 15.0 | 10.0 | 0 | Silent | high | ||
| 44 | #The Hollywood Reporterhollywoodreporter.com | culture-entertainment | 60.0 | 10 | 24.0 | 5.0 | 15.0 | 6 | Blanket | high | ||
| 45 | #FiveThirtyEightfivethirtyeight.com | indie-longform | 58.0 | 10 | 30.0 | 5.0 | 10.0 | 3 | Partial | high | ||
| 46 | #Business Insiderbusinessinsider.com | business-finance | 57.3 | 10 | 26.0 | 5.0 | 13.3 | 3 | Partial | high | ||
| 47 | #Politicopolitico.com | top-tier-news | 57.0 | 10 | 26.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 48 | #Tom's Hardwaretomshardware.com | reviews-service | 57.0 | 5 | 27.0 | 15.0 | 10.0 | 0 | Silent | high | ||
| 49 | #Fortunefortune.com | business-finance | 56.0 | 10 | 23.0 | 10.0 | 10.0 | 3 | Partial | high | ||
| 50 | #InfoQinfoq.com | tech | 55.0 | 5 | 30.0 | 10.0 | 10.0 | 0 | Silent | high | ||
| 51 | #Moz Blogmoz.com | vertical-marketing | 54.0 | 10 | 26.0 | 5.0 | 10.0 | 3 | Partial | high | ||
| 52 | #Wirecutterwirecutter.com | reviews-service | 54.0 | 5 | 29.0 | 10.0 | 10.0 | 0 | Silent | high | ||
| 53 | #BBC Newsbbc.com | top-tier-news | 52.0 | 10 | 21.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 54 | #The Informationtheinformation.com | tech | 51.9 | 15 | 17.3 | 8.3 | 8.3 | 3 | ✓ | Partial | high | |
| 55 | #Forbesforbes.com | business-finance | 50.7 | 10 | 19.0 | 5.0 | 6.7 | 10 | Differentiated | high | ||
| 56 | #CNNcnn.com | top-tier-news | 50.6 | 10 | 16.3 | 8.3 | 10.0 | 6 | Blanket | high | ||
| 57 | #STAT Newsstatnews.com | vertical-health | 47.3 | 10 | 19.3 | 5.0 | 10.0 | 3 | Partial | high | ||
| 58 | #Reutersreuters.com | top-tier-news | 46.6 | 10 | 17.3 | 3.3 | 10.0 | 6 | Blanket | high | ||
| 59 | #MarketWatchmarketwatch.com | business-finance | 46.0 | 5 | 26.0 | 10.0 | 5.0 | 0 | Silent | high | ||
| 60 | #Puck Newspuck.news | top-tier-news | 41.0 | 5 | 21.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 61 | #Search Engine Journalsearchenginejournal.com | vertical-marketing | 41.0 | 5 | 21.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 62 | #National Geographicnationalgeographic.com | vertical-science | 40.0 | 5 | 15.0 | 5.0 | 15.0 | 0 | Silent | high | ||
| 63 | #Trusted Reviewstrustedreviews.com | reviews-service | 40.0 | 10 | 10.7 | 3.3 | 10.0 | 6 | Blanket | high | ||
| 64 | #Epicuriousepicurious.com | vertical-food | 39.3 | 10 | 10.0 | 3.3 | 10.0 | 6 | Blanket | high | ||
| 65 | #MarTechmartech.org | vertical-marketing | 37.7 | 5 | 17.7 | 5.0 | 10.0 | 0 | Silent | high | ||
| 66 | #Serious Eatsseriouseats.com | vertical-food | 36.0 | 10 | 0.0 | 5.0 | 15.0 | 6 | Blanket | medium | ||
| 67 | #9to5Mac9to5mac.com | tech | 35.0 | 5 | 15.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 68 | #Naturenature.com | vertical-science | 32.7 | 10 | 0.0 | 5.0 | 11.7 | 6 | Blanket | high | ||
| 69 | #Ars Technicaarstechnica.com | tech | 31.0 | 10 | 0.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 70 | #The Economisteconomist.com | top-tier-news | 31.0 | 10 | 0.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 71 | #The Conversationtheconversation.com | indie-longform | 31.0 | 10 | 0.0 | 5.0 | 10.0 | 6 | Blanket | high | ||
| 72 | #Kottkekottke.org | indie-longform | 30.0 | 10 | 0.0 | 5.0 | 5.0 | 10 | Differentiated | high | ||
| 73 | #The Atlantictheatlantic.com | top-tier-news | 30.0 | 10 | 0.0 | 0.0 | 10.0 | 10 | Blanket | medium | ||
| 74 | #Medium (platform)medium.com | platform | 29.7 | 10 | 0.0 | 5.0 | 11.7 | 3 | Partial | high | ||
| 75 | #Aeonaeon.co | indie-longform | 28.0 | 10 | 0.0 | 5.0 | 10.0 | 3 | Partial | high | ||
| 76 | #Consumer Reportsconsumerreports.org | reviews-service | 28.0 | 10 | 0.0 | 5.0 | 10.0 | 3 | Partial | high | ||
| 77 | #Healthlinehealthline.com | vertical-health | 28.0 | 15 | 0.0 | 0.0 | 10.0 | 3 | ✓ | Partial | high | |
| 78 | #TechCrunchtechcrunch.com | tech | 28.0 | 10 | 0.0 | 1.7 | 13.3 | 3 | Partial | high | ||
| 79 | #PCMagpcmag.com | reviews-service | 27.7 | 10 | 0.0 | 1.7 | 10.0 | 6 | Blanket | high | ||
| 80 | #Digital Trendsdigitaltrends.com | reviews-service | 26.0 | 10 | 0.0 | 0.0 | 10.0 | 6 | Blanket | high | ||
| 81 | #Axiosaxios.com | top-tier-news | 23.0 | 10 | 0.0 | 0.0 | 10.0 | 3 | Partial | high | ||
| 82 | #Food52food52.com | vertical-food | 20.0 | 5 | 0.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 83 | #Ghost (platform)ghost.org | platform | 20.0 | 5 | 0.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 84 | #Mayo Clinicmayoclinic.org | vertical-health | 20.0 | 5 | 0.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 85 | #The Puddingpudding.cool | indie-longform | 20.0 | 5 | 0.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 86 | #Quanta Magazinequantamagazine.org | vertical-science | 20.0 | 5 | 0.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 87 | #The Wall Street Journalwsj.com | top-tier-news | 20.0 | 5 | 0.0 | 5.0 | 10.0 | 0 | Silent | high | ||
| 88 | #The Hustlethehustle.co | newsletter-hybrid | 18.7 | 5 | 7.0 | 0.0 | 6.7 | 0 | Silent | high | ||
| 89 | #Longreadslongreads.com | indie-longform | 18.0 | 10 | 0.0 | 0.0 | 5.0 | 3 | Partial | high | ||
| 90 | #Substack (platform)substack.com | platform | 16.7 | 5 | 0.0 | 5.0 | 6.7 | 0 | Silent | high | ||
| 91 | #Quartzqz.com | business-finance | 15.0 | 5 | 0.0 | 5.0 | 5.0 | 0 | Silent | high | ||
| 92 | #Rtingsrtings.com | reviews-service | 15.0 | 5 | 0.0 | 5.0 | 5.0 | 0 | Silent | high | ||
| 93 | #Smithsonian Magazinesmithsonianmag.com | vertical-science | 15.0 | 5 | 0.0 | 5.0 | 5.0 | 0 | Silent | high | ||
| 94 | #ESPNespn.com | vertical-sports | 13.0 | 10 | 0.0 | 0.0 | 0.0 | 3 | Partial | low | ||
| 95 | #Gizmodogizmodo.com | tech | 13.0 | 10 | 0.0 | 0.0 | 0.0 | 3 | Partial | low | ||
| 96 | #Daring Fireballdaringfireball.net | indie-longform | 8.0 | 5 | 0.0 | 0.0 | 0.0 | 3 | Partial | low | ||
| 97 | #Harvard Business Reviewhbr.org | business-finance | 8.0 | 5 | 0.0 | 0.0 | 0.0 | 3 | Partial | low | ||
| 98 | #WebMDwebmd.com | vertical-health | 8.0 | 5 | 0.0 | 0.0 | 0.0 | 3 | Partial | low | ||
| 99 | #Defectordefector.com | culture-entertainment | 5.0 | 5 | 0.0 | 0.0 | 0.0 | 0 | Silent | low | ||
| 100 | #Notebookchecknotebookcheck.net | reviews-service | 5.0 | 5 | 0.0 | 0.0 | 0.0 | 0 | Silent | low | ||
| 101 | #Search Engine Landsearchengineland.com | vertical-marketing | 5.0 | 5 | 0.0 | 0.0 | 0.0 | 0 | Silent | low | ||
| 102 | #The Ringertheringer.com | vertical-sports | 5.0 | 5 | 0.0 | 0.0 | 0.0 | 0 | Silent | low | ||
| 103 | #Timetime.com | top-tier-news | 5.0 | 5 | 0.0 | 0.0 | 0.0 | 0 | Silent | low |
Reuse and citation
Released under CC-BY-4.0. Reuse with attribution. Suggested citation:
Nouriel, M. (2026). SARI Publisher Audit Dataset (Wave 1) [Data set]. Automation Switch. https://automationswitch.com/research/agent-legibility-audit/dataset
For the methodology, the audit script, and the editorial framing, see the companion deep dive. For visual summaries, see the infographic. For other waves and future research, see the research hub.