Wikipedia News Desk
Wikipedia News Desk

Tag: AI evaluation

26 Feb
Benchmarking LLMs With Wikipedia Tasks: Retrieval and Summarization
Leona Whitcombe

Benchmarking LLMs With Wikipedia Tasks: Retrieval and Summarization

Wikipedia tasks are becoming the gold standard for evaluating LLMs. Testing retrieval and summarization on real encyclopedia articles reveals how well AI models handle messy, real-world knowledge-not just clean test data.

View More 0

recent posts

Safety Journalism: How to Cover Wikipedia Harassment and Moderation
Safety Journalism: How to Cover Wikipedia Harassment and Moderation

Published on: 12 May

Foundation Research Highlights: How New Studies Shape Wikipedia Strategy
Foundation Research Highlights: How New Studies Shape Wikipedia Strategy

Published on: 9 May

Case Study: How Japanese Wikipedia’s Community Norms Shape Coverage
Case Study: How Japanese Wikipedia’s Community Norms Shape Coverage

Published on: 27 May

How Wikinews Uses Wikidata and Commons for Multimedia Reporting
How Wikinews Uses Wikidata and Commons for Multimedia Reporting

Published on: 27 May

1Lib1Ref Campaign Updates and Upcoming Wikipedia Events: A Guide for 2026
1Lib1Ref Campaign Updates and Upcoming Wikipedia Events: A Guide for 2026

Published on: 5 May

categories

  • Online Encyclopedias
  • Journalism
  • Technology

archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025

tags

  • Wikimedia Foundation
  • Wikipedia editing
  • reliable sources
  • Wikipedia
  • Wikipedia governance
  • Wikipedia community
  • Wikipedia reliability
  • Wikipedia policy
  • The Signpost
  • open knowledge
  • Wikipedia policies
  • multilingual Wikipedia
  • Wikipedia moderation
  • Wikipedia editors
  • Wikipedia sources
  • edit wars
  • neutral point of view
  • community governance
  • Wikipedia guidelines
  • fact-checking
Wikipedia News Desk

latest posts

How Reader Engagement Works on The Signpost: Surveys, Comments, and Feedback Loops
How Reader Engagement Works on The Signpost: Surveys, Comments, and Feedback Loops

Published ON: 22 Jan

Using Wikipedia as a Starting Point for Academic Research
Using Wikipedia as a Starting Point for Academic Research

Published ON: 15 Dec

© 2026. All rights reserved.