Citizendium Forums
February 09, 2010, 05:39:56 PM *
Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
News: POSTING RULES FOR MAIN CZ BOARDS: (1) The CZ Forums are Citizens-only (a "Citizen" is a Citizendium member). Non-Citizens may use only the "Non-member discussion" and "General help" boards, but still must register before posting (it's easy!). Non-Citizen posts elsewhere will be summarily deleted. (2) All must now use their own real names. To edit your displayed name, click on Profile > Account Related Settings. (3) Citizens must now link to their CZ user pages. To edit your signature, click on Profile > Forum Profile Information.
Click here to return to the wiki
 
   Home   Help Search Login Register  
Pages: [1]
  Print  
Author Topic: List of Wikipedia's "bad pages" categories - for purposes of filtering  (Read 3452 times)
Jesse Weinstein
Forum Participant
**
Posts: 42


WWW
« on: November 17, 2006, 04:48:18 PM »

Wikipedia, while hosting vast piles of steaming c**p, and while currently too unbalaced between new content coming in and people reviewing it, has made a lot of efforts to at least mark lots of the less than good content that it still hosts.  Below, I list many of the Categories (present on CZ, too) that classify this mess.

My proposal is that, at least for many of these categories, we make a single decision on whether to keep or delete everything in the category, then implement this decision in the refresh software, so it simply won't refresh pages marked with those categories.  That will help us a lot in filtering out much of the bad stuff currently in the dumps.  If people are interested, we can copy this list the pilot wiki and discuss it there.  I'll make regular edits to this post to update it from any changes.

*[[Category:Wikipedia_cleanup_categories]] - a good overview of these categories.

*[[Category:Possible copyright violations]] - this should probably be *delete in full* - it's not worth our time to go through this.
*[[Category:Suspected hoax articles]] - also, probably delete; not worth our time
*[[Category:Articles which may be unencyclopedic]] - if a Wikipedian thinks it's unencyclopedic, I doubt we want it; in any case it's a pretty small category (64 when I checked just now)
*[[Category:Biographies without real biographical information]] - small, but probably deletable
*[[Category;Uncategorised films]] - depending on our film standards, we could probably toss most of this category.

*[[Category:Articles which may contain original research]] - we'll have to look at this one.  It's big, so if we can get rid of all of them, that may be good...
*[[Category:Wikipedia articles incorporating text from the 1911 Encyclopędia Britannica]] - the ones from 1911; sadly ,we do need to look through these, because many have been fixed, but many have not.
*[[Category:Wikipedia articles needing factual verification]] - this may be hard for us to deal with without the necessary experts...

There are lots more, but here's a sampling... Let's get decisions made on thise, and get the things removed.
Logged

Zachary Pruckowski
Technical Liasion/Executive Committee
Administrator
Forum Regular
****
Posts: 933


« Reply #1 on: November 17, 2006, 10:56:10 PM »

Thanks Jesse, that's a major help to us.

Insofar as the 1911 (and also most other PD-sourced) articles go, I'd almost want to aim to redo a lot of them straight from the source, unless the WP improvements are very good.
Logged

niais
New Arrival
*
Posts: 3


« Reply #2 on: November 19, 2006, 11:21:44 AM »

Can I make a suggestion? I think we should put the above list out on the CZ-l list for  - maybe not a vote, but for comments. I think it's about time that we at least start trimming the things which are really just filler. Otherwise, when we go "public" we'll have some very small number of CZ improved articles lost in the sea of WP copy. Which at least if I pretend to be an outsider just seems silly. If the comments are supportive, then I think we should incorporate it into policy and go ahead. Of course, it isn't obvious how things get incorporated into policy so maybe we should just give it a try. Comments on CZ-L, then a posting on the notice board about what the consensus seems to be. What to you guys think?

Sarah Tuttle
Logged
Oliver Hauss
New Arrival
*
Posts: 37


« Reply #3 on: November 19, 2006, 11:46:45 AM »

I'd warn about being too quick deleting suspected hoaxes. A lot of them probably are, but I've seen plenty of articles on German Wikipedia being submitted to AfD as a suspected fake solely due to the submission being done by someone without the hint of an idea of the topic at issue in the article. Be it that it was a drug with their sometimes ridiculous-sounding names, be it that it was something like theoretical physics which to the layperson can sound like Star Trek technobabble posing as something real.
Logged

Zachary Pruckowski
Technical Liasion/Executive Committee
Administrator
Forum Regular
****
Posts: 933


« Reply #4 on: November 19, 2006, 01:56:39 PM »

I'd warn about being too quick deleting suspected hoaxes. A lot of them probably are, but I've seen plenty of articles on German Wikipedia being submitted to AfD as a suspected fake solely due to the submission being done by someone without the hint of an idea of the topic at issue in the article. Be it that it was a drug with their sometimes ridiculous-sounding names, be it that it was something like theoretical physics which to the layperson can sound like Star Trek technobabble posing as something real.

This is why we have experts.  I'd hope a physics expert would be good at diagnosing hand-waving and actual science.  Either they may have heard of it, or they can make sense of it after casting an augury (a Google search).
Logged

Oliver Hauss
New Arrival
*
Posts: 37


« Reply #5 on: November 19, 2006, 02:04:02 PM »

This is why we have experts.  I'd hope a physics expert would be good at diagnosing hand-waving and actual science.  Either they may have heard of it, or they can make sense of it after casting an augury (a Google search).

Yes, but Jesse suggested they are not worth our time and should be dealt with summarily. In fact, I've been arguing for quite some time to have stuff like AfDs being handed over to people with at least some interest in a topic, such as Wikiprojects, on de:Wikipedia for some time.
Logged

David Goodman
Forum Communicator
***
Posts: 247


« Reply #6 on: February 10, 2007, 04:01:06 PM »

Anyone can put anything in a Wikipedia category. I could put [[Category:Articles which may be unencyclopedic]] on about half of Wikipedia, including many articles we might want to use as a basis for something real. Some Copyright tags are added by a bot. I would not base anything on a decision by a WP bot. The 1911 EB, and the old Catholic Encyc, etc have generally been partially modified, and are, as you say, non-trivial.

If we decided not to cover some subjects, however, then we could certainly go by categories.
Logged

Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.7 | SMF © 2006-2008, Simple Machines LLC Valid XHTML 1.0! Valid CSS!