Wikipedia:Bots/Requests for approval/NrhpBot
- The following discussion is an archived debate. Please do not modify it. Subsequent comments should be made in a new section. The result of the discussion was Approved.
Operator: Paultyng
Automatic or Manually Assisted: Automatic article creation, manual review of potential existing articles.
Programming Language(s): C# (hosted on google)
Function Summary: This bot is used for NRHP data mining from the NPS databases for article stubbing and cleaning. Initial use will be bulk stubbing.
Edit period(s) (e.g. Continuous, daily, one time run): Continuous, initial stubbing, then minor edits as database changes over time
Edit rate requested: 1 per minute is probably fine, can be scaled back after stubbing.
Already has a bot flag (Y/N):
Function Details: This bot will initially be used to bulk stub articles from the National Register of Historic Places database. After stub creation, it will then be used to keep the database and wikipedia in synch as places are listed, delisted, etc.
Discussion
[edit]Looks pretty good to me. Will it update immediately following the change in database, or the next time you run it? ~ Wikihermit 01:40, 5 July 2007 (UTC)[reply]
- Looks good to me. Can you please go into more detail about the NPS database and where it will retrieve the data? Thanks! E talk bots 01:44, 5 July 2007 (UTC)[reply]
- The database can be found here. Their main site though is here. I think they put out a weekly list of changes, so maybe once a week, or once a month can run the updates. The biggest run though would be the initial stubbing, that would be the most controversial so I'm curious what your thoughts on that are, its a pretty big list, here is one county in Ohio for example: List of Registered Historic Places in Hamilton County, Ohio. pw 01:51, 5 July 2007 (UTC)[reply]
- It won't be running off the live database, i downloaded it and had to massage it to clean up a lot of the names and other potential issues. Updates it may be possible to run off the live database though, I could maybe talk to them about that. Right now I'm also having it query for existing articles so I don't stub anything that exists right now and googling to see if the NRHP number is already in any articles on WP. pw 01:53, 5 July 2007 (UTC)[reply]
- This is what I was envisioning as a stub template: User:NrhpBot/StubTemplate. Its a work in progress though, but it has the basics. I still need to put some HTML comments in there regarding the automated creation and so forth. pw 01:54, 5 July 2007 (UTC)[reply]
- Any thoughts from the BAG? Here is a sample page it output I was using for testing: User:Paultyng/Sandbox pw 14:23, 6 July 2007 (UTC)[reply]
- Approved for trial. Please provide a link to the relevant contributions and/or diffs when the trial is complete. Go ahead and tag about 50 pages. --ST47Talk 13:34, 8 July 2007 (UTC)[reply]
- I ran a few debugs and then ran 10 (overall it was close to maybe 30 edits). On the contributions page, everything past 12:07, July 10, 2007 is the run of 10. It took a while, so I think if possible I will need to bump up the edit rate. 16:13, 10 July 2007 (UTC)
- Ran about 10 more, so almost up to 50 edits, should be enough to evaluate. I made some template adjustments based on feedback from others as well. Let me know what you would like me to do. pw 12:28, 11 July 2007 (UTC)[reply]
- I think its going to probably need 4-5 edits a minute if you guys are ok with that to be able to finish a county or state in a reasonable amount of time while I can watch it. pw 12:05, 12 July 2007 (UTC)[reply]
- Edit rate up to 10 per minute approved, if needed. ----ST47Talk 12:11, 12 July 2007 (UTC)[reply]
- Approved. --ST47Talk 12:11, 12 July 2007 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. Subsequent comments should be made in a new section.