Wikipedia talk:AutoWikiBrowser/Dev/Archive 3
This is an archive of past discussions about Wikipedia:AutoWikiBrowser. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 1 | Archive 2 | Archive 3 | Archive 4 |
Sourceforge problem
Anybody knows what's wrong with Sourceforge? I often can't preview sources like here and the diff shows nothing like here. Jogers (talk) 14:57, 14 October 2007 (UTC)
- The binary diff, ive seen it a few time before. But never been bothered by it. It just seems ot be random files now and again.
- For the other, its viewVC or python errror. Its not just you Jogers :). /me thinks SF should update them Reedy Boy 15:09, 14 October 2007 (UTC)
- Reported - [1] Reedy Boy 15:23, 14 October 2007 (UTC)
- Its been replied to and closed.. See the link above. I suggest we normalise the text in the files... Reedy Boy 17:50, 25 October 2007 (UTC)
- I've converted Tools.cs to UTF-8 and it can be correctly displayed now but we still can't preview previous revisions and the diffs. I wonder if there is any way to update them? Jogers (talk) 12:31, 26 October 2007 (UTC)
- Get ViewVC to update the software to fix it.. And then get SF to install the update...? Reedy Boy 18:40, 27 October 2007 (UTC)
- Never mind, I can always check the diffs in TortoiseSVN. Jogers (talk) 12:42, 31 October 2007 (UTC)
Feature Requests
What a backlog!!
Reedy Boy 14:01, 20 October 2007 (UTC)
- Probably, some of them should be declined as too vague, needlessly specialized or not in line with AWB's ideology. MaxSem 09:54, 21 October 2007 (UTC)
Some dev should be constantly monitoring the Feature requests page as it looks really dead.--Kozuch (talk) 21:28, 14 April 2008 (UTC)
Legal fun
Eek! I've found a good C# diff library, http://razor.occams.info/code/diff/ . Then the copy(left|right|up|down|<censored>) fun begins:
- I't based upon Perl's Algorithm::Diff, and therefore is licensed under the Artistic License, but not clear which version.
- The version matters because according to Stallman, first version, unlike 2.0, is incompatible with GPL.
- Algorithm::Diff itself says 'you can redistribute it and/or modify it under the same terms as Perl itself'. Does it mean that it's v. 2.0 and we can use the C# diff?
- Finally, how can we implement that license compatibility per 4(c)(ii)? MaxSem(Han shot first!) 15:17, 14 November 2007 (UTC)
- Is there anyone alive? I'll just add it a a binary (which is obviously allowed) and start working on rendering. MaxSem(Han shot first!) 11:08, 19 November 2007 (UTC)
- Yeah, i am... Just wanted to get time to go over the licensing and see if you had missed anything, or anything that may help us —Reedy Boy 12:33, 19 November 2007 (UTC)
- Is there anyone alive? I'll just add it a a binary (which is obviously allowed) and start working on rendering. MaxSem(Han shot first!) 11:08, 19 November 2007 (UTC)
- Looking good Max. Also, seems quicker...? —Reedy Boy 19:13, 24 November 2007 (UTC)
Typo statistics
I have an idea, but it needs much polishing before we start implementing it. So: AWB should log all typos to some kind of log. Two things should be logged: typos found and false positives (when a regex matches, but proposed correction matches the word found). Several issues:
- If we log it to a file, how should we prevent it from growing forever? Spamming users with requests to submit their data to us once their logs reach 50(?)kb? Automatically uploading them (bad idea)?
- Where should they be submitted? (E.g. to subpage of RETF on every wiki?) What about smaller projects that use typos from en? (Are there such projects?)
- How should be process the logs?
- Is it really worth it?
MaxSem(Han shot first!) 21:14, 20 November 2007 (UTC)
- It'd be a bit difficult, i suppose to do it. If it was done to a subpage, AWB would have to parse it.. and we'll get edit conflicts and alike.
- The simplest way, would be some form of submit to a Database somewhere, and have a webscript to make some statistics... But what would they submit? Original word, replaced word? As the regex's will change..
- The "Is it really worth it", is the thing. Yes, its doable, and would be a major new feature. But what actual gain is going to come from it...? —Reedy Boy 21:32, 20 November 2007 (UTC)
- It's needed for us to know which regexes are problematic and either should be fixed not to cause false positives or removed if it's too infrequent. MaxSem(Han shot first!) 21:48, 20 November 2007 (UTC)
Next Release...
4.2 i suppose... New stuff, changes, and the changed Diff library..
—Reedy Boy 15:46, 25 November 2007 (UTC)
- Let's create a checklist. MaxSem(Han shot first!) 13:42, 4 December 2007 (UTC)
- New functionality
- Enhanced diffs
- Seems pretty good. Need to try actually doing some normal AWB work and check it out better. —Reedy Boy 15:49, 4 December 2007 (UTC)
- Some speed-related improvements, but nothing drastical
- This i think, has just become a typical ongoing improvement for AWB —Reedy Boy 15:49, 4 December 2007 (UTC)
- Startup has been seriously sped up, do I've marked it with a
bugbig check sign. MaxSem(Han shot first!) 14:42, 15 December 2007 (UTC)- Yup - Hugely faster! =) —Reedy Boy 21:56, 16 December 2007 (UTC)
- Startup has been seriously sped up, do I've marked it with a
- This i think, has just become a typical ongoing improvement for AWB —Reedy Boy 15:49, 4 December 2007 (UTC)
- I've added some code to load site matrix, but it isn't used anywhere so far. Possible uses include site selection dialog and more sophisticated metadata sorter (no speed gain for WP but it's quite possible for other WMF projects with less languages available). MaxSem(Han shot first!)
- Would be a good idea, then languages dont actually have to be added manually. I know we dont get many requests (well, not recently at least)... Maybe loading the Namespaces via the API/whatever for the wiki upon use would also be better than them all hard coded...? —Reedy Boy 15:49, 4 December 2007 (UTC)
- Enhanced diffs
- Bug fixes
- As always, lots of them
- Yup, list is generally empty.. —Reedy Boy 15:49, 4 December 2007 (UTC)
- Emboldening code isn't fixed so far
- We've gotta do something aboug {{reflist}} - related complaints
- If there are cases it should stay (ie certain pages), and no actual consensus to replace it.. Shouldnt we really disable thiks feature? —Reedy Boy 15:49, 4 December 2007 (UTC)
- rev 1875 - Disabled by MaxSem - We proceeding with it disabled then? —Reedy Boy 14:04, 7 December 2007 (UTC)
- I see no other way, too many complaints. MaxSem(Han shot first!) 14:58, 7 December 2007 (UTC)
- As always, lots of them
- Testing - have anyone made a good run with latest version?
- No, going to try now.. —Reedy Boy 15:49, 4 December 2007 (UTC)
- I think now it's tested enough. MaxSem(Han shot first!) 15:39, 21 December 2007 (UTC)
- Just tried it myself. Ran through my watchlist and did some typo fixing. Just that stupidly long inital processing time in my windows version that is a problem for me! —Reedy Boy 15:49, 21 December 2007 (UTC)
- I think now it's tested enough. MaxSem(Han shot first!) 15:39, 21 December 2007 (UTC)
- No, going to try now.. —Reedy Boy 15:49, 4 December 2007 (UTC)
- Just to add to the list - Vista/64bit stupid typo first processing time. Pretty sure i wont be the only person come release who has this problem...—Reedy Boy 16:59, 5 December 2007 (UTC)
- Vista sucks. MaxSem(Han shot first!) 20:45, 5 December 2007 (UTC)
- Haha —Reedy Boy 23:04, 5 December 2007 (UTC)
- Fixed as per bugs :) —Reedy Boy 22:03, 29 December 2007 (UTC)
- Haha —Reedy Boy 23:04, 5 December 2007 (UTC)
- Vista sucks. MaxSem(Han shot first!) 20:45, 5 December 2007 (UTC)
Pre Christmas
Are we going to try and get the next version (4.2?) out before christmas? ie This weekend?
—Reedy Boy 16:04, 21 December 2007 (UTC)
Help for a script
moved to WT:AWB
AWBPackager
Now included in the Extras folder
Simple program, run it, follow the basic prompts, and it collects all the files from the various release build folders, and puts them into a temporary directory in the running directory, then zips it up based on the name given
Useful for releases, but also so we can quickly and easily build SVN versions and package them for users to try!
—Reedy Boy 12:12, 31 December 2007 (UTC)
AWB SVN Snapshots
Avaliable here
For users who are experienced with AWB, and want to use a more up to date version, but are unable to download from SVN themselves and build
May try and make the AWBPackager FTP upload to there...
Not very pretty, but it serves its purpose - Simple webpage to list all the FTP files in the folder :)
—Reedy Boy 12:15, 31 December 2007 (UTC)
GPL 3
Why is a copy of the GPL version 3 shipping with AWB? I don't recall any dicussions about this or giving my permission for my work to be relicenced (which I presume is needed although I stand corrected if not :) ) --kingboyk (talk) 20:25, 1 January 2008 (UTC)
Diffs
I'm finding the default diff in the current AWB (green) to be nowhere near as clear or obvious as Mediawiki (red). Would it be possible/desirable to make the AWB default diffs (CSS?) look exactly like Wikipedia's? --kingboyk (talk) 00:52, 2 January 2008 (UTC)
One other thing. I don't like the new buttons - why "Start processing" and "Stop all tasks"? "Stop" and "Start" are so much clearer... --kingboyk (talk) 01:03, 2 January 2008 (UTC)
Translations of AWB
Will you plan integration of translations into different languages ? For example, by moving character strings in parameters files. Thus, we could contribute to translation into different languages. You can contact me (email) for French translation. --Shawn86 (talk) 09:15, 3 January 2008 (UTC)
- It could be done. Im not sure the best way to do it really, as there would have to be a lot of strings loaded and applied. Of course, as we can be used on multiple wiki's, we are open to suggestions as to how best to do this, and therefore implement it if we see it feasible! —Reedy Boy 09:36, 3 January 2008 (UTC)
- Resource files isn't it? (Caution: I know nothing abt this topic). --kingboyk (talk) 14:54, 4 January 2008 (UTC)
Wiktionary & mandatory updates
Can the "mandatory updates" thing be restricted to use on en.wikipedia please? That was a fairly rude update for me.
For Wiktionary, I'm having trouble declaring myself as a bot. wikt:WT:AWB. Am I setting this up in the wrong place? The 4.1.4.2 AWB version throws a fit, maxing out the CPU when I try to log in now.
Can some of the warnings be customized? For Wiktionary, entries that do not start with a language heading should be marked as an error (for one small example.) This version, being heavily Wikipedia oriented, makes the opposite recommendation/warning.
Thanks in advance, --Connel MacKenzie - wikt 02:11, 14 January 2008 (UTC)
That's not a typo...
There are a few isolated cases where a word that's flagged for AWB typo correction might actually be correct in certain circumstances. The example I came across recently was that a user did a major AWB change from "cliche" to "cliché" on articles pertaining to a Canadian politician whose name was Robert Cliche, with no accent because it's a French language surname, pronounced "Kleesh", which was not meant to convey that Mr. Cliche was particularly trite or clichéd. Is there any way to flag certain specific articles for an automatic exemption from such spelling "corrections" which would actually be incorrect? Bearcat (talk) 11:10, 15 January 2008 (UTC)
- The ruling with the typo list is, either, the page can be excluded completely (only really for certain pages), or, if they produce ANY false positives, they should be removed from the typo list. —Reedy Boy 11:57, 15 January 2008 (UTC)
- A possible workaround is to ignore the words in title of the page. — Dispenser 20:26, 9 March 2008 (UTC)
WebControl.Wait()
MaxSem identified an issue in Wikipedia_talk:AutoWikiBrowser/Bugs#Logged_in_user_at_bottom_right_still_not_updating, and User:Skenmy asked me if i could make it autoprotect for Wikinews.
Bit of fiddling, and the code would work.. But the Wait is releasing it too early.. And its going on to load the edit page, causing it not to fully protect
Blasted thing! Any suggestions? Must be another way round it...?
—Reedy Boy 23:04, 24 January 2008 (UTC)
AWB almost functional in Wine!
See last two comments on http://bugs.winehq.org/show_bug.cgi?id=8499 - a workaround in AWB may be possible, though the ideal solution would of course be patches to Wine ;-) In any case, it's now working well enough to spot bugs in! - David Gerard (talk) 13:56, 17 February 2008 (UTC)
I want added to the Project so i can use it my name is Milece and this is the only talk page i found ive clicked through it numerous times, so please add me and i apologize if this is the wrong page. —Preceding unsigned comment added by Milece (talk • contribs) 09:45, 6 March 2008 (UTC)
Could you guys...
add a list reduction mode? So user can create a big list, let AWB eliminate links that don't need attention. After that it would be possible to work only with list that do require user attention for sure, at convenient time. And second - can articles specific spellchecker exceptions be created? I mean let user mark some words at exception belonging to article. And let AWB upload them to some page. That page could be read at start of AWB, and later would let to avoid multiple human intervention and associated mistakes... TestPilot 21:59, 11 March 2008 (UTC)
- Feature requests are this way. MaxSem(Han shot first!) 06:11, 12 March 2008 (UTC)
Addition to Custom Project Drop-Down
Hey guys. Recently downloaded the BOT and found that there is no wiki option. I work on http://www.wiki.tribalwars.net currently there are issues with BOT Api and SVN but if there was a wiki option it would save alot of hassle in the future :) --Oscardog1991 (talk) —Preceding comment was added at 13:05, 12 March 2008 (UTC)
- Im not sure what your asking for... —Reedy Boy 13:11, 12 March 2008 (UTC)
- I don't think that your wiki is large enough to warrant its own variant in project settings. You will need to input your URL only once because AWB saves such settings. MaxSem(Han shot first!) 13:24, 12 March 2008 (UTC)
- Yeh but as you know it current doesn't work. Can you reply on the Discussion page of the AWB Bot so i can find out where to update my SVN and BOT Api or explain how to update it etc :) Thanks :) --Oscardog1991 (talk) —Preceding comment was added at 14:46, 12 March 2008 (UTC)
- Try the "custom" option, and then inputting www.wiki.tribalwars.net. --76.105.227.156 (talk) 04:07, 14 March 2008 (UTC)
AWB's home
AWB isn't really a Wikipedia EN project any more, although all the devs are of course active here. I've no objection to staying here, but might it not send out a good message if we were to move the AWB infrastructure to a "neutral" wiki, be it meta, mediawiki.org or our own wiki? --kingboyk (talk) 21:19, 9 May 2008 (UTC)
- Own wiki isn't as reliable as Wikimedia cluster, and can be easily slashdotted. AWB is offtopic on mw.org, and of all us devs only I have +sysop on Meta, and we often need to tweak a protected page with settings. Yes, "hosting" here is a bit biased, but is more convenient. MaxSem(Han shot first!) 16:53, 18 May 2008 (UTC)
Release next version please
A couple of weeks ago I suggested that it was appropriate to release a new version of AWB, particularly due to the typo list loading bug Wikipedia_talk:AutoWikiBrowser/Archive_18#Version_bump_after_typo_list_access_fix.3F. A user is having problems because of this bug - [2]. I really think we need a new (optional update?) release to roll out this fix. Opinions? Thanks Rjwilmsi (talk)
- As the user mentioned above, I concur - "forname" is a common misspelling, and being unable to use the new typo list is a pain (it currently corrects it to "oref") - I've already had at least one bad edit slip through because of it. — iridescent 22:08, 17 May 2008 (UTC)
- Agreed. If there will be no objections from other devs, I'll release it in a few hours. MaxSem(Han shot first!) 16:55, 18 May 2008 (UTC)
- Released, old version disabled. MaxSem(Han shot first!) 10:25, 19 May 2008 (UTC)
- Agreed. If there will be no objections from other devs, I'll release it in a few hours. MaxSem(Han shot first!) 16:55, 18 May 2008 (UTC)
Kingbotk plugin violates GPL
According to GPL, everything linking to GPL'd software must be licensed uder the GPL or compatible licenses. And it's currently not the case - there is no indication that it's GPL'ed, and there are no freely accessible sources. That needs to be fixed. MaxSem(Han shot first!) 20:28, 18 May 2008 (UTC)
- Will discuss with Steve when he's back from his holiday. —Reedy 11:12, 22 May 2008 (UTC)
- No, I don't think that's the case when all I'm doing is referencing the WikiFunctions.dll. What are you basing your interpretation on? (URL?) And, even if you are correct, what are "you" gonna do, sue me?! :)
- I'm not much interested in releasing my source code because it's demanded of me, but I have in the past wondered whether to put it into the AWB repository and close my private repository down. If that would be agreeable to other users of the AWB repository I'll think about it. No promises but we'll see. --kingboyk (talk) 11:46, 26 May 2008 (UTC)
- Links? Here they are: GPL#The GPL in court, http://www.gnu.org/licenses/why-not-lgpl.html. I didn't wanted to be annoying, but I thought we should follow the basics at least. Anyway, I released the latest version with your plugin right after starting this discussion even though I believed that it's teh copyvio. And releasing the source could make AWB development easier, because when I make a breaking change to AWB I'd like to check if everything works after that. MaxSem(Han shot first!) 12:51, 26 May 2008 (UTC)
- Makes sense. At least, you know if the plugin compiles that everything should be fine (AFAIK all the "hacky" access methods have now been removed...) —Reedy 13:27, 26 May 2008 (UTC)
- Wikipedia isn't a source. That doesn't prove anything. AFAIC, linking is not making a derivative work. My plugin contains zero code from AWB and AFAIC is entirely my work and property (except for the bits Sam wrote lol).
- However, independent of this discussion Sam and I have agreed to release the code under the GPL, which I've just done. --kingboyk (talk) 16:52, 29 May 2008 (UTC)
- Links? Here they are: GPL#The GPL in court, http://www.gnu.org/licenses/why-not-lgpl.html. I didn't wanted to be annoying, but I thought we should follow the basics at least. Anyway, I released the latest version with your plugin right after starting this discussion even though I believed that it's teh copyvio. And releasing the source could make AWB development easier, because when I make a breaking change to AWB I'd like to check if everything works after that. MaxSem(Han shot first!) 12:51, 26 May 2008 (UTC)
Max, Just FYI - We werent expecting a release so soon. I was aware of the problems that existed.
I was going to fix the problems caused, some minor refactoring regressions, but having to return the ram on my PC, i have been without my development (and more-a-less any sort of pc) pc
Should be back online tommorow (if it arrives back at my dads work)
Chers!
—Reedy 22:14, 19 May 2008 (UTC)
- I'll try and sort this soon now i've got my PC back. —Reedy 11:11, 22 May 2008 (UTC)
- rev 2464 - Reverted 2459 and added migration code to fix the regressions that could be caused by moving the registry keys. Migrated the Recent setting code in same way, and changed code to save/load to there. That probably needs migrating to the newer registry code, but works fine as it is. —Reedy 15:50, 22 May 2008 (UTC)
- Yes, I should have mentioned this in my commit summary. I was going away and ran out of time; Sam was fully aware of the situation and was going to finish the work for me (and now has - thanks Sam). --kingboyk (talk) 11:48, 26 May 2008 (UTC)
AWB Updater
Devs: apparently, we need to tweak the updater to delete everything executable from the AWB directory and its subdirectories. We update our plugins in their dirs, but those people who actually use them move them to the parent directory, where AWB catches them. And updater does nothing about such plugins. MaxSem(Han shot first!) 16:39, 19 May 2008 (UTC)
- Hmm.. We do need to overhaul the updater again i think... Wonder if we can have it load a file list (from the internet) on the fly, so we dont have to release new versions to add extra new files etc... Wonder if we can get it to download a CS file and compile it like the custom module does... —Reedy 22:00, 19 May 2008 (UTC)
Feature Request ?? Multiple Categories entry
Currently only one category can be supplied to AWB to make article list at a time ( I am _NOT_ taking about recursive categories here ). My need is to provide mutiple categories seperated by a "," or "| " in the input box or from a text file , AWB to take one by one categories , make a list of articles from all of them .The reason is for a BOT run, I may need to supply multiple categories ,it is tedious to wait for each category to make article list and then supply next category..Thoughts ?? -- TinuCherian (Wanna Talk?) - 14:40, 13 June 2008 (UTC)
- It does. Make list from Category. In the text box "Category1|Category2"... —Reedy 14:57, 13 June 2008 (UTC)
Module
Can somebody help me making a module which adds text before the text on an article? Like an InfoBox? Thank you in advance! --Firilăcroco mesage 16:27, 15 June 2008 (UTC)
- Wikipedia:AutoWikiBrowser/Custom_Modules#Mass_AfD_Tagging - If you change what its appending, it will work fine. You could just use the "prepend text" feature.. —Reedy 19:13, 15 June 2008 (UTC)
- If you have the data for the infobox in a CSV (comma-seperated-values) file, you can use this plug-in. Regards, Ganeshk (talk) 21:49, 15 June 2008 (UTC)
- The CVSLoader looks fine, but how can I set which is the CSV file?--Firilăcroco mesage 08:37, 16 June 2008 (UTC)
- You would use the "text file" option under "Make from" (on AWB) to load the CSV file. (Step 4 on the instructions). Regards, Ganeshk (talk) 12:27, 16 June 2008 (UTC)
- Thank you a lot! Now I understand what I've done wrong. --Firilăcroco mesage 16:50, 16 June 2008 (UTC)
- I have fixed a bug with saving preferences. Please download the newer version (1.0.0.2) of the plugin. Thanks, Ganeshk (talk) 10:41, 17 June 2008 (UTC)
- Thank you a lot! Now I understand what I've done wrong. --Firilăcroco mesage 16:50, 16 June 2008 (UTC)
- You would use the "text file" option under "Make from" (on AWB) to load the CSV file. (Step 4 on the instructions). Regards, Ganeshk (talk) 12:27, 16 June 2008 (UTC)
- The CVSLoader looks fine, but how can I set which is the CSV file?--Firilăcroco mesage 08:37, 16 June 2008 (UTC)
InterWikiLinks Regex
/// <summary>
/// Matches interwiki links
/// </summary>
public static readonly Regex InterWikiLinks = new Regex(@"\[\[(nds-nl|rmy|lij|bat-smg|map-bms|ksh|pdc|vls|nrm|frp|zh-yue|tet|xal|pap|tokipona|minnan|aa|af|ak|als|am|ang|ab|ar|an|arc|roa-rup|as|ast|gn|av|ay|az|bm|bn|zh-min-nan|ba|be|be-x-old|bh|bi|bo|bs|br|bg|ca|cv|ceb|cs|ch|cu|ny|sn|tum|cho|co|za|cy|da|de|dv|nv|dz|mh|et|el|en|es|eo|eu|ee|fa|fo|fr|fy|ff|fur|ga|gv|gd|gl|ki|gu|got|ko|ha|haw|hy|hi|ho|hr|io|ig|ilo|id|ia|ie|iu|ik|os|xh|zu|is|it|he|jv|kl|kn|kr|ka|ks|csb|kk|kw|rw|ky|rn|sw|kv|kg|ht|kj|ku|lo|lad|la|lv|lb|lt|li|ln|jbo|lg|lmo|hu|mk|mg|ml|mt|mi|mr|ms|mo|mn|mus|my|nah|na|nb|fj|nl|cr|ne|ja|nap|ce|pih|nb|no|nn|oc|or|om|ng|hz|ug|pa|pi|pam|ps|km|nds|pl|pms|pt|ty|ro|rm|qu|ru|war|se|sm|sa|sg|sc|sco|st|tn|sq|scn|si|simple|sd|ss|sk|sl|so|sr|sh|su|fi|sv|tl|ta|tt|te|th|vi|ti|tg|tpi|to|chr|chy|ve|tr|tk|tw|udm|bug|uk|ur|uz|vec|vo|fiu-vro|wa|wo|ts|ii|yi|yo|zh|zh-tw|zh-cn|wuu|mzn|new|lbe|eml|bxr|hsb|nov|pag|bar|bpy|diq|zea|roa-tara|cbk-zam|zh-classical|cu|ru-sib|glk|cdo):.*?\]\]", RegexOptions.Compiled);
Is it me, or shouldnt that be build dynamically probably...? Using the Wikipedia:AutoWikiBrowser/IW page? Just so it is up to date?
Use one of the strings, replace ", " with | and build into regex?
—Reedy 21:03, 30 June 2008 (UTC)
- rev 3020 - Was some ancient stuff in that regex —Reedy 21:23, 30 June 2008 (UTC)
- I optimised it in rev 3023, removing that regex completely. If some plugin writer needs it, they may build it themselves:
"(" + string.Join("|", SiteMatrix.Languages.ToArray()) + ")"
. MaxSem(Han shot first!) 16:51, 1 July 2008 (UTC)- Cool, looks better Max. And a bug fixed at the same time —Reedy 18:15, 1 July 2008 (UTC)
- I optimised it in rev 3023, removing that regex completely. If some plugin writer needs it, they may build it themselves:
IRCM
As per MaxSem's comment about using Huggle instead of IRCM, i think this is right.
TBH, i think we should stop shipping it with AWB... And then probably remove it from the SVN - can leave a link somewhere to the last revision with it if people need/want it
Whats anyone else think?
—Reedy 10:04, 28 July 2008 (UTC)
- I didn't read MaxSem's comment, but anyways, I think IRCM is a good part of AWB; Minor fixes (i.e to make it usable with all users, Without checking page) will make it more popular, if this is the problem.--OsamaK 12:23, 28 July 2008 (UTC)
New release
Long time passed since 4.3.2 was released, lots of bugs were fixed, so now it's time to release. I propose to stop working on new features in 24h from now, to concentrate on bugfixes and code polishing. A few more things to do or to decide:
- What will be its version number? 4.3.3 or 4.4?
- Few annoying problems with <br> and <p> remain, probably worth trimming all such code if we can't keep it problem-less.
- Will we include IRCM or consider it dead? (Even if thrown away, we'll have to add an empty exe file to avoid updater errors.
- Is TypoScan mature enough to include, or we should release it separately? I propose not to include it into main distribution until we make it cross-wiki or at least make the plugin and server side to degrade gracefully on other projects.
MaxSem(Han shot first!) 09:41, 5 August 2008 (UTC)
- Definately 4.4, its a major overhauled (improved) version! (or 4.5, to mark the major changeness of it)
- Hmm
- I propose this being the last version we include it, the updated updater will then be in action for the next release, and it will work fine. For the long term, its pretty much abandoned, other than keeping it compiling against the current versions of WF with breaking changes. I suppose, we should try and find out what the usage of it is... ie whether its worth continuing to develop it... Huggle is a "reasonable" replacement
- It would be good to include it, definately. Its on my todo list to make it more multiproject, but if we can put it so it gets wider distribution, that would be good, but like you say, something needs doing about making it degrade gracefully for other projects
—Reedy 15:17, 5 August 2008 (UTC)
Making a final run through bug reports, will support releasing after some testing. MaxSem(Han shot first!) 11:14, 7 September 2008 (UTC)
- Cool. Im back from London now, so from tommorow should be able to start doing some more AWB stuff. —Reedy 22:11, 7 September 2008 (UTC)
Variables.cs
Isnt all the language stuff now redundant?
Cause namespaces are loaded on the fly (tested against the german wp)
Deleting most of switch language code would knock off nearly 40kb off the size of variables.cs
—Reedy 19:28, 8 September 2008 (UTC)
Synchronized fixing projects
The feature request to enable easy communal of regular expression pattern has been duplicated a number of times. I’m posting this here to hopefully move towards implementing this. Active requests for this feature directly:
- WP:AWB/FR#Support for typo-fixing like projects - 3 January 2008
- WP:AWB/FR#Options list (implied in interface) - 26 February 2008
- WP:AWB/FR#External link cleanup (dupe) - 23 June 2008
- And few more duplicates in the archive
The following feature requests, which probably should be listed under the general fixes section, could be completed with such a feature:
- WP:AWB/FR#Auto Convert - 25 August 2008
- WP:AWB/FR#Math - 21 July 2008
- WP:AWB/FR#Convert hatnotes to templates - 25 May 2008
- WP:AWB/FR#Hyphen to en dash replacement - 7 May 2008 – disputed typo fixing
- WP:AWB/FR#Removing ordinal indicators from dates - 3 May 2008
- WP:AWB/FR#Fix accessdate tags - 30 April 2008
- WP:AWB/FR#Date links that break autoformatting - 8 December 2007
- WP:AWB/FR#Non breaking spaces - 8 October 2007
— Dispenser 06:45, 15 September 2008 (UTC)
- Its things like this having a way to change edit requirements to sysadmin, and having a user whitelist would be good. If anyone is allowed access, the chances of things going wrong are greatly increased... —Reedy 08:59, 15 September 2008 (UTC)
5000 GetList Limits
I know why, and agree about the implementation, just we've had quite a few comments/"complaints" regarding it...
Obviously, we want to prevent the servers being raped, but we also need to allow users to get what they want
I've seen it before, that we shouldnt really have to worry about the server load and such (thats more end users)
Think its worth speaking to brion/tim and such about their opinion on it?
—Reedy 14:52, 22 September 2008 (UTC)
Image to File
Wikipedia:Bot_owners'_noticeboard#Canonical_namespace_change_coming:_Image:_-.3E_File:
Suppose we're going to have to get AWB to deal with both, and get out a compulsory upgrade...
—Reedy 06:17, 7 October 2008 (UTC)
Weird Regex
Category = new Regex(@"\[\[" + Variables.NamespacesCaseInsensitive[14] + @"(.*?)\]\]|<[Gg]allery\b([^>]*?)>[\s\S]*?</ ?[Gg]allery>", RegexOptions.Compiled);
Why are we doing stuff with gallery in a category regex...??? Presumably that needs removing as it was copied from the image one?
—Reedy 09:55, 8 October 2008 (UTC)
- Removed in rev 3439, cant see why it would be there —Reedy 12:51, 9 October 2008 (UTC)
Unit Tests
Would it be worth switching to use the testing suite built into VS2008?
—Reedy 19:52, 11 November 2008 (UTC)
- Not everyone has a non-free VS, while VS Express doesn't seem to have buil-in support. We're a free software project, after all. MaxSem(Han shot first!) 19:56, 11 November 2008 (UTC)
Namespaces/Alias'
Max, Was wondering if it'd make sense to use a Dictionary<int, List<string>> for the whole thing? Rather then having to check "Dictionary<int, string> Namespaces" and "Dictionary<int, List<string>> NamespaceAliases"?
Also, shouldnt we cater for these Aliases in "Dictionary<int, string> NamespacesCaseInsensitive"?
Seems to make sense having that as "Dictionary<int, List<string>>" too, and then we can build that up by iterating through the Dictionary, and then into the List value?
Might have a go at setting this up tommorow....
—Reedy 00:22, 30 November 2008 (UTC)
- Changing the type of a well-known variable will break existing plugins/modules. NamespacesCaseInsensitive already takes aliases into account (though we don't support enwiki's custom aliases such as WP->Wikipedia yet). MaxSem(Han shot first!) 07:52, 30 November 2008 (UTC)
- Duh. God knows why i didnt think of that! —Reedy 09:55, 30 November 2008 (UTC)
Lowercase and Underscore
Needs dealing with in AWB
—Reedy 13:49, 30 June 2008 (UTC)
- rev 3020 (Code commented out for time being). [3] removes it from the checkpage, preventing users AWB copies from trying to load it —Reedy 21:25, 30 June 2008 (UTC)
If User is Admin || Bot
Pull API query results in larger numbers. For most avaliable queries, admins and/or bots are allowed to get larger numbers of results in one go
—Reedy 14:46, 30 June 2008 (UTC)
- Redundant now, using max for amount to pull —Reedy 16:04, 27 January 2009 (UTC)
Usage stats - unique users per site
I'm looking at [4] and wondering if it would be possible to have the number of unique users listed per site under the Sessions & saves per site section. Penguin (talk) 09:31, 6 August 2008 (UTC)
- Yeah, should be. Will have a look at doing so later on. —Reedy 18:33, 6 August 2008 (UTC)
- Well overdue, rev 4058/rev 4059. Probably needs the cols changing round... —Reedy 23:15, 10 March 2009 (UTC)
Migrate to API?
I spoke to VasilievVV, he said that soon Bot API will be enhanced with action=render
, and thus we will be able to do everything using the API, needing not to parse HTML at all. Sure, it will take several months to make sure that all these new API features work satisfiably, and bug 11604 needs to be fixed too. MaxSem(Han shot first!) 07:55, 14 November 2007 (UTC)
- Well, that will certainly help speed up AWB operations! And with the internal Diff's... It would only be like previews that would need actually doing via the servers... Interesting... —Reedy Boy 15:59, 14 November 2007 (UTC)
- If that means ditching IE, it may also bring AWB a step closer to being portable to mono? --kingboyk (talk) 22:08, 29 December 2007 (UTC)
- We'd need a control for the Diff browser... And the History.. And then only the edit one if they want to see the formatted page.. —Reedy Boy 23:21, 29 December 2007 (UTC)
- Sam have already seen it, but others may not: Wikipedia:AutoWikiBrowser/API migration, that's the masterplan. Please participate. MaxSem(Han shot first!) 15:01, 8 June 2008 (UTC)
AWB v5
- Wikipedia:AutoWikiBrowser/API migration
- Migrate to .NET FW 3/3.5 (? would possibly cause problems towards mono integration)
—Reedy 14:22, 9 January 2009 (UTC)
task list
- Switch to editing via the API
- Add all currently failing unit tests into a separate unit test category so that actual regressions due to future code changes can be identified more easily Done rev 3842
- Include some of the feature requests around changes to the tag a module (particularly combining article issues tags, sorting tags appropriately and adding dates to tags were needed)
rev 3846 splits up the Tag Adding/Removing in the code, doesn't do anything gui wise though —Reedy 14:25, 27 January 2009 (UTC)Reverted- rev 3848, add and remove now booleanised in the same function. Tags added if Auto Tag enabled, removed if general fixes enabled. Code still needs overhauling —Reedy 16:31, 27 January 2009 (UTC)
- Include the feature request to differentiate between trivial and visible general fixes Done rev 3857
- Expand the range of general fixes to include outstanding feature requests etc.
- (Now that it is possible to hide all the quotes) include general fixes to remove ordinals from dates (per feature request) and "of" between months and years Done rev 3874
recent change has broken my access to unit tests project
A recent change under rev 4057 has broken my ability to use the unittests project as VS 2008 express claims not to support 'solution folders'. My access is restored locally by reverting the change to the unit tests solution file. Rjwilmsi 21:36, 13 March 2009 (UTC)
2005/2008?
I'm a bit confused if 2005 or 2008 is required. I just grabbed the source code yesterday with the goal of starting a plug-in. I have VS 2005 Pro on my main machine, but can install 2008 if necessary. I was able to load the project, but am getting build errors on 'wikifunctions' - is this another issue, or is 2008 required? Cander0000 (talk) 03:36, 15 June 2009 (UTC)
- Just VS being noddy. If you close and restart VS those errors normally go away. Rjwilmsi 11:10, 15 June 2009 (UTC)
- 2010 beta is out.. Hopefully that'll fix some of the stupid errors!! —Reedy 11:41, 15 June 2009 (UTC)
- Yeah. Also, try loading some Wikifunctions stuff before attempting Main.cs - worked for me I think. (I had a similar error in 2008) - Jarry1250 (t, c) 11:46, 15 June 2009 (UTC)
- 2010 beta is out.. Hopefully that'll fix some of the stupid errors!! —Reedy 11:41, 15 June 2009 (UTC)
Create AWB in Portuguese
Hi!
I would like to participate in the development of the AWB with you but in order to develop a version in Portuguese. There is possibility to be able to help?
Thank you!
Miguel Couto (talk) 20:16, 21 August 2009 (UTC)
- First of all, thanks for the offer! We could really use some help with localisation. Second, our code is pretty unprepared for that. It would take a huge effort to make it localisable. Can you program in C#? Do you know how to use Subversion? 62.140.253.6 (talk) 19:35, 23 August 2009 (UTC)
- Yeah, are you offering to help make the code localisable? Or just provide a Portuguese translation? —Reedy 22:20, 23 August 2009 (UTC)
Also, could you elaborate the reasons for your current one-week block on pt:? I see that it's due to some kind of offensive rant in an AWB-related discussion, but couldn't figure out what exactly it was about. 213.33.172.214 (talk) 07:12, 24 August 2009 (UTC)
5.0 release checklist
- Fix all showstopper bugs
- Error handling: I think it's better to use the restart timer if we get a null return from API / underlying connection closed etc. rather than a stop message to user. Also, better use of the bottom left progress meter/messages. Rjwilmsi 09:42, 8 October 2009 (UTC)
MaxSem
Has Max really retired? If so, considering he did a lot of work on AWB I think he should be listed in a "Retired" section under current devs (unless he doesn't want to be, of course). --kingboyk (talk) 18:46, 3 October 2009 (UTC)
What to do until 5.0.1.0
I think it´s almost time to release the new version. We have already fixed 2 main bugs and introduced some new feautures (updated auto-tagger, advanced html scraper, etc.). IMO the only things we have to do before releasing are:
- BUG FIX: Wikipedia_talk:AutoWikiBrowser/Bugs#ArgumentOutOfRangeException_in_AsyncApiEdit.CallEvent. We located the source of the bug.
MINOR: Add tooltips to advanced html scaper (Reedy)- BUG FIX: Shortcut keys not working
Releasing the new version will help us clean the orphans in en.wiki a lot.
--Magioladitis (talk) 10:10, 22 February 2010 (UTC)
Changed my mind: Let's release it as is. :) -- Magioladitis (talk) 22:06, 7 March 2010 (UTC)
User manual update
The user manual could do with a reasonable update. In particular the list of general fixes. I think we need to name each general fix function called within Article.PerformGeneralFixes, give a one line summary of what it does and then list its functionality beneath (with the actual parser function name in brackets). Volunteer to set up that list, which we could all then expand. Rjwilmsi 19:58, 13 January 2010 (UTC)
Profiling returns the following (ignore the numbers):
Initial skip checks: 0 Custom module: 0 External Program: 0 Plugins: 0 HideMoreText: 7 Unicodify: 19 UnHideMoreText: 0 F&R: 0 Categories: 0 Auto-tagger: 12 FixDates: 19 HideText: 0 ArticleIssues: 2 FixHeaderErrors: 61 FixPeopleCategories: 135 SetDefaultSort: 249 FixCategories: 0 FixImages: 0 FixLinkWhitespace: 0 FixSyntax: 54 FixCitationTemplates: 26 FixTemperatures: 8 FixNonBreakingSpaces: 3 FixMainArticle: 0 FixReferenceListTags: 0 FixEmptyLinksAndTemplates: 0 FixReferenceTags: 3 DuplicateUnnamedReferences: 0 DuplicateNamedReferences: 0 SameRefDifferentName: 0 ReorderReferences: 0 FixEmptyReferences: 0 AddMissingReflist: 0 Mdashes: 4 CiteTemplateDates: 0 BulletExternalLinks: 0 RemoveEmptyComments: 0 FixDateOrdinalsAndOf: 4 Links: 0 Metadata: 3 FixLinks: 26 End of general fixes: 0 Images: 0 Make Edit summary: 462 Alerts: 0
-- Magioladitis (talk) 22:28, 14 January 2010 (UTC)
- Make edit summary is rather sloooow —Reedy 07:25, 15 January 2010 (UTC)
I updated the manual a lot. I ll do some more tomorrow. -- Magioladitis (talk) 01:09, 3 February 2010 (UTC)
Merging "plugin" tab from kingbot with start tab
I think we should redesign the tabs a bit. We could merge the plugin tab with the start tab by making different buttons appear depending on whether editor is an admin, bot or none. For example, right now we have page statistics in both tabs and buttons in the plugin tab are a bit larger than the ones in the start tab. -- Magioladitis (talk) 20:16, 25 January 2010 (UTC)
There is a discussion about AWB and DEFAULTSORT
Wikipedia:Village_pump_(policy)#DEFAULTSORT. Better take a look. -- Magioladitis (talk) 11:37, 3 March 2010 (UTC)
Feature request cease-fire
OK. Let's organise a little bit for the new release. Here is my proposal:
- No new feature requests added till new release
- We try to fix as many bugs as possible
- We finish the KingbotK transformation to a new plugin. Main priority is to fix some small bugs that occurred after Reedy backend stuff.
That's all! -- Magioladitis (talk) 03:15, 12 September 2010 (UTC)
- 5.1.0.0 is out. -- Magioladitis (talk) 23:45, 14 October 2010 (UTC)
New AWB tagging bot discussed
Wikipedia:Village pump (proposals)#Bot tagging of new articles. -- Magioladitis (talk) 19:32, 23 October 2010 (UTC)