Jump to content

Talk:Assembly language/Archive 2

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Archive 1Archive 2Archive 3Archive 4

In "See also" section the point "List of assemblers" linked to Comparison of assemblers! And where the link to MACRO-11 should be added? --Tim32 (talk) 19:20, 22 June 2008 (UTC)

Syntax section

Since this section has to deal mostly with x86 assembly language, I suggest that it be moved there. ( rCX (talk) 03:59, 4 July 2008 (UTC) )

I moved it. rCX (talk) 00:57, 23 July 2008 (UTC)

FAQ

The representation in here does it refer to the hexadecimal address that machines use, such as when Blue Screen of Death error, they consists of a particulara hardware address in hexadecimal form. (I have very little macros programming foundation.)

This representation is usually defined by the hardware manufacturer, and is based on 
abbreviations (called mnemonics) that help the programmer remember individual instructions, 
registers, etc

By the way some interpreter use to translate XML documents according to XML schema, are they consider assemblers?

--Ramu50 (talk) 03:24, 13 July 2008 (UTC)

This is mostly OT, but I can't stand to let a question on BSODs go unanswered. :) In most blue screen messages, one or more of the hex numbers you see are virtual addresses, and often one of them will be the virtual address of an instruction. If you could look in memory at that address you would find the numeric (usually displayed in hex) coding of the actual instruction (or "opcode"). The blue screen however never displays the instructions themselves, only their addresses. The documentation that comes with the Windows Debugging Tools will tell you all about how to interpret blue screen data. The debugger can of course display the hex coding for the instructions and can also "disassemble" them, turning them back into assembly language mnemonics. There is a decent quick tutorial on using the debugger here: [1]. If you want to know more than most of us need to know about how Intel x86 instructions are encoded, download the three volumes of "IA32 Architecture Software Developer's Manual" from the Intel site, particularly volumes 1 and 2. You don't need to know the instruction coding in depth unless you're writing a compiler, an assembler, a debugger, or the microcode for the CPU itself, but it helps if you understand the general principles and if you can recognize a few common patterns. I can also highly recommend Matt Pietrek's articles from MSJ, "Just Enough Assembly to Get By", for a look at how the x86 instruction set is actually used by everyday code. There's nothing there on how the instructions are coded into hex, though, only on the assembler mnemonics.

I never heard of an XML interpreter called an assembler - they're usually called interpreters, or some more specialized name depending on their function. Jeh (talk) 08:02, 16 July 2008 (UTC)


-- Fasmlib keeps cropping up throughout wikipedia Assembler articles ---

Seems to me the author of Fasmlib is quietly seeking to advertise his product through wikipedia. Fasmlib isnt even a working product as far as I can tell, nor is it unique in what it attempts to deliver ( Randolph Hyde has done a similar, yet complete work in the past ). —Preceding unsigned comment added by 217.42.215.104 (talk) 21:45, 16 November 2008 (UTC)

Hi MrOllie. I'm publishing (with permission) Doug Dingus' comprehensive guide to Assembly Language for the Absolute Begineer. Not trying to sell anything, but I think it would be a valuable external resource that goes into detail beyond what would be appropriate for an article. This is inline with the External Link guidlines. —Preceding unsigned comment added by Nmcclana (talkcontribs) 02:07, 18 February 2009 (UTC)

You're publishing? Please do not add links to your own site. See WP:EL and especially WP:COI. - MrOllie (talk) 15:42, 18 February 2009 (UTC)
This EL doesn't promote anything other than a better understanding of assembly language. And linking to an article you've reprinted doesn't qualify for automatic deletion. MrOllie thinks an EL to 'Assembly Language for the Absolute Beginner' is spam, a COI, and requires deletion without discussion. I think it is a useful guide for someone who would read about assembly language and want to learn more. Any other opinions? --Nmcclana (talk) 19:16, 18 February 2009 (UTC)
Quoting WP:EL - 'In line with Wikipedia policies, you should avoid linking to a site that you own, maintain, or represent — even if WP guidelines seem to imply that it may otherwise be linked.' - MrOllie (talk) 19:26, 18 February 2009 (UTC)

separate Manuals / Tutorials

I think this article would be of more benefit to the public if there was a section for manuals, and a section for tutorials. A manual would be any assembly manual put out by the company that made that particular computer (IBM/Unix/etc.). Online manuals are really of great benefit to anyone interested or working in the field. Tutorials would be those tutorials written by experienced people/teachers/universities/etc. Right now.. it is all jumbled together. Not very user friendly. --stmrlbs|talk 23:10, 9 June 2009 (UTC)

See my update above. I think we should just link to Open Directory. —Preceding unsigned comment added by Yworo (talkcontribs) 23:10, 9 June 2009 (UTC)
Or rather, we should link only to pages/sites about assembly language in general and then provide a link to OD for those wanting to find manuals/tutorials for specific architectures. —Preceding unsigned comment added by Yworo (talkcontribs) 23:13, 9 June 2009 (UTC)
Yworo, I took a look at the Open directory, and it is a nice idea, but I couldn't find some basic things for IBM like the online IBM manuals for assembly language. I am not against putting a link to there, it is a good source - but I don't think it is a complete source (wikipedia isn't either). As for putting the assembler manuals on the 370 system architechure article, then maybe we could put a link to it from here (and the same with other architechures). But I still think that if a company that manufactures the hardware has online manuals which programmers use as reference to program that machine, then perhaps a link to all the manuals from that particular architecture would be great.
--stmrlbs|talk 23:32, 9 June 2009 (UTC)
I don't believe we should be linking to the IBM manuals from this article. I think they should be link from the article about the machine itself. This article is not about specific assembly languages. Maybe there should be subarticles about specific assembly languages, the links would be appropriate there.... —Preceding unsigned comment added by Yworo (talkcontribs) 23:36, 9 June 2009 (UTC)
Then how about something like, There are assembly languages for these architectures: then list the architectures, with wiki links to the articles on the different architectures. Then have a link to the online manuals from the different architectures. --stmrlbs|talk 23:56, 9 June 2009 (UTC)
also, Yworo, if you follow your posts with --~~~~ , wiki will automatically sign the post, so that people will know that you wrote the post. --stmrlbs|talk 23:58, 9 June 2009 (UTC)
Yah, I've just been informed on my talk page about the tildes. Thanks. You may have something there about how to do this. The current list is just so disorganized. Maybe some sort of a "list" subarticle would be better. And what about obsolete architectures, do we list them too? I took a machine language/assembly language class many years ago which used Burroughs, PDP-11, and CDC Cyber assembly languages as examples. PDP-11 assembly may well still be an excellent example of an orthogonally designed assembly language, but the other too are certainly obsolete. Do we link to info about say 8008 assembly? I still think it might be better to put the links on the pages about the machines, but perhaps we could create a see also section which mentions that that is where to find the assembly language reference links? Or? There just has to be a better solution than the current one... Yworo (talk) 13:13, 10 June 2009 (UTC)
Yworo, I think we should just start with what we have, rather than worrying about getting everything in at once. If we set up the structure, then I think people will add in the right place if they can see it easily. How about if I set up something tonight, then you can give me your opinion, or change/tweak it, then we can go from there. Ok? --stmrlbs|talk 18:27, 10 June 2009 (UTC)
Sure thing, go ahead. I'm the newcomer here... :-) Yworo (talk) 19:07, 10 June 2009 (UTC)
Yworo, I took a look around, and I see that this has already been created: List_of_assemblers. At the bottom of the Assembler article, it has this mixed in with other links here: Assembly_language#See_also. I think we should the link to a section by itself with a little paragraph explaining that this is a link of the different assemblers for the different machines. Then, maybe organize the rest of the links so they make better sense. What do you think? --stmrlbs|talk 09:11, 11 June 2009 (UTC)

Yworo, I created a new section: Assemblers#List_of_assemblers_for_different_computer_architectures which just has an introductory sentence explaining that a list of assemblers and architectures is on an associated page. Now, I think we can go through the links, and add those links to different assemblers into the table on the other page (if they are not already there), and take them off this page. The table is a more user friendly presentation of the information, than a bunch of links. You are right in that all the jumbled links with no organization is not good. If you need any help figuring out how to put something in the table, let me know. I will be glad to help --stmrlbs|talk 19:29, 11 June 2009 (UTC)

OK, let's make sure I have this straight, most of the links, the ones specific to architechtures, will go into the table on the subpage? And we'll have only a few general links on this one? I think that solves it quite nicely and is much more navigable and user-friendly. Good work. Yworo (talk) 13:12, 12 June 2009 (UTC)
yes, the table give like a crossroads link page between the different articles - between the assembly language, and the architecture. For each specific language, the table points to a wikipedia article on that language. For each architecture, there is a wikipedia article on that architecture.
So, I was thinking of going back to what you originally suggested, with a variation. For each wiki article on a language, add a subsection: manuals - then add the links to the assembly manuals there. And for each architecture, add a section for manuals. This will streamline this article, which, like you said, shouldn't have every assembler manual referenced in this article, and it will provide a way for a person to see all assemblers and their associated architecture, and point them to the correct articles for more information, and the manuals. If someone adds a manual here, we just move it and explain on the talk page and edit summary - that way the editor is still encouraged to contribute to the article.
I think this will make the wikipedia articles on this subject much more user friendly, and provide a nice resource for the public. I will start with the IBM Z/Architecture Assembler (HLASM) that is my ballpark. --stmrlbs|talk 18:14, 12 June 2009 (UTC)

Modulo

I made a long research to use the Modulo operator in Assembly language and the closest I found was the DIV operator however it's not available on the simple educational Assembler PEP8 [2] (French operators instruction Chapitre 7 in http://www.er.uqam.ca/nobel/k20250/Notes_cours.html).

Is there a simpler way of doing a modulo ? Perhaps doing bitwise operations ? --DynV (talk) 19:29, 12 June 2009 (UTC)

Only thing I can think of is performing the DIV operation to return an integral value and multiplying it with the divisor then subtracting it from the dividend. This is probably how it's done physically in architectures. ChazZeromus (talk) 18:05, 13 August 2009 (UTC)

Vic-20 assembler

The main page for this article includes a reference to an assembler named "French Silk" and touts it as being the smallest assembler ever written. In fact, the Instant Editor Assembler (known more by its acronym, IEA) was the smallest assembler around. I am not sure who wrote it, but remember it was marketed through Randy Chase's Commodore Users' Group Newsletter. I am not a fan of the IEA, as I was one of many dissatisfied customers who bought a copy of it, and tried to use it. French Silk, on the other hand, had a much better reputation for speed of execution and assembling. Dexter Nextnumber (talk) 08:22, 2 January 2010 (UTC)

No size given, but it's mentioned here Tedickey (talk) 14:10, 2 January 2010 (UTC)

Pagetable.com was removed from the external links/software section - pagetable.com is a unique resource for assembly language analysis and history/evolution of the language across various architectures and compilers; I believe it comes into category 3, Wikipedia:EL#What_should_be_linked. One of the admins should examine the content of pagetable.com and other links and not just delete en masse because they don't have either the expertise or time to identify useful resources 121.45.167.176 (talk) 20:29, 19 March 2010 (UTC)

Manuals, etc.

The list of manuals, tutorials, etc. seems too long. I would argue that only links to sites/pages which are about assembly language in general should go here. Links for specific assembly languages for specific machines or chips belong on the article about that machine or chip. —Preceding unsigned comment added by Yworo (talkcontribs) 23:06, 9 June 2009 (UTC)

I'd also propose that we link to the Open Directory page at http://www.dmoz.org/Computers/Programming/Languages/Assembly/ Wikipedia can never be a complete directory to pages about every assembly language and we should not try to be. A link to Open Directory will give the read a much better overview and categorization of assembly languages.

I believe that references for specific assemblers belong here and that they do not belong in articles on hardware.
OTOH, some of the references are about architecture and hardware and have nothing to do with assembler languages; I believe that those should be moved. Shmuel (Seymour J.) Metz Username:Chatul (talk) 12:27, 10 August 2010 (UTC)

Automatic optimization

Some assemblers do limited Peephole optimization, e.g., converting long branches to short branches. I believe that there should be a discussion of this in the article, preferably with some examples. Shmuel (Seymour J.) Metz Username:Chatul (talk) 12:39, 10 August 2010 (UTC)

Neutral nomenclature

Some terms have different definitions in different assemblers. In particular, some assemblers use the term pseudo-instruction to refer to assembler directives. The article should avoid conveying an impression that the nomenclature used is universal. Shmuel (Seymour J.) Metz Username:Chatul (talk) 13:22, 10 August 2010 (UTC)

There's no neutrality/POV issue here, so I'm removing the systematic bias template. If reliable sources give distinctly different definitions of a term, and I think this is the case, the article should reflect this. That much is very straight forward. ButOnMethItIs (talk) 08:47, 6 September 2010 (UTC)
The NPOV issue is the 'removal of neutral language, as discussed in #A pseudo-opcode is a directive. Shmuel (Seymour J.) Metz Username:Chatul (talk) 13:35, 6 September 2010 (UTC)
I think you misunderstand the meaning of neutrality. Your language may have been more accurate, but it was not more neutral. Again, I see no NPOV issue. ButOnMethItIs (talk) 15:48, 6 September 2010 (UTC)

Removal of Systemic bias template

I did in fact initiate a discussion prior to adding a {{Systemic bias}}; the proper course for someone who disagreed would have been to discuss the reasons instead of removing the template. Shmuel (Seymour J.) Metz Username:Chatul (talk) 15:09, 7 September 2010 (UTC)

There was no on-going discussion or clear claim of bias. If there were, I would have started there. ButOnMethItIs (talk) 15:54, 7 September 2010 (UTC)
There was, and the template linked to it. Shmuel (Seymour J.) Metz Username:Chatul (talk) 18:00, 7 September 2010 (UTC)

Clarification of extended mnemonics?

There is a comment suggesting that the following text is unclear

Pseudo-opcodes are often used within the instruction set to support alternative mnemonics for instructions that the CPU designer did not specifically include. For example, many older CPUs do not have a true nop (no operation) instruction. But often there is another instruction that can be used instead with the same effect as a nop. In 8086 CPUs the instruction xchg ax,ax is used for nop. With nop being a pseudo-opcode to encode the instruction xchg ax,ax. Some disassemblers recognize this and will decode the xchg ax,ax instruction as nop.

I agree that it is misleading, and suggest the following

extended mnemonics are often used to support specialized uses of instructions, often for purposes not obvious from the instruction name. For example, many CPU's do not have an explicit NOP instruction, but do have instructions that can be used for the purpose. In 8086 CPUs the instruction xchg ax,ax is used for nop, with nop being a pseudo-opcode to encode the instruction xchg ax,ax. Some disassemblers recognize this and will decode the xchg ax,ax instruction as nop. Similarly, IBM assemblers for System/360 and System/360 use the extended mnemonics NOP and NOPR for BC and BCR with zero masks.

Shmuel (Seymour J.) Metz Username:Chatul (talk) 15:22, 8 September 2010 (UTC)

Meta: Quotes and italics

There is a general convention to use italics to represent quoted material that is not surrounded by quotaion marks or apostrophes. User:Nigelj recently added quotation marks around material that already was framed by double apostrophes, the Wiki markup for italics. E.g.,

"floating point partial arctangent"

Shouldn't it be one or the other but not both? Shmuel (Seymour J.) Metz Username:Chatul (talk) 14:48, 19 November 2010 (UTC)

Yes, I added the double quotes after having added the wikilinks inside the italic phrases.diff I wasn't sure, but I felt that the blue then black type broke up the phrases making it difficult to see what was going on. I was in two minds about the double quotes, but was just trying to make the meaning easy to see. I'm not attached to them and they could be removed without any problem if people think that is best. --Nigelj (talk) 22:58, 19 November 2010 (UTC)

Prevalence of name spaces

Are there any data to support the claim that most assemblers have symbol management, e.g., name spaces? There are certainly many assemblers that don't. Shmuel (Seymour J.) Metz Username:Chatul (talk) 18:13, 23 August 2010 (UTC)

I seriously doubt it. I'm sure most assemblers are extremely spartan in their feature set. ButOnMethItIs (talk) 01:00, 7 September 2010 (UTC)
And for that reason, I have replaced "most" with "some." Although I don't have any reference to support even "some" I assume that the person who put the statement there in the first place would have first-hand experience. Most of the assemblers that I have used have been, as noted above, quite spartan in the features they supported. An old 6502 assembler wouldn't even calculate forward branches for you. Dead Horsey (talk) 06:48, 21 December 2010 (UTC)

Add references or TMI?

I corrected the dates for some milestones in the evolution of assembler languages, and in the process I wondered whether it would be appropriate to add references, e.g., 705 Autocoder, 709 FAP, for the approximate dates. Would that be helpful, or would it be TMI? Shmuel (Seymour J.) Metz Username:Chatul (talk) 21:12, 18 January 2011 (UTC)

Why use assembly language?

Hi all,

I stumbled across this page via Random Article, it brought back happy memories of writing TSRs and trying to understand PC BIOS routines (ahem). I feel that the first main section (Why use assembly language?) dives a little deep into the nitty-gritty for an opener. But the first para of the Historical perspective section is just what the article needs at the beginning (from a layman's point a view), and I was wondering about incorporating it at the top of the page. MinorProphet (talk) 23:03, 28 April 2011 (UTC)

Macros

The section on macros has a lot of good information, but it seems to get very off-topic. It starts with a good discussion of the use of macros, and then diverges into specific uses of macros in legacy systems, using the macro assembler as a code generator for COBOL, history of the C preprocessor, and then discussions of the underlying structure of Prolog, LISP, and Forth. I'd like to trim this section down dramatically, and move the more advanced content to another article. Any comments? Dead Horsey (talk) 06:55, 21 December 2010 (UTC)

I agree. It's legitimate to mention that macro assemblers inspired macros in other language, but not to go into detail about macros in languages other than assemblers. Separate articles about macros in document formatting languages, e.g., SCRIPT, general programming languages, e.g., PL/I, and shell script languages, e.g., CLIST, would be helpful. Shmuel (Seymour J.) Metz Username:Chatul (talk) 11:20, 21 December 2010 (UTC)
I will try to make some time in the next week to make these changes. I need to find out if there are other articles where the content can move. Someone put a lot of effort into writing all that information, and if it can be moved somewhere else that makes sense, I'd rather do that instead of deleting it. Dead Horsey (talk) 03:52, 22 December 2010 (UTC)
Discussion of using the assembler macro facilities to generate code in other languages definitely belongs in this article. As I noted last year, discussion of features in other languages inspired by the macro languages of assemblers probably does not belong here. Shmuel (Seymour J.) Metz Username:Chatul (talk) 18:47, 11 December 2011 (UTC)

Why Assembly?

The "Why Assembly Language" section focussed on the question why to use assembly language rather than to code mahine instruction dirctly. I think it gives a helpful explanation of the functionality provided by an assembler. But no sensible person will ever code machine instructions directly (e.g. with a binary editor), so this alternative is merely academic. or educational, if you like.

In contrast, the question why to write assembly language rather than a high(er) level language is a serious practical question, and I started reading the section with that question in mind. Because this aspect was not covered, I added some text. Because I am not a professional programmer, I don't know to what extent assembly language is still used today. Perhaps others can add some comments to that extent. Anyway, I am old enough to remember the days when even administrative programs were occasionally written in assembly language. Which is not as bad as it may seem if appropriate macro's and subroutines are used. I guess that the introduction oof C in the 1970's has greatly reduced the need to revert to assembly language. Older languages like COBOL, FORTRAN, ALGOL or even Pascal are less suited to write operating system functions, not to speak about the terrible "esperanto" developed by IBM in the 1960's called PL/I. Rbakels (talk) 10:35, 9 December 2011 (UTC)

Actually, people did program in machines language. The three situations I'm aware of are
  • Pedagogical. Force students to write in machine language to give them an understanding of why they will be using assembler for the rest of the semester.
  • Binary patches[NB 1]. This was more common when assemblies ran for a long time, but still is done when the source code is not available.
  • During the development of a new machine.[NB 2]
I would argue that PL/I, used in Multics and PRIMOS, is a much better choice than C for writing an operating system. Further, the experience of Burroughs[NB 3] writing MCP suggests that Extended ALGOL[NB 4] is also a better choice. Shmuel (Seymour J.) Metz Username:Chatul (talk) 18:39, 11 December 2011 (UTC)
  1. ^ Sometimes automated with “zapmaker” programs that post-process assembler output
  2. ^ These days it is more common to write a cross assembler.
  3. ^ Now Unisys
  4. ^ including ESPOL
Chatul - I'd agree about PL/I, but arguing over choice of a favored language is about as useful as arguing whose wife is better. Peter Flass (talk) 13:37, 16 January 2012 (UTC)

Help request: Decoding starting opcode 69h at ofs 000h in FAT formatted floppy boot sector

I am currently working on a significant overhaul of the File Allocation Table article for accuracy and completeness, and I have run into a question I was unable to answer myself so far. Perhaps someone of you can help out and provide an answer.

Primer: All FAT formatted volumes since DOS 2.0 contain a BIOS Parameter Block (BPB) in the boot sector describing the volume, wheras DOS 1.x volumes did not. In their course to determine the actual medium format, among other methods MS-DOS, PC DOS and DR-DOS check various byte patterns at offset 000h in a boot sector in order to find out, if a given sector might contain some form of BPB or not. Volumes containing a BPB typically start with a jump instruction at offset 000h to skip over the BPB. Patterns tested for by DOS include a short jump sequence "JMPS ??, NOP" (EBh ??h 90h, as seen since DOS 3.0) or a near jump (E9h ??h ??h, as seen on DOS 2.x formatted disks). On harddisks, DR-DOS (but not MS-DOS/PC DOS) additionally checks for a swapped sequence 90h EBh ??h. (To be precise, these tests alone are not enough to be sure a BPB is present since some DOS 1.1 disks contain EBh ??h 90h as well, but still have no BPB; but I won't go into further details here, as it doesn't matter in regard to my question below.)

On floppies, all these operating systems (MS-DOS, PC DOS and DR-DOS) also check for a byte pattern 69h ??h ??h at offset 000h in the boot sector. This is documented for MS-DOS/PC DOS in at least one book, and it can be found in the OpenDOS source code as well, but without further explanation. Stepping through MS-DOS in a debugger it can be verified that the test actually exists in MS-DOS as well, however, in none of the books I have checked so far, 69h is a valid x86 opcode. So, I wonder what it is. Perhaps a jump instruction for a non-x86 processor? Mind, that the FAT file system was also used on Ataris (Motorola 680x0) as well as on some very late CP/M variants (8080) and all MSX-DOS machines (Z80), but a starting opcode of 69h does not seem to make sense on these platforms as well. The IBM PC RT was built around the IBM ROMP processor, a RISC processor -- certainly not x86-compatible, but unfortunately I cannot find any opcode maps for this specific processor. What about the extra opcodes supported by the NEC V20/V30? Or some undocumented opcode? Was there any other platform important enough to have Microsoft or IBM add this special test into the volume mount code of MS-DOS/PC DOS? Windows NT? Any ideas? --Matthiaspaul (talk) 01:41, 21 January 2012 (UTC)

General cleanup

I did some more general cleanup on this article, but it is still disorganized and full of redundancies. In particular the sections "current usage" and "typical applications" should be combined, but there are lots of other instances of duplication.Peter Flass (talk) 13:39, 9 March 2012 (UTC)

Two of these things are not like the others...

Please stop adding these external links:

In the first place, if you look you will notice that the "external links" here are pointers to information, not pointers to specific assemblers - the page List of assemblers is intended for that. In the second place, BBC Basic is not an assembler, even if it contains one. By that criterion, almost all C compilers would be assemblers because they allow imbedded assembly code. Peter Flass (talk) 23:34, 7 June 2012 (UTC)

Locked

As a new editor I cannot yet edit this article. Is there a way to get it unlocked? I only want to make some minor edits to make it read more smoothly. — Preceding unsigned comment added by Geau (talkcontribs) 12:09, 26 August 2012 (UTC)

See this: http://wiki.riteme.site/wiki/Wikipedia:Protection_policy#semi HumphreyW (talk) 12:33, 26 August 2012 (UTC)

Thank you. I can see why it was protected, from looking at the history. I suppose I can wait until I am validated. --Geau (talk) 15:41, 26 August 2012 (UTC)

Immediate encodings

I thought I'd put a little section about the ModR/M operand specifier byte underneath the opcode section. Please edit if their are any erroneous typings in my words lol. ChazZeromus (talk) 18:11, 13 August 2009 (UTC)

I reverted this change. You are describing the underlying machine language of x86, not assembly language. The ModR/M byte details, etc., aren't part of and are not reflected in the assembly language. You could however see if the x86 architecture article could use this material. Jeh (talk) 18:14, 13 August 2009 (UTC)
Hm, maybe there should be a section approaching the architecture of the assembler itself? This article describes the different language, and so I thought I'd put an example, but I forgot about the assembler part! Alright, I'll see what I can plug around in the x86 article. 72.91.245.68 (talk) 21:45, 13 August 2009 (UTC)
Some assemblers are remarkably flexible when it comes to typing in immediate code. I can't speak for the x86 series of microprocessors as I have no real experience with that microprocessor, but there are assemblers around for 6502s that let you type the following kind of stuff in:
hex 41424344 45464748  ; lowercase PETASCII clumped together into 4 byte series
hex 41 42 43 44 45 46 47 48  ; lowercase PETASCII with a space between the arguments for easier reading
hex c1 c2 c3 c4 c5 c6 c7 c8  ; uppercase PETASCII same thing as above, but it's standard uppercase format
bin 1010 0000 1011 1111  ; series of 4 bit nybbles with zeroes on the upper nybbles
scr "abcdefgh"  ; VIC-II compatible screencodes
scr "ABCDEFGH"  ; VIC-II compatible screen codes
and so on. Naturally, good assemblers also let you decide for yourself whether the bytes are being generated in reverse order (so, in effect, you could spell strings backward). Assemblers usually (or at least often) try to avoid byte-framing errors by keeping the object code on an even boundary, unless the programmer actually wanted things to end up on odd boundaries for copy protection purposes. Similarly, some assemblers are designed in such a way that you can generate and deposit object directly in memory, or in specific tracks and sectors. An essential feature for programmers who prefer to write their own Disk Operating Systems. Dexter Nextnumber (talk) 22:29, 9 December 2009 (UTC)


As far as i know, *all* assemblers have directives for entering Hex and other literal values. There is nothing special about what you are describing. Good little assemblers everywhere, let you enter the byte codes "in the same sequence" that you want them to appear in memory, the only issue being endieness. It would be a very unusual assembler indeed that would reverse the order of an arbitrary sequence of bytes for you. Maybe as a Unicode function, but not otherwise, I don't recall ever encountering such a directive. Assemblers only align the bytes on memory boundaries when and where you tell them to by using the appropriate directives they have no crystal ball to allow them to predict where the alignment should be placed. And unless you are dealing with FORTH and writing to BLOCKS which are obsolete, then assemblers do NOT "deposit object directly ... in specific tracks and sectors" assemblers use ordinary files same as any other program and they generate executable machine code not objects. Well, perhaps this is something unique to the VIC20? does the VIC not have a file system? I've never used a VIC, maybe it does use tracks and sectors, some early mainframes did use tracks and sectors because that was before the development of file systems. OldCodger2 (talk) 22:02, 15 September 2012 (UTC)

/* Macros */ removed a bogus claim, about built-in operations for gaming and data managment

Removed: "Many assemblers have built-in (or predefined) macros for system calls and other special code sequences, such as the generation and storage of data realized through advanced bitwise and boolean operations used in gaming, software security, data management, and cryptography."

The above is just total nonsense. First off, hardly any assembler has anything like this. Second off, if it's a macro library, then that has nothing at all to do with the assembler itself. Third, I know of nothing that is specific to "gaming" that is implemented in any cpu architecture unless it is a custom designed cpu. Intel has announced/documented built-in instructions for AES, but guess what, they have yet to ship any actual x86 cpus which implement these instructions -- see Intel's manual for details. By advanced bitwise operations, I guess you mean the usual shifts and rotates? nothing special about that. me? I've programmed so many different computers using so many different assemblers that I've lost count. OldCodger2 (talk) 22:13, 15 September 2012 (UTC)

I looked at this claim crosswise too. I have a vague recollection that some assemblers may have had predefined macros, maybe SDS Metasymbol for example, but not for anything mentioned. In any case I think it would need a citation. Peter Flass (talk) 23:05, 15 September 2012 (UTC)
While we're looking at macros, the article says this "Note that this definition of "macro" is slightly different from the use of the term in other contexts, like the C programming language. C macro's created through the #define directive typically are just one line, or a few lines at most. Assembler macro instructions can be lengthy "programs" by themselves, executed by interpretation by the assembler during assembly." While this may be true of C other high-level languages have macro facilities that can be and sometimes are used to write lengthy programs, PL/I, for example. Peter Flass (talk) 23:08, 15 September 2012 (UTC)
Also, both assembler and PL/I macros have been written with the function of generating text unrelated to the native language, e.g., Stage 1 of the OS/360 systems generation (SysGen) process.Shmuel (Seymour J.) Metz Username:Chatul (talk) 12:00, 19 September 2012 (UTC)
In the early days it was common for assemblers to have built-in macros but not macro libraries. However, I'm not aware of any that did so in the time-frame of the game consoles; the cases I'm aware of were much earlier. Shmuel (Seymour J.) Metz Username:Chatul (talk) 12:00, 19 September 2012 (UTC)

I disagree with your revert -- Data Sections

EvilKeyboardCat (talk · contribs) left the following comment on my talk page. I'm moving it here, as it would be better if other editors of this page joined in.

"I don't agree with either of these changes: 'sections' are not instructions; hardware architecture can easily be a c..."
I simply provided context and added a link, what is there not to agree with?
If you don't agree please fix rather than revert.
I'm an experienced assembly programmer for Hobby operating systems and as far as I'm aware data sections are produced by pseudo-ops, which are instructions.
Please discuss.

My edit summary was truncated by Twinkle, but I went on to say something like "hardware architecture can easily be a constraint on the use of high-level languages." Taking the two parts of the edit in turn, the edit changed a sentence to begin "Data sections are instructions used to define data elements..." My point is simply that data sections are sections of assembly programs. They contain instructions, and these instructions may well be pseudo-ops, but sections are not instructions. They are sections of the programm, or sections of memory - however you look at it - that contain instructions, or data. I cannot see how more simply to state that. Adding this minor confusion of terminology is not an improvement. The second part of the edit I reverted changed a sentence to specifically say that "constraints or peculiarities in the target operating system's architecture" may prevent the effective use of higher-level languages (The edit also spoiled the number agreement in the sentence by changing prevent to prevents, even though constraints or peculiarities were still plural). My point was that although the operating system may reflect and model the constraints or peculiarities of the underlying hardware, there is no benefit in specifying that it is not the hardware but the OS that has the constraints or peculiarities. Indeed, very often it is the OS that is being written in Assembler, so that the point as changed becomes rather tautologous. As to my admonishment to "please fix rather than revert", that is what I did: the phraseology was fine beforehand, the edit did not improve it, so I fixed it by putting it back exactly as it was, since there was nothing that I can see that was wrong with those passages in the first place. --Nigelj (talk) 16:52, 2 November 2012 (UTC)

Okay, I agree with you on the point that data pseudo-ops are sections not instruction but would it be better for the start of the text on data sections to read "Data section are" not "These are"? Perhaps it could be reworded? Also I think my link to the variables page was justified. EvilKeyboardCat (talk) 00:09, 3 November 2012 (UTC)

Data Sections is still wonky... I think the term is being misapplied. Normally the SECTIONS of a program are used to define the memory space in which it will reside. The typical divisions are between code and data spaces with some operating systems providing Read Only Protection for the code space. Other section properties are for Read Only Data and memory address alignment. Sections can be used to group related parts of data together. What is currently described by this article does not fit any of the above but instead appears to be talking about DIRECTIVES that can be used to define DATA TYPES. A TYPE is not a SECTION, the concepts are completly different. Also DIRECTIVES are sometimes called PSUDO-OPS but they are never (as far as I know) called INSTRUCTIONS. Please clarify your intent... Tautologies aside, how much assembly programming have you people actually done? OldCodger2 (talk) 00:23, 13 December 2012 (UTC)

I regularly work on a 16-bit x86 hobby operating system and the operating system kernel is written in assembly, which I have worked on many times. The operating system compiles using NASM and section 3.2 of the NASM manual references data and macro statements as Pseudo-instructions. This is what I though they were called, i.e. there are real and pseudo instructions. Here is a quote from the manual: Pseudo-instructions are things which, though not real x86 machine instructions, are used in the instruction field anyway because that's the most convenient place to put them. The current pseudo-instructions are DB, DW, DD,DQ, DT, DO and DY; their uninitialized counterparts RESB, RESW, RESD, RESQ, REST, RESO and RESY; the INCBIN command, the EQU command, and the TIMES prefix.. NASM calls them pseudo instruction but different assemblers (like FASM or MASM) may call them by different names. We should put the most use name first then a (also known as foobar) afterwards. EvilKeyboardCat (talk) 08:46, 13 December 2012 (UTC)
Nomenclature is not consistent between systems, and sometimes not even consistent between assemblers on the same system. My background is heavilly S/360, through z, although I've been exposed to a dozen or so assemblers on other systems.
A section is a block of storage[a]that a loader can treat as a unit; it can contain code, data or a mixture, and can be r/o, writable or copy on write. A section usually has a name, although do to FORTRAN there is supprot for unnamed common sections. The details depend on the program structure of the OS[b], not on the hardware architecture.
I wouldn't mention this had you not asked, but I've been writing in assembly languages since 1960; I feel confident that there are others here of the same vintage. Shmuel (Seymour J.) Metz Username:Chatul (talk) 17:26, 18 December 2012 (UTC)

Notes

  1. ^ It may be delimited by a pseudo-op, but it is not itself a pseudo-op.
  2. ^ A single assembler may support more than one object format.



I still disagree with the wording of sections. Perhaps it's because my experience has mostly all been with microprocessors and yours is mostly mainframes??? If you look at the NASM manual and how it describes sections for x86... which is also very similar to how sections are used for z80 and other processors I've programmed. Here is what the manual has to say. And I cannot reconcile what the manual says with what this article trys to say.... OldCodger2 (talk) 10:45, 29 January 2013 (UTC)

 6.3 SECTION or SEGMENT: Changing and Defining Sections
  
 The SECTION directive (SEGMENT is an exactly equivalent synonym) changes which section of the output file the code you write will be assembled into. 
 In some object file formats, the number and names of sections are fixed; in others, the user may make up as many as they wish. Hence SECTION may 
 sometimes give an error message, or may define a new section, if you try to switch to a section that does not (yet) exist.
 
 The Unix object formats, and the bin object format (but see section 7.1.3, all support the standardized section names .text, .data and .bss 
 for the code, data and uninitialized-data sections.
 

A few points.
The nomenclature among assemblers is inconsistent, and any of the terms directive, pseudo-instruction or pseudo-operation may be used with the same meaning. As noted, a section is not a pseudo-op, but the selection of what code goes into what section is controlled by pseudo-ops.
The details of defining sections have almost nothing to do with the hardware's architecture and are strongly tied to the operating system. Historically sections were influenced by the need to support named common in FORTRAN. The NASM paragraph 6.3 quote is fully consistent with that. Shmuel (Seymour J.) Metz Username:Chatul (talk) 22:58, 31 January 2013 (UTC)

Compiler optimization versus hand optimization

One fact to to keep in mind is that code rearrangements that improve performance on one processor may degrade it on another. A compiler (including an assembler) that has an option to do optimization for a specific processor may give result better than an assembler program hand optimized for one processor but run on another. Of course, configuration management will be more complicated if you have to distribute more versions of the object code. Shmuel (Seymour J.) Metz Username:Chatul (talk) 18:40, 22 January 2014 (UTC)