Talk:Reed–Solomon error correction/Archive 3
This is an archive of past discussions about Reed–Solomon error correction. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 1 | Archive 2 | Archive 3 |
Other algorithms used for RS codes.
The RS article leaves out some other algorithms, some of which would take a while to explain. Here's a list:
- erasure and error handling - The algorithm used to modify syndromes given a set of know erasure (error) locations. The result is yet another set of linear equations that can be solved with various algorithms. The modified syndromes are then used to calculate any unknown error locations.
- I think the term Forney syndrome is used for this method of erasure processing. It would be good to work this information into the article. I have a page I've been working on that shows some examples: v:Reed–Solomon codes for coders. It's pretty rough, but there might be something useful in there. Bobmath (talk) 04:20, 26 October 2011 (UTC)
- No actually there is no clear term, but usually the most common I've seen are errors-and-erasures decoder, or errata decoder. And in fact the Forney syndrome is not necessary for errata decoding, one can just adapt the Berlekamp-Massey/Euclidian/any other decoder to compute the errata locator polynomial given the erasure locator polynomial. See Figure 7.10 in "Algebraic codes for data transmission", Blahut, Richard E., 2003, Cambridge university press. Lrq3000 (talk) 00:35, 24 June 2015 (UTC)
- I think the term Forney syndrome is used for this method of erasure processing. It would be good to work this information into the article. I have a page I've been working on that shows some examples: v:Reed–Solomon codes for coders. It's pretty rough, but there might be something useful in there. Bobmath (talk) 04:20, 26 October 2011 (UTC)
- hardware invesion of a finite field binary polynomial based number using a sub-field. - For example, an 8 bit field is normally implemented as finite field math modulo some 9 bit primitive polynomial. A compatable 8 bit field can also be implemented as finite field math modulo a 3 nibble (4 bit) polynomial 1 x2 + a x + b. The coefficients of the 3 nibble polynomial are finite fields modulo via one of three 5 bit primitive polynomials (hex 13, 19, or 1f). For each 5 bit polynomial there will be 4 polynomials 1 x2 + a x + b where the fields are compatable, where addition (exclusive or), multiplication, and division of numbers with the same logarithm produce answers of numbers with the same logarithm. Any compatable nibble based polynomial can be used. The process involves mapping the 8 bit number to the nibble based 8 bit number, doing the math to invert it, then mapping back. A 1/x table is still needed, but it only has 16 four bit entries. The point of all of this is that for the 8 bit case, this procedure ends up at around 400 gates with a single propagation delay through the stages of gates, versus the number of gates required for a 256 byte lookup table. Which sub-field is choosen can slightly affect the number of gates required. Note the lookup table takes more gates, but there are fewer gate stages and the propagation time is less than the sub-field method.
- multi-layer type RS code algorithms - Used in cd-roms, dvds, magnetic tape. Treated as a matrix, where the innner layer rows typically have weak RS code, while the outer layer columns have a strong RS code. The outer layer can be optimized by handling corrections mostly as erasures (each row would be tagged as erasure location for the columns).
- parities to syndromes, syndromes to parities - Parities or re-encoded partities (received message re-encoded) can be converted into sydromes via a matrix multiply. I'm not sure this saves much in hardware or software, just that it is possible. Syndromes to parities: the original message has zeroes appended to it, then those zeroes are considered erasures, and the message is corrected to add the parities. This eliminates the need for the generator polynomial in hardware or software.
Rcgldr (talk) 01:20, 21 October 2011 (UTC)
- Your other points are interesting, but some of your terminology seems strange. Compatible doesn't seem to be a standard term for what you're doing, for example, and I thought the term nibble died 20 years ago. Also, I don't think the article needs to include every possible implementation method, because that will start to get confusing and unwieldy. Bobmath (talk) 04:20, 26 October 2011 (UTC)
- @Bobmath: - The proper term is isomorphic, and all fields of the same size are isomorphic, but require mapping of elements from one field to another in order for the fields to be isomorphic: map(a+b) = map(a) + map(b), map(a b) = map(a) map(b). Rcgldr (talk) 01:22, 5 August 2020 (UTC)
- nibble is still used when describing things such as Binary-coded_decimal. Instead of compatible, I've also seen equivalent used. I'm not sure if there is a standard term for this, as I haven't seen it in any text books, although most experts in RS coding are aware of this method for inverting numbers. The two most common mappings would be GF(28) to GF(162) or GF(210) to GF(322). As a bit of trivia, for a GF(28) primitive polynomial: b8 X8 + b7 X7 + ... + b1 X + 1, then the product of of the 4 compatible (out of 64 possible) GF(162) (4 bit coefficient) primitive polynomials (1 X^2 + c1 X + c2) ... (1 X^2 + c7 X + c8) = b8 X8 + b7 X7 + ... + b1 X + 1, except that the b's are now 4 bit coefficients, but will only have the values 0000 or 0001. This method is used in some computer peripherals (hard drives, tape drives, ...). Rcgldr (talk) 05:33, 26 October 2011 (UTC)
- BCD is another thing that I thought had died 20 years ago. (They're no longer supporting it in the x86-64 instruction set, it seems.) It sounds like what you're describing is a field isomorphism (a subfield is something completely different). My abstract algebra isn't terribly strong -- does an isomorphic field with the right properties always exist, or only in certain cases? Bobmath (talk) 06:13, 26 October 2011 (UTC)
- Bobmath - It is my understanding that an isomorphic field always exists based on the statement that all fields of the same size are isomorphic, but for large fields, such as GF(2^1024), finding a way to map to an alternate field is difficult (and is part of the reason some encryption methods are based on large fields). For smaller fields such as GF(2^32), a somewhat optimized brute force search to find a proper mapping takes a few minutes on a typical PC. For something like GF(2^64), a better approach would be needed, but I haven't found an article on how GF(2^64) could be mapped. In some cases, a composite field like GF((2^16)^4) is used instead of GF(2^64), knowing that it could be mapped from or back to GF(2^64), but never actually doing the mapping. Rcgldr (talk) 14:37, 5 August 2020 (UTC)
- Off topic, but BCD is still around, mostly because the accounting and banking industry use math based on BCD to avoid any issues related to binary :: decimal conversion issues (required by laws and/or industry standards). BCD is how Cobol's packed decimal would be stored on most computers. Rcgldr (talk) 08:22, 26 October 2011 (UTC)
- The idea Rcgldr is trying to describe is constructing the finite field GF(p^m) not as a polynomial ring over GF(p) but over GF(p^l) for some l|k. (Inversions in GF(p^l) are precomputed and stored in lookup tables, but polynomials are significantly shorter, reducing computation efforts.)
- Re isomorphism: All finite fields of a given size (order) are isomorphic to each other. That's why there is this simple notation of GF(size).
- Re nibble: Aside from assembler programmers, network folks use it, too, at least occasionally. You will still find it in a few more recent IETF documents. Nageh (talk) 07:11, 26 October 2011 (UTC)
- All finite fields of a given size (order) are isomorphic to each other. - This is why I didn't use the term isomorphic. The conditions required to take advantage of mapping for inversion require that all math operations produce the same results when expressed as powers of each fields primitive element α. Multiplication and division aren't an issue since αi * αj = αi+j, and αi / αj = αi-j will hold true for any field with a primitive element α. The issue is addition and subtraction (or exclusive or for binary based fields), where αi + αj = αk and αk - αj = αi hold true for the same i, j, and k, in all compatible or equivalent fields. For each of the 3 possible GF(24) defining polynomials, hex 13, 19, or 1F, there are 64 possible fields that could be used to map a specific GF(28) to a specific GF(162), but only 4 of those fields will meet the addition and subtraction (in this case exclusive or) requirements. Again, the point of this is that it takes a fraction of the gates (although a longer propagation delay) to implement inversion via this type of mapping. Rcgldr (talk) 08:33, 26 October 2011 (UTC)
- So you require that the binary representation of all field members is (essentially) the same for both fields? (It could be bit-inverted, or... what are the other two cases, most-significant bit flipped I guess.) I don't think there is a name for what you describe, or at least I'm not aware of it. Nageh (talk) 09:47, 26 October 2011 (UTC)
- All finite fields of a given size (order) are isomorphic to each other. - This is why I didn't use the term isomorphic. The conditions required to take advantage of mapping for inversion require that all math operations produce the same results when expressed as powers of each fields primitive element α. Multiplication and division aren't an issue since αi * αj = αi+j, and αi / αj = αi-j will hold true for any field with a primitive element α. The issue is addition and subtraction (or exclusive or for binary based fields), where αi + αj = αk and αk - αj = αi hold true for the same i, j, and k, in all compatible or equivalent fields. For each of the 3 possible GF(24) defining polynomials, hex 13, 19, or 1F, there are 64 possible fields that could be used to map a specific GF(28) to a specific GF(162), but only 4 of those fields will meet the addition and subtraction (in this case exclusive or) requirements. Again, the point of this is that it takes a fraction of the gates (although a longer propagation delay) to implement inversion via this type of mapping. Rcgldr (talk) 08:33, 26 October 2011 (UTC)
- The binary representations won't be the same, but if you generated sets of 256 by 256 byte lookup matrices for each compatible field, each with four matrices: addition, subtraction, multiplication, and division, with all entries in each matrix being the logarithm base α for each field (except for log(0), although this could be set to hex ff, since it's not used elsewhere), those logarithm matrices would be identical. I added a section to your talk page. Can I post a direct link to a document from my web site here as an example? Rcgldr (talk) 10:06, 26 October 2011 (UTC)
- There is no problem linking external content on a talk page if it fits within the context of a discussion. I have to think it through which mapping is required to fulfill the particular requirements you specified, but gotta leave now. Nageh (talk) 10:18, 26 October 2011 (UTC)
- Just to make sure I get you right... you do require that α has the same binary representation in the isomorphism, right? Nageh (talk) 10:36, 26 October 2011 (UTC)
- No, just that math based on powers of α is the same for both fields. For a GF(28) field based on the 9 bit polymomial 11D hex, α is 1X + 0 = hex 2. I could choose a GF(162) field where α = 1X + 1 = hex 11, as long as it met the requirements that given i and j, that k is the same for both fields for addition (exclusive or): αi + αj = αk (multiplication and division will always be compatible), but choosing a field with α = 1X + 1 would consume more hardware than choosing a field where α = 1X + 0 = hex 10. Rcgldr (talk) 10:58, 26 October 2011 (UTC)
- The binary representations won't be the same, but if you generated sets of 256 by 256 byte lookup matrices for each compatible field, each with four matrices: addition, subtraction, multiplication, and division, with all entries in each matrix being the logarithm base α for each field (except for log(0), although this could be set to hex ff, since it's not used elsewhere), those logarithm matrices would be identical. I added a section to your talk page. Can I post a direct link to a document from my web site here as an example? Rcgldr (talk) 10:06, 26 October 2011 (UTC)
All finite fields of a given size (order) are isomorphic to each other. A follow up on this, all fields of a given size are isomorphic, but require the mapping of elements to / from one field to another for the mapped elements in the "other" field to be isomorphic, such that map(a+b) = map(a) + map(b) and map(a b) = map(a) map(b). Typically the mapping is done by generating a mapping matrix to map (matrix multiply) elements from one field to another and the inverse of that matrix to map from the other field back to the first. Doing a web search for "sub-field mapping", "composite field", "field extension", ... , will find articles that include examples of mapping matrices and their inverses, but I have yet to find an article that explains how those matrices are generated. I created an explanation for a specific case of the inversion step used for AES encryption: Composite Field Mapping Example pdf . I would assume there is some text book or classroom notes that explain the process, but I have yet to find one. Rcgldr (talk) 17:24, 4 August 2020 (UTC)
Other algorithms used for RS codes - hardware inversion
- Here's an example table in powers of α order for two compatible fields.
- GF(28) is modulo hex 11D, GF(162) modulo hex 142 (1 X2 + 4 X + 2), GF(16) = GF(24) modulo hex 13:
i GF(2^8) αi GF(16^2) αi 0 01 01 1 02 10 2 04 42 3 08 18 4 10 c2 8 1d 99 19 03 11 28 18 da 68 0d 5b 253 47 1d 254 8e 92
- As an example of compatible, note that in both fields, α0 + α1 = α19, α3 + α4 = α28 and α4 + α8 = α68
- An old Word 6.0 / 95 document that explains this with a couple of examples: m8to44.doc . All of this can be implemented in hardware, including circuits for the 8 bit mapping, the GF(24) math: a + b, a × b, 1 / a, a2, and mapping back to the 8 bit field, as a series of stages of about 340 gates (what I was told by the resident RS expert at the time), with a single progation delay. Rcgldr (talk) 13:30, 26 October 2011 (UTC)
- Ok, I got utterly confused for a moment, so let's get things straight. I was initially assuming that α would be the same for both fields, but obviously the alpha in the other group (let's call it β) is a different generator. Let m be the isomorphism that maps elements from one field to the other such that m(α) = β. You want the following to hold:
- If αj * αk = αl then βj * βk = βl, as well as
- If αj + αk = αl then βj + βk = βl.
- Let's focus on the second line. By the isomorphism, βj + βk = m(α)j + m(α)k = m(αj) + m(αk) = m(αj + αk) = m(αl) = m(α)l = βl. So this is all about finding the right generator β. Can you give a polynomial for which you think this isomorphism does not hold? I suspect your other polynomials might not be irreducible. Nageh (talk) 20:27, 26 October 2011 (UTC)
- Well, there is another explanation, which is that addition in GF(162) does not match that of GF(28) (i.e., xor). While it must hold that βj + βj = 0 for any j I'd be interested to see such an addition table. So coming back to what you were trying to describe, you are looking for is constructing the finite field GF(p^m) not as a polynomial ring over GF(p) but over GF(p^l) for some l|k, and where addition can be carried out modulo 2 (xor). Nageh (talk) 20:47, 26 October 2011 (UTC)
- For GF(28) modulo hex 11d, GF(16) modulo hex 13, try GF(162) modulo hex 119 (1 X2 + 1 X + 9):
- List of the 60 non-compatible modulo values with α = 1 x + 0:
- 119 11b 11d 11e 122 129 12d 12e 133 139
- 13b 13e 144 14b 14d 14e 155 159 15b 15d
- 162 163 164 165 16b 16d 172 173 174 175
- 179 17e 183 184 18d 192 193 19b 19e 1a2
- 1a4 1a9 1b4 1b5 1bd 1be 1c3 1c5 1ce 1d4
- 1d5 1d9 1db 1e2 1e3 1e9 1ed 1f2 1f5 1fb
- List of the 4 compatible module values with α = 1 x + 0: 125 134 142 153
- Ok, I got utterly confused for a moment, so let's get things straight. I was initially assuming that α would be the same for both fields, but obviously the alpha in the other group (let's call it β) is a different generator. Let m be the isomorphism that maps elements from one field to the other such that m(α) = β. You want the following to hold:
- Getting back to that bit of trivia using GF(28) modulo hex 11d, GF(28) = 1 x8 + 1 x4 + 1 x3 + 1 x2 + 1
- In GF(16) modulo hex 13, the product of the 4 compatible polynomials
- (1 x2 + 2 x + 5) (1 x2 + 3 x + 4) (1 x2 + 4 x + 2) (1 x2 + 5 x + 3) = 1 x8 + 1 x4 + 1 x3 + 1 x2 + 1
- isomorphism with addition (exclusive or) - At least part of the reason for this requirement is that the mapping from GF(28) to GF(162) and back is to be done with 8 bit by 8 bit matrices (otherwise there's no point if mapping uses a 256 byte lookup table). This means that every symbol in each field can be generated by summing some combination of the symbols in the 8 bit by 8 bit mapping matrices. For this strategy to work, if αz = αa + αb + ... + αh in GF(28), then βz = βa + βb + ... + βh in GF(162).
- Note that there are 4 other compatible fields with α = 1 x + 1 : modulo hex 126, 136, 147, 157, but these would involve more hardware. Generally the goal is to minimize the number of non-zero bits in the mapping polynomials. Also that trivia bit doesn't apply in this case, the product of those 4 poynomials translates into hex 11b, not 11d. Rcgldr (talk) 14:04, 27 October 2011 (UTC)
- All of this and what you say in your document makes sense. But I'm stumbling over the isomorphism that you describe. How do you find it? By trial and error? Or do you compute the modulo polynomial for GF(16^2) somehow? And for those non-compatible isomorphisms, is it because the generator does not match x (i.e., hex 10) or because addition in those isomorphisms does not match xor? Nageh (talk) 21:14, 27 October 2011 (UTC)
- by trial and error. I select a GF(16) (GF(24)) field, usually modulo hex 13. Then I start with an arrary of 256 bytes, hex 00 to hex ff and consider each byte to be a pair of nibbles a and b, each representing a potential GF(162) polynomial 1x2 + ax + b . I then eliminate all non-prime pairs by calculating (x-c)(x-d) for all 256 combinations of nibbles c and d, removing the entries matching (x-c)(x-d) from the array. Next I set β to 1x + 0 (hex 10), and test each entry to see if it takes 255 cycles to: set e = 1; loop e = e*β modulo GF(162); while e ≠ 1. If it doesn't take 255 cycles, I try the next entry in the array. If it does take 255 cycles, then 1x2 + ax + b is primitive with β = 1x + 0 (hex 10). I then test mapping by creating exponential tables for GF(28) (this is only done once) and GF(162), then creating the mapping matrix from the first 8 entries in GF(162) exponential table. I then iterate e = hex 08 to hex ff, f = αe in GF(28); map f from GF(28) to g in GF(162); then check if g = βe in GF(162). If this holds true for all e hex 08 to hex ff, then then that instance of 1x2 + ax + b is mappable or compatible with the GF(28) polynomial I'm trying to map. If it fails at this point, it's probably because of setting β = 1x+0 or xor, I'm not sure if there are other mapping issues. The last step is to actually test the invesion algorithm as explained in m8to44.doc to confirm it works.
- Using the trivia trick, in the case of GF(28) = 1 x8 + 1 x4 + 1 x3 + 1 x2 + 1, I could try doing a polynomial divide of GF(16) 1 x8 + 1 x4 + 1 x3 + 1 x2 + 1 by a potential GF(16) 1x2 + ax + b to see if it's one of the 4 roots with α = 1x + 0 (hex 10). The same method should work with any GF(28) with α = 1x + 0 (hex 2) (I've confirmed this for GF(28) hex 11d and 187). Actually testing the inversion algorithm would still be used to confirm it works. Rcgldr (talk) 00:41, 28 October 2011 (UTC)
- I updated my test program to test all 3 GF(16) ((GF(24) hex 13:α=2, 19:α=2, 1f:α=3) with all 16 GF(28) with α = 1x+0 (hex 2), producing 4 GF(162) with β = 1x+0 (hex 10) for each of the 48 combinations, and the trivia rule held: the product of the 4 compatable GF(162)'s ends up with the same polynomial coefficients as GF(28). It seems that none of the 14 GF(28) with α ≠ 1x+0 are mappable, even for β = or ≠ 1x+0 (unless my program has a bug). I used a second program to verify the inversion algorithm worked for all 192 (48*4) cases. Rcgldr (talk) 12:55, 29 October 2011 (UTC)
- Using the trivia trick, in the case of GF(28) = 1 x8 + 1 x4 + 1 x3 + 1 x2 + 1, I could try doing a polynomial divide of GF(16) 1 x8 + 1 x4 + 1 x3 + 1 x2 + 1 by a potential GF(16) 1x2 + ax + b to see if it's one of the 4 roots with α = 1x + 0 (hex 10). The same method should work with any GF(28) with α = 1x + 0 (hex 2) (I've confirmed this for GF(28) hex 11d and 187). Actually testing the inversion algorithm would still be used to confirm it works. Rcgldr (talk) 00:41, 28 October 2011 (UTC)
- There must be a more elegant approach than trial-and-error, but I'll think about it when I find enough time. If you cannot find any compatible setting with β ≠ 1x+0 (and assuming the modulo polynomial is irreducible) this means that addition simply does not match exclusive-or, and indeed what you're looking for is an isomorphism where addition can be carried out modulo 2. Minor detail, you do not need to loop 255 times to test for a generator, it suffices to test up to α17 ≠ 1. So, to conclude this obviously interesting discussion, am I right to assume that this is some insider trick that you came up with? In that case our sourcing policies obviously do not permit adding such material to the article. Or is it referencable to secondary sources? In that case we might think about adding a section on implementation aspects, though honestly I think that they are too specific for this otherwise quite general article (which is still lacking in much more basic aspects). Nageh (talk) 19:47, 29 October 2011 (UTC)
- insider trick? - No, I've seen this method used at multiple companies, hard drive (GF(210) -> GF(322)), tape drive (GF(28) -> GF(162)), chip set makers (both versions), ... . This method dates back to at least 1995 (used in some DAT (tape) drives). I don't know who the inventor is. If there was a common source that distributed information about this and other methods in my local area, it might have been CMRR (UC San Diego - there was a professor Jack Keil Wolf (current status unknown)), and/or hawaii.edu (University of Hawaii - there was a professor E J Weldon Jr (retired)). I assume other sources are aware of this and other methods, but most of my external sources were people from CMRR or U of Hawaii.
- If you cannot find any compatible setting with β ≠ 1x+0. The issue is any of the 14 GF(28) where α ≠ 1x+0, none of them are mappable. All 16 GF(28) with α = 1x+0 can be mapped with β = 1x+0, and 10 of them can also be mapped with β ≠ 1x+0, but I see no reason to use β ≠ 1x+0. Every binary (xor) type RS implementation I've seen only uses GF()s where α = 1x+0. Rcgldr (talk) 22:19, 29 October 2011 (UTC)
- The issue is any of the 14 GF(28) where α ≠ 1x+0, none of them are mappable - These GF()'s are mappable. For each of the 14 GF(28) where α ≠ 1x+0, there are 8 out of 128 α (generators) that will map to any specific 16 GF(162) with β = 1x+0. Rcgldr (talk) 17:42, 4 November 2011 (UTC)
- unindent - This method of using sub-fields for inversion is apparently well known by the experts working with an encryption scheme known as AES that includes inversion of symbols in GF(28) = 1 x8 + 1 x4 + 1 x3 + 1 x + 1 (hex 11b) with α = 1x+1. One implementation of AES GF() inversion combined with the encryption mapping is known as the greedy algorithm. A nested sub-field scheme is used going down to 2 bit symbols: using my notation from above, GF(28) is mapped to GF(162), each GF(16) 4 bit symbol is mapped to GF(42), (2 bit symbols) where 1/a = a2. I found a web site with a pdf file by David Canright that optimizes the greedy algorithm by grinding through a large number of possible mapping implementations to find minimal gate count A_Very_CompactS-box_for_AES.pdf. My program failed to map any GF(28) with α ≠ 1x+0. The apparent issue may be that my program restricts the mapping matrix to the first few powers of β or perhaps it's just a bug. If and when I get the time, I'll fix my program. It seems there's already been a lot of research into optimized inversion for AES, so I'll search for an answer rather than try to re-invent a solution on my own. Rcgldr (talk) 10:18, 30 October 2011 (UTC)
- I also did a web search for Galois field inversion, and was suprised by multiple patents that seem to be based on the obvious method of exponentiating a number a in GF(2n), 1/a = am where m = 2n-2. In the case of n=8, m=254 and 1/a = a2 a4 a8 a16 a32 a64 a128. The wiki article mentions this method as well as a Euclidean method: Finite_field_arithmetic. Rcgldr (talk) 09:07, 30 October 2011 (UTC)
- Check chapter 6 of this thesis. I am no more surprised about the incredible number of utterly trivial patents. It just shows how broken the patent system is. Nageh (talk) 12:14, 30 October 2011 (UTC)
- all fields of the same size are isomorphic - the issue is determining how to map elements between two fields so they are isomorphic: map(a+b) = map(a) + map(b) and map(a b) = map(a) map(b). The mapping is done by a matrix that is generated based on the primitive elements of the two fields. Generally the parameters including the primitive element of the field being mapped to are chosen for efficiency, and some type of brute force search is done for any primitive element of the field being mapped from that will result in an isomorphic mapping matrix. Rcgldr (talk) 18:47, 4 August 2020 (UTC)
Other algorithms used for RS codes - hardware inversion - GF(28) where α ≠ 1x+0
From what I've read, finding a sub-field and mapping matrix for GF(28) where α ≠ 1x+0, involves more trial and error. For example, there are 128 possible α (generators) for GF(28) = 1 x8 + 1 x4 + 1 x3 + 1 x + 1 (hex 11b). One document I read chose GF(162) = 1 x2 + 1x + 9, with β = 1x+0; Then in what is described as brute force search through the 128 α for GF(28), one is searched for so that the mapping function M(a) + M(b) = M(a + b), where + is xor. For each trial case of α and β, two matrices are created, one is the powers 0 to 7 of α the other is powers 0 to 7 of β, with the symbols stored as columns in the two matrices left to right. Probably any set of 8 (sequential?) powers would work, I also tested reversing the matrices and using powers 1 to 8, and both variations resulted in an identical mapping matrix. The matrix mapping equation to be solved is
|map| |powers of α 0 to 7| = |powers of β 0 to 7|
|map| = |powers of β 0 to 7| |powers of α 0 to 7|-1.
As mentioned, each of the 128 α are tried until M(a) + M(b) = M(a + b), where + is xor. In the document I read success was found with α = 1 x7 + 1 x6 + 1 x5 + 1 x4 + 1 x3 + 1 x2 + 1 x + 1 (hex ff). I confirmed that this mapping matrix worked for inversion. I'm not sure how GF(162) or β was chosen. Rcgldr (talk) 07:17, 31 October 2011 (UTC)
- I'm not sure how GF(162) or β was chosen. - The choice is arbitrary. Turns out that for any GF(28) where α ≠ 1x+0, there are 8 out of 128 α (generators), each of which will be compatible with a specific GF(162) with β = 1x+0, so the choice for α, GF(16), and GF(162) that reduces the amount of hardware is chosen. Rcgldr (talk) 17:44, 4 November 2011 (UTC)
In the cases where α = 1x+0, the matrices can be reversed so the columns of GF(28) are powers 7 to 0 of α which is the identity matrix. So the columns of the mapping matrix for α = 1x+0 will always be powers 7 to 0 of β Rcgldr (talk) 12:07, 31 October 2011 (UTC)
- Examples of the effort put into sub-field mapping for inversion (1/x) as used in AES S-box. This time mapping an 8 bit field into two 4 bit fields then in into four 2 bit fields:
- A Very Compact S-box for AES
- A Very Compact S-box for AES - Power Point
- Rcgldr (talk) 21:36, 4 September 2015 (UTC)
- Correction about mapping matrices. This is easier to explain for binary fields, such as mapping from GF(2^8) to GF((2^4)^2) or to GF(((2^2)^2)^2). The mapping matrix column indexes correspond to the values hex {80 40 20 10 08 04 02 01}. Let α(x) represent a primitive element of GF(2^8), and β(x) represent the primitive element chosen for the field to be mapped to, then the column values of the mapping matrix are β^logα(x){80 40 20 10 08 04 02 01} . I created an explanation of this in composite field mapping example pdf . Rcgldr (talk) 17:50, 4 August 2020 (UTC)
Add a section for Raid 6 and Jerasure.
The current article mentions Raid 6 just once via a link. There is mention of erasure (only) codes, but no example, such as Jerasure. Jerasure appears to be used for some forms of data storage such as cloud base sites. Doing a web search, typical implementations involve a matrix similar to a systematic original view encoding matrix: original view systematic encoding, in this case, using a modified Vandermonde matrix. The actual encoding matrix depends on the implementation. Decoding of data for a RS(n,k) code takes k rows of the encoding matrix corresponding to the first k non-erased rows of data and parity, inverts it, then uses the first e rows of the inverted matrix to do a e by k matrix multiply to regenerate the erased data, where e is the number of erasures in data. Any erasures in parity rows are regenerated by re-encoding once any erased data has been regenerated. If the encoding matrix is Raid 6 or an extension of Raid 6, such as 3 parity Raid 6, then XOR can be used for single data erasure instead of k by k matrix inversion followed by matrix multiply of 1 by k matrix.). Note that Raid 6 encoding generates syndromes instead of BCH view parities, so Raid 6 encoded columns are not true codewords, but instead data followed by syndromes. Rcgldr (talk) 21:15, 31 August 2020 (UTC)
Movement of original view decoders to after BCH view decoders
I don't mind this move, since BCH view is much more common, and practical original view decoders were developed long after practical BCH view decoders, however, @Leijurv: edit comment for the move is "the PGZ decoder is much more approachable and better explained than the original view, which is just equations with no derivation, logic, or commentary", which conflicts with the fact that Berlekamp–Welch_algorithm has it's own article and is reasonably explained (some of this via references), and the Gao decoder [Gao_RS.pdf] algorithm is well explained although both BCH view and original view Euclid decoders do not include a derivation for why they work. These are explained in text books. I'll see if I can find any online references for these. Rcgldr (talk) 02:47, 31 August 2020 (UTC)
- Regardless of what's linked, the BW and Gao decoders in this article present nothing but a numerically worked example. That's just what I was going off of. It's just night and day compared to the PGZ decoder's coverage in this article. Do you disagree? I think the PGZ decoder is explained very well in here and I can follow it from beginning to end, not just how it works but why it works. It's a great example of WP:ONEDOWN. I don't really think the BW article is comparable. I think having a numeric example in the article, and an explanation of what it's doing and why in a reference, is putting the cart before the horse. Leijurv (talk) 07:11, 31 August 2020 (UTC)
- The BCH view Berlekamp Massey is mostly a worked example with a link to Berlekamp–Massey_algorithm. Note that BCH view Berlekamp Massey, BCH view Sugiyama extended Euclid algorithm, original view Berlekamp Welch algorithm, and original view Gao algorithm are all based on some key equation. The issue is that including what's involved for these algorithms to the same level as what is shown for PGZ decoder seems like it would be excessive and redundant, since there are links that explain these decoders. Rcgldr (talk) 14:22, 31 August 2020 (UTC)
- Right, that's my point. The PGZ decoder is explained, here, well. The others are just a numerically worked example. I think it makes sense to put the well-explained one first in this article. I'm afraid I quite don't understand if/where we disagree regarding this...? Leijurv (talk) 22:17, 31 August 2020 (UTC)
- The original view theoretical decoder is also explained well, but the explanation just happens to be very simple: generate comb(n,k) potential polynomials using Lagrange interpolation from the comb(n,k) sets of k elements of the received message, and choose the most popular one. An argument for the move is that BCH view is much more common. An argument against the move is that the order of decoders is reversed from the order of encoders. I would like to get opinions from others on the move. Rcgldr (talk) 01:35, 1 September 2020 (UTC)
a method like Lagrange interpolation on various subsets of n codeword values taken k at a time to repeatedly produce potential polynomials, until a sufficient number of matching polynomials are produced to reasonably eliminate any errors in the received codeword.
I don't really understand this. The wording of "subsets of n values taken k at a time" is ambiguous. I might read "subset of 5 values" to mean a subset of cardinality 5 takes from a larger set, or it could be read as a subset of undeclared size taken from a set of size 5. I don't know what it means to have a polynomial match another. All coefficients equal? Or only some? What is a sufficient number? Is this a probabilistic or deterministic algorithm?errors in the codeword can be corrected, by recalculating the corresponding codeword values
They can be corrected by correcting them? How exactly?- After mulling it over for a while, my best guess is that it repeatedly takes subsets of size K from the codeword, interpolates a polynomial through those values, and marks down the resulting polynomial. After some time, the resulting polynomial that comes up most commonly will probably be correct, so you evaluate that polynomial at the original points and you get back your message. But that's very unclear and I still don't know what is "sufficient" and if this can be made deterministic. Leijurv (talk) 02:00, 1 September 2020 (UTC)
All coefficients equal?
Yes, all equal.what is sufficient?
The choice is arbitrary, but generating polynomials for all combinations of n elements taken k at a time, then choosing the most popular one would be "sufficient" (unless the number of errors >= Hamming distance).If this can be made deterministic
- The practical decoders developed later for original view are deterministic. As for the original "original view" decoder, after choosing a polynomial, the polynomial could be used to regenerate the message and verify that the number of mismatches, which would be errors, is not >= Hamming distance. This isn't practical if the number of combinations of n elements taken k at a time is large. I don't know if this was ever used outside of an educational environment. By 1963 or sooner, Reed Solomon was switched to using BCH view with the practical PGZ decoder used for BCH (originally bit oriented) code, with time complexity O((n-k)^3). Later improved BCH view decoders were developed: 1969 (Berlekamp Massey), and 1975 (Sugiyama extended Euclid). It wasn't until 1986 when Berlekamp Welch was developed as a practical decoder for the original view, and 2002 when Gao's extended Euclid decoder for original view was developed. So called list decoders that are essentially an optimization of the original "original view" decoder are used to "guess" the polynomial when the number of errors >= Hamming distance. Rcgldr (talk) 04:17, 1 September 2020 (UTC)- That makes sense. The history is cool too. However, I wasn't really asking for my own enlightenment :) I was sort of able to piece it together. My questions were more rhetorical - shouldn't the article answer these questions? Maybe they're dumb things, but idk: WP:ONEDOWN. Leijurv (talk) 08:37, 1 September 2020 (UTC)
I would like to get opinions from others on the move.
I mean if you don't like it you can absolutely put it back :) Leijurv (talk) 02:00, 1 September 2020 (UTC)- I see one pro and one con for the move, so I don't have an issue with either the prior or current ordering. Others might have an opinion on this. Rcgldr (talk) 04:28, 1 September 2020 (UTC)
- The original view theoretical decoder is also explained well, but the explanation just happens to be very simple: generate comb(n,k) potential polynomials using Lagrange interpolation from the comb(n,k) sets of k elements of the received message, and choose the most popular one. An argument for the move is that BCH view is much more common. An argument against the move is that the order of decoders is reversed from the order of encoders. I would like to get opinions from others on the move. Rcgldr (talk) 01:35, 1 September 2020 (UTC)
symbols are used before they are defined
In the second paragraph of the page, one can read that it uses "t = n -k". In this paragraph, neither n nor k are defined. the same paragraph introduces symbol "b" with no reference or explanation and "Lt / 2" where "L" is some unexplained symbol.
Therefore, readers must be experts to already know what theses symbols usually refer to.
Shouldn't there be at least a link to a page, or a direct explanation for each of these symbols ?
86.253.66.57 (talk) 06:18, 20 November 2020 (UTC)
modern
It is stated that Reed-Solomon is being replaced by more "MODERN" LDPC codes... Both were developed in 1960. LDPC is not more modern... From wikipedia on LDPC... LDPC codes are also known as Gallager codes, in honor of Robert G. Gallager, who developed the LDPC concept in his doctoral dissertation at the Massachusetts Institute of Technology in 1960. — Preceding unsigned comment added by 24.141.52.159 (talk) 14:57, 31 December 2021 (UTC)
Isn't this a very badly written article?
The article is written is of a highly technical nature and shows no empathy for a reader of an cyclopedia. — Preceding unsigned comment added by 130.76.96.111 (talk) 23:10, 1 March 2013 (UTC)
- Perhaps there should be a comment that one of the external links is for a tutorial, Tutorial on Reed–Solomon Error Correction Coding, which is currently the last external link on the article. Rcgldr (talk) 23:34, 1 March 2013 (UTC)
- It would be helpful to point to specific parts of the article, argue why you think they are badly written, and, if you can, suggest improvements. Thanks! ylloh (talk) 18:13, 6 March 2013 (UTC)
- I agree that the article is poorly written. This is an article written for a specialist, not for an encyclopedia. An article should be understandable by people who don't already know the subject. But after reading this article, I don't have the faintest idea how this works.12.166.73.35 (talk) 11:23, 12 May 2015 (UTC)
- Five years later it's still far too complicated for anyone without degree-level mathematics. I suggest starting with the lead and rewriting it so that it does not use any mathematical symbols. 83.104.249.240 (talk) 03:02, 29 April 2018 (UTC)
- Its fairly complicated and technical, and that's unfortunate - but it seems comparable "math" or "FEC algorithms" articles aren't much better in this regard. Its probably due to quite complicated and unusual nature of underlying math, etc. CS papers describing ReedSolomon aren't very easy to read either and assume reader got background enough to understand some relatively advanced/unusual math and so on. — Preceding unsigned comment added by 195.140.194.170 (talk) 02:33, 25 May 2022 (UTC)
Syndrome decoding change
@Rcgldr: I am not sure that this edit Special:Diff/1081403675 made the equation better. The idea of "evaluate the polynomial at " is very simple and intuitive. Specifically, the text says The decoder starts by evaluating the polynomial as received at points [...]. We call the results of that evaluation the "syndromes"
. This is directly equivalent to , which is what it used to say, I think it was fine that way. Please correct me if I am missing something, but I don't think it's as obvious that this is equal to , and I'm not sure why it is helpful to say so. May I revert? Leijurv (talk) 00:25, 30 June 2022 (UTC)
- @Leijurv: - Note the prior statement: "As a result of the Reed-Solomon encoding procedure, s(x) is divisible by the generator polynomial g(x): "
- Since s(x) is divisible by the generator polynomial g(x), then it's also divisible by each of the factors : , and the remainders of of r(x) divided by each of the factors of g(x): are the syndromes. If you do a few steps via long hand division, you'll see that , an optimization. Without this explanation, it's not clear how the fact that s(x) is divisible by the generator polynomial g(x) translates in to generating syndromes using . So the issue here is a way to explain that . I'll look at my old RS code books to see if they offer a simple explanation for this. Rcgldr (talk) 21:31, 2 July 2022 (UTC)
- Right, I see that bit where g is defined in terms of that . However, the "upshot" is that g has roots at . See the following statement in the article:
Since s(x) is a multiple of the generator g(x), it follows that it "inherits" all its roots
. - Basically, when we say
The decoder starts by evaluating the polynomial as received at points . We call the results of that evaluation the "syndromes"
, we should follow it up with an equation describing what happens when we do that, when we evaluate . It's a definition: is how we define the computation of . Putting the polynomial mod in between there is jumping the gun, it belongs later. Specifically, it explains the equality of . So,I've made this edit to explain where that fits in, what do you think? Leijurv (talk) 05:39, 3 July 2022 (UTC)- Actually, on second thought, I also think that the introduction of "x" is confusing. We're evaluating a polynomial at a specific location, doing it in terms of "r(x) mod" is confusing. I'm not sure what you mean when you just said "using "... that's not what's happening in this specific decoder procedure. The syndromes are numeric values (elements of the field under which we are operating). They aren't polynomials themselves, there is no x term? There's no need to introduce x in order to explain the idea of "if is a root of a polynomial, then evaluating that polynomial at comes out to zero". In other words, there's better ways to explain this than introducing the mod. I think that the sentence in the previous section,
Since s(x) is a multiple of the generator g(x), it follows that it "inherits" all its roots
, is much better. That's what really matters here. I've made another edit here, I think this is even better because this "mod" business is correctly placed in the logical flow, showing the result that it's meant to. Leijurv (talk) 05:43, 3 July 2022 (UTC)- The change you made is OK. I corrected my prior comments which should have been I'm only aware of one person at some forum asking why evaluating syndromes via works. Also some textbooks introduce codes based on remainder theorem, noting that BCH type (not original view) Reed Solomon is one such code. As far as syndromes not being polynomials, any finite field polynomial mod $$ won't have an ``x`` coefficient, and more generally, the coefficients of any finite field polynomial are numeric values. It's also possible to decode using , and doing a matrix multiply by a constant matrix to map the re-encoded parities into syndromes (I've seen this used in a few cases back in the 1980's, but not recently). I assume a reference is not needed to explain that if a polynomial has a factor , then is a root of that polynomial. Rcgldr (talk) 06:31, 3 July 2022 (UTC)
- Ahhhh, I see. Because it's "mod" a degree-1 polynomial, the remainder ought to be degree-0, which will be a constant with no x term. Got it, thanks! Leijurv (talk) 06:52, 3 July 2022 (UTC)
- The change you made is OK. I corrected my prior comments which should have been I'm only aware of one person at some forum asking why evaluating syndromes via works. Also some textbooks introduce codes based on remainder theorem, noting that BCH type (not original view) Reed Solomon is one such code. As far as syndromes not being polynomials, any finite field polynomial mod $$ won't have an ``x`` coefficient, and more generally, the coefficients of any finite field polynomial are numeric values. It's also possible to decode using , and doing a matrix multiply by a constant matrix to map the re-encoded parities into syndromes (I've seen this used in a few cases back in the 1980's, but not recently). I assume a reference is not needed to explain that if a polynomial has a factor , then is a root of that polynomial. Rcgldr (talk) 06:31, 3 July 2022 (UTC)
- Actually, on second thought, I also think that the introduction of "x" is confusing. We're evaluating a polynomial at a specific location, doing it in terms of "r(x) mod" is confusing. I'm not sure what you mean when you just said "using "... that's not what's happening in this specific decoder procedure. The syndromes are numeric values (elements of the field under which we are operating). They aren't polynomials themselves, there is no x term? There's no need to introduce x in order to explain the idea of "if is a root of a polynomial, then evaluating that polynomial at comes out to zero". In other words, there's better ways to explain this than introducing the mod. I think that the sentence in the previous section,
- Right, I see that bit where g is defined in terms of that . However, the "upshot" is that g has roots at . See the following statement in the article:
Hack face book
Jane Kelly tisoy daan — Preceding unsigned comment added by 183.171.127.231 (talk) 12:36, 24 September 2022 (UTC)
Nomenclature never described (x, y)
Lots of use of (a, b) shorthand to describe some parameters of the code, but the meaning of this shorthand is never explained (and is the one thing I came here to find out.) What is RS(x, y)? 149.32.224.51 (talk) 19:46, 9 August 2023 (UTC)
Alphabet size (q) vs block size (n)
the side bar says q>=n but shouldn't it be q>n? I'm not an expert so I won't change the page, but I don't see how you can make a rs code where q=n. — Preceding unsigned comment added by 205.175.106.101 (talk) 01:07, 13 December 2023 (UTC)