Module talk:Lang/data
![]() | Module:Lang/data is permanently protected from editing because it is a heavily used or highly visible module. Substantial changes should first be proposed and discussed here on this page. If the proposal is uncontroversial or has been discussed and is supported by consensus, editors may use {{edit template-protected}} to notify an administrator or template editor to make the requested edit.
|
This is the talk page for discussing improvements to the Lang/data module. |
|
Archives: 1Auto-archiving period: 3 months ![]() |
![]() | This module does not require a rating on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||
|
Edit request 8 January 2025
[edit]![]() | This edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
Description of suggested change:
Diff:
− | ["fr-ca"] = | + | ["fr-ca"] = "Canadian French", |
Northern Moonlight 00:22, 8 January 2025 (UTC)
- Why?
{{lang-fra-que}}
was a wrapper template around this:{{Language with name|fr-CA|Quebec French|{{{1}}}}}
- Replacing
{{{1}}}
withtext
:{{Language with name|fr-CA|Quebec French|text}}
- Quebec French: text
- When that template was deleted in favor of
{{langx}}
,["fr-ca"] = "Quebec French"
was added to Module:Lang/data so that there would be no visual changes to articles:{{langx|fr-CA|text}}
- Quebec French: text
- Is there discussion somewhere that has achieved consensus to make this change? Link?
- —Trappist the monk (talk) 00:43, 8 January 2025 (UTC)
Edit request 24 March 2025
[edit]![]() | This edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
Description of suggested change: Add support for additional proto-languages, under their family's ISO 639-5 codes:
- Proto-Kartvelian:
ccs
- Proto-Uralic:
urj
I ran into the need to tag these languages while performing language cleanup in Laryngeal theory. I'm certain their articles would benefit from proper tagging, as well.
Diff:
− | ["ca-x-old"] = "Old Catalan",
["cel-x-combrit"] = "Common Brittonic", -- cel in IANA is Celtic languages | + | ["ca-x-old"] = "Old Catalan",
["ccs-x-proto"] = "Proto-Kartvelian", -- ccs in IANA is Kartvelian languages
["cel-x-combrit"] = "Common Brittonic", -- cel in IANA is Celtic languages |
− | ["sla-x-proto"] = "Proto-Slavic", -- sla in IANA is Slavic languages
["yuf-x-hav"] = "Havasupai", -- IANA name for these three is Havasupai-Walapai-Yavapai | + | ["sla-x-proto"] = "Proto-Slavic", -- sla in IANA is Slavic languages
["urj-x-proto"] = "Proto-Uralic", -- urj in IANA is Uralic languages
["yuf-x-hav"] = "Havasupai", -- IANA name for these three is Havasupai-Walapai-Yavapai |
EnronEvolvedMy Talk Page 22:32, 24 March 2025 (UTC)
{{lang|fn=name_from_tag|link=yes|ccs-x-proto}}
→ Proto-Kartvelian{{lang|fn=name_from_tag|link=yes|urj-x-proto}}
→ Proto-Uralic- —Trappist the monk (talk) 22:57, 24 March 2025 (UTC)
Edit request 24 March 2025
[edit]![]() | This edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
Description of suggested change: Add a language code for a couple more proto-languages, also using their groups' ISO codes:
- Proto-Finno-Ugric:
fiu
- Proto-Samic:
smi
I hear Proto-Finno-Ugric a debatable proto-language these days, but I'm running into the need to tag it in Laryngeal theory.
Diff:
− | ["egy-x-old"] = "Old Egyptian",
["gem-x-proto"] = "Proto-Germanic", -- gem in IANA is Germanic languages | + | ["egy-x-old"] = "Old Egyptian",
["fiu-x-proto"] = "Proto-Finno-Ugric", -- fiu in IANA is Finno-Ugric languages
["gem-x-proto"] = "Proto-Germanic", -- gem in IANA is Germanic languages |
− | ["sem-x-taymanit"] = "Taymanitic",
["sla-x-proto"] = "Proto-Slavic", -- sla in IANA is Slavic languages | + | ["sem-x-taymanit"] = "Taymanitic",
["smi-x-proto"] = "Proto-Samic", -- smi in IANA is Samic languages
["sla-x-proto"] = "Proto-Slavic", -- sla in IANA is Slavic languages |
EnronEvolvedMy Talk Page 23:32, 24 March 2025 (UTC)
{{lang|fn=name_from_tag|link=yes|fiu-x-proto}}
→ Proto-Finno-Ugric{{lang|fn=name_from_tag|link=yes|smi-x-proto}}
→ Proto-Samic- —Trappist the monk (talk) 00:15, 25 March 2025 (UTC)
@Trappist the monk: I am curious what you think of the Belarusian Latin alphabet AKA "łacinka". The IANA language-subtag-registry for BCP47 does not seem to say much in this regard. For "be", I could only find variants "be-1959acad" and "be-tarask" and that "Cyrl" script should be suppressed with "be" (but not "Latn"). Since some Belarusian seems to actually be/have been originally written in "łacinka" (vs. transliterated for readers of Latn scripted languages) is this better as a variant via something like "be-łacinka" (I am not sure that technically qualifies due to the "ł") or a romanization via something like "be-Latn-łacinka"? And should "łacinka" be added here as a transliteration addition to translit_title_table
? What is the best way to markup such text: with a {{lang|be-Latn-łacinka|...}}
or {{translit|be|łacinka|...}}
or something else? Thank you, —Uzume (talk) 18:23, 31 March 2025 (UTC)
- From the point of view Module:Lang, latn script is latn script regardless of alphabet so the general case is
{{lang|be-latn|łacinka text}}
or{{langx|be-latn|łacinka text}}
. When the text is a łacinka-alphabetic romanization of Cyrillic Belarusian, you can use{{transl|be|łacinka}}
. So far as I know, łacinka is not a 'romanization standard' so is not supported by{{transl}}
. - We do not create variants like
1959acad
andtarask
because they must first be registered with IANA (there is no external standard from which variant subtags are derived). - If it is important to do so, you might consider creating a separate template like
{{lang-sr-Latn}}
which hard-codes the language label to link as[[Gaj's Latin alphabet|Serbian]]
. I don't think that easter-egging the language label is a good idea so the practice should be discouraged. - Łacinka is a latn script so should be simply marked up as a latn script.
- Did I answer your question?
- —Trappist the monk (talk) 22:33, 31 March 2025 (UTC)
- @Trappist the monk: Yes, pretty much. You seem to be advocating for
{{lang|be-Latn|łacinka text}}
and{{langx|be-Latn|łacinka text}}
and perhaps something likebe-Latn-latsinka
(wherelatsinka
is BGN/PCGN for лацінка or łacinka) if and when such a beast gets registered with IANA in much the same way aszh-Latn-pinyin
is althoughpinyin
seems to also be a romanization here as well. The only downside I see if that there is no real way to differentiate between{{langx|be|лацінка}}
(Belarusian: лацінка) and{{langx|be-Latn|łacinka}}
(Belarusian: łacinka) except for the fact that the latter is Latin script and thus gets automatically italicized. —Uzume (talk) 03:12, 1 April 2025 (UTC)
- @Trappist the monk: Yes, pretty much. You seem to be advocating for