Wikipedia talk:Manual of Style/Dates and numbers
Please stay calm and civil while commenting or presenting evidence, and do not make personal attacks. Be patient when approaching solutions to any issues. If consensus is not reached, other solutions exist to draw attention and ensure that more editors mediate or comment on the dispute. |
This project page does not require a rating on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||||||||
|
It has been 165 days since the outbreak of the latest dispute over date formats. |
Recent edits
[edit]A string of edits by Jc3s5h and JMF. introducing and removing changes to Wikipedia:Manual of Style/Dates and numbers § Common mathematical symbols, raise issues that I believe should be discussed.
- The most recent change, permalink/1247903136, has the comment
This page does not cover matrix operations.
, however, I do not see anything in the article to support a restriction to numerical operations. - The most recent change reinstates the link to dot product, despite the comment.
- There seems to be disagreement on the division sign.
The questions that I wish to raise are
- Should that section mention {{tmath}} or
<math>...</math>
? - Are vector operations within the scope of the article? Regardless of the answer, the dot and cross products should be treated consistently.
- Should there be two new rows for dot and cross product?
- Should there be a row for tensor product?
- Is obelus unhelpful since it has three forms?
- Should the Division sign (U+00F7 ÷ DIVISION SIGN) be deprecated in favor of Slash (U+002F / SOLIDUS)?
- Should U+2215 ∕ DIVISION SLASH be explicitly deprecated in favor of Slash?
- Should the use of "x" and "*" as multiplication signs be explicitly deprecated in favor of U+00D7 × MULTIPLICATION SIGN?
- Should that section show the LaTeX markup for characters in addition to the HTML character entity references?
-- Shmuel (Seymour J.) Metz Username:Chatul (talk) 10:52, 27 September 2024 (UTC)
-
- I think the page should be devoted to general articles, and <math> should be reserved for advanced math and science articles.
- Vector operations are not currently in the scope of the project page, and I'm not thrilled about adding them.
- Dot product and cross product should certainly not be addressed in the same row as any scalar operation. The multiplication dot should certainly not be linked to the "Dot product" article nor should the multiplication cross be linked to the "Cross product" article.
- Tensor products should not be covered in this project page because they're too advanced.
- I'm not willing to spend 5 or minutes figuring out what this line means.
- The asterisk as a multiplication sign should be limited to articles about computer languages that use it as such.
- LATEX should not be mentioned, since we don't use it in Wikipedia. This isn't a style manual for writing outside of Wikipedia.
- Tbh, I wondered what this extensive list is doing in the MOS in the first place. Glossary of mathematical symbols does it better. It really needs to be reduced to cover only those symbols that have a styling issue: scalar division and multiplication.
- The grade-school division sign should be formally deprecated, for reasons explained at division sign.
- The 'ordinary' slash (002F) should be preferred over 2215, same logic as straight quotes and curly quotes.
- I prefer U+00D7 × MULTIPLICATION SIGN over x, for biology as well as math but maybe that needs debate.
- 𝕁𝕄𝔽 (talk) 20:04, 27 September 2024 (UTC)
- Comments:
- I see no good reason to prohibit using a division sign to express division. That seems absolutely fine. The division sign article seems to say it might be confusing in Italian, Russian, Polish, Danish, Norwegian, or Swedish, but this is the English Wikipedia. We use points as decimal separators also, and we use commas as a thousands separator too, although that might be confusing in other languages.
- I also see no good reason to prohibit using an asterisk for multiplication; it seems well-understood, easy to type, unambiguous, and common in practice. I agree with not using "x" for multiplication, although I think it's OK to express "by" relationships for 2x4 lumber, 4x8 sheets of plywood, and 4x4 trucks.
- <math>x</math> (i.e., ) looks different from ''x'' (i.e., x), and those look different from {{math|''x''}} (i.e., x), at least on my screen, and seeing mixtures of those in the same article can be a bit annoying (especially if they are near each other).
- — BarrelProof (talk) 21:46, 7 October 2024 (UTC)
- Asterisk means convolution (which is somewhat related to the idea of "multiplication" but should not be confused with the usual multiplication). Its use as a substitution for "×" or "⋅" is a bad habit from the old days of poor technology (but it was never used as such in professional typesetting) and has no excuse nowadays. — Mikhail Ryazanov (talk) 22:12, 7 October 2024 (UTC)
- Convolution would only be a matter to consider in very mathematically sophisticated specialized contexts. It's not something most people have ever encountered. Even for those who use it, it would often be expressed using summation or integration instead. — BarrelProof (talk) 22:21, 7 October 2024 (UTC)
- I don't think that this is a good reason to make exceptions to tolerate/promote sloppy typography (moreover, in some computer fonts the ASCII asterisk looks more like a superscript than a binary operator consistent with +, −, = and so on).
- I don't think we should feel responsible for how Wikipedia is rendered in all possible fonts. We should remember that everyone is supposed to be able to edit Wikipedia articles. In an article that isn't about mathematics, or at least isn't using it beyond the 10th grade level, f = 1.8 * c + 32 seems basically OK to describe conversion from degrees C to degrees F. It's tricky enough that we tell people to pay attention to the difference between "-", "–", "—", and "−", and to not use italics for the numbers in that formula, although I support those instructions. — BarrelProof (talk) 03:37, 8 October 2024 (UTC)
- Nobody should complain about otherwise good edits that include "lazy" typography. Those edits are 100% OK and a net improvement to Wikipedia. Other editors who care about typography and MoS can clean up the markup and character choices later. Wikipedia is a collaborative project. Indefatigable (talk) 15:46, 8 October 2024 (UTC)
- I don't think that this is a good reason to make exceptions to tolerate/promote sloppy typography (moreover, in some computer fonts the ASCII asterisk looks more like a superscript than a binary operator consistent with +, −, = and so on).
- Convolution would only be a matter to consider in very mathematically sophisticated specialized contexts. It's not something most people have ever encountered. Even for those who use it, it would often be expressed using summation or integration instead. — BarrelProof (talk) 22:21, 7 October 2024 (UTC)
- Using an asterisk to represent multiplication is programming language syntax; I don't think this is common or even well-known among non-programmers. isaacl (talk) 01:47, 20 October 2024 (UTC)
- I agree we should discourage use of "*" as a multiplication symbol. I agree it's easy to type, so if one editor writes "y = m*x + c" in an otherwise correct edit, the response should not be to revert that edit, but to replace it with "y = mx + c" or other approved alternative. Dondervogel 2 (talk) 10:40, 20 October 2024 (UTC)
- Using an asterisk for multiplication is absolutely known to non-programmers because that's what is used on the number pad on most keyboards in the US. --User:Khajidha (talk) (contributions) 14:28, 12 November 2024 (UTC)
- Ah, but which came first - the * key, or its use in mathematical expressions? Forty-some years ago, I was taught that in computer code, the
*
character was chosen to avoid confusion with the letterx
, since the×
did not exist in either of the character sets that were in use at the time - ASCII and EBCDIC. It's the same with/
vs.÷
and indeed-
vs.−
. --Redrose64 🌹 (talk) 18:15, 12 November 2024 (UTC)- * appeared on many (but not all) early typewriters. When not present it was often replaced by a fraction key (1/2, 1/4, etc) Practically every computer terminal from the 1970s onward has a * key - but that's probably due to it being used by Fortran (1957). Early teletype keyboards typically used Baudot code encoding and did not have * - but these were more for telecommunications rather than programming. Fortran was invented at IBM and used punch cards/tape using IBM's BCDIC. The early variations of BCDIC had *, - and / but not +. + was added soon after. My take is that BCDIC tried to encode whatever was commonly used on typewriters - subject to the limitation of using only 64 characters. Fortran then assigned functionality to whatever was in that set. * looked the most like x without being a letter, so it got the job. Stepho talk 23:56, 12 November 2024 (UTC)
- It would really behoove participants here, instead of just speculating from the armchair, to take the radical step of doing some research to actually find out the answer. * has been used, in math, to mean multiplication for three hundred years. See the bottom of p. 66 of [1]. EEng 07:15, 13 November 2024 (UTC)
- I didn't mention that paper, because I'm not in the habit of searching through 100-year-old academic journals. Now, 100-year-old magazines is a different matter, witness my stacks of boxes of The Railway Magazine back to 1902 (gaps between 1902 and 1939, complete from 1940 onward). --Redrose64 🌹 (talk) 12:02, 13 November 2024 (UTC)
- It would really behoove participants here, instead of just speculating from the armchair, to take the radical step of doing some research to actually find out the answer. * has been used, in math, to mean multiplication for three hundred years. See the bottom of p. 66 of [1]. EEng 07:15, 13 November 2024 (UTC)
- FORTRAN was a decade earlier than ASCII and EBCDIC. What the first FORTRAN compiler used was the scientific BCD character set of the IBM 704, which replaced the older Percent (%) and Lozenge (U+2311 ⌑ SQUARE LOZENGE) with parentheses. -- Shmuel (Seymour J.) Metz Username:Chatul (talk) 14:35, 13 November 2024 (UTC)
- * appeared on many (but not all) early typewriters. When not present it was often replaced by a fraction key (1/2, 1/4, etc) Practically every computer terminal from the 1970s onward has a * key - but that's probably due to it being used by Fortran (1957). Early teletype keyboards typically used Baudot code encoding and did not have * - but these were more for telecommunications rather than programming. Fortran was invented at IBM and used punch cards/tape using IBM's BCDIC. The early variations of BCDIC had *, - and / but not +. + was added soon after. My take is that BCDIC tried to encode whatever was commonly used on typewriters - subject to the limitation of using only 64 characters. Fortran then assigned functionality to whatever was in that set. * looked the most like x without being a letter, so it got the job. Stepho talk 23:56, 12 November 2024 (UTC)
- Ah, but which came first - the * key, or its use in mathematical expressions? Forty-some years ago, I was taught that in computer code, the
- Asterisk means convolution (which is somewhat related to the idea of "multiplication" but should not be confused with the usual multiplication). Its use as a substitution for "×" or "⋅" is a bad habit from the old days of poor technology (but it was never used as such in professional typesetting) and has no excuse nowadays. — Mikhail Ryazanov (talk) 22:12, 7 October 2024 (UTC)
Misleading shortcut
[edit]Wikipedia:Manual of Style/Dates and numbers#Common mathematical symbols indicates that its shortcut is "MOS:COMMONMATH", but in fact MOS:COMMONMATH links to Wikipedia:Manual of Style#Common mathematical symbols (a different section on a different page, although partially covering the same topic), which also indicates "MOS:COMMONMATH" as its shortcut. Perhaps one of them must be renamed. — Mikhail Ryazanov (talk) 00:47, 7 October 2024 (UTC)
- @Mikhail Ryazanov: I have traced it to this edit nearly two years ago by SMcCandlish (talk · contribs), which I have reverted. The two redirects MOS:COMMONMATH and WP:COMMONMATH were created on the same day in January 2014 (although about twenty hours apart), the first by BarrelProof (talk · contribs) and the second by Wavelength (talk · contribs) following this discussion. It seems that they were intentionally different - and have remained so ever since. If one of them should be repurposed to match the other after ten years, we would need a WP:RFD. --Redrose64 🌹 (talk) 09:00, 7 October 2024 (UTC)
- There must've been something that happened to instigate creation of those on the same day, but I have no recollection of it. — BarrelProof (talk) 09:17, 7 October 2024 (UTC)
- You'd observed that there are two MOS sections on the symbols and suggested merging them, Wavelength responded that both locations are appropriate and we could have two shortcuts instead, and no-one else said anything. NebY (talk) 11:26, 7 October 2024 (UTC)
- Thanks for the refresher. I think the two sections ought to at least mention each other in hatnotes, if not be merged. I just added the mentions. It is confusing that both of them are part of the MOS and both of them are sections of the MOS with the same heading: "Common mathematical symbols". Maybe they should become MOS:COMMONMATH1 and MOS:COMMONMATH2?? Is there some way to express the difference between the purposes of those two? I notice that one of those is part of Wikipedia:Manual of Style/Dates and numbers but says nothing at all about dates and numbers, so I suggest that it be merged into the other one. Mathematics is not synonymous with numbers. That section is about expressing operations and relationships and formatting variable names, not numbers. — BarrelProof (talk) 17:50, 7 October 2024 (UTC)
- Better with hatnotes, yes. Though mathematics != numbers, MOSNUM seems the natural place where readers might look for guidance on the symbols; after all, the less mathematically sophisticated we are, the more likely we are to think of the operators as things we use with numbers. I'd expected that MOSNUM would be more detailed but there's extra content in MOS too, so that's not a useful distinction. The chatty one and the formal one? NebY (talk) 20:10, 7 October 2024 (UTC)
- Thanks for the refresher. I think the two sections ought to at least mention each other in hatnotes, if not be merged. I just added the mentions. It is confusing that both of them are part of the MOS and both of them are sections of the MOS with the same heading: "Common mathematical symbols". Maybe they should become MOS:COMMONMATH1 and MOS:COMMONMATH2?? Is there some way to express the difference between the purposes of those two? I notice that one of those is part of Wikipedia:Manual of Style/Dates and numbers but says nothing at all about dates and numbers, so I suggest that it be merged into the other one. Mathematics is not synonymous with numbers. That section is about expressing operations and relationships and formatting variable names, not numbers. — BarrelProof (talk) 17:50, 7 October 2024 (UTC)
- You'd observed that there are two MOS sections on the symbols and suggested merging them, Wavelength responded that both locations are appropriate and we could have two shortcuts instead, and no-one else said anything. NebY (talk) 11:26, 7 October 2024 (UTC)
- There must've been something that happened to instigate creation of those on the same day, but I have no recollection of it. — BarrelProof (talk) 09:17, 7 October 2024 (UTC)
- Usually having "MOS:FOO" and "WP:FOO" go to two different places is fine; the very reason we have the "MOS:" namespace (formerly pseudo-namespace) for MoS shortcuts is that MoS pages were sucking up too many of the mnemonically meanful shortcut strings in which "WP:FOO" would for more editors bring to mind some non-MoS "WP:"-namespace material. Yes, use a disambiguation hatnote as needed; we have those for a reason. However, in this case, both targets are MoS sections, so both shortcuts should go to the same place, presumably the more detailed material. If the stuff at Wikipedia:Manual of Style#Common mathematical symbols is simply a nutshell summary of Wikipedia:Manual of Style/Dates and numbers#Common mathematical symbols (which is probably the case and should be the case) then the former needs no shortcut at all. — SMcCandlish ☏ ¢ 😼 06:04, 8 October 2024 (UTC)
Discourage postfix plus?
[edit](motivated by the previous section) If there's any work to be done with combining/rearranging MOS:COMMONMATH and WP:COMMONMATH, can we please also add that "over N" and "at least N" should use the standard notation >N
and ≥N
respectively (as, for example, the CMOS tells in 3.83 and 12.16) instead of a postfix plus (N+, which is ambiguous, inconsistent with other cases like <N
and ~N
, and doesn't seem to conform to any reputable style guide)? — Mikhail Ryazanov (talk) Mikhail Ryazanov (talk) 21:29, 7 October 2024 (UTC)
- Has this issue come up a lot? EEng 22:12, 7 October 2024 (UTC)
- A while ago I've got a revert with a suggestive edit summary (that particular article has changed a lot since then, but I still stumble upon similar examples from time to time – if needed, I can put some effort to find specific examples). Also, a simple search for insource:/[0-9]\+ / prefix:: yields thousands of results (before timing out), only a small fraction of which are legitimate uses (or poorly formatted binary operations). — Mikhail Ryazanov (talk) 22:54, 7 October 2024 (UTC)
- Then the next question is: has time been wasted debating this question on multiple articles, or can they just be fixed on sight without fuss? If the latter, then no new MOS provision is needed, and therefore it is needful that there not be one. EEng 23:10, 7 October 2024 (UTC)
- A while ago I've got a revert with a suggestive edit summary (that particular article has changed a lot since then, but I still stumble upon similar examples from time to time – if needed, I can put some effort to find specific examples). Also, a simple search for insource:/[0-9]\+ / prefix:: yields thousands of results (before timing out), only a small fraction of which are legitimate uses (or poorly formatted binary operations). — Mikhail Ryazanov (talk) 22:54, 7 October 2024 (UTC)
Numerals in a sequence
[edit]'Phase 1' or Phase one'? This appears to be a case that's not explicitly covered.
The AP Stylebook recommends using figures for sequences in its section on "Numbers": "Also use figures in all tabular matter, and in statistical and sequential forms", from which I infer that for sequences, such as 'phase 1', figures should be used for clarity and consistency.
Similarly, chapter 9 of The Chicago Manual of Style advises using figures when referring to a sequence.
I propose adding similar explicit advice to this section of the MOS.
-- Jmc (talk) 20:10, 19 October 2024 (UTC)
- As usual, what's needed before something's added to MOS is examples of this being an issue on multiple articles -- see WP:MOSBLOAT. Are editors not able to work this out for themselves on individual articles? Anyway, why does the word "Phase" need this in particular? Why not "Section" and "Part" and any other words like that? The advice from APA and CMS are great if you're making up a new sequence for your thesis, but that's not us. It's hard to imagine an article using a phrase like "Phase 1" or "Phase One" on its own -- that is, other than in imitation of the phrasing of sources. So follow the sources; for example, Economic Stabilization Act of 1970 refers to Phase I and Phase II and Phase III., because that's the form the Act uses. We're not going to override that in the name of consistency with other, unrelated articles. EEng 22:00, 19 October 2024 (UTC)
- To clarify: I'm using 'Phase' purely as an example. The issue of using figures for sequences applies to any sequence. including 'Section' and 'Part' - and other examples: "Game 3", of a sequence of nine; 'Chapter 9' of a sequence of 24; 'Week 4' of a limitless sequence.
- I raise this issue in the context of differing editorial practices in the British Post Office scandal article, where both figures and words have been used to reference the same phases and weeks of the inquiry. I sought guidance from the MOS and found none.
- I'd be content to follow the sources, without adding bloat to the MOS, if I could be confident that that's an accepted stylistic convention in this instance. -- Jmc (talk) 22:27, 19 October 2024 (UTC)
- Such names are very often established by authoritative sources and constitute proper names; we should follow the sources rather than renaming them. Per EEng, we only need a MOS guideline if our sources don't provide clear names and either there is dissent among editors or consistency across articles would be of significant benefit. In the Post Office case, I see the phases have been titled Phase 1, Phase 2 etc by the inquiry[2] so unless the inquiry's inconsistent, we can follow that source. Still, I see that this is a live issue at that British Post Office scandal article, so it would be wrong to establish a new guideline or issue some sort of MOS talk-page ruling without the knowledge of the other editor; pinging MapReader. NebY (talk) 14:56, 20 October 2024 (UTC)
- Between May 1966 and December 1989, multi-episode Doctor Who stories could have titles in any of the four combinations of (i) "Episode ..." or "Part ..."; (ii) numbers as figures or as words. The decision as to which format to use was probably in the hands of the series producer, but in our articles about each story, we give the actual title shown on screen - except that where the on-screen title is all-capitals, we reduce it to title case. Certain Doctor Who reference books do the same, so we're following the sources. --Redrose64 🌹 (talk) 18:18, 20 October 2024 (UTC)
- The question raised was "differing editorial practices in the British Post Office scandal article". Sounds like a matter of internal consistency, which is different. For all manner of things -- this being one IMO -- we might not need consistency among articles, but it does look bad within articles. Surely we already have a rule addressing that general issue tho? Herostratus (talk) 13:24, 21 October 2024 (UTC)
- I think we don't. In articles on TV series it's common to have expressions like "season 3" and "episode 7", which seem to go against our current wording (use words for numbers below 10). Gawaon (talk) 16:37, 21 October 2024 (UTC)
- It is indeed a matter of internal consistency and it does look bad, as Herostratus says. Within the one article (British Post Office scandal), we have (e.g.) both "Phase 3 hearings" and "Phases five and six". Is there in fact a rule addressing this general issue? -- Jmc (talk) 18:47, 21 October 2024 (UTC)
- From Wikipedia:Manual of Style/Dates and numbers#Numbers as figures or words: "Comparable values nearby one another should be all spelled out or all in figures, even if one of the numbers would normally be written differently." Unless you are dealing only with series with fewer than 10 seasons each with fewer than 10 episodes, it is more in line with MOS to give all season and episode numbers in digits rather than words. --User:Khajidha (talk) (contributions) 13:15, 22 October 2024 (UTC)
- True, but series with less than ten seasons aren't all that rare, and there are also miniseries with less than ten episodes. Gawaon (talk) 16:39, 22 October 2024 (UTC)
- Whether or not it's in line with MOSNUM, we frequently – I suspect in the vast majority of cases – give series/season and episode numbers in digits. I've been dipping into Wikipedia:Good articles/Media and drama#Television. Articles on individual episodes do routinely begin e.g. " the ninth and final episode of the first season" but with digits in the infobox. Articles on a season/series list episodes using digits, and articles on a show list series/seasons and episodes with digits, regardless of whether there are more or less than ten, in keeping with the examples in Wikipedia:Manual of Style/Television#Episode listing. Articles are often titled <show> season <n> where n is a digit, never a word, in accordance with Wikipedia:Naming conventions (television)#Season articles. Sampling our WP:Featured articles#Media, I see the same treatment in titles, infoboxes, and listings.I very much doubt that editors would accept changes to those FAs and GAs to bring them into line with MOS:NUMERAL, that FA and GA assessors will start to apply MOS:NUMERAL in such cases, that any move requests would succeed, or that MOS:TV and WP:TVSEASON will be brought into line with the current MOS:NUMERAL. Changing MOS:NUMERAL might be easier. NebY (talk) 08:20, 23 October 2024 (UTC)
- I agree, a small addition to MOS:NUMERAL might be a good thing. Gawaon (talk) 17:00, 23 October 2024 (UTC)
- Your final sentence doesn't follow from your statement. It would be more in keeping with the MOS to give all in words. MapReader (talk) 11:16, 23 October 2024 (UTC)
- I think we don't. In articles on TV series it's common to have expressions like "season 3" and "episode 7", which seem to go against our current wording (use words for numbers below 10). Gawaon (talk) 16:37, 21 October 2024 (UTC)
- The question raised was "differing editorial practices in the British Post Office scandal article". Sounds like a matter of internal consistency, which is different. For all manner of things -- this being one IMO -- we might not need consistency among articles, but it does look bad within articles. Surely we already have a rule addressing that general issue tho? Herostratus (talk) 13:24, 21 October 2024 (UTC)
- Between May 1966 and December 1989, multi-episode Doctor Who stories could have titles in any of the four combinations of (i) "Episode ..." or "Part ..."; (ii) numbers as figures or as words. The decision as to which format to use was probably in the hands of the series producer, but in our articles about each story, we give the actual title shown on screen - except that where the on-screen title is all-capitals, we reduce it to title case. Certain Doctor Who reference books do the same, so we're following the sources. --Redrose64 🌹 (talk) 18:18, 20 October 2024 (UTC)
- Such names are very often established by authoritative sources and constitute proper names; we should follow the sources rather than renaming them. Per EEng, we only need a MOS guideline if our sources don't provide clear names and either there is dissent among editors or consistency across articles would be of significant benefit. In the Post Office case, I see the phases have been titled Phase 1, Phase 2 etc by the inquiry[2] so unless the inquiry's inconsistent, we can follow that source. Still, I see that this is a live issue at that British Post Office scandal article, so it would be wrong to establish a new guideline or issue some sort of MOS talk-page ruling without the knowledge of the other editor; pinging MapReader. NebY (talk) 14:56, 20 October 2024 (UTC)
- Generally concur with EEng and NebY. It's clear that certain conventions adhere strongly to certain things, and these conventions will be readily apparent from the source material about those things. WP is not in a position to impose an artificial WP-invented consistency on them that makes no sense for those familiar with the subject (e.g. referring to "issue number seven" of a comic book or "the three ball" in a game of pool). Where nothing like a consistent convention can be observed for the topic at hand, then MOSNUM already provides us with a default to fall back to: use "one" through "nine", then "10" onward. This is the case with centuries, for example. There is no overwhelming source preference for either "third century BC[E]" or "3rd century BC[E]" in reliable sources. (Books tend to prefer the former, journals use the latter more than books do because journal publishers are more interested in compression/expediency. Scroll through first 10 pages of GScholar resuls here and see how much variance there is, and how frequent the numeral style is compared to "traditional" spelling-out. That said, GScholar searches do include some books as well as journals.) Following our default system, we naturally end up with "third century BC" and "12th century BC". (Of course, our material doesn't perfectly follow this; our editors are human, not robots. Well, mostly.) — SMcCandlish ☏ ¢ 😼 15:04, 24 November 2024 (UTC)
μs vs us
[edit]Which style I should use for micro seconds? Does μs relative to "Do not use precomposed unit symbol characters"? DungeonLords (talk) 04:44, 30 October 2024 (UTC)
- The 2 characters "μ" and "s" are just fine. The precomposed symbols advice is to guard against particular fonts that combine them into a single character because many software readers for the sight impaired do not know all of these symbols. Stepho talk 04:53, 30 October 2024 (UTC)
- But do use μ, not "u". The latter was something of an early-Internet halfassed approach, but we have Unicode now. — SMcCandlish ☏ ¢ 😼 15:09, 24 November 2024 (UTC)
Day, date month format
[edit]Greetings and felicitations. I assume that such constructions as "Wednesday, 24 February" are discouraged, but I can't find it in the text or the this page's archives. (The comma seems unnecessary to me.) May I please get confirmation or refutation? —DocWatson42 (talk) 04:28, 4 November 2024 (UTC)
- MOS:DATEFORMAT and MOS:BADDATE cover the allowed and disallowed formats. Unless the day of the week is vitally important then we leave it out. Stepho talk 06:16, 4 November 2024 (UTC)
- This specifically regards the "Hadaka Matsuri" article, and its Konomiya Hadaka Matsuri infobox, which includes the days of the week. —DocWatson42 (talk) 07:40, 4 November 2024 (UTC)
- Ah, the mysterious East. EEng 08:06, 4 November 2024 (UTC)
- This specifically regards the "Hadaka Matsuri" article, and its Konomiya Hadaka Matsuri infobox, which includes the days of the week. —DocWatson42 (talk) 07:40, 4 November 2024 (UTC)
- Salutations and hugs and kisses to you too.
- If your question is whether day-of-week should be gratuitously included with dates for no particular reason, the answer is No. That is, if the day-of-week is somehow relevant to the narrative, sure, include it, but otherwise no.
- Assuming we're in some situation where (per the preceding) inclusion of day-of-week is indeed justified, maybe your question is how to append the D.O.W.
- If the date is February 24 or February 24, 2024, then without doubt the right format is Wednesday, February 24 or Wednesday, February 24, 2024.
- According to "Elite editing" [3] (whoever they may be -- search the text "inverted style" on that page), the corresponding answers for 24 February and 24 February 2024 are Wednesday, 24 February and Wednesday, 24 February 2024. To me that does seem right -- Wednesday 24 February 2024 (all run together, no commas at all) seems intolerable.
- The question naturally arises as to whether MOS should offer advice on all the above. My answer, as usual, is provisionally No, per WP:MOSBLOAT. EEng 08:02, 4 November 2024 (UTC)
- Looking at the article, the date is the 12th day of the Chinese year and the day of the week has no significance. I would remove the day of the week from all those dates in the infobox. For what it's worth, I spent most of the 1990s in Hong Kong/China. Major holidays based on the Chinese calendar treat the day of the week in the same way that we treat the day that Christmas falls on. Stepho talk 09:18, 4 November 2024 (UTC)
- Okay—will do. Thank you both. ^_^ —DocWatson42 (talk) 09:21, 4 November 2024 (UTC)
- Looking at the article, the date is the 12th day of the Chinese year and the day of the week has no significance. I would remove the day of the week from all those dates in the infobox. For what it's worth, I spent most of the 1990s in Hong Kong/China. Major holidays based on the Chinese calendar treat the day of the week in the same way that we treat the day that Christmas falls on. Stepho talk 09:18, 4 November 2024 (UTC)
- The new 18th edition of The Chicago Manual of Style gives advice about commas in dates in ¶ 6.14. When giving examples they mostly give examples with words after the end of the date so the punctuation at the end of the date is illustrated. Some examples:
- The hearing was scheduled for 2:30 p.m. on Friday, August 9, 2024.
- Monday, May 5, was a holiday; Tuesday the 6th was not.
- Jc3s5h (talk) 16:56, 4 November 2024 (UTC)
- Concur with EEng on avoiding adding a rule about this, as more WP:MOSBLOAT. It's just a matter of basic writing sense, basic comma usage in competent English. Our MoS's purpose is not that of CMoS or Fowler's, trying to answer every imaginable usage question. Just those that have an impact on reader comprehensibility and/or recurrent editorial strife. — SMcCandlish ☏ ¢ 😼 15:18, 24 November 2024 (UTC)
Spacing with percentage points
[edit]A question regarding spacing of percentage point (pp) usage. I have always assumed there is no space between the number and pp (e.g. 5.5pp not 5.5 pp), on the basis that you wouldn't put a space between a number and a percentage sign (5% not 5 %). There is no reference to this in the MOS, but the percentage point article uses it unspaced. It might be good to have it clarified in the MOS as I see regular changes adding spacing, which I am not sure is correct. Cheers, Number 57 23:49, 5 November 2024 (UTC)
- MOS:PERCENT says "omit space". Stepho talk 23:54, 5 November 2024 (UTC)
- Perhaps I am missing something, but as far as I can see, it says to omit space when using the percentage symbol (%) but nothing about when using pp? Number 57 00:21, 6 November 2024 (UTC)
- Apologies, I missed the "point" word in your question. Stepho talk 01:49, 6 November 2024 (UTC)
- Perhaps I am missing something, but as far as I can see, it says to omit space when using the percentage symbol (%) but nothing about when using pp? Number 57 00:21, 6 November 2024 (UTC)
- % is essentially a constant factor (.01), but pp is more like a unit so my intuition says it should be spaced. I note that the basis point article uses a space before bp (mostly, anyway). I'll be interested to hear what others think. EEng 18:23, 6 November 2024 (UTC)
- You've got this back to front. Percent (%) is a standard unit symbol and should be spaced, whereas pp is a made up abbreviation, meaning you can put it anywhere you want, space or unspaced. I know MOSNUM says otherwise, which is WP's prerogative. In other words, if we need a rule, let's make one up and apply it, but there's no logic involved. Dondervogel 2 (talk) 21:06, 6 November 2024 (UTC)
- Dondervogel, "Percent (%) is a standard unit symbol and should be spaced". Huh? It's not an ISO unit symbol, is it. No spacing in English, unlike French. On pp, I agree with EEng: space it. Tony (talk) 11:10, 8 November 2024 (UTC)
- Absolutely. When it comes to peepee, always space it [4]. EEng 21:36, 8 November 2024 (UTC)
- Yes, "%" is an ISO standard unit symbol. Dondervogel 2 (talk) 12:45, 8 November 2024 (UTC)
- What is it the unit of? Gawaon (talk) 13:14, 8 November 2024 (UTC)
- Nothing. It's a dimensionless quantity. To the original q: I don't see "pp" used often, in fact rarely. It's probably better written out in full on first use, and if there are subsequent uses, follow the guidance at MOS:ACRO1STUSE. --Redrose64 🌹 (talk) 19:58, 8 November 2024 (UTC)
- It's used widely in election infoboxes where there isn't space to write it out. Number 57 22:25, 8 November 2024 (UTC)
- I will answer Gawaon's valid question in two parts. The first part is a quotation from ISO 80000-1:2009 (emphasis added)
- In some cases, per cent, symbol %, where 1 % := 0,01, is used as a submultiple of the coherent unit one.
- EXAMPLE 4
- reflection factor, r = 83 % = 0,83
- Also, per mil (or per mille), symbol ‰, where 1 ‰ := 0,001, is used as a submultiple of the coherent unit one.Since the units “per cent” and “per mil” are numbers, it is meaningless to speak about, for example, percentage by mass or percentage by volume. Additional information, such as % (m/m) or % (V/V) shall therefore not be attached to the unit symbol %. See also 7.2. The preferred way of expressing, for example, a mass fraction is “the mass fraction of B is w B = 0,78” or “the mass fraction of B is wB = 78 %”. Furthermore, the term “percentage” shall not be used in a quantity name, because it is misleading. If a mass fraction is 0,78 = 78 %, is the percentage then 78 or 78 % = 0,78? Instead, the unambiguous term “fraction” shall be used. Mass and volume fractions can also be expressed in units such as µg/g = 10-6 or ml/m3 = 10-9.
- Notice the deliberate space between numerical value (e.g., 83) and unit symbol (%). Dondervogel 2 (talk) 22:10, 8 November 2024 (UTC)
- The second part is a partial retraction, quoting from ISO 80000-1:2022, which supersedes the 2009 document:
- If the quantity to be expressed is a sum or a difference of quantities, then either parentheses shall be used to combine the numerical values, placing the common unit symbol after the complete numerical value, or the expression shall be written as the sum or difference of expressions for the quantities.
- EXAMPLE 1
- l = 12 m - 7 m = (12 - 7) m = 5 m, not 12 - 7 m
- U = 230 ⋅ (1 + 5 %) V = 230 ⋅ 1,05 V ≈ 242 V, not U = 230 V + 5 %
- The space is still there between numerical value (5) and percentage symbol (%), but I could not find an explicit reference to "%" as a unit symbol. I'm unsure how to interpret that change, but I'll report back here if I find further clarification. Dondervogel 2 (talk) 22:16, 8 November 2024 (UTC)
- I found this in NIST Special Publication 811
- In keeping with Ref. [4: ISO 31-0], this Guide takes the position that it is acceptable to use the internationally recognized symbol % (percent) for the number 0.01 with the SI and thus to express the values of quantities of dimension one (see Sec. 7.14) with its aid. When it is used, a space is left between the symbol % and the number by which it is multiplied [4: ISO 31-0]. Further, in keeping with Sec. 7.6, the symbol % should be used, not the name "percent."
- Example: xB = 0.0025 = 0.25 % but not: xB = 0.0025 = 0.25% or xB = 0.25 percent
- Note: xB is the quantity symbol for amount-of-substance fraction of B (see Sec. 8.6.2).
- Because the symbol % represents simply a number, it is not meaningful to attach information to it (see Sec. 7.4). One must therefore avoid using phrases such as "percentage by weight," "percentage by mass," "percentage by volume," or "percentage by amount of substance." Similarly, one must avoid writing, for example, "% (m/m)," "% (by weight)," "% (V/V)," "% (by volume)," or "% (mol/mol)." The preferred forms are "the mass fraction is 0.10," or "the mass fraction is 10 %," or "wB = 0.10," or "wB =10 %" (wB is the quantity symbol for mass fraction of B—see Sec. 8.6.10); "the volume fraction is 0.35," or "the volume fraction is 35 %," or " φB = 0.35," or "φB = 35 %" (φB is the quantity symbol for volume fraction of B—see Sec. 8.6.6); and "the amount-of-substance fraction is 0.15," or "the amount-of-substance fraction is 15 %," or "xB = 0.15," or "xB = 15 %." Mass fraction, volume fraction, and amount-of-substance fraction of B may also be expressed as in the following examples: wB = 3 g/kg; φB = 6.7 mL/L; xB = 185 mmol/mol. Such forms are highly recommended (see also Sec. 7.10.3).
- In the same vein, because the symbol % represents simply the number 0.01, it is incorrect to write, for example, "where the resistances R1 and R2 differ by 0.05 %," or "where the resistance R1 exceeds the resistance R2 by 0.05 %." Instead, one should write, for example, "where R1 = R2 (1 + 0.05 %)," or define a quantity Δ via the relation Δ = (R1 - R2) / R2 and write "where Δ = 0.05 %." Alternatively, in certain cases,the word "fractional" or "relative" can be used. For example, it would be acceptable to write "the fractional increase in the resistance of the 10 kΩ reference standard in 2006 was 0.002 %."
- As with ISO 80000-1:2022, there is always a space between numerical value (e.g., 35) and the percentage symbol (%), but no mention of % as a unit symbol. Dondervogel 2 (talk) 22:38, 8 November 2024 (UTC)
there is always a space between numerical value (e.g., 35) and the percentage symbol (%)
– Maybe in NIST-world, but not here on Wikipedia (see MOS:PERCENT), so I don't see how any of that helps us with the issue at hand. EEng 23:29, 8 November 2024 (UTC)- I was correcting a misconception that % is not a unit symbol when it is. At least it was until 2022. I find it best not to leave incorrect statements unchallenged or they take on a life of their own. Dondervogel 2 (talk) 00:24, 9 November 2024 (UTC)
- Um, OK, but you do realize that WP does not follow NIST's advice about spacing it, yes? EEng 00:44, 9 November 2024 (UTC)
- Yep, and I wasn't trying to change that. My contributions have been to
- correct a factual error (yours)
- respond to questions from Tony and Gawaon
- I have not weighed in on the main thread regarding percentage points because I don't expect my opinion (based not on NIST's utterings but on the ISO standards on which they are based) to be taken seriously, so why would I waste my e-breath? Dondervogel 2 (talk) 09:41, 9 November 2024 (UTC)
- Yep, and I wasn't trying to change that. My contributions have been to
- Um, OK, but you do realize that WP does not follow NIST's advice about spacing it, yes? EEng 00:44, 9 November 2024 (UTC)
- I was correcting a misconception that % is not a unit symbol when it is. At least it was until 2022. I find it best not to leave incorrect statements unchallenged or they take on a life of their own. Dondervogel 2 (talk) 00:24, 9 November 2024 (UTC)
- Nothing. It's a dimensionless quantity. To the original q: I don't see "pp" used often, in fact rarely. It's probably better written out in full on first use, and if there are subsequent uses, follow the guidance at MOS:ACRO1STUSE. --Redrose64 🌹 (talk) 19:58, 8 November 2024 (UTC)
- What is it the unit of? Gawaon (talk) 13:14, 8 November 2024 (UTC)
- It is not conventional to space "%" in English. Nearly no publishers do this, and our MoS doesn't say to do this or incidentally illustrating doing this, so don't do this. "pp" here is a unit abbreviation for percentage point ("the unit for the arithmetic difference between two percentages)", so space it. % is not a unit abbreviation/symbol, but a quantity symbol, so it's in a different class. It's more like the ~ in "~5 ml". That the spelled-out equivalent "approximately", like the spelled out "per[]cent", is spaced apart from the numeral is irrelevant. — SMcCandlish ☏ ¢ 😼 15:24, 24 November 2024 (UTC)
UNITSYMBOLS (1 × 3 × 6 m): “each number should be followed by a unit name or symbol”
[edit]MOS:UNITSYMBOLS currently requires a unit symbol after each value when listing dimensions separated by × (“1 m × 3 m × 6 m, not 1 × 3 × 6 m”). Could we have a carveout from this rule, and allow editors to use only a final unit when writing for infoboxes, and perhaps other places where space is limited?
Context: {{Infobox mobile phone}} currently has a preference for listing the dimensions of the product each on a separate line. This, and other parameters, can make the infobox very long. This is especially problematic for pages that cover multiple products or versions of a product; see dimensions in Samsung Galaxy S21 infobox. In order to cut down these infoboxes, we could be using a single line for all three dimensions, but the unit after each value feels unnecessary, and can cause line overflow.
Prior discussion: Wikipedia talk:Manual of Style/Dates and numbers/Archive 145#Repeating units in ranges and dimensions, where the potential for confusion with actually multiplying values was pointed out. I think this is a minor concern in general, but worth considering in prose, or in contexts where the values could be ambiguous. — HTGS (talk) 04:17, 11 November 2024 (UTC)
- Where space is limited, it makes sense to present a single compound unit, equal to the product of the separate units. For the example given, the compound unit symbol would be m3. Dondervogel 2 (talk) 12:13, 11 November 2024 (UTC)
- Who ever heard of a phone advertised as 5 cc ? People are more interested in it being wide and tall but very thin. This necessitates stating each individual dimension. Stepho talk 22:40, 11 November 2024 (UTC)
- No, what Dvogel means is you'd write that a certain phone measures
146 x 71.5 x 7.65 mm3
. Having clarified that, I'm bound to say that that would, of course, confuse 99% of our readers. EEng 22:47, 11 November 2024 (UTC)- Gotcha. As well as confusing most readers, it would also be different to
1 by 3 by 6 m
, which is allowed. Stepho talk 23:30, 11 November 2024 (UTC)- To be clear for those playing along at home, while the canonical formuations are
1 m by 3 m by 6 m
and1 m x 3 m x 6 m
, MOS currently makes an exception allowing1 by 3 by 6 m
(specifically in the case where all the quantities are in the same unit -- in this case metres), but no corresponding exception allowing1 x 3 x 6 m
. While it may offend purists, I really don't see why the exception shouldn't be extended to that last case as well. Thoughts? EEng 23:39, 11 November 2024 (UTC)
- To be clear for those playing along at home, while the canonical formuations are
- Thank you for clarifying my intent. And for making me chuckle. LoL
- For a 3 dimensional object, one can write either 146 mm x 71.5 mm x 7.65 mm or 146 x 71.5 x 7.65 mm3. I agree the former is clearer, but the latter uses less space, which can be a consideration. There is no difference in meaning.
- I guess one could also write 146 x 71.5 x 7.65 mm, but then we have a length, not a volume. It would be clearer to write that length as 79.86 m. Dondervogel 2 (talk) 23:42, 11 November 2024 (UTC)
one could also write 146 x 71.5 x 7.65 mm, but then we have a length, not a volume
– Formally perhaps, but you could say the pretty much the same about 146 by 71.5 by 7.65 mm, and yet we allow it. No one will think that 146 x 71.5 x 7.65 mm means the length 79.86 m (i.e. 79860 mm). In context readers will understand it for what it is. I'd like to hear what others think about my proposal. EEng 23:56, 11 November 2024 (UTC)- Seconded EEng's proposal - simple and clear. Mr.choppers | ✎ 04:36, 14 November 2024 (UTC)
- EEng is, of course, correct. At {{convert}} we sometimes are asked how the duplicate mm units can be removed to save space (the trick is to use
xx
in convert) and we tell them that omitting repeated units is ok if space is limited. May as well make it official. Johnuniq (talk) 05:51, 14 November 2024 (UTC)EEng is, of course, correct.
– Of course -- even Dondervogel says so. EEng 06:37, 14 November 2024 (UTC)
- I also support the proposal. Stepho talk 05:53, 14 November 2024 (UTC)
- Gotcha. As well as confusing most readers, it would also be different to
- No, what Dvogel means is you'd write that a certain phone measures
- I thought this was a joke and burst out laughing on a train, which got me a weird look from a fellow passenger. Anyhow, I too support allowing the single unit after x symbols per EEng and John. Toadspike [Talk] 17:31, 18 November 2024 (UTC)
- Who ever heard of a phone advertised as 5 cc ? People are more interested in it being wide and tall but very thin. This necessitates stating each individual dimension. Stepho talk 22:40, 11 November 2024 (UTC)
- It's tiresome to have to write (and read) units multiple times when multiplication signs are used. Tony (talk) 09:47, 14 November 2024 (UTC)
- As the person who proposed this in the first place, I too support EEng’s proposal. I will carry on working on the infobox, and leave the written MOS to others. I imagine the purists might be happy if we left some comment or endnote about making sure the measurements are not potentially ambiguous though?
- And, for anyone who cares, there are already pages where this is in sensible use: List of photographic film formats. — HTGS (talk) 23:34, 18 November 2024 (UTC)
Do we have to convert inches for wheels?
[edit]I see people adding conversions to mentions of screen sizes and wheel dimensions - is this really necessary? Even in Germany or New Zealand, automobile and bike wheels are universally referred to by inches; rim diameters are expressly defined in inches in the EU regulations. To me, adding conversions for these types of dimensions adds unnecessary clutter, harming readability for no return whatsoever. I haven't read the entire MOS today, apologies if I missed a mention of these situations. Mr.choppers | ✎ 17:24, 13 November 2024 (UTC)
- It looks like sizing bike wheels in inches is not universal. I see many charts in the I-net such as this that use both metric and imperial/American units for bike wheels and tires. Whether the convert template handles them correctly is another issue. Donald Albury 17:43, 13 November 2024 (UTC)
- On the matter of wheel sizes, not all are inches. See this post and my reply. Even for a conventional non-Denovo wheel, the dimensions are a bastard mixture: "195/65 R 15" means a tyre that is 195 mm wide on a 15-inch rim. --Redrose64 🌹 (talk) 19:10, 13 November 2024 (UTC)
- Yes, there is the Michelin TRX and the Denovo. Just as we wouldn't convert the "195" when we write 195/60 R15, I don't think we ought to convert the diameter either. I would treat all of these tire dimensions as one would nominal measurements, rather than inserting unnecessary templates. Bicycle tires, meanwhile, proved more varied than I was aware of. Mr.choppers | ✎ 04:33, 14 November 2024 (UTC)
- I agree with Mr.Choppers on this subject. I think wheels sizes on cars are a compromise between the USA and the rest of the world. There are metric rims on older vehicles but pretty rare on new vehicles. Avi8tor (talk) 11:40, 14 November 2024 (UTC)
- @Avi8tor: - I was actually triggered by you converting screen dimensions, but five minutes online showed me that the modern world has indeed begun dropping the use of inches for screens. My gut was wrong. Mr.choppers | ✎ 13:36, 14 November 2024 (UTC)
- Many people around the planet know only millimetres, so it makes sense to have both. I notice in France the data information on television screen size have it in both inches and millimetres. Avi8tor (talk) 17:57, 16 November 2024 (UTC)
- @Avi8tor: - I was actually triggered by you converting screen dimensions, but five minutes online showed me that the modern world has indeed begun dropping the use of inches for screens. My gut was wrong. Mr.choppers | ✎ 13:36, 14 November 2024 (UTC)
- I agree with Mr.Choppers on this subject. I think wheels sizes on cars are a compromise between the USA and the rest of the world. There are metric rims on older vehicles but pretty rare on new vehicles. Avi8tor (talk) 11:40, 14 November 2024 (UTC)
- Yes, there is the Michelin TRX and the Denovo. Just as we wouldn't convert the "195" when we write 195/60 R15, I don't think we ought to convert the diameter either. I would treat all of these tire dimensions as one would nominal measurements, rather than inserting unnecessary templates. Bicycle tires, meanwhile, proved more varied than I was aware of. Mr.choppers | ✎ 04:33, 14 November 2024 (UTC)
- I agree with Aviator, who didn't mention that aviation uses "feet" for altitude—needs conversion in my view. Tony (talk) 07:30, 22 November 2024 (UTC)
RfC Indian numbering conventions
[edit]
|
I am revisiting an issue that was last brought up 6 years ago here and settled without a strong consensus.
I think we should avoid using Indian numbering conventions unless it is needed for context. For instance, if we want to list the box office take of an Indian movie, don't use "crore", use "millions". This isn't about disrespecting a culture, it's about using internationally favored notation and unit conventions. We should use "millions" instead of "crore" for the same reason we favor meters over feet. There is no reason that India-related articles should be an enclave of Indian conventions. People who are not Indian will struggle with these things, it will weaken Wikipedia's role as an information tool for everyone.
This is not the same thing as currency. It is appropriate to list an Indian movie's box office take in rupees. Providing a US$ conversion is optional, but a good idea since the US dollar is widely used around the world as a reserve currency. But write it as "millions of rupees", not "crores of rupees". Kurzon (talk) 16:38, 16 November 2024 (UTC)
- What's the common usage in english? GoodDay (talk) 16:45, 16 November 2024 (UTC)
- I don't think most people in the US understand what "crore" is, and would not recognize it as part of the English language. The online Merriam-Webster dictionary says it means ten million, specifically, a unit of value equal to ten million rupees or 100 lakhs. I think most people in the US would not even understand that a currency is being mentioned.
- --Jc3s5h (talk) 17:00, 16 November 2024 (UTC)
- Not just people in the US. Nobody outside of India can be expected to know what a crore is. Kurzon (talk) 17:15, 16 November 2024 (UTC)
- We use meters over feet? Where?
Aaron Liu (talk) 17:50, 16 November 2024 (UTC)In non-scientific articles with strong ties to the United States, the primary units are US customary (pounds, miles, feet, inches, etc.)
- You get extra points for saying "US customary" and not "Imperial". 😉 Isaac Rabinovitch (talk) 18:20, 16 November 2024 (UTC)
- You get extra points for saying "US customary" and not "Imperial". 😉 Isaac Rabinovitch (talk) 18:20, 16 November 2024 (UTC)
- I agree with Kurzon, do not use "crore", use "millions". Wikipedia is for a worldwide audience. Avi8tor (talk) 18:03, 16 November 2024 (UTC)
- Kinda like how US units are used for US articles, I don't see the harm in using "crore", and it's way more work to manually convert to millions every time a member of India's vast diaspora in the Global North adds "crore" to an article, not knowing our ManualOfStyle. Aaron Liu (talk) 18:19, 16 November 2024 (UTC)
- Except we don't favor meters over feet — we use both. That's what the Convert template is for.
- Speaking as a non-Indian, who can never remember what how many is a "crore": I'm fine with it, as long as the international unit is also used. Isaac Rabinovitch (talk) 18:18, 16 November 2024 (UTC)
- We already make an exception for feet. I see no good reason for barring a second exception. State in crore and convert to a unit non-Indians can understand (millions of rupees?). Dondervogel 2 (talk) 20:48, 16 November 2024 (UTC)
The article for the French movie Les Visiteurs lists the budget as "9.5 million", using a point as a decimal separator. In France they use commas for this, ie "9,5 million". We don't use the French notation convention for France-related articles. Kurzon (talk) 17:14, 16 November 2024 (UTC)
- Is it the French style to use that notation in English? A different unit elicits way less confusion than a reversed decimal separator meaning anyways. Aaron Liu (talk) 17:50, 16 November 2024 (UTC)
- Bad RFC; see WP:RFCNEUTRAL and the rest of the guidance there too. Unsurprisingly, this has just started out as a disorganized discussion that doesn't resemble a normal RFC...you might want to just remove the tag, get some feedback, and then start a proper one in a bit (separate subsections for discussion and survey are pretty helpful too). 35.139.154.158 (talk) 18:21, 16 November 2024 (UTC)
- @Kurzon: I did advise you not to jump straight for a full-blown thirty-day formal RfC without first exhausting the suggestions at WP:RFCBEFORE. --Redrose64 🌹 (talk) 18:39, 16 November 2024 (UTC)
- This RfC is clearly improperly formatted, Kurzon; thank you to our unregistered friend for pointing this out.
- Oh come now. It seems to be developing nicely, I doubt that any editors are swayed by the wording. it's not perfect but perfect is the enemy of good and its good enough. Herostratus (talk) 04:47, 29 November 2024 (UTC)
- That reply was before the appropriate discussion centers were notified and before discussion started to develop. It's not just formatting; it's that there was no prior discussion. Now we're effectively having both at the same time, especially when an informal discussion could've resulted in consensus without a time-consuming process. Aaron Liu (talk) 16:08, 29 November 2024 (UTC)
- Oh come now. It seems to be developing nicely, I doubt that any editors are swayed by the wording. it's not perfect but perfect is the enemy of good and its good enough. Herostratus (talk) 04:47, 29 November 2024 (UTC)
- Consistency and clarity to our international readership are valid arguments in favor of prohibiting "crore" and "lakh". However, Aaron Liu makes good points about the fact that we allow local variation in articles with local ties, e.g. all of ENGVAR. I am unsure where I sit on this issue. I would like to see some Indian editors weigh in on this. Toadspike [Talk] 19:58, 16 November 2024 (UTC)
- I also agree that crores are too obscure (as are lakhs), with use limited to South Asia. Feet and inches, while retrograde and infinitely useless, were used across most of the world not many generations ago. The major unit in Japanese is 万 (man), which is 10,000, but we do not use that because most people wouldn't know it. Engvar is somewhat different: we cannot avoid choosing between "colour" and "color", for instance, whereas we can easily write the globally recognized "millions" rather than crores. As for User:Aaron Liu's comment: if someone adds crore, it will be there until fixed – it's not pressing enough of a problem to hunt down every instance. Mr.choppers | ✎ 20:03, 16 November 2024 (UTC)
- Good point about 万 – I completely forgot that Chinese has similarly different units. I think that settles it – either we allow crore and lakh alongside the East Asian 万 and 亿 (which I think is ridiculous) and an infinite variety of customary units, or we allow none.
- (Two counterarguments: 1. This is a slippery slope argument, which is a logical fallacy. To which I say no, we can't give only one country special treatment, we ought to be fair. 2. The East Asian units are non-Latin characters and thus more impractical than "crore". This is true.) Toadspike [Talk] 20:15, 16 November 2024 (UTC)
- On the subject of the myriad, I agree with Toads's second counterargument: there is no widely-recognized English translation for the unit in some "East Asian variant" of English; they just convert it to short scale in translations.
Part of my argument is that "crore" vs long scale is basically the same thing as "colour" vs "color": anonymous editors are going to add them. A ton. Expecting people to not use crore is like expecting people to not spell "colour". It's not pressing enough to hunt down, sure, but you're going to see sweet summer children adding crore into crore-free articles again and again and again. Aaron Liu (talk) 01:14, 17 November 2024 (UTC)we cannot avoid choosing between "colour" and "color", for instance, whereas we can easily write the globally recognized "millions" rather than crores.
- By the way, I've left a (neutrally-worded) note about this discussion at the Talk page of WikiProject India. Toadspike [Talk] 20:16, 16 November 2024 (UTC)
- I also agree that crores are too obscure (as are lakhs), with use limited to South Asia. Feet and inches, while retrograde and infinitely useless, were used across most of the world not many generations ago. The major unit in Japanese is 万 (man), which is 10,000, but we do not use that because most people wouldn't know it. Engvar is somewhat different: we cannot avoid choosing between "colour" and "color", for instance, whereas we can easily write the globally recognized "millions" rather than crores. As for User:Aaron Liu's comment: if someone adds crore, it will be there until fixed – it's not pressing enough of a problem to hunt down every instance. Mr.choppers | ✎ 20:03, 16 November 2024 (UTC)
- Don't allow crore. In the interest of making articles understandable to a wider audience, we already do this for the decimal marker (.) and separator for groups of 3 digits (,) as previously mentioned. We also require the use of short-scale even though long-scale hasn't entirely died out in the British Isles. Jc3s5h (talk) 21:16, 16 November 2024 (UTC)
- The decimal marker and long/short scale have a much better reason for their ban: The symbols they use have very different meanings outside of their local context, while crore, lakh, etc. do not. Aaron Liu (talk) 01:04, 17 November 2024 (UTC)
- Don't allow crore Per WP:COMMONALITY. This is not comparable with US v metric units where we report both - that is just a case of which is primarily reported. Furthermore, imperial units have a relatively recent historical usage across English. It is not like other issues of ENGVAR such as colour v color or ise v ize that do not affect understanding.
For an international encyclopedia, using vocabulary common to all varieties of English is preferable
- to the point of being paramount. Cinderella157 (talk) 22:38, 16 November 2024 (UTC) - Allow crore, lakh and Indian numbering system, but always, 1) link it on first use, 2) include what it is a measure of (rupees can not be assumed), 3) also include conventional numbering, and 4) allow it only in articles about the subcontinent. SchreiberBike | ⌨ 23:13, 16 November 2024 (UTC)
- I agree with all of these conditions. While I remain somewhat ambivalent on the use of “crore” in general, we must provide enough context for non-Indian readers to understand them. Toadspike [Talk] 13:56, 17 November 2024 (UTC)
- Allow crore, lakh per SchreiberBike, and with the same caveats. Dondervogel 2 (talk) 00:03, 17 November 2024 (UTC)
- Allow ScreiberBike, per my comments above. Aaron Liu (talk) 01:20, 17 November 2024 (UTC)
- Allow ScreiberBike. But see also Wikipedia:Manual of Style/India-related articles#Basic_India_conventions - "You may use the Indian numbering system of lakhs and crores but should give their equivalents in millions/billions in parentheses" — Preceding unsigned comment added by Asteramellus (talk • contribs) 00:30, 18 November 2024 (UTC)
- Allow crore, lakh and Indian numbering system, but always, 1) link it upon first use in every section where it appears, 2) include what it is a measure of (rupees can not be assumed), 3) also include conventional numbering using template {{convert}}—i.e., don't convert yourself, and 4) allow it only in articles about the subcontinent. Mathglot (talk) 23:11, 18 November 2024 (UTC)
- Hm; was very surprised to notice that the {{convert}} template does not currently support lakhs and crores. I think it should, and started a discussion about that. If you wish to comment, please go to Module talk:Convert#Indian numbering system: lakhs and crores. Thanks, Mathglot (talk) 23:50, 18 November 2024 (UTC)
- The convert template converts units, like feet and metres. Crores and lakhs are not units, but multipliers. It would be like convert being used to convert between hundreds, thousands, millions etc. --Redrose64 🌹 (talk) 22:52, 19 November 2024 (UTC)
- I agree with SchreiberBike and others; "crores" and "lakhs" can always be used to add colour/color to an article as long as those requirements are met. Mr.choppers | ✎ 04:50, 20 November 2024 (UTC)
- Hm; was very surprised to notice that the {{convert}} template does not currently support lakhs and crores. I think it should, and started a discussion about that. If you wish to comment, please go to Module talk:Convert#Indian numbering system: lakhs and crores. Thanks, Mathglot (talk) 23:50, 18 November 2024 (UTC)
- Do not allow. This is not the same as variations of English in wide use where there are multiple widespread usages (color or colour). While SchreiberBike's conditions for use are reasonable, I would say that the standard international measurements should always be primary and subcontinent-specific numbering as a secondary only in articles about the subcontinent. Avgeekamfot (talk) 09:50, 20 November 2024 (UTC)
- What does "widespread" mean? Aaron Liu (talk) 12:17, 20 November 2024 (UTC)
- Allow, but always ... exactly as Mathglot laid out above (other than, per Stepho-wrs and Redrose64,
{{convert}}
isn't actually the right template, or at least isn't presently). I would add a further caveat that these traditional Indic units (technically, multipliers) should be given secondarily not primarily, but I could live without that. — SMcCandlish ☏ ¢ 😼 11:55, 21 November 2024 (UTC) - Allow when appropriate, under conditions set out by ScreiberBike. Also, this RfC does not meet WP:RFCNEUTRAL. ThatIPEditor Talk · Contribs 02:18, 22 November 2024 (UTC)
- Do not allow crore et al. It's not only native English-speakers who haven't a clue what it means when reading India-related articles; it's non-natives too. Tony (talk) 07:32, 22 November 2024 (UTC)
- I don't get what native/non-native speakers have to do with the issue. Aaron Liu (talk) 12:21, 22 November 2024 (UTC)
- Allow per ScreiberBike for South Asian articles. Johnbod (talk) 17:29, 22 November 2024 (UTC)
- Allow All Indian academic/professional textbooks and all Indian reliable sources, with few exceptions for specific conditions, use lakhs/crores when denoting INR and millions/billions when denoting foreign currencies. Not allowing is not an option, unless editors want to disregard Indian readers. Using X million rupees is almost as uncommon in India as using Y lakh dollars. My suggestion -- for articles that use {{Use Indian English}} force editors to 1) link it on first use, 2) include what it is a measure of (rupees can not be assumed) with Indian comma separator at 00 after thousands and for articles that don't use that template force editors to always use millions/billions with 000 comma separator. — hako9 (talk) 03:01, 23 November 2024 (UTC)
- Strongly disallow use of Indian comma separator. That would only serve to confuse. We don't permit a French comma separator on English Wikipedia. The Indian comma would be much worse. Dondervogel 2 (talk) 09:11, 23 November 2024 (UTC)
- I concur entirely with Dongervogel_2 on this side-point; we cannot mix-and-match numeric separator styles. We've repeatedly had debates in the past about permitting "," instead of "." as a decimal point to suit the preference of some subset of readers, and the answer is always firmly "no", so this isn't going to be any different. I'm not a professional researcher in this area, but I have looked into the matter in the course of various style debates, and the evidence clearly shows Indian publications using "Western" number formatting systems (or whatever you want to call them) on a regular basis, though often alongside the Indic krore, etc., system. That is, it's just not plausible that English-using readers in/from India have any difficulty understanding our numeric material, especially after the rise of the Internet has exposed them to content from all over the world since the mid-1990s and pretty much ubiquitously since the early 2010 with the rise of mobile data. — SMcCandlish ☏ ¢ 😼 14:49, 24 November 2024 (UTC)
“it's just not plausible that English-using readers in/from India have any difficulty understanding our numeric material …”
Of course the same could be said of American readers and the spelling of ‘colour’. — HTGS (talk) 17:41, 28 November 2024 (UTC)- What isn't the same is how many editors will add "colour" into articles while most wouldn't add numbers in the Indian system. Aaron Liu (talk) 18:30, 28 November 2024 (UTC)
- I’m genuinely not sure what your point is? Editors are more likely to (erroneously) change spelling to ‘colour’, so that gives them more grounds for the MOS giving them parity with American English? I know we should be realistic about what we can control, but I don’t love that logic. — HTGS (talk) 03:18, 29 November 2024 (UTC)
- Yes, that or add spelling that says "colour" is what I'm saying. Aaron Liu (talk) 04:03, 29 November 2024 (UTC)
- Like I would campaign for navboxes to be placed in the "see also" section if it weren't so widespread and unduly investative to correct. The corrections for disallowing crore are the same thing to me. Aaron Liu (talk) 04:11, 29 November 2024 (UTC)
- I’m genuinely not sure what your point is? Editors are more likely to (erroneously) change spelling to ‘colour’, so that gives them more grounds for the MOS giving them parity with American English? I know we should be realistic about what we can control, but I don’t love that logic. — HTGS (talk) 03:18, 29 November 2024 (UTC)
- What isn't the same is how many editors will add "colour" into articles while most wouldn't add numbers in the Indian system. Aaron Liu (talk) 18:30, 28 November 2024 (UTC)
- Because, the fact is, we aren’t using varieties of English solely to ensure accuracy or intelligibility. They are also being used to avoid recreating the Anglo-American hegemony that exists in published English, and to foster a connection in the community with the most interest in the subject. — HTGS (talk) 18:05, 28 November 2024 (UTC)
- Do not allow crore - I am not convinced that this word is actually English, and this is the English-language wikipedia. It seems that this is a foreign word that is used alongside English in areas that have ties to the language this word is from. Even in these areas, it seems that English speakers there fully understand what "millions", "thousands", etc mean, and there have been attestations linked above where they use both, presumably to help English speaking people understand what number is being referred to. My perspective here is colored by being an American expat living in Japan... in day-to-day speech, I will sometimes mix the languages and say "Oh, this costs 3 man yen." But I am under no circumstances thinking that "man" meaning "ten thousand" is English. I'm using another language's word. That's what it looks like they are doing here. Fieari (talk) 07:01, 28 November 2024 (UTC)
- As an alternative, I would also accept allowing crore only if the "millions" number is included alongside it. Fieari (talk) 07:28, 28 November 2024 (UTC)
- "Gumption" is borrowed from Scots; it is English. "Chutzpah" is borrowed from Yiddish; it is English. "Powwow" is borrowed from East-American indigenous language; it is English. "Crore" is borrowed from Hindustani; it is Indian English. All of the above are attested by dictionaries, while "man" to mean myriads is not. Aaron Liu (talk) 18:28, 28 November 2024 (UTC)
- Allow crore - my gut feeling is to disallow it because it is not English as understood by the majority of English readers (including native speakers from UK/US/Australia/etc and second language speakers from China/S.America/Europe/etc). However, crore and lakh are words that Indians practically think in even when speaking English. We have a similar problem where an article is marked as British English and has 99 occurrences of "litre" - an American will still add new stuff with "liter" because it is so naturally to them. In the same way, we will be pushing it up hill trying to get them to stop. So, we should let them use it in articles related to the Indian region but never on anything outside that region. Each first usage should link to crore and lakh so that the few non-Indian region readers have a clue what's going on. I would not bother with conversion to millions - once you learn that they are just putting 0's at the end it becomes easy enough in a short time and conversions just clutter up the article. But do not allow grouping like 1,00,000 under any circumstances. Stepho talk 02:41, 29 November 2024 (UTC)
- Don't allow crore. If there are people who don't know what "million" is, well some level of literacy is required here, yes. As to "link on first use", no, links are supposed to be "here's some extra/more detailed info about the subject if you want" not "you need to interrupt the flow of your reading and go off the page to understand this word". Herostratus (talk) 04:57, 29 November 2024 (UTC)
- Actually that's exactly what links are for. Readers who know the general topic well can just read an article straight forwardly. But readers new to the general topic are likely to come across words they don't know yet and can follow the links to learn. Eg, in car articles we often talk about the camshaft. If you are new to the detailed study of cars then you can follow that link and then return later. Stepho talk 06:09, 29 November 2024 (UTC)
- And if anybody thinks that a politely worded MOS rule will stop them adding crore and lakh then consider that at https://en.wikipedia.org/w/index.php?title=Nissan&diff=1256595427&oldid=1256557060 somebody added a MDY style date in spite of the article having 186 references in DMY style. I fix these (in both directions) practically daily. People do whatever comes natural and do not consider that any other way even exists.
- But I do feel a little better after my vent :) Stepho talk 11:35, 29 November 2024 (UTC)
- +1 and it’s worth reiterating that most advocates here are suggesting that the Indic value should always be “translated” into a Western value in parentheses, so most naïve readers would still be able to parse the article without following the link. — HTGS (talk) 06:21, 29 November 2024 (UTC)
- Do not allow crore—India-related articles are for international readership. No one outside the subcontinent is familiar with crore. It is a disservice to readers to allow it. Tony (talk) 06:24, 29 November 2024 (UTC)
- If they are not familiar with crore they can read the conversion to millions. And if they also want to learn about crore they can click on the link. I see no disservice. Dondervogel 2 (talk) 12:49, 29 November 2024 (UTC)
- Perhaps some are not aware but English Wikipedia is heavily used in India. The Top 50 Report from 2023 had five items about Indian movies and movie stars. The latest week's most viewed Top 25 had 2024 Maharashtra Legislative Assembly election and Kanguva. According to Indian English there are 128 million English speakers there. If we say to basically never use crore and lakh, we are sending a discouraging, even insulting, message to many of our readers and editors. SchreiberBike | ⌨ 13:51, 29 November 2024 (UTC)
- Allow in articles with strong ties to India, provided that the conversion is shown at first use. Hey, we could even write
In non-scientific articles with strong ties to
. See sauce for the goose. Also, it is very relevant that a huge fraction of en.wiki readers are Indian. "ccording to a 2011 census, 10.2% of the Indian population speaks English. This figure includes all Indians who speak English as a first, second, or third language. 10% of India's population is approximately 145 million people." Twice as many as in the UK, half as many as in the US. --𝕁𝕄𝔽 (talk) 11:49, 29 November 2024 (UTC)the United StatesIndia, the primaryunits are US customary (pounds, miles, feet, inches, etc.)multipliers are Crore and Lakh