Hacker News new | past | comments | ask | show | jobs | submit login

Fun fact - in Polish we have separate forms for 1 (singular), 2-4 (plural but nit many) and everything else. Zero is in “everything else”

0 książek

0.5 książki

1 książka

2,3,4 książki

2.5 książki

5 and above książek

5.5 (any other fraction) książki

>100 and a fraction - depends

Singular is for one.

The first plural is for things kind of treated as individual objects.

The second plural is for things that are treated as a bulk/mass.

The moment you use a fraction, the assumption is that you would need to count all I guess, so it’s treated as individual objects.

From this perspective, zero of something is zero plural-not-easily countable. Kind of “Zero OF books” like “Ten OF books”, with of being implied by the form of the word.




The term in linguistics for a category of 3 or 4 things is "paucal". Most languages with a paucal separate 2 from 3 or 4, resulting in four noun categories/forms by number: singular (1), dual (2), paucal (3 - 4? a few?) and plural (5+). That's quite a common pattern among the world's languages. Polish and the other Slavic languages with this feature are a little unusual in not having the separate dual. A few languages have a trial (3) as a distinct category but it's rare. And some languages distinguish between a greater and lesser paucal, roughly "a few" vs "many", usually with the singular, dual and plural as well, having 5 categories of noun number.

Languages with these features often have lots of irregularities around them, too. In the same way that "pants" are plural for no reason in English, eyes might be plural instead of the obvious-seeming dual, etc. And if that seems all a bit unnecessarily numerical, you may be right; Chinese has gotten by for thousands of years without any plurals at all.


> Chinese has gotten by for thousands of years without any plurals at all.

Chinese has plurals; 们 has no other use.


Counting words and 们 aren't the same, as they aren't declensions. You add on the syllable rather than actually changing the syllable of the noun itself.

Granted, that's still different from saying that Mandarin doesn't have a concept of plural, but I think the underlying point- no conjugation or declension- is very different from the other languages being discussed.


To be more accurate, 们 isn't a plural marker more because of the fact that it's not productive[1], rather than the fact that Chinese doesn't have declension. If 们 were able to be suffixed to any noun to make it plural, then you could consider it to be a plural marker, even though the noun isn't technically declined. That's not the case anyway though, since 们 can only be used with a closed set of pronouns or in a limited way to refer to groups represented by the noun its attached to (in this sense it's more of a metonymic[2] marker rather than a plural marker). For example, 白宮们 can be used to translate "the White House" when it refers to the President and his administration, and cannot be used to mean "white houses".

[1] https://en.wikipedia.org/wiki/Productivity_(linguistics) [2] https://en.wikipedia.org/wiki/Metonymy


It is productive. It can't pluralize anything, but it can pluralize anything that refers to people and it is actively used in novel ways. I've seen someone refer to 美国的妈妈们; metonymy is not involved there. It just means "American mothers", as distinct from a hypothetical 美国的妈妈 "America's mother".

However, the other angle on this is that Mandarin pronouns have singular and plural forms (plurals using 们), and the use of the correct form is obligatory, which suffices to show that plurality exists in the language. Although it isn't the case that 们 is unproductive, even if it was unproductive that still wouldn't show that the language has no plurals.

> in this sense it's more of a metonymic marker rather than a plural marker). For example, 白宮们 can be used to translate "the White House" when it refers to the President and his administration, and cannot be used to mean "white houses"

I should note that this argument doesn't entirely hang together. You can make "the White House" explicitly plural in English by giving it a plural verb:

https://us.iasservices.org.uk/bidens-immigration-bill-propos...

> The White House have announced a comprehensive immigration reform proposal in a bill that has been sent to congress.

How would you say that differs from 白宫们? Does it refer to multiple houses?


When the comment you replied to mentioned "Chinese has gotten by for thousands of years without any plurals at all", I understood it to mean that Chinese has not featured any general system of marking plural by grammatical means[1], which is what is usually understood by the term "plural"[2], not that Chinese has no ability to express a more-than-one count distinction at all (which isn't the case in any language as far as I'm aware).

> It can't pluralize anything, but it can pluralize anything that refers to people and it is actively used in novel ways. I've seen someone refer to 美国的妈妈们; metonymy is not involved there.

It is productive in a limited sense in that way, but not as a general plural marker as you're arguing, and it's limited because 美国的妈妈们 means "American mothers" in that it necessarily refers to them as a collective group (which I argue is an instance of metonymy) rather than a set of more than one "American mother". For instance you cannot say *三个美国的妈妈们 to mean "three American mothers"; you must instead say 三个美国的妈妈 because 美国的妈妈们 can only ever refer to the entire collective group.

> I should note that this argument doesn't entirely hang together. You can make "the White House" explicitly plural in English by giving it a plural verb

This is a feature of UK English where collective nouns agree with plural forms of verbs. US English on the other hand, requires the singular form[3][4]. This has no bearing on how we analyze Chinese.

[1] https://en.wikipedia.org/wiki/Grammatical_number

[2] https://en.wikipedia.org/wiki/Plural

[3] https://victoryediting.com/collective-nouns/

[4] https://en.wikipedia.org/wiki/Collective_noun#Examples_of_me...


Yeah, that was what I meant by including "counting words" which I now remember are better known as measure words or classifiers.


I believe Slovenian is the only living official Slavic language that still retains this feature.

EDIT: I know that some dialects of coastal croatian still retain this feature. But there may be more


Not certain about Polish any more (it’s 15 years or so since I studied) but certainly Russian uses the genitive singular after numbers ending in 2, 3, and 4 (e.g. 02, 23, 34, but not the “teens”) and genitive plural for numbers ending in 5, 6, 7, 8, 9, 0). The endings here look the same.

Does Polish also use the same form for 20 as for 0 (i.e. 20 książek)? As I remember it does. If I remember correctly, Polish also differs from Russian in that it only uses the singular for 1, not for all numbers ending in 1 (except 11).

(Note for linguists, it’s not actually quite the genitive, but it’s close enough not to warrant its own case)


It's the same or similar in many if not all Slavic languages. Just goes to show true internationalization in software is almost impossible because you don't know all the rules in all the languages of the world. E.g. if you treat numbers as singular like English does you will have difficulty with Polish because you were unaware it has a special case for 2-4. And then you can have a third language that handles 2-7 differently.


You just do (p)ngettext. It works if care for it to work and proofread the interface instead of raw files.


> The moment you use a fraction, the assumption is that you would need to count all I guess, so it’s treated as individual objects.

I always explained it to myself that, when you use a fraction, you're focusing on that one incomplete object, also calling more attention to the individuality of objects in the set. So in case of say, "100.5 książki", I like to imagine shoving ~95 books into the box very quickly, then slowing down for the last 6 books, counting them off one by one, to know exactly when to stop and saw the 106th book in half.

IDK what the official justification is.


Interesting.

It's the opposite in Russian — it'd be "100½ książek" there (сто c половиной книжек), even though it's "½ książki" (половина книжки).

"½ książek" (половина книжек) is also valid, and it means "half of (all the) books".

But in "100½", it's the whole number that determines the ending (i.e. case) of the word.


FWIW, both "½ książek" and "½ książki" are valid Polish, but they mean different things:

- "½ książek" == "połowa książek" == "half of the books", and refers to half of some set, e.g. "połowa książek spadła z półki", "half of the books fell off the shelf". It gets tricky when the set turns out to contain only one book, but that's not spelling/grammar issue anymore.

- "½ książki" == "pół książki" or "połowa książki" == "half of the book", as in "pierwszap połowa tej książki jest nudna", "the first half of this book is boring". There's some difference between "pół" and "połowa" that makes them not interchangeable in most cases, but I don't feel confident enough to articulate a rule here.


Same in Russian and Ukrainian in that regard.


And it gets weirder. If you have 5 or more of something, you use the neuter singular form of verbs for it.

4 books were sitting on the shelf = Stały 5 książek na półce [feminine plural verb, which makes sense]

5 books were sitting on the shelf = Stało 5 książek na półce [neuter singular???]

As someone learning Polish this is quite confusing.


You have an error there (bad copy-paste?).

Should be "Stały 4 książki na półce".

Anyway, to make sense of the second one: treat "5 książek" as "5 of books", or "a 5-set of books"

— Four books were sitting on a shelf ("books" are plural, feminine)

— A five-set of books was sitting on a shelf ("five-set" is singular, neutral)

Now, why 5 becomes a set and 4 doesn't is not something I have a clue about.

But hope it helps grok how it affects the form of the noun being enumerated :)


We also have a separate case for missing something.

So it's "Mam 3 książki" (I have 3 books) but "Nie mam 3 książek" (I don't have 3 books) or "Brakuje 3 książek" (3 books are missing).




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: