Wikidata:Property proposal/unicode range
Unicode range
editOriginally proposed at Wikidata:Property proposal/Creative work
Represents | Unicode plane (Q10853148) |
---|---|
Data type | String |
Template parameter | unicode in en:Template:Infobox writing system |
Domain | item |
Allowed values | Syntax specified by W3Cs CSS Fonts Module Level 3 Chapter 4.5. |
Example 1 | Braille (Q79894) → U+28?? |
Example 2 | ASCII (Q8815) → U+0000-U+062 |
Example 3 | Unicode (Q8819) → U+0-10FFFF |
Example 4 | Cyrillic (Q2493952) → U+0400-04FF |
Example 5 | A (Q9659) → U+41,U+61,U+249C,U+24B6,U+24D0,U+FF21,U+FF41,U+1F110,U+1F130,U+1F150,U+1F170 |
Example 6 | Latin script in Unicode (Q1046233) → 0000-U+007F, U+0080-U+00FF, U+0100-U+017F, U+0180-U+024F, U+0250-U+02AF, U+02B0-U+02FF,U+1D00-U+1D7F, U+1D80-U+1DBF, U+1E00-U+1EFF, U+2070-U+209F, U+2100-U+214F, U+2150-U+218F, U+2C60-U+2C7F, U+A720-U+A7FF, U+AB30-U+AB6F, U+FB00-U+FB4F, U+FF00-U+FFEF |
Example 7 | Papyrus (Q4816790) → U+20-7E,U+A1-17F,U+189,U+192,U+1A0,U+1A1,U+1AF,U+1B0,U+1C0-1ED,U+1F0-1F5,U+1F8-21B,U+21E,U+21F,U+226-233,U+237,U+251,U+252,U+254,U+258,U+259,U+2BB-2BD,U+2C6-2CB,U+2D8-2DD,U+2EE,U+300-304,U+306-30F,U+311-314,U+316,U+317,U+31B,U+320,U+323-329,U+32C-331,U+338,U+340,U+341,U+343,U+344,U+34F,U+357,U+37E,U+385,U+387,U+3A9,U+3C0,U+4C0,U+4D4,U+4D5,U+1E00-1E9B,U+1EA0-1EF9,U+200B-200D,U+2010-2014,U+2018-2022,U+2026,U+2027,U+2030,U+2039,U+203A,U+203D,U+2044,U+204B-204F,U+2070,U+2071,U+2074-208E,U+2090-2094,U+20AB,U+20AC,U+2103,U+2109,U+2122,U+2126,U+212A,U+212B,U+2160-217F,U+2202,U+2206,U+2207,U+220F,U+2211-2213,U+2215,U+2216,U+2219,U+221A,U+221E,U+222B,U+2248,U+2260,U+2264,U+2265,U+25CA,U+F8FF |
Example 8 | U+20,U+2E,U+46,U+49,U+4D,U+4F,U+54,U+55,U+59,U+61-69,U+6B-70,U+72-77,U+79,U+7A |
Example 9 | arrow (Q1139519) → U+2190-21FF,U+2794,U+2798-27AF,U+27B1-27BE,U+27F0-27FF,U+2900-297F,U+2B00-2B11,U+2B30-2B4C,U+2B90,U+2B91,U+1F800-1F80B,U+1F810-1F847,U+1F850-1F859,U+1F860-1F887,U+1F890-1F8AD |
Source | unicode parameter in en:Template:Infobox writing system |
Motivation
editFor typesetters and programmers it regulary comes in handy to know the unicode range of a language, script, font, unicode block, text or specific set of symbols. Luckily there already is a standard to define ranges in CSS. I propose to use exactly this standard since it can represent a variety of things: single codepoints, ranges, wildcard ranges and all of the above combined. For most scripts it tends to be human readable (see en:Hiragana Infobox) and it is machine readable. There are tools to convert lists of characters to a unicode-range
definition. --Shisma (talk)
Discussion
edit- Support John Samuel (talk) 11:34, 15 September 2018 (UTC)
- Support --Giovanni Alfredo Garciliano Díaz ★ diskutujo 02:39, 16 September 2018 (UTC)
- Support Cwf97 (talk) 11:22, 17 September 2018 (UTC)
- Comment why is datatype monolingual text here? Shouldn't it be string datatype (this is an encoding)? ArthurPSmith (talk) 15:51, 17 September 2018 (UTC)
- Fair enough (fixed) --Shisma (talk) 17:22, 17 September 2018 (UTC)
OpposeI think the scope should be limited. --- Jura 16:46, 25 September 2018 (UTC)- @Shisma: I tend to agree with Jura, I am not sure if this property should really be used on poems for instance. Marking as not ready for now, feel free to add the flag back once this issue is resolved. − Pintoch (talk) 06:59, 26 September 2018 (UTC)
- I agree with you. I don't know why it put that in there. I propose to limit the scope to writing systems, encodings, fonts and symbols. What do you think? @Pintoch: @Jura: --Shisma (talk) 17:52, 26 September 2018 (UTC)
- Sounds better. --- Jura 16:34, 30 September 2018 (UTC)
- Support. YULdigitalpreservation (talk) 14:11, 2 October 2018 (UTC)
- Support Robin van der Vliet (talk) (contribs) 21:43, 3 October 2018 (UTC)
@YULdigitalpreservation, Jura1, Shisma, Robin van der Vliet: @Jsamwrites, ArthurPSmith: Done: Unicode range (P5949) − Pintoch (talk) 12:33, 5 October 2018 (UTC)