Wikidata:Property proposal/unicode range

Unicode range

edit

Originally proposed at Wikidata:Property proposal/Creative work

   Done: Unicode range (P5949) (Talk and documentation)
RepresentsUnicode plane (Q10853148)
Data typeString
Template parameterunicode in en:Template:Infobox writing system
Domainitem
Allowed valuesSyntax specified by W3Cs CSS Fonts Module Level 3 Chapter 4.5.
Example 1Braille (Q79894)U+28??
Example 2ASCII (Q8815)U+0000-U+062
Example 3Unicode (Q8819)U+0-10FFFF
Example 4Cyrillic (Q2493952)U+0400-04FF
Example 5A (Q9659)U+41,U+61,U+249C,U+24B6,U+24D0,U+FF21,U+FF41,U+1F110,U+1F130,U+1F150,U+1F170
Example 6Latin script in Unicode (Q1046233)0000-U+007F, U+0080-U+00FF, U+0100-U+017F, U+0180-U+024F, U+0250-U+02AF, U+02B0-U+02FF,U+1D00-U+1D7F, U+1D80-U+1DBF, U+1E00-U+1EFF, U+2070-U+209F, U+2100-U+214F, U+2150-U+218F, U+2C60-U+2C7F, U+A720-U+A7FF, U+AB30-U+AB6F, U+FB00-U+FB4F, U+FF00-U+FFEF
Example 7Papyrus (Q4816790)U+20-7E,U+A1-17F,U+189,U+192,U+1A0,U+1A1,U+1AF,U+1B0,U+1C0-1ED,U+1F0-1F5,U+1F8-21B,U+21E,U+21F,U+226-233,U+237,U+251,U+252,U+254,U+258,U+259,U+2BB-2BD,U+2C6-2CB,U+2D8-2DD,U+2EE,U+300-304,U+306-30F,U+311-314,U+316,U+317,U+31B,U+320,U+323-329,U+32C-331,U+338,U+340,U+341,U+343,U+344,U+34F,U+357,U+37E,U+385,U+387,U+3A9,U+3C0,U+4C0,U+4D4,U+4D5,U+1E00-1E9B,U+1EA0-1EF9,U+200B-200D,U+2010-2014,U+2018-2022,U+2026,U+2027,U+2030,U+2039,U+203A,U+203D,U+2044,U+204B-204F,U+2070,U+2071,U+2074-208E,U+2090-2094,U+20AB,U+20AC,U+2103,U+2109,U+2122,U+2126,U+212A,U+212B,U+2160-217F,U+2202,U+2206,U+2207,U+220F,U+2211-2213,U+2215,U+2216,U+2219,U+221A,U+221E,U+222B,U+2248,U+2260,U+2264,U+2265,U+25CA,U+F8FF
Example 8Fever (Q47575674)U+20,U+2E,U+46,U+49,U+4D,U+4F,U+54,U+55,U+59,U+61-69,U+6B-70,U+72-77,U+79,U+7A
Example 9arrow (Q1139519)U+2190-21FF,U+2794,U+2798-27AF,U+27B1-27BE,U+27F0-27FF,U+2900-297F,U+2B00-2B11,U+2B30-2B4C,U+2B90,U+2B91,U+1F800-1F80B,U+1F810-1F847,U+1F850-1F859,U+1F860-1F887,U+1F890-1F8AD
Sourceunicode parameter in en:Template:Infobox writing system

Motivation

edit

For typesetters and programmers it regulary comes in handy to know the unicode range of a language, script, font, unicode block, text or specific set of symbols. Luckily there already is a standard to define ranges in CSS. I propose to use exactly this standard since it can represent a variety of things: single codepoints, ranges, wildcard ranges and all of the above combined. For most scripts it tends to be human readable (see en:Hiragana Infobox) and it is machine readable. There are tools to convert lists of characters to a unicode-range definition. --Shisma (talk)

Discussion

edit

@YULdigitalpreservation, Jura1, Shisma, Robin van der Vliet: @Jsamwrites, ArthurPSmith:   Done: Unicode range (P5949)Pintoch (talk) 12:33, 5 October 2018 (UTC)[reply]