Property talk:P2091

From Wikidata
Jump to navigation Jump to search

Documentation

World Rowing ID
identifier for a rower in the database of World Rowing (FISA)
Associated itemWorld Rowing (Q684885)
Applicable "stated in" valueWorld Rowing athlete database (Q21008628)
Data typeExternal identifier
Corresponding templatenot applicable
Allowed values[1-9]\d{0,4}| (numeric string with 1 to 5 digits; or novalue)
ExampleHamish Bond (Q1321253)20096
Kim Brennan (Q576663)24665
Kristof Wilke (Q64501)20438
Miroslava Knapková (Q435707)13214
Sourcehttps://www.worldrowing.com/
External linksUse in sister projects: [ar][de][en][es][fr][he][it][ja][ko][nl][pl][pt][ru][sv][vi][zh][commons][species][wd][en.wikt][fr.wikt].
Formatter URLhttps://worldrowing.com/athlete/$1
Tracking: sameCategory:World Rowing ID same as Wikidata (Q25350932)
Tracking: differencesCategory:World Rowing ID different from Wikidata (Q25350933)
Tracking: usageCategory:World Rowing template using Wikidata (Q24689919)
Tracking: local yes, WD noCategory:World Rowing ID not in Wikidata (Q25350930)
See alsoWorld Rowing UUID (P8990)
Lists
Proposal discussionProposal discussion
Current uses
Total17,741
Main statement14,778 out of 51,775 (29% complete)83.3% of uses
Qualifier4<0.1% of uses
Reference2,95916.7% of uses
Search for values
Explanations [Edit]

World Rowing ID (P2091) is a simple authority control property to link Wikidata items to entries in the World Rowing athlete database (Q21008628). This database is operated by World Rowing (Q684885) (formerly known as FISA), the international governing body of rowing (Q159354), and contains the most complete set of data about international rowing persons and rowing events which is publicly available.

On 17 December 2020, World Rowing (Q684885) relaunched their website and issued a new Universally Unique Identifier (Q195284) for all rowers; these may be found in the Wikidata property World Rowing UUID (P8990).

On 9 January 2021, World Rowing (Q684885) returned to using the previous numeric identifier (Q93868746) for all rowers, may be found in the Wikidata property World Rowing ID (P2091).

Add this property

[edit]

To use this property, you need to determine the identifier in the World Rowing database: let’s consider the example of the Olympic champion Kim Brennan (Q576663); the profile about her can be found at:

http://web.archive.org/web/20201004003834/http://www.worldrowing.com/athletes/athlete/24665/
https://www.worldrowing.com/athlete/24665/

The yellow part is the identifier to use within this property, while the remaining part of the URL is defined within the property page World Rowing ID (P2091) using the formatter URL (P1630) property.

FISA maintains a pretty nice database which comprises data of around 50 000 persons (as of February 2019); however, in roughly 1–2% of the cases there are multiple entries for the same person (amount estimated). This means that one needs to add multiple values for this property within an item, which in turn violates the single-value constraint (Q19474404) defined by a property constraint (P2302) statement on Property:P2091. In case of multiple IDs for one person, qualify each identifier with a subject named as (P1810) qualifier using the database entry title (do not use the all-uppercase family names from the database). We want to avoid using exception to constraint (P2303) qualifiers to the single-value constraint (Q19474404).

In case of multiple values it is also important to set proper ranks for the individual claims. If all important information about a person is contained within a single profile out of the available profiles, then mark the important claim with preferred rank. There are cases, however, in which this is not possible and the information is distributed across multiple important profiles. All of them should use the normal rank. Deprecated rank should not be used at all for any value that has been valid in the past.

Please also add empty profiles which often do not contain much more than a person’s name. We do not know whether FISA will start to add content to database entries which are still (almost) empty.

Use this property

[edit]

Within Wikidata this property is used to link against FISA’s database and to identify multiple items per person (via the distinct-values constraint (Q21502410), defined by a property constraint (P2302) statement on Property:P2091). Identifiers can also be queried using the common query tools for applications outside Wikimedia.

On Wikipedias, there are typically templates available (such as Template:World Rowing (Q14334032) and others, see {{ExternalUse}} template on Property talk:P2091) which either store the same identifier locally within the Wikipedia, or take the data from Wikidata by using the {{#property:P2091}} or {{#statements:P2091}} parser function or more sophisticated functions of Module:Wikidata (Q12069631) and others. In case of the existence of a local value, one can also compare it to the Wikidata value and make use of maintenance categories, which are linked from the {{Property documentation}} template on Property talk:P2091. However, be aware that in case of an item with multiple claims, of which none is the primary one marked with preferred rank, you also might get multiple values if you pull this information from Wikidata.

See also

[edit]
This property is maintained by WikiProject Rowing.
Distinct values: this property likely contains a value that is different from all other items. (Help)
List of violations of this constraint: Database reports/Constraint violations/P2091#Unique value, hourly updated report, SPARQL (every item), SPARQL (by value)
Single best value: this property generally contains a single value. If there are several, one would have preferred rank (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P2091#single best value, SPARQL
Format “[1-9]\d{0,4}|: value must be formatted using this pattern (PCRE syntax). (Help)
List of violations of this constraint: Database reports/Constraint violations/P2091#Format, hourly updated report, SPARQL
Scope is as main value (Q54828448), as reference (Q54828450): the property must be used by specified way only (Help)
List of violations of this constraint: Database reports/Constraint violations/P2091#Scope, hourly updated report, SPARQL
Allowed entity types are Wikibase item (Q29934200): the property may only be used on a certain entity type (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P2091#Entity types
Qualifiers “subject named as (P1810), reason for deprecated rank (P2241), reason for preferred rank (P7452): this property should be used only with the listed qualifiers. (Help)
List of violations of this constraint: Database reports/Constraint violations/P2091#allowed qualifiers, hourly updated report, SPARQL
Item “instance of (P31): human (Q5): Items with this property should also have “instance of (P31): human (Q5)”. (Help)
List of violations of this constraint: Database reports/Constraint violations/P2091#Item P31, hourly updated report, search, SPARQL
Item “sport (P641): rowing (Q159354), para rowing (Q1575010): Items with this property should also have “sport (P641): rowing (Q159354), para rowing (Q1575010)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P2091#Item P641, search
Item “country of citizenship (P27): Items with this property should also have “country of citizenship (P27)”. (Help)
List of violations of this constraint: Database reports/Constraint violations/P2091#Item P27, hourly updated report, search, SPARQL
Item “World Rowing UUID (P8990): Items with this property should also have “World Rowing UUID (P8990)”. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P2091#Item P8990, search, SPARQL
Label required in languages: en: Entities using this property should have labels in one of the following languages: en (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P2091#Label in 'en' language, search, SPARQL

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)

External reference error reports

[edit]
Error reports
Contact:

World Rowing (Q684885), results and athletes database manager P. Gumbar, (contact details at FISA)

Ideally this information should be moved to the item for the reference and be transcluded from there.
Modeling of errors in Wikidata:

use of ranks to deal with multiple P2091 values; FISA does not leave redirects on merged profiles, thus fixed values are deleted from Wikidata

Reports:
Error type Item(s) affected Description/Duplicates Exception Reported Resolved (also unpublished) Resolved and published Wikidata updated
duplicate 72 items with multiple identifiers: Matthew Pinsent (Q52680), Xeno Müller (Q74523), Sébastien Vieilledent (Q260993), Florian Eichner (Q370040), Marnie McBean (Q445381), Ute Wild (Q526551), Ramona Balthasar (Q526623), Chira Apostol (Q607373), Karl-Heinz Bußert (Q827332), Wyatt Allen (Q924726), Tranquilo Cappozzo (Q978765), Claudia Waldi (Q1097862), Gereon Schürmann (Q1472789), Matthias Ungemach (Q1910368), Marleen van Rij (Q2117239), Liesbeth Pascal-de Graaff (Q2136299), Myriam van Rooyen-Steenman (Q2435986), Jan van der Horst (Q2487957), Roline Repelaer van Driel (Q2539102), Camelia Macoviciuc-Mihalcea (Q2934732), Heather McDermid (Q3129088), Shane O'Brien (Q3481372), Stoyanka Kurbatova (Q3657235), Piero Carletto (Q3903196), Venjamin But (Q4100132), Olena Pukhaieva (Q4384499), Yelena Khloptseva (Q4498176), Ann Redgrave (Q4766637), Anton Chermashentsev (Q4775706), Antonina Makhina (Q4776350), David Weightman (Q5240919), Elena Horvat (Q5359034), Hrihoriy Dmytrenko (Q5923673), Ihor Yemchuk (Q5994809), Jeanne Flanagan (Q6172080), Maricica Țăran (Q6762573), Mărioara Trașcă (Q6765036), Mykola Dovhan (Q6947571), Nikolay Kuznetsov (Q7036273), Olga Pivovarova (Q7086659), Peter Dignan (Q7173671), Vasil Yakusha (Q7916580), Yelena Matiyevskaya (Q8051488), Viorica Ioja (Q8273593), Fabrizio Güttinger (Q9257536), Grażyna Różańska (Q9276092), Hakan Özcan (Q9285028), Ryszard Giło (Q9324697), Sándor Baka (Q9352799), Urszula Niebrzydowska (Q9367194), Henrique Baixinho (Q10294712), Ronaldo Vargas (Q10364379), Kazushige Ura (Q11558181), Jolmer van der Sluis (Q11728132), Leonid Shaposhnykov (Q12171103), Mette Bloch (Q12326816), Nikolay Aksyonov (Q15715592), Jutta Abromeit (Q16001456), Svend Ove Pedersen (Q16018555), Sverre Norberg (Q16185502), Hans Lycklama (Q16186769), Mykola Chupryna (Q16194478), Robyn Grey-Gardner (Q16195555), Huang Cheng (Q16221786), Céline Garcia (Q16635805), Sigitas Kučinskas (Q16669299), Carolina Schiffmacher (Q16940769), Valeriy Dosenko (Q18238914), Reet Palm-Killing (Q19682448), Bendegúz Pétervári-Molnár (Q24331148), Claire Akossiwa (Q26252407), Zhang Min (Q26267551) possible duplicates in external database; update process stalled at the moment via email on 2016/12/29 by en:User:Schwede66 3 items: Vasil Yakusha (Q7916580), Venjamin But (Q4100132), Claire Akossiwa (Q26252407) (see resolved) (see resolved; Zhang Min (Q26267551) was fixed at WD without update at FISA)

Discussion

[edit]

Constraint:Format

[edit]

@Ivan A. Krestinin, Pasleim, Sjoerddebruin: The three of you changed the pattern of Template:Constraint:Format from \d{1,5} to [0-9]\d{0,4} [1], [2] and removed the mandatory=true parameter [3]. This was obviously done because something went wrong with the associated covi page update last night. Just for curiosity: what’s the problem with \d{1,5}? For instance, Template:Constraint:Format gives an example which is almost the same… (I’m kind of a beginner with regular expressions) Thanks in advance, MisterSynergy (talk) 08:03, 29 September 2015 (UTC)[reply]

I only removed the mandatory=true part to make Wikidata:Database reports/Constraint violations/Mandatory constraints/Violations accessible for everyone again. Sjoerd de Bruin (talk) 08:06, 29 September 2015 (UTC)[reply]
Oops! Okay, can I make it mandatory again once the number of violations on the property covi page (Wikidata:Database_reports/Constraint_violations/P2091#Format, actually it shows “Violations count: 5232”) is small or zero after a future update? —MisterSynergy (talk) 08:14, 29 September 2015 (UTC)[reply]
The pattern was /d{1,5} with a slash instead of backslash. That's why so many violations popped up. [0-9]\d{0,4} is similar to \d{1,5} but it prevents to have a leading 0. The mandatory=true can be added as soons as the violation count is down to 0.--Pasleim (talk) 08:34, 29 September 2015 (UTC)[reply]
Oh I see. It’s [1-9]… and not [0-9]…. Apart from that, I’ve already noticed the silly slash/backslash mistake. Thanks for your answers, this is resolved for me now. —MisterSynergy (talk) 08:42, 29 September 2015 (UTC)[reply]

Contact with FISA regarding duplicates & database errors

[edit]

 Info In December 2016 User:Schwede66 from English Wikipedia was in contact with FISA via email (database engineer P. Gumbar) regarding duplicates in their database (related conversation 1, related conversation 2). They appear to be willing to receive corrections, but no changes are effective as of today. —MisterSynergy (talk) 14:32, 30 May 2017 (UTC)[reply]

 Info en:Wikipedia:WikiProject Rowing/FISA database errors lists lots of further apparent database errors. Most of them have not yet been reported to FISA. —MisterSynergy (talk) 10:26, 12 February 2019 (UTC)[reply]

[edit]

Three days ago, WorldRowing launched a new website. Unfortunately the old numeric athlete identifiers seem to be invalid now, and have been replaced with pseudo-random UUID identifiers. Since the new website still seems to have some problems, we are going to wait some more days before we need to make a decision what to do. —MisterSynergy (talk) 20:56, 20 December 2020 (UTC)[reply]

There is now a proposal for a new identifier property at Wikidata:Property proposal/World Rowing ID. The new one should take the new identifiers which I can import in bulk, while P2091 can remain unchanged. Please add some support to the property proposal. —MisterSynergy (talk) 14:05, 21 December 2020 (UTC)[reply]
The new identifier property is now available at World Rowing UUID (P8990). I am going to import lots of new identifiers with that property later this day. —MisterSynergy (talk) 13:46, 29 December 2020 (UTC)[reply]

new formatter URL

[edit]

Now that the old links do not work any longer, there are a couple of options how to set the formatter URL (P1630) value with preferred rank of this property:

  • keep the original http://www.worldrowing.com/athletes/athlete/$1/ – results in dead links
  • direct to the latest web archive snapshot with https://web.archive.org/web/20201217000000/http://www.worldrowing.com/athletes/athlete/$1/ – not sure whether all profiles were saved there
  • use https://msbits.toolforge.org/worldrowing/index.php?numericId=$1 – a redirect service to load the current profile at worldrowing.com using the old numeric identifier; does not work for ~2000 entries which are missing in the new database
  • do not use any formatter URL, so that the identifiers would be displayed as text only with no link

Any preferences? The selection determines which sort of links are being displayed in the Wikidata web UI, and potentially also by templates in the projects if they use this information.

User:MisterSynergy User:Kamelox User:Vanbasten_23 User:Schwede66

Notified participants of WikiProject RowingMisterSynergy (talk) 20:17, 30 December 2020 (UTC)[reply]