Cleaning via bot: Naming convention has inadequacies (field separator)
Open, HighPublic
Actions

Assigned To

Authored By

	Yug
	Dec 13 2021, 7:38 PM

Description

Current state : LinguaLibre's naming convention is largely based on - as the main field separator.
( and ) are also field informers.

Review

LinguaLibre Queries Services let you see the current situation.

List of speaker by name and presense of - as separator :

sparql
SELECT *
WHERE {
  ?id prop:P2 entity:Q3 .
  SERVICE wikibase:label {
    bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" .
    ?id rdfs:label ?name .
  }
  BIND (regex(STR(?name),"-") AS ?has_separator)
}
ORDER BY DESC (?has_separator)

How many:

sparql
SELECT ?has_separator (COUNT(?has_separator) AS ?found)
WHERE {
  ?id prop:P2 entity:Q3 .
  SERVICE wikibase:label {
    bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en" .
    ?id rdfs:label ?name .
  }
  
  BIND (regex(STR(?name),"-") AS ?has_separator)
  # filter( regex(?name, "-" ))
}
#ORDER BY DESC (?has_separator)
GROUP BY (?has_separator)

51/1000 (5%) of the speakers' username contain -, which makes regex on their files more unpredictable. A better field separator would be welcome.

Suggestion

On peut/doit prendre un plus rare, a minima qui n'est pas un des caractères présents sur nos claviers.
U+FF0D － FULLWIDTH HYPHEN-MINUS LL－Q150 (fra)－Roll-Morton－vert.wav
Ou doubler le séparateur, pour créer une pratique unique. LL--Q150 (fra)--Roll-Morton--vert.wav

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Open		None	T298412 Cleaning via bot: Vast bot-lead clean up of files storage system (Lili + Commons)
		Open		Yug	T297635 Cleaning via bot: Naming convention has inadequacies (field separator)

Event Timeline

Yug created this task.Dec 13 2021, 7:38 PM

Yug updated the task description. (Show Details)Dec 13 2021, 7:59 PM

Yug updated the task description. (Show Details)

Yug updated the task description. (Show Details)Dec 13 2021, 9:59 PM

Yug mentioned this in T298412: Cleaning via bot: Vast bot-lead clean up of files storage system (Lili + Commons).Dec 31 2021, 5:28 PM

Yug updated the task description. (Show Details)Dec 31 2021, 7:00 PM

Yug updated the task description. (Show Details)

Yug triaged this task as High priority.Jul 6 2022, 11:20 AM

Yug renamed this task from Naming convention has inadequacies (field separator) to Cleaning via bot: Naming convention has inadequacies (field separator).Jul 7 2022, 8:22 PM

Yug claimed this task.Fri, Nov 15, 2:03 PM

Yug added a parent task: T298412: Cleaning via bot: Vast bot-lead clean up of files storage system (Lili + Commons).

Cleaning via bot: Naming convention has inadequacies (field separator)Open, HighPublicActions