Hi, could you please update the formatter URL for mix-n-match catalog #4744. It is currently: https://www.wtatennis.com/players/$1 but should be https://www.wtatennis.com/players/$1/-. Thanks.
Topic on User talk:Magnus Manske/Structured Discussions Archive 1
Update also needed for https://mix-n-match.toolforge.org/#/catalog/652 (completely matched as of now); it should be https://www.hoc.gr/el/node/$1 and effectively it seems to be so when I hover over the link, but when I click it becomes https://www.hoc.grel/node/$1 (very strange). Thanks!
The link in the catalog is actually http, not https. They have a redirect on their site, but it's missing a /. So nothing strange, just a bogus untested setup at the HOC :)
So the present URL in MnM is http://www.hoc.gr/el/node/$1 and it should become https://www.hoc.gr/el/node/$1 in order to avoid the wrong redirect by HOC. Thanks @Nono314: for making me understand this!
Having thought about this a bit more, I think the simplest solution is to give catalog creators (and admins) the ability to purge all items from a catalog so they can re-scrape/re-upload them. Also, in the "Create a new web scraper/catalog" menu, if an existing catalog ID is entered, the menu would ideally auto-fill with the scraper settings last used, so any tweaks could be made easily when re-scraping rather than needing to start from scratch.