Free Software Directory talk:Free Software Directory (Savannah)
See User talk:Habs#Meta-data_import for the beginning of this discussion. --Adfeno (talk) 09:20, 16 February 2018 (EST)
One thing that might help the importer script avoid duplicates is the new Property:Package names, you currently have to set it manually for a given entry, like what is done in Sandbox revision 52983, then the importer script can query/ask for all the packages which have that property set, like what is done in FSD:Sandbox revision 53017. This property is supposed to hold a comma-and-a-space separated list of possible names that describe the package as generally as possible (for example: in a page/entry for GCC, then the name "gcc" would be a good candidate for insertion in Property:Package names, but "cpp" and "g++" despite also coming from GCC are too specific to a given part of it, and don't describe GCC as widely as possible. In the future, it would be a good idea to make the importer script work only for the pages which have that property set. Once Property:Package names is properly understood by the importer script, then it can do all sorts of things to determine which free/libre distro has the most updated version of a package. --Adfeno (talk) 09:20, 16 February 2018 (EST)
Another thing that the importer script will find easier to do once Property:Package names is correctly interpreted is to use some sort of regular expression in the raw MediaWiki text to remove or update the version and software prerequisite information, or simply forget about getting the prerequisite part from the FSD and insert it based on what the free/libre distro has. --Adfeno (talk) 09:20, 16 February 2018 (EST)
Finally, I noticed from a conversation I overheard/oversaw in IRC in the directory meeting for 2019-02-09, that checking for duplicate entries involves, among other things, checking if a page title is similar to another one, this is why I created Property:Package names. The current workflow without this property seems to have the caveat that page titles have to keep a name that isn't understandable in most cases, for example 7kaa (imported) has a more understandable name: Seven Kingdoms: Ancient Adversaries. --Adfeno (talk) 09:20, 16 February 2018 (EST)
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.
The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.