From Free Software Directory
Jump to: navigation, search



Gender Detection from the Name

A toolkit to measure gender gap in mailing lists, csv files, articles in newspapers, git repositories, ... Giving a new solution to detect gender from first names. So, we have developed the maths related to this task in python code, collected a lot of datasets with free licenses released from statistical institutions and we are doing experiments with machine learning to guess nicknames, new names, diminutives, ...



Verified by

Verified on


Verified by

David Arroyo Menéndez

Verified on

29 November 2021

Leaders and contributors

David Arroyo Menéndez (Davidam)Developer

Resources and communication

AudienceResource typeURI
DevelopersVCS Repository Webviewhttps://github.com/davidam/damegender

Software prerequisites

Required to usepython interpreter
Required to usehttps://pypi.org/project/json2html
Required to usehttps://pypi.org/project/unidecode
Weak prerequisitehttps://pypi.org/project/perceval
Required to usehttps://pypi.org/project/scikit-learn
Required to usehttps://pypi.org/project/matplotlib
Required to usehttps://pypi.org/project/requests/
Required to usehttps://pypi.org/project/Markdown/
Required to usehttps://pypi.org/project/genderize
Required to usehttps://pypi.org/project/newspaper3k
Required to usehttps://pypi.org/project/nltk/
Required to usehttps://pypi.org/project/pandas
Required to usehttps://pypi.org/project/scipy/
Required to usehttps://pypi.org/project/lxml
Required to usehttps://pypi.org/project/numpy/


"GPL-3.0-or-later" is not in the list (ACEL, AFL-3.0, AGPL-1.0, AGPL-1.0-or-later, AGPL-3.0, AGPL-3.0-or-later, AGPL-3.0-or-later-with-exception, AGPL-3.0-with-exception, AGPLv1orlater, AGPLv3, ...) of allowed values for the "License" property.

"All" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.

"Developers" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.

Property "Submitted by" (as page type) with input value "{{{Submitted by}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process. "{{{Submitted date}}}" contains an extrinsic dash or other characters that are invalid for a date interpretation.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.