Open source ecosystems need equitable credit across contributions

Casari, Amanda; McLaughlin, Katie; Trujillo, Milo Z.; Young, Jean-Gabriel; Bagrow, James P.; Hébert-Dufresne, Laurent

doi:10.1038/s43588-020-00011-w

Correspondence
Published: 14 January 2021

Open source ecosystems need equitable credit across contributions

Nature Computational Science volume 1, page 2 (2021)Cite this article

2686 Accesses
6 Citations
54 Altmetric
Metrics details

Subjects

You have full access to this article via your institution.

Download PDF

To the Editor — Collaborative and creative communities are more equitable when all contributions to a project are acknowledged. Equitable communities are, in turn, more transparent, more accessible to newcomers, and more encouraging of innovation — hence we should foster these communities, starting with proper attribution of credit. However, to date, no standard and comprehensive contribution acknowledgement system exists in open source, not just for software development but for the broader ecosystems of conferences, organization and outreach efforts, and technical knowledge. Furthermore, both closed and open source projects are built on a complex web of open source dependencies, and we lack a nuanced understanding of who creates and maintains these projects¹. As a result, large sums and efforts go to open source software projects without knowing whom the investments support and where they have impact².

Academia faces a similar recognition problem. Attribution is often collapsed to ‘authorship’, yet increases in the size and complexity of scientific teams are colliding with this narrow definition of contribution³. Focusing only on authorship hides much of the work necessary to publish research⁴. Since this hidden work is performed disproportionately by people from underrepresented communities, the full picture of who is doing work is not accurately represented⁵. Fortunately, new models of recognition are gaining widespread adoption. One example is the CRediT framework, a taxonomy created for ‘contributorship, not authorship’, to more fully represent the roles that people play in creating research outputs³. Contributor roles are categorized by tasks and stages of the research process⁶, allowing multiple people to perform the same role or the same person to perform multiple roles. Since its inception, the CRediT taxonomy has been widely adopted (by 33 major publishers so far) in part because it can not only contribute to equity within a project, but also potentially provide the major benefit of standardizing credit across projects and communities⁷.

Open source ecosystems need to follow suit and adopt a standard taxonomy of contributor roles. As in academic research, modern open source is a highly collaborative and complex task environment, where overly broad or poorly defined roles easily obfuscate the work of many². For example, a broadly defined role of ‘code contributor’ fails to distinguish between specific tasks such as adding features, fixing bugs or taking other actions that directly edit the source code. Likewise, other substantial contributions, such as organizing meetings, providing outreach or performing other activities that leave no visible trace within the code, are often neglected. Indeed, some important contributions occur entirely outside of common open source development platforms such as GitHub and often go unrecognized⁸. To succeed, a taxonomy of roles should be simple, comprehensive, use clearly defined non-overlapping categories, represent different types of contributions equally, and must avoid favoring specific platforms.

Existing efforts to recognize contributions to open source are laudable, but gaps remain. For instance, many open source projects include ad hoc attribution lists like ‘credit files’, without consistent attribution categories. Some approaches propose taxonomies (for example, All Contributors), but do not include clear guidelines around how to apply the proposed taxonomy to various contributions or may miss entire categories altogether. Any confusion around the interpretation of the taxonomy, where different communities could interpret the categories as they wish, negates the benefit of standardizing credit across projects and communities.

Meanwhile, platform-specific metrics like GitHub’s ‘contributor count’ are some of the most visible contribution indicators but are limited in scope and do not generalize to contributions outside the platform. Data-driven efforts that extract contributions automatically from source code, version control records, or other platform data are further limited to only those activities explicitly recorded within the data (see, for example, octohatrack⁸ and name-your-contributors). Despite its importance for open source, efforts to recognize contributions more broadly have yet to be widely adopted.

What obstacles have prevented the adoption of a broader, standard recognition model? If CRediT can teach us anything, it is that standards should emerge from the community, undergo many iterations and rounds of feedback, and receive buy-in from major relevant institutions and involved parties. The CRediT taxonomy resulted from a long categorization effort and is a prime example of a working contributor taxonomy. Research leveraging CRediT data is only ramping up⁷, yet adoption of the framework continues to rise through community support. A successful taxonomy for open source should develop through a similar community peer review. Just as academic institutions and publishers are embracing the CRediT model, open source contributions need the same attention.

A standard taxonomy of recognized contributions will benefit all levels of open source. Contributors will gain credits beyond the code, providing a clearer signaling of their work for the community. Projects will be able to measure their growth and evaluate their culture. Structural biases will be brought to light, helping to foster more equitable open source communities. Everyone will better understand the interconnected structure of skills, projects and contributions across the broader ecosystems of open source.

References

Eghbal, N. Roads and Bridges: The Unseen Labor Behind Our Digital Infrastructure (Ford Foundation, 2016).
Eghbal, N. Working in Public: The Making and Maintenance of Open Source Software (Stripe Press, 2019).
Brand, A., Allen, L., Altman, M., Hlava, M. & Scott, J. Learned Publ. 28, 151–155 (2015).
Article Google Scholar
Holcombe, A. O. Publications 7, 48 (2019).
Article Google Scholar
Larivière, V. et al. Soc. Stud. Sci. 46, 417–435 (2016).
Article Google Scholar
Holcombe, A. O. Nature 571, 147–148 (2019).
Article Google Scholar
Allen, L., O’Connell, A. & Kiermer, V. Learned Publ. 32, 71–74 (2019).
Article Google Scholar
McLaughlin, K. Model View Culture https://modelviewculture.com/pieces/acknowledging-non-coding-contributions (2016).

Download references

Author information

Authors and Affiliations

Open Source Programs Office, Google, Kirkland, WA, USA
Amanda Casari
Open Source Programs Office, Google, Sydney, New South Wales, Australia
Katie McLaughlin
Vermont Complex Systems Center, University of Vermont, Burlington, VT, USA
Milo Z. Trujillo, Jean-Gabriel Young, James P. Bagrow & Laurent Hébert-Dufresne
Department of Computer Science, University of Vermont, Burlington, VT, USA
Jean-Gabriel Young & Laurent Hébert-Dufresne
Department of Mathematics and Statistics, University of Vermont, Burlington, VT, USA
James P. Bagrow

Authors

Amanda Casari
View author publications
You can also search for this author in PubMed Google Scholar
Katie McLaughlin
View author publications
You can also search for this author in PubMed Google Scholar
Milo Z. Trujillo
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Gabriel Young
View author publications
You can also search for this author in PubMed Google Scholar
James P. Bagrow
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Hébert-Dufresne
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laurent Hébert-Dufresne.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Casari, A., McLaughlin, K., Trujillo, M.Z. et al. Open source ecosystems need equitable credit across contributions. Nat Comput Sci 1, 2 (2021). https://doi.org/10.1038/s43588-020-00011-w

Download citation

Published: 14 January 2021
Issue Date: January 2021
DOI: https://doi.org/10.1038/s43588-020-00011-w

This article is cited by

We need to talk about the lack of investment in digital research infrastructure
- Rebecca Knowles
- Bilal A. Mateen
- Yo Yehudi
Nature Computational Science (2021)
How community software ecosystems can unlock the potential of exascale computing
- Lois Curfman McInnes
- Michael A. Heroux
- Katie Antypas
Nature Computational Science (2021)

Open source ecosystems need equitable credit across contributions

Subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

This article is cited by

We need to talk about the lack of investment in digital research infrastructure

How community software ecosystems can unlock the potential of exascale computing

Search

Quick links

Subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

We need to talk about the lack of investment in digital research infrastructure

How community software ecosystems can unlock the potential of exascale computing

Search

Quick links