A curated list of open technology projects to sustain a stable climate, energy supply, biodiversity and natural resources.

EML

An open source, community oriented project dedicated to providing a high-quality metadata specification for describing data relevant to diverse disciplines that involve observational research like ecology, earth, and environmental science.
https://github.com/nceas/eml

Category: Sustainable Development
Sub Category: Taxonomy and Ontology

Keywords

ecological-metadata-language eml metadata

Keywords from Contributors

ecology ecoinformatics archiving measur transforms semantic-annotations science-metadata metadata-editor metacat knb

Last synced: about 8 hours ago
JSON representation

Repository metadata

Ecological Metadata Language (EML)

README.md

EML: Ecological Metadata Language

Main Build Status
Develop Build Status
EML

Cite as:

Matthew B. Jones, Margaret O'Brien, Bryce Mecum, Carl Boettiger, Mark Schildhauer, Mitchell Maier, Timothy Whiteaker, Stevan Earl, Steven Chong. 2019. Ecological Metadata Language version 2.2.0. KNB Data Repository. doi:10.5063/F11834T2 Copy BibTeX

The Ecological Metadata Language (EML) defines a comprehensive vocabulary and a readable XML markup syntax for documenting research data. It is in widespread use in the earth and environmental sciences, and increasingly in other research disciplines as well. EML is a community-maintained specification, and evolves to meet the data documentation needs of researchers who want to openly document, preserve, and share data and outputs. EML includes modules for identifying and citing data packages, for describing the spatial, temporal, taxonomic, and thematic extent of data, for describing research methods and protocols, for describing the structure and content of data within sometimes complex packages of data, and for precisely annotating data with semantic vocabularies. EML includes metadata fields to fully detail data papers that are published in journals specializing in scientific data sharing and preservation.

Getting Started

Composing an EML document can be done in a simple text editor (e.g., Atom),
via scripting languages like R and python (e.g., the R eml package),
in general-purpose XML authoring tools (e.g., Oxygen), and in custom web-based metadata editing tools
(e.g., MetacatUI). While these tools expand and shift over time, the core metadata language
has been consistent and backwards compatible, allowing for decades of seamless
interoperability of data sets in many repositories.

EML documents can be started simply, and then additional detail added over time.
On the simple end, an EML document that provides basic bibliographic information
would be sufficient for citing a data set and for simple discovery in catalogs:

<?xml version="1.0"?>
<eml:eml
    packageId="doi:10.xxxx/eml.1.1" system="https://doi.org"
    xmlns:eml="https://eml.ecoinformatics.org/eml-2.2.0"
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xmlns:stmml="http://www.xml-cml.org/schema/stmml-1.1"
    xsi:schemaLocation="https://eml.ecoinformatics.org/eml-2.2.0 xsd/eml.xsd">
    
    <dataset>
        <title>Primary production of algal species from Southeast Alaska, 1990-2002</title>
        <creator id="https://orcid.org/0000-0003-0077-4738">
            <individualName>
                <givenName>Matthew</givenName>
                <givenName>B.</givenName>
                <surName>Jones</surName>
            </individualName>
            <electronicMailAddress>[email protected]</electronicMailAddress>
            <userId directory="https://orcid.org">https://orcid.org/0000-0003-0077-4738</userId>
        </creator>
        <keywordSet>
            <keyword>biomass</keyword>
            <keyword>productivity</keyword>
        </keywordSet>
        <contact>
            <references>https://orcid.org/0000-0003-0077-4738</references>
        </contact>
    </dataset>
</eml:eml>

This document can then be supplemented with additional metadata describing research
projects and methods, structural information about the data, and much more.

About the EML Project

The EML project is an open source, community oriented project dedicated to providing a high-quality metadata specification for describing data relevant to diverse disciplines that involve observational research like ecology, earth, and environmental science. The specification is maintained by voluntary project members who donate their time and experience in order to advance information management for ecology. Project decisions are made by consensus of the current maintainers on the project.

We welcome contributions to this work in any form. Individuals who invest substantial amounts of time and make valuable contributions to the development and maintenance of EML (in the opinion of current project maintainers) will be invited to become EML project maintainers. Contributions can take many forms, including the development of the EML schemas, writing documentation, and helping with maintenance, among others.

Contributing

Developers may be interested in browsing the source code repository that we use in developing EML. Starting with EML 2.1.1, the master branch reflects the current stable release of EML. Development occurs in development branches (e.g., BRANCH_EML_2_2), which allows experimental additions as they are being proposed by the community. This always contains the most recent development version of EML, and therefore may be in flux, or otherwise broken. It is unlikely that it will contain the same files that are in the current release. Development branches are virtually guaranteed to change before they are released, and so they should not be used in production environments. Use development branches at your own risk for testing. Write access to this repository is reserved for current project maintainers. Please submit contributions as pull requests. We welcome contributions to this work in any form. Contributions can take many forms, including the development of the EML schema, writing documentation, and helping with maintenance, among others. Non-project members can contribute by submitting their feedback, revisions, fixes, code, or any other contribution through pull requests at GitHub. Discussion of issues occurs on the Slack channel, or through the EML Issue Tracking system. The preferred way to submit problems with EML or feature requests is the issue tracking system.

History

EML was originally developed by Matthew Jones at NCEAS based on a report by the ESA Committee on the Future of Long-Term Ecological Data and on a related paper on ecological metadata by Michener et al. (see Michener, William K., et al., 1997. Ecological Applications, "Nongeospatial metadata for the ecological sciences" Vol 7(1). pp. 330-342.). Version 1.0 was released at NCEAS in 1997 and used internally, with further internal releases of versions 1.2, 1.3, and 1.4, all of which followed the FLED recommendations closely in its content implementation. Version 2 became a community-maintained, open specification. Substantial modifications for EML 2.x came from experience using the earlier specification at NCEAS and from feedback from the ecological community, particularly information managers from the Long Term Ecological Research Network. Versions 2.1 and 2.2 introduce significant new features like internationalization, semantic annotations, and support for data papers.

Older versions (deprecated)

The following versions are still available for reference purposes, although they have been superseded by the current version (2.2.0). Please make every effort to use the current version.

Copyright and License

Copyright: 1997-2019 Regents of the University of California

This program is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA

Funding and Acknowledgements

EML was developed and is maintained with support from the National Center for
Ecological Analysis and Synthesis (NCEAS)
, a Center
funded by the University of California Santa Barbara and the state of California.

This material is based upon work supported by the US National Science Foundation
under Grant No. DEB-9980154, DBI-9904777, 0225676, DEB-0072909, DBI-9983132,
and DEB-9634135. Any opinions, findings and conclusions or recommendations
expressed in this material are those of the author(s) and do not necessarily
reflect the views of the National Science Foundation (NSF).

This product includes software developed by the Apache Software
Foundation (http://www.apache.org/). See the LICENSE file in lib/apache
for details.

The source code, object code, and documentation in the com.oreilly.servlet
package is copyright and owned by Jason Hunter. See the cos-license.html file
for details of the license. Licensor retains title to and ownership of the
Software and all enhancements, modifications, and updates to the Software.

This product includes software developed by the JDOM Project
(http://www.jdom.org/). See jdom-LICENSE.txt for details.


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 7 days ago

Total Commits: 2,616
Total Committers: 33
Avg Commits per committer: 79.273
Development Distribution Score (DDS): 0.854

Commits in past year: 0
Committers in past year: 0
Avg Commits per committer in past year: 0.0
Development Distribution Score (DDS) in past year: 0.0

Name Email Commits
Chad Berkley b****y@n****u 382
Jing Tao t****o@n****u 335
Matt Jones j****s@n****u 319
Duane Costa d****a@l****u 243
Matt Jones g****e@m****g 234
Margaret O'Brien m****n@u****u 232
Ben Leinfelder l****r@n****u 215
Chris Jones c****s@n****u 149
Steven Chong s****g@g****m 127
Dan Higgins h****s@n****u 84
Peter McCartney p****y@l****u 56
Mark s****d@n****u 50
Matthew Brooke b****e@n****u 47
David Blankman b****n@e****g 34
John Harris h****s@n****u 22
Bryce Mecum p****h@g****m 19
Jivka Bojilova b****a@n****u 14
maier-m m****r@u****u 7
Rudolf Nottrott n****t@n****u 5
StevanEarl s****l@a****u 5
Tim Whiteaker w****r@u****u 5
dependabot[bot] 4****] 5
Lauren Palmer w****r@n****u 4
Saurabh Garg g****g@n****u 4
James Brunt j****t@l****u 3
Scott Chapal c****l@e****g 3
mobb m****b@m****u 3
Christopher Jones c****j@c****o 2
Corinna Gries c****s@w****u 2
Shawn Walbridge w****e@n****u 2
and 3 more...

Committer domains:


Issue and Pull Request metadata

Last synced: about 8 hours ago

Total issues: 72
Total pull requests: 32
Average time to close issues: 9 months
Average time to close pull requests: 2 months
Total issue authors: 18
Total pull request authors: 8
Average comments per issue: 4.35
Average comments per pull request: 0.97
Merged pull request: 21
Bot issues: 0
Bot pull requests: 22

Past year issues: 2
Past year pull requests: 0
Past year average time to close issues: 2 minutes
Past year average time to close pull requests: N/A
Past year issue authors: 2
Past year pull request authors: 0
Past year average comments per issue: 1.0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/nceas/eml

Top Issue Authors

  • mbjones (33)
  • mobb (10)
  • amoeba (8)
  • twhiteaker (3)
  • srearl (3)
  • taojing2002 (2)
  • csjx (2)
  • stevenchong (1)
  • jeanetteclark (1)
  • zer0mode (1)
  • atn38 (1)
  • RobLBaker (1)
  • kmexter (1)
  • mpsaloha (1)
  • antibozo (1)

Top Pull Request Authors

  • dependabot[bot] (22)
  • srearl (4)
  • yvanlebras (1)
  • twhiteaker (1)
  • kf8a (1)
  • amoeba (1)
  • pieterprovoost (1)
  • laurenwalker (1)

Top Issue Labels

  • documentation (28)
  • bug (19)
  • enhancement (18)
  • eml-parser (4)
  • breaking-change (3)
  • utilities (2)
  • question (1)
  • help wanted (1)

Top Pull Request Labels

  • dependencies (22)
  • documentation (1)

Score: 8.27563105457801