Recent Releases of GNparser

GNparser - v1.14.2

CHANGELOG

[v1.14.2] - 2026-01-14 Wed

  • Fix: typo in REST API (change flatOutput to flattenOutput).
  • Improve hovering info for checkboxes in web ui.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 5 months ago

GNparser - v1.14.1

CHANGELOG

[v1.14.1] - 2026-01-14 Wed

  • Add[#301]: rename 'initials' flag to 'compact-authors'.
    WARNING this creates backward incompatibility (only for v1.14.0).
  • Add: normalize 'flatten' flag.
  • Add: remove useless web-log flag.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 5 months ago

GNparser - v1.14.0

CHANGLOG

  • Add[#292]: add flattened option for JSON output, add more (optional) details
    to CSV/TSV output.
  • Fix[#227]: Cultivars words show correctly in "words" section.
    (PR #300 by @tobymarsden)

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 5 months ago

GNparser - v1.13.0

CHANGELOG

  • Add[#298]: update CONTRIBUTING.md to reflect changes in gnparser's code.
  • Add[#297]: rename NomCode field to NomCodeSetting to clarify its intention.
    WARNING this creates some backward incompatibility.
  • Add[#294]: option to remove spaces between authors' names initials (PR
    #295 by @juillerat)
  • Fix[#296]: old virus names are marked as 'Virus' when nomenclatural code
    is set to 'icvcn'.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 5 months ago

GNparser - v1.12.1

CHANGELOG

[v1.12.1] - 2025-12-19 Fri

  • Add: virus code for web UI.
  • Add: docs about viral code to README.
  • Add: update flake data.
  • Add: update nix build.
  • Add: update TOC for tests.

[v1.12.0] - 2025-12-19 Fri

  • Add[#290]: support modern virus nomenclature (ICTV/ICVCN)

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 5 months ago

GNparser - v1.11.10

CHANGELOG

[v1.11.10] - 2025-12-16 Tue

  • Fix [#291]: prefix los of an author is not parsed as part of canonical form.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 6 months ago

GNparser - v1.11.8

CHANGELOG

[v1.11.8] - 2025-07-21 Mon

  • Add [#288] - add more authors' rules.

[v1.11.7] - 2025-04-30 Wed

  • Add: move nomenclatural code functionality to gnlib.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus 10 months ago

GNparser - v1.11.6

CHANGELOG

[v1.11.6] - 2025-04-01 Tue

  • Add [#287]: parse names with genus and species hybrid.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 1 year ago

GNparser - v1.11.5

[v1.11.5] - 2025-03-30 Sun

  • Fix [#286]: parse names with named hybrid and comparison.

CHANGELOG

[v1.11.4] - 2025-03-10 Mon

  • Add: improve flatten output.

[v1.11.3] - 2025-03-07 Fri

  • Add [#283]: parsed.Flatten function to generate 'flat' version of parsed data.
  • Add: update upload-artifacts in Actions to v4.
  • Add [#279]: add unr. rank.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 1 year ago

GNparser - v1.11.2

CHANGELOG

[v1.11.2] - 2025-02-21 Fri

  • Add: update modules.
  • Add [#282] remove varning for botanical names with ex authors.
  • Add [#280] prol. botanical rank.

Full Changelog: https://github.com/gnames/gnparser/compare/v1.11.1...v1.11.2

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 1 year ago

GNparser - v1.11.1

CHANGELOG

[v1.11.1] - 2024-11-11 Mon

  • Add: minor improvements in web-interface.
  • Add: make web-interface remember nomenclatural code picked in the previous
    GET query.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 1 year ago

GNparser - v1.11.0

CHANGELOG

[v1.11.0] - 2024-11-11 Mon

  • Add [#265]: add optional nomenclatural code option to parse names with
    an ambiguity according to a particular code.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 1 year ago

GNparser - v1.10.4

CHANGELOG

[v1.10.4] - 2024-11-07 Thu

  • Add [#269]: switch to slog from zerologs.
  • Fix [#271]: distinguish between ex and in.
  • Fix [#270]: missing verbatim authorship for names that look similar to
    combination uninomial in ICZN.
  • Fix [#268]: if botanical author looks like a combination uninomial,
    make sure its characters are not normalized.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 1 year ago

GNparser - v1.10.3

CHANGELOG

[v1.10.3] - 2024-10-11 Fri

  • Fix [#266]: remove author from species detail for named hybrids.

[v1.10.2] - 2024-07-30 Tue

  • Add [#264]: more exceptions.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 1 year ago

GNparser - v1.10.1

Changelog

[v1.10.1] - 2024-06-05 Wed

  • Add [#263]: add more exceptions with speciific epithets like
    "complex", "do", "spec".

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 2 years ago

GNparser - v1.10.0

Changelog

[v1.10.0] - 2024-06-04 Tue

  • Add [#260]: add candidatus field for parsed data.
  • Add [#232]: parse names like subgen. Psammophrynopsis Koch, 1953.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 2 years ago

GNparser - v1.9.1

CHANGELOG

[v1.9.1] - 2023-10-13 Fri

  • Add: update modules.
  • Fix [#259]: allow diacritics in any UTF-8 normalization form.
  • Fix [#258]: allow authors with 2 dashes in the name.
  • Fix [#256]: fix normalization where a misplacced year changes
    the year of original authors.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 2 years ago

GNparser - v1.9.0

CHANGELOG

[v1.9.0] - 2023-10-12 Thu

  • Add: restore backward compatibility by creating a new flag
    --species-group-cut.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 2 years ago

GNparser - v1.8.0

CHANGELOG

[v1.8.0] - 2023-10-11 Wed

  • Add [#255]: normalize stemmed canonical of Aus bus bus to Aus bus.
    WARNING this creates some backward incompatibility.
  • Add: sorting uses slices package.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 2 years ago

GNparser - v1.7.5

CHANGELOG

[v1.7.5] - 2023-09-26 Tue

  • Add: CSV and TSV files provide now verbatim authorship instead of normalized
    one.
  • Add: a few more "termination words"
  • Fix [#254]: treat fa as forma.
  • Fix [#253]: process dem as an author word for Von dem Bush and like.
  • Fix [#251]: do not process y as and for Rafael Arango y Molina.
  • Fix [#249]: allow cf at the end of the strings, cf for infraspecies.
  • Fix [#248]: do not escape double quotes for TSV output.
  • Fix [#246]: ignore ms at the end of the strings.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 2 years ago

GNparser - v1.7.4

CHANGELOG

[v1.7.4] - 2023-08-22 Tue

  • Fix [#243]: parse correctly Nassa pagoda var. acuta P. P. Carpenter, 1857.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 3 years ago

GNparser - v1.7.3

CHANGELOG

[v1.7.3] - 2023-06-17 Sat

  • Add [#241]: allow comma before ex authors.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 3 years ago

GNparser - v1.7.2

CHANGELOG

[v1.7.2] - 2023-03-09 Thu

  • Add [#240]: add tr. subtr. as ranks for combo-uninomials.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 3 years ago

GNparser - v1.7.1

CHANGELOG

[v1.7.1] - 2023-03-07 Tue

  • Add: upgrade all modules.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 3 years ago

GNparser - v1.7.0

CHANGELOG

[v1.7.0] - 2023-03-07 Tue

  • Add [#238]: stem takes in account -ii suffix, macdonaldii -> macdonald.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 3 years ago

GNparser - v1.6.9

CHANGELOG

[v1.6.9] - 2022-11-10 Thu

  • Add [#237]: detect and normalize non-breaking hyphens. In case if other
    non-typical hythens will appear, they will be dealt the same way.

[v1.6.8] - 2022-10-01 Sat

  • Add: update all modules.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 3 years ago

GNparser - v1.6.7

[v1.6.7] - 2022-08-22 Mon

  • Add [#231]: more edge cases.
  • Add [#230]: Take into account mihi annotation.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 4 years ago

GNparser - v1.6.5

CHANGELOG

[v1.6.5] - 2022-03-21 Mon

  • Add [#223]: Use PEG parser for preprocess instead of RegEx.
    This approach gives 15-17% speed increase.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 4 years ago

GNparser - v1.6.4

CHANGELOG

[v1.6.4] - 2022-03-19 Sat

  • Add [#224]: Parse correctly italian authors with degli.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 4 years ago

GNparser - v1.6.3

CHANGELOG

[v1.6.3] - 2022-02-08

  • Add [#222]: Improve logs for NSQ, switch to zerologs library.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.6.2

CHANGELOG

[v1.6.2]

  • Fix [#221]: No parsing for names with cyanobacterium.
  • Fix [#220]: Crenarchaeote enrichment culture clone should stop parsing
    at enrichment.
  • Fix [#219]: filter out complex word during preprocessing for names like
    Aegla uruguayana complex.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.6.1

CHANGELOG

[v1.6.1]

  • Add: use NSQ logger from sfgrp/lognsq

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.6.0

CHANGELOG

  • Add [#218]: enable/disable logs for web-services, allow logs aggregation with NSQd.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.5.6

CHANGELOG

  • Add [#212]: Set year from 'ex' authorship as a year of a name.
    Add 'ex' authors to list of all authors.

  • Add [#211]: PR [#214] by @tobymarsden, general approach for non-...
    specific epithets.

  • Add [#208]: PR [#210] by @tobymarsden, option to preserve diaereses.

  • Fix [#213]: Stop generating space between Mc, Mac and the rest of an
    an author name.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.5.5

CHANGELOG

  • Add [#207]: PR [#209] by @tobymarsden, fix parsing of names with nudum specific epithet.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.5.4

CHANGELOG

  • Add: different approach for normalize-by-type for words.
  • Add [#205]: allow genera starting with De-, Eu-, Le-, Ne- (by @tobymarsden).
  • Add [#203]: allow up to 2 dashes in genera (by @tobymarsden).

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.5.2

CHANGELOG

  • Add [#200]: support for 'div.' rank in uninomial combinations.
  • Add [#199]: fixes for several names that were not parsed correctly.
  • Add [#198]: parse "Solanum tuberosum wila-k`oyu".
  • Add [#97]: do not parse "Cyanophage".
  • Add [#85]: parse names with a dagger character.
  • Add [#84]: parse "Muscicapa randi Amadon & duPont, 1970".
  • Add [#83]: parse authors like 'Laverde-R.'.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.5.1

CHANGELOG

  • Add [#191]: support for ambiguous specific epithets

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.5.0

CHANGELOG

  • Add [#194]: support for cultivars' graft-chymeras (courtesy of @tobymarsden)

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.4.2

CHANGELOG

[v1.4.2]

  • Add [#196]: parse authors with prefix 'ver'

[v1.4.1]

  • Fix [#195]: parse multinomials where authorshp is not separated by space.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.4.0

CHANGELOG

  • Add [#193]: add TSV format for output.
  • Add [#190]: support prefixes do and de los for authors.
  • Add [#187]: support ter suffix for authors.
  • Add [#186]: support non-ASCII apostrophe in authors.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 4 years ago

GNparser - v1.3.3

CHANGELOG

  • Add [#176]: refactoring of hybrid sign treatment (use PEG instead of
    RegEx for normalizing x, X, and ×.
  • Add [#183]: stop parsing after nec, non, fide, vide, treat
    ms in as in or ex for exAuthors.
  • Add [#182]: support for authors with prefixes ten, delle, dos.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 5 years ago

GNparser - v1.3.2

CHANGELOG

  • Add [#182]: support Do, Oo, Nu 2-letter genera.
  • Add [#53]: exceptions to annotations (Bottaria nudum for example).

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 5 years ago

GNparser - v1.3.1

CHANGELOG

  • Add [#180]: Zenodo DOI.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 5 years ago

GNparser - Nightly build

GNparser development build

version: nightly-1-ge635f91

build:   2021-07-17_14:30:07UTC

Install gnparser

Linux and Mac (x64)

  1. Download gnparser tar file
  2. Run tar xvf gnparser-linux.tar.gz
  3. Copy gnparser to your path cp gnparser /usr/local/bin

Windows (x64)

  1. Download gnparser zip file
  2. Unzip it
  3. Copy gnparser.exe somewhere in your path (see README for details)

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by github-actions[bot] almost 5 years ago

GNparser - v1.3.0

CHANGELOG

  • Add [#179]: add cultivars info to README.

  • Add [#178]: parse cultivars via REST API.

  • Add [#177]: parse botanical cultivars via web.

  • Add [#173]: cultivars parsing @tobymarsden.

  • Add [#172]: authors initials with a dash like "B.-E.van Wyk".

  • Add: tests for cultivars (Toby Marsden)

  • Fix [#174]: Hybrid character is missed or wrong in details'
    Words section.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus almost 5 years ago

GNparser - v1.2.0

CHANGELOG

  • Add [#169]: option to capitalize first letter of name-strings.
  • Add [#166]: support 'fm.' as 'forma'.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 5 years ago

GNparser - v1.1.0

CHANGELOG

  • Add [#163]: support bacterial Candidatus names.
  • Add [#162]: show PEG AST tree for debugging.
  • Add [#161]: add automatic tools dependency.
  • Add [#160]: use embed feature of Go v1.16.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus about 5 years ago

GNparser -

CHANGELOG

  • Add: limit nightly builds to master only.
  • Fix [#159]: missing '/' in POST route .

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.12

CHANGELOG

  • Add [#154]: parse names with ambiguous f. as forma if there
    is a space between authr and f.. If there is
    no space, parse as filius. Give ambiguity
    warning in both cases.
  • Add: PHP example from @barotto about using pipes with gnparser.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.11

CHANGELOG

  • Fix [#153]: flags csv=false and with_details=false trigger opposite
    behavior.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser -

CHANGELOG

  • Add [#152]: change auto-prereleases from nightly to on master submit.
  • Add [#151]: do not parse names with (endo|ecto)?symbiont.
  • Add [#150]: ignore serovar/serotype in bacerital names.
  • Add [#149]: support abbreviated subgenus (Aus (B.) cus).

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.9

CHANGELOG

  • Add [#146]: unordered flag.
  • Add [#145]: better CI/build actions, add nightly binaries.
  • Fix [#144]: remove configuration file as it creates more problems than solves.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.8

CHANGELOG

  • Add: remove config message for CLI app.
  • Add: ldflags -s -w to decrease binary size.
  • Fix: header does not show in CSV format for stream.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.7

CHANGELOG

  • Add [#143]: quiet flag to suppress showing progress output.

  • Fix [#142]: stream waits until certain names number is equal the batch size.

  • Fix [#141]: config file is not created.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.6

CHANGELOG

  • Add: update version handling, readme.
  • Add: remove gnlib package.
  • Add [#140]: remove config package.
  • Add: cleanup constructor methods names.
  • Add [#139]: make package names less abstract.
  • Fix [#137]: add correct VerbatimID for HTML-containing names.
  • Add [#136]: Man page
  • Add [#100]: Switch continuous integration to use GitHub Actions.
  • Add [#129]: Make c-binding usable for biodiversity parser.
  • Fix [#135]: Changes: SubGenus->Subgenus, InfraSpecies->Infraspecies

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser - v1.0.0

CHANGELOG

  • Add [#127]: Update documentation to v1.0.0.
  • Add [#122]: Implement parsing as a stream in addition to batch parsing.
  • Add [#126]: Update c-binging to v1.0.0.
  • Add [#131]: Add parameters "with_details" and "csv" to REST API.
  • Add [#134]: Transofrm "positions" section to "words" section.
  • Add [#128]: Add more examples to OpenAPI specification.
  • Add [#125]: Describe changes from v0.x to 1.x.
  • Add [#132]: Add context.Context to control lifespan of go routines.
  • Add [#115]: Migrate tests from ginkgo to plain tests.
  • Add [#109]: Move web packag to io.
  • Add [#124]: Document warnings for each quality category.
  • Add [#121]: Convert package parser to use interfaces.
  • Add [#120]: CLI app for newly created functionality.
  • Add [#119]: Formatted output for output.Parsed.
  • Add [#117]: Convert failed parsing results to output.Parsed.
  • Add [#114]: Convert parsing result to output.Parsed.
  • Add [#118]: Add Verbatim and Year fields to the root of Authorship.
  • Add [#107]: Move grammar package to entity and rename to parser.
  • Add [#110]: Move stemmer to entity.
  • Add [#113]: Move str package to entity.
  • Add [#112]: Move preprocess package to entity.
  • Add [#105]: Move fs package to io.
  • Add [#111]: Move dict package to io.
  • Add [#106]: Describe main use-case via interface.
  • Add [#104]: Add configuration package.
  • Add [#103]: Create an output.Parsed object that can be used in Go and as JSON.
  • Add [#101]: Start using gnlib where it makes sense.
  • Add [#99]: Move code to GitHub and change links accordingly.
  • Add [#95]: Remove dependency on gRPC and protobuf.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago

GNparser -

CHANGELOG

  • Add [#96]: Do not parse names starting with "Candidatus".
  • Add [#93]: Parse 'y' (Spanish '&') as an author separator.

Biosphere - Biodiversity Data Cleaning and Standardization - Go
Published by dimus over 5 years ago