Recent Releases of GNparser
GNparser - v1.14.1
CHANGELOG
[v1.14.1] - 2026-01-14 Wed
- Add[#301]: rename 'initials' flag to 'compact-authors'.
WARNING this creates backward incompatibility (only for v1.14.0). - Add: normalize 'flatten' flag.
- Add: remove useless web-log flag.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus 5 months ago
GNparser - v1.14.0
CHANGLOG
- Add[#292]: add flattened option for JSON output, add more (optional) details
to CSV/TSV output. - Fix[#227]: Cultivars words show correctly in "words" section.
(PR #300 by @tobymarsden)
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus 5 months ago
GNparser - v1.13.0
CHANGELOG
- Add[#298]: update CONTRIBUTING.md to reflect changes in gnparser's code.
- Add[#297]: rename NomCode field to NomCodeSetting to clarify its intention.
WARNING this creates some backward incompatibility. - Add[#294]: option to remove spaces between authors' names initials (PR
#295 by @juillerat) - Fix[#296]: old virus names are marked as 'Virus' when nomenclatural code
is set to 'icvcn'.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus 5 months ago
GNparser - v1.12.1
CHANGELOG
[v1.12.1] - 2025-12-19 Fri
- Add: virus code for web UI.
- Add: docs about viral code to README.
- Add: update flake data.
- Add: update nix build.
- Add: update TOC for tests.
[v1.12.0] - 2025-12-19 Fri
- Add[#290]: support modern virus nomenclature (ICTV/ICVCN)
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus 5 months ago
GNparser - v1.11.5
[v1.11.5] - 2025-03-30 Sun
- Fix [#286]: parse names with named hybrid and comparison.
CHANGELOG
[v1.11.4] - 2025-03-10 Mon
- Add: improve flatten output.
[v1.11.3] - 2025-03-07 Fri
- Add [#283]: parsed.Flatten function to generate 'flat' version of parsed data.
- Add: update upload-artifacts in Actions to v4.
- Add [#279]: add
unr.rank.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus about 1 year ago
GNparser - v1.11.2
CHANGELOG
[v1.11.2] - 2025-02-21 Fri
- Add: update modules.
- Add [#282] remove varning for botanical names with
exauthors. - Add [#280] prol. botanical rank.
Full Changelog: https://github.com/gnames/gnparser/compare/v1.11.1...v1.11.2
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 1 year ago
GNparser - v1.10.4
CHANGELOG
[v1.10.4] - 2024-11-07 Thu
- Add [#269]: switch to slog from zerologs.
- Fix [#271]: distinguish between
exandin. - Fix [#270]: missing verbatim authorship for names that look similar to
combination uninomial in ICZN. - Fix [#268]: if botanical author looks like a combination uninomial,
make sure its characters are not normalized.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 1 year ago
GNparser - v1.9.1
CHANGELOG
[v1.9.1] - 2023-10-13 Fri
- Add: update modules.
- Fix [#259]: allow diacritics in any UTF-8 normalization form.
- Fix [#258]: allow authors with 2 dashes in the name.
- Fix [#256]: fix normalization where a misplacced year changes
the year of original authors.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus about 2 years ago
GNparser - v1.8.0
CHANGELOG
[v1.8.0] - 2023-10-11 Wed
- Add [#255]: normalize stemmed canonical of
Aus bus bustoAus bus.
WARNING this creates some backward incompatibility. - Add: sorting uses
slicespackage.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 2 years ago
GNparser - v1.7.5
CHANGELOG
[v1.7.5] - 2023-09-26 Tue
- Add: CSV and TSV files provide now verbatim authorship instead of normalized
one. - Add: a few more "termination words"
- Fix [#254]: treat
faas forma. - Fix [#253]: process
demas an author word forVon dem Bushand like. - Fix [#251]: do not process
yasandforRafael Arango y Molina. - Fix [#249]: allow
cfat the end of the strings, cf for infraspecies. - Fix [#248]: do not escape double quotes for TSV output.
- Fix [#246]: ignore
msat the end of the strings.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 2 years ago
GNparser - v1.6.9
CHANGELOG
[v1.6.9] - 2022-11-10 Thu
- Add [#237]: detect and normalize non-breaking hyphens. In case if other
non-typical hythens will appear, they will be dealt the same way.
[v1.6.8] - 2022-10-01 Sat
- Add: update all modules.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 3 years ago
GNparser - v1.6.2
CHANGELOG
[v1.6.2]
- Fix [#221]: No parsing for names with
cyanobacterium. - Fix [#220]:
Crenarchaeote enrichment culture cloneshould stop parsing
atenrichment. - Fix [#219]: filter out
complexword during preprocessing for names like
Aegla uruguayana complex.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 4 years ago
GNparser - v1.5.6
CHANGELOG
-
Add [#212]: Set year from 'ex' authorship as a year of a name.
Add 'ex' authors to list of all authors. -
Add [#211]: PR [#214] by @tobymarsden, general approach for
non-...
specific epithets. -
Add [#208]: PR [#210] by @tobymarsden, option to preserve diaereses.
-
Fix [#213]: Stop generating space between
Mc,Macand the rest of an
an author name.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 4 years ago
GNparser - v1.5.4
CHANGELOG
- Add: different approach for normalize-by-type for words.
- Add [#205]: allow genera starting with De-, Eu-, Le-, Ne- (by @tobymarsden).
- Add [#203]: allow up to 2 dashes in genera (by @tobymarsden).
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 4 years ago
GNparser - v1.5.2
CHANGELOG
- Add [#200]: support for 'div.' rank in uninomial combinations.
- Add [#199]: fixes for several names that were not parsed correctly.
- Add [#198]: parse "Solanum tuberosum wila-k`oyu".
- Add [#97]: do not parse "Cyanophage".
- Add [#85]: parse names with a dagger character.
- Add [#84]: parse "Muscicapa randi Amadon & duPont, 1970".
- Add [#83]: parse authors like 'Laverde-R.'.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 4 years ago
GNparser - v1.4.0
CHANGELOG
- Add [#193]: add TSV format for output.
- Add [#190]: support prefixes
doandde losfor authors. - Add [#187]: support
tersuffix for authors. - Add [#186]: support non-ASCII apostrophe in authors.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 4 years ago
GNparser - v1.3.3
CHANGELOG
- Add [#176]: refactoring of hybrid sign treatment (use PEG instead of
RegEx for normalizingx,X, and×. - Add [#183]: stop parsing after
nec,non,fide,vide, treat
ms inasinorexfor exAuthors. - Add [#182]: support for authors with prefixes
ten,delle,dos.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus almost 5 years ago
GNparser - Nightly build
GNparser development build
version: nightly-1-ge635f91
build: 2021-07-17_14:30:07UTC
Install gnparser
Linux and Mac (x64)
- Download gnparser tar file
- Run
tar xvf gnparser-linux.tar.gz - Copy gnparser to your path
cp gnparser /usr/local/bin
Windows (x64)
- Download gnparser zip file
- Unzip it
- Copy gnparser.exe somewhere in your path (see README for details)
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by github-actions[bot] almost 5 years ago
GNparser - v1.3.0
CHANGELOG
-
Add [#179]: add cultivars info to README.
-
Add [#178]: parse cultivars via REST API.
-
Add [#177]: parse botanical cultivars via web.
-
Add [#173]: cultivars parsing @tobymarsden.
-
Add [#172]: authors initials with a dash like "B.-E.van Wyk".
-
Add: tests for cultivars (Toby Marsden)
-
Fix [#174]: Hybrid character is missed or wrong in details'
Wordssection.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus almost 5 years ago
GNparser - v1.1.0
CHANGELOG
- Add [#163]: support bacterial
Candidatusnames. - Add [#162]: show PEG AST tree for debugging.
- Add [#161]: add automatic tools dependency.
- Add [#160]: use embed feature of Go v1.16.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus about 5 years ago
GNparser -
CHANGELOG
- Add: limit nightly builds to master only.
- Fix [#159]: missing '/' in POST route .
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago
GNparser - v1.0.12
CHANGELOG
- Add [#154]: parse names with ambiguous
f.as forma if there
is a space between authr andf.. If there is
no space, parse asfilius. Give ambiguity
warning in both cases. - Add: PHP example from @barotto about using pipes with gnparser.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago
GNparser -
CHANGELOG
- Add [#152]: change auto-prereleases from nightly to on master submit.
- Add [#151]: do not parse names with
(endo|ecto)?symbiont. - Add [#150]: ignore serovar/serotype in bacerital names.
- Add [#149]: support abbreviated subgenus (
Aus (B.) cus).
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago
GNparser - v1.0.7
CHANGELOG
-
Add [#143]:
quietflag to suppress showing progress output. -
Fix [#142]: stream waits until certain names number is equal the batch size.
-
Fix [#141]: config file is not created.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago
GNparser - v1.0.6
CHANGELOG
- Add: update version handling, readme.
- Add: remove gnlib package.
- Add [#140]: remove config package.
- Add: cleanup constructor methods names.
- Add [#139]: make package names less abstract.
- Fix [#137]: add correct VerbatimID for HTML-containing names.
- Add [#136]: Man page
- Add [#100]: Switch continuous integration to use GitHub Actions.
- Add [#129]: Make c-binding usable for biodiversity parser.
- Fix [#135]: Changes: SubGenus->Subgenus, InfraSpecies->Infraspecies
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago
GNparser - v1.0.0
CHANGELOG
- Add [#127]: Update documentation to v1.0.0.
- Add [#122]: Implement parsing as a stream in addition to batch parsing.
- Add [#126]: Update c-binging to v1.0.0.
- Add [#131]: Add parameters "with_details" and "csv" to REST API.
- Add [#134]: Transofrm "positions" section to "words" section.
- Add [#128]: Add more examples to OpenAPI specification.
- Add [#125]: Describe changes from v0.x to 1.x.
- Add [#132]: Add context.Context to control lifespan of
go routines. - Add [#115]: Migrate tests from ginkgo to plain tests.
- Add [#109]: Move
webpackag toio. - Add [#124]: Document warnings for each quality category.
- Add [#121]: Convert
packageparser to use interfaces. - Add [#120]: CLI app for newly created functionality.
- Add [#119]: Formatted output for
output.Parsed. - Add [#117]: Convert failed parsing results to
output.Parsed. - Add [#114]: Convert parsing result to
output.Parsed. - Add [#118]: Add
VerbatimandYearfields to the root ofAuthorship. - Add [#107]: Move
grammarpackage toentityand rename toparser. - Add [#110]: Move
stemmertoentity. - Add [#113]: Move
strpackage toentity. - Add [#112]: Move
preprocesspackage toentity. - Add [#105]: Move
fspackage toio. - Add [#111]: Move
dictpackage toio. - Add [#106]: Describe main use-case via interface.
- Add [#104]: Add configuration package.
- Add [#103]: Create an output.Parsed object that can be used in Go and as JSON.
- Add [#101]: Start using gnlib where it makes sense.
- Add [#99]: Move code to GitHub and change links accordingly.
- Add [#95]: Remove dependency on gRPC and protobuf.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago
GNparser -
CHANGELOG
- Add [#96]: Do not parse names starting with "Candidatus".
- Add [#93]: Parse 'y' (Spanish '&') as an author separator.
Biosphere - Biodiversity Data Cleaning and Standardization
- Go
Published by dimus over 5 years ago