https://github.com/alirezatheh/perke

A keyphrase extractor for Persian
data-mining data-processing information-retrieval keyphrase keyphrase-extraction keyphrase-extractor keyword keyword-extraction keyword-extractor machine-learning ml natural-language-processing nlp persian persian-language python text-mining text-processing unsupervised-learning
Added: over 1 year ago - Last Synced: 11 months ago - Created: February 03, 2020

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 87
  • Committers: 4
  • Issues: 0
  • Pull Requests: 4
  • Owner: AlirezaTheH
  • Stars: 68
  • Forks: 7
  • Packages: 1
  • Downloads: 77
https://github.com/asyml/forte

Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
data-processing deep-learning information-retrieval machine-learning natural-language natural-language-processing pipeline python text-data
Added: over 1 year ago - Last Synced: 11 months ago - Created: August 09, 2019

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 1028
  • Committers: 53
  • Issues: 55
  • Pull Requests: 49
  • Owner: asyml
  • Stars: 236
  • Forks: 60
  • Packages: 1
  • Downloads: 173
https://github.com/nl4dv/nl4dv

A python toolkit to create Visualizations (Vis) using natural language (NL) or add an NL interface to existing Vis.
conversational conversational-interaction conversational-interactions data-visualization datascience jupyter-notebook natural-language natural-language-interface natural-language-processing nl-interface opensource python toolkit vega-lite visualization
Added: over 1 year ago - Last Synced: 11 months ago - Created: April 30, 2020

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 192
  • Committers: 6
  • Issues: 12
  • Pull Requests: 5
  • Owner: nl4dv
  • Stars: 127
  • Forks: 23
  • Packages: 1
  • Downloads: 35
https://github.com/asyml/texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
bert casl-project data-processing deep-learning dialog-systems gpt-2 machine-learning machine-translation natural-language-processing python tensorflow texar text-data text-generation xlnet
Added: over 1 year ago - Last Synced: 11 months ago - Created: July 22, 2017

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 1384
  • Committers: 40
  • Issues: 53
  • Pull Requests: 48
  • Owner: asyml
  • Stars: 2381
  • Forks: 371
  • Packages: 2
  • Downloads: 52
https://github.com/kensho-technologies/sequence_align

Efficient implementations of Needleman-Wunsch and other sequence alignment algorithms written in Rust with Python bindings via PyO3.
bioinformatics hirschberg natural-language-processing needleman-wunsch nlp pyo3 python rust sequence-alignment
Added: over 1 year ago - Last Synced: 11 months ago - Created: April 05, 2023

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 13
  • Committers: 2
  • Issues: 5
  • Pull Requests: 10
https://github.com/konbraphat51/animatedwordcloud

Animate a timelapse of word cloud
animation datascience natural-language-processing nlp video visualization wordcloud
Added: over 1 year ago - Last Synced: 11 months ago - Created: November 15, 2023

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 675
  • Committers: 4
  • Issues: 45
  • Pull Requests: 87
  • Owner: konbraphat51
  • Stars: 9
  • Forks: 0
  • Packages: 1
  • Downloads: 271
https://github.com/asyml/fortehealth

The project is in the incubation stage and still under development. ForteHealth is a flexible and powerful ML workflow builder for biomedical and clinical scenarios. This is part of the CASL project: http://casl-project.ai/
biomedical-named-entity-recognition clinical-nlp clinical-text-processing data-processing deep-learning information-retrieval machine-learning natural-language natural-language-processing python
Added: over 1 year ago - Last Synced: 11 months ago - Created: February 04, 2022

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 383
  • Committers: 10
  • Issues: 48
  • Pull Requests: 43
  • Owner: asyml
  • Stars: 10
  • Forks: 5
  • Packages: 1
  • Downloads: 13
https://github.com/dongrixinyu/jionlp

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
apache2 chinese natural-language-processing ner nlp nlp-parse preprocessing python time-parse time-parsing
Added: over 1 year ago - Last Synced: 11 months ago - Created: March 13, 2020

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 489
  • Committers: 14
  • Issues: 261
  • Pull Requests: 32
  • Owner: dongrixinyu
  • Stars: 3054
  • Forks: 370
  • Packages: 4
  • Downloads: 2,815
https://github.com/liaad/pt-pump-up

Hub for the Portuguese language NLP Resources
natural-language-processing nlp nlp-datasets nlp-resources portuguese-language resources
Added: over 1 year ago - Last Synced: 11 months ago - Created: October 25, 2023

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: PHP
  • Commits: 97
  • Committers: 2
  • Issues: 13
  • Pull Requests: 13
  • Owner: LIAAD
  • Stars: 4
  • Forks: 0
  • Packages: 0
https://github.com/allenai/smashed

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
dataset datasets dict huggingface in-context-learning mappers natural-language-processing nlp pipeline prefix prefix-tuning preprocessing prompting pytorch text torchdata transformer transformers
Added: over 1 year ago - Last Synced: 11 months ago - Created: July 21, 2022

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 145
  • Committers: 6
  • Issues: 1
  • Pull Requests: 65
  • Owner: allenai
  • Stars: 30
  • Forks: 3
  • Packages: 1
  • Downloads: 11,380