🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
💫 Industrial-strength Natural Language Processing (NLP) in Python
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Awesome-pytorch-list 翻译工作进行中......
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
skweak: A software toolkit for weak supervision applied to NLP tasks
Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
Self-contained Japanese Morphological Analyzer written in pure Go
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Get updates on the fastest growing repos and cool stats about GitHub right in your inbox
Once per month. No spam.