๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
๐งโ๐ซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
๐ Easy-to-use and powerful NLP and LLM library with ๐ค Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Text Classification, ๐ Neural Search, โ Question Answering, โน๏ธ Information Extraction, ๐ Document Intelligence, ๐ Sentiment Analysis etc.
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
The next-generation platform to monitor and optimize your AI costs in one place ๐
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
๐ฅ Fast State-of-the-Art Tokenizers optimized for Research and Production
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
A PyTorch-based Speech Toolkit
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
C# functional language extensions - a base class library for functional programming
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
This repository contains demos I made with the Transformers library by HuggingFace.
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Get updates on the fastest growing repos and cool stats about GitHub right in your inbox
Once per month. No spam.