analyzemyrepo.com
GitHub Repo stars

Discover the Best Open-Source Multimodal Projects Sorted by Stars

Get Insights into Stars Growth, Contributions, Diversity, Bus Factor, and Community Governance. Optimize Your Project's Performance and Collaborate with a Thriving Community.

Repo Image
jina-ai/jina

🔮 Multimodal AI services & pipelines with cloud-native stack: gRPC, Kubernetes, Docker, OpenTelemetry, Prometheus, Jaeger, etc.

19,034
Repo Image
facebookresearch/mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

5,301
Repo Image
haotian-liu/LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

4,870
Repo Image
clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

4,284
Repo Image
swyxio/ai-notes

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

3,715
Repo Image
SkalskiP/courses

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

3,646
Repo Image
kyegomez/tree-of-thoughts

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

3,432
Repo Image
IDEA-CCNL/Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

3,334
Repo Image
OpenGVLab/InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

2,851
Repo Image
rom1504/img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

2,567
Repo Image
lucidrains/big-sleep

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun

2,534

Get updates on the fastest growing repos and cool stats about GitHub right in your inbox

I agree to the storage of my email for the purpose of contacting me.

Once per month. No spam.