📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Always know what to expect from your data.
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
lakeFS - Data version control for your data lake | Git for data
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Get updates on the fastest growing repos and cool stats about GitHub right in your inbox
Once per month. No spam.