Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.
a C\C++ library and command-line tools for Diff & Patch between binary files or directories(folder); cross-platform; runs fast; create small delta/differential; support large files and limit memory requires when diff & patch.
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
The most complete tool for modding, decompiling and unpacking Undertale (and other Game Maker: Studio games!)
Email-based instant messaging for Desktop.
Get updates on the fastest growing repos and cool stats about GitHub right in your inbox
Once per month. No spam.