Hence, it tames the complexity that arises from the "Cambrian explosion" of novel data processing platforms that we currently witness.Ĭhristofer Dutz, Lars George, Bernd Fondermann, Jean-Baptiste OnofréĪn implementation of the Atom Syndication Format and Atom Publishing Protocol.Īccumulo is a distributed key/value store that provides expressive, cell-level access labels.īenson Margulies, Alan Cabrera, Bernd FondermannĪ software distribution framework based on OSGi that allows you to manage and distribute artifacts. Wayang is a cross-platform data processing system that aims at decoupling the business logic of data analytics applications from concrete data processing platforms, such as Apache Flink or Apache Spark. Uniffle is an unified Remote Shuffle Serviceįelix Cheung, Junping Du, Liu Xun, Weiwei Yang, Zhankun Tang The Training project aims to develop resources which can be used for training purposes in various media formats, languages and for various Apache and non-Apache target projects.Ĭraig Russell, Christofer Dutz, Justin Mclean, Lars Francke Toree provides applications with a mechanism to interactively and remotely access Apache Spark. Teaclave is a universal secure computing platform.įelix Cheung, Furkan Kamaci, Jianyong Dai, Matt Sicker, Zhijie Shen, Gordon King Tison, Willem Ning Jiang, Stephan Ewen, Thomas Weise, Duo Zhang StreamPark is a streaming application development platform. SDAP is an integrated data analytic center for Big Science problems. Pony Mail is a mail-archiving, archive viewing, and interaction service, that can be integrated with many email platforms. PJ Fanning, Justin McLean, Roman Shaposhnik, Wu Sheng, Ryan Skraba, JB Onofré, Claude Warren Pekko is a toolkit and an ecosystem for building highly concurrent, distributed, reactive and resilient applications for Java and Scala. Pegasus is a distributed key-value storage system which is designed to be simple, horizontally scalable, strongly consistent and high-performance.ĭuo zhang, Liang Chen, Von Gosling, Liu Xun Paimon is a unified lake storage to build dynamic tables for both stream and batch processing with big data compute engines, supporting high-speed data ingestion and real-time data query.īecket Qin, Robert Metzger, Stephan Ewen, Yu Li Tison, Willem Ning Jiang, Sheng Wu, Ted Liu, Xiaoqiao He Open Data Access Layer: Access data freely, painlessly, and efficiently. Roman Shaposhnik, Furkan Kamaci, Evans Ye, Paul King, Konstantin I Boudnik, Dave Fisher Hyunsik Choi, Byung-Gon Chun, Jean-Baptiste Onofré, Markus Weimer Nemo is a data processing system to flexibly control the runtime behaviors of a job to adapt to varying deployment characteristics. Milagro is core security infrastructure and crypto libraries for decentralized networks and distributed systems. With Livy, new applications can be built on top of Apache Spark that require fine grained interaction with many Spark contexts.īikas Saha, Luciano Resende, Jean-Baptiste Onofré, Madhawa Kasun Gunasekara Livy is web service that exposes a REST interface for managing long running Apache Spark contexts in your cluster. Jean-Baptiste Onofré, Henry Saputra, Uma Maheswara Rao G, Davor Bonaci, Liang Chen KIE (Knowledge is Everything) is a community of solutions and supporting tooling for knowledge engineering and process automation, focusing on events, rules, and workflows.īrian Proffitt, Claus Ibsen, Andrea CosentinoĪpache Liminal is an end-to-end platform for data engineers and scientists, allowing them to build, train and deploy machine learning models in a robust and agile way. Lidong Dai, Trista Pan, Xiangdong Huang, Yu Li, Willem Ning Jiang Taylor Goetz, Henry Saputra, Furkan KamaciĭevLake is a development data platform, providing the data infrastructure for developer teams to analyze and improve their engineering productivity.įelix Cheung, Liang Zhang, Lidong Dai, Sijie Guo, Jean-Baptiste Onofré, Willem Ning JiangĪ large-scale and easy-to-use graph database Nick Kew, Tommaso Teofili, Benjamin YoungĪpache Baremaps is a toolkit and a set of infrastructure components for creating, publishing, and operating online maps.īertrand Delacretaz, Martin Desruisseaux, Julian Hyde, Calvin Kirs, George Percivall, Martin DesruisseauxĬeleborn is an intermediate data service for big data computing engines to boost performance, stability, and flexibility.īecket Qin, Lidong Dai, Willem Ning Jiang, Duo Zhang, Yu LiĭataLab is a platform for creating self-service, exploratory data science environments in the cloud using best-of-breed data science tools. Annotator provides annotation enabling code for browsers, servers, and humans.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |