COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20200727171136/https://github.com/topics/data-stream
Here are
72 public repositories
matching this topic...
A curated list of awesome big data frameworks, ressources and other awesomeness.
Apache Kafka running on Kubernetes
Updated
Jul 27, 2020
Java
Probabilistic data structures for processing continuous, unbounded streams.
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Updated
Mar 31, 2020
Python
NIST Certified SCAP 1.2 toolkit
Updated
Jul 23, 2020
XSLT
A stream processing API for Go (alpha)
Go stream processing library
Series and Panels for Real-time and Exploratory Analysis of Data Streams
Simple yet powerful live data computation framework
Updated
Jul 27, 2020
JavaScript
Probabilistic deep learning for data streams.
Updated
Oct 6, 2019
Scala
The Open Source Time-Series Data Historian
Tideland GoCells (Event Based Applications)
The Tornado 🌪️ framework, designed and implemented for adaptive online learning and data stream mining in Python.
Updated
May 4, 2020
Python
Appbase.io streaming client lib for Javascript
Updated
Jul 20, 2020
JavaScript
A framework for data stream modeling and associated data mining tasks such as clustering and classification. - R Package
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Updated
Jun 26, 2020
Python
MIST: High-performance IoT Stream Processing
Updated
Mar 19, 2019
Java
HyperLogLog and other probabilistic data structures for mining in data streams
Updated
Dec 3, 2014
Python
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Updated
Aug 8, 2017
JavaScript
unsupervised concept drift detection
Updated
Dec 16, 2019
Python
A library for performing Content-Defined Chunking (CDC) on data streams.
Simple cloud based logger for microcontrollers.
Updated
Jan 23, 2016
Python
RPJiOS: RPJ's RPi OS, a sensor data platform for the Raspberry Pi built with python2.7 and redis.
Updated
May 9, 2020
Python
Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow)
Updated
Jul 20, 2020
Python
非结构化课程作业,包括社交网络、链路预测、数据流、文本分析
Updated
Mar 18, 2019
Jupyter Notebook
🗂 List all repositories on Github (separated by language)
Updated
Jun 16, 2020
Shell
Real-time data stream classification and knowledge generation engine with no dependencies
Updated
May 10, 2017
Pony
Software library for stream-based recommender systems
Updated
Jul 5, 2020
Python
Library to connect to the Azure Event Hub via AMQP 1.0 for the Go programming language (Golang) based on Apache Qpid Proton (an AMQP 1.0 C library)
Offline and online (i.e., real-time) annotated clustering methods for text data.
Updated
Sep 15, 2018
Jupyter Notebook
Improve this page
Add a description, image, and links to the
data-stream
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
data-stream
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.