MIT - The Human Data Interaction Project
- Cambridge, MA
- https://hdi-dai.lids.mit.edu/
- [email protected]
Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upPinned repositories
Repositories
-
BTB
A simple, extensible library for developing AutoML systems
-
MLBlocks
A library for composing end-to-end tunable machine learning pipelines.
-
MLPrimitives
Primitives for machine learning and data science.
-
-
AutoBazaar
AutoBazaar: An AutoML System from the Machine Learning Bazaar
-
ballet-predict-census-income
A feature engineering pipeline for income prediction using the Ballet framework
-
ballet-predict-house-prices
A feature engineering pipeline for house price prediction using the Ballet framework
-
ballet
☀️ 🦶 A lightweight framework for collaborative, open-source, data science -
ATMSeer
Visual Exploration of Automated Machine Learning with ATMSeer
-
-
mit-d3m
MIT tools to work with datasets in the D3M format.
-
github-oauth-gateway
Gateway for authenticating with GitHub using OAuth
-
mit-d3m-ta2
MIT's TA2 System
-
ATM
Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).
-
hdi-project.github.io
hdi-project.github.io website
-
d3m-dataset-manager
Tool to generate and manage datasets in custom formats.
-
Trane
Automatically generating machine learning tasks from multi entity, temporal datasets.
-
mlbazaar-demos
MLPrimitives demos
-
piex
Pipeline Explorer - Explore and analyze millions of pipelines learned using MLBlocks and MLPrimitives.
-
FeatureHub
A collaborative feature engineering system built on JupyterHub
-
DataAudit
DataAudit
-
MLBlocks-Demos
Some example MLPipelines and code to test them on sample datasets
-
ATM-Demos
This repo has tutorials, demos and experimental code for the ATM repository
-
model-provenance-json
A specification for the model provenance file that keeps track of the journey from raw data to deployed model in Machine Learning 2.0 projects.

