Skip to content
@PKU-DAIR

DAIR Lab

Data and Intelligence Research (DAIR) Lab @ Peking University

Pinned Loading

  1. Hetu Hetu Public

    Forked from Hsword/Hetu

    A high-performance distributed deep learning system targeting large-scale and automated distributed training.

    Python 331 40

  2. open-box open-box Public

    Forked from thomas-young-2013/open-box

    Towards Generalized and Efficient Blackbox Optimization System/Package (KDD 2021 & JMLR 2024)

    Python 431 58

  3. Hetu-Galvatron Hetu-Galvatron Public

    Forked from AFDWang/Hetu-Galvatron

    Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

    Python 173 15

  4. mindware mindware Public

    Forked from thomas-young-2013/mindware

    An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.

    Python 60 9

  5. DataFlow DataFlow Public

    Forked from OpenDCAI/DataFlow

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    Python 5

  6. Starter-Guide Starter-Guide Public

    A comprehensive guide for beginners in the field of data management and artificial intelligence.

    522 22

Repositories

Showing 10 of 43 repositories
  • open-box Public Forked from thomas-young-2013/open-box

    Towards Generalized and Efficient Blackbox Optimization System/Package (KDD 2021 & JMLR 2024)

    PKU-DAIR/open-box’s past year of commit activity
    Python 431 85 17 1 Updated Dec 24, 2025
  • Hetu-Galvatron Public Forked from AFDWang/Hetu-Galvatron

    Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

    PKU-DAIR/Hetu-Galvatron’s past year of commit activity
    Python 173 Apache-2.0 18 3 0 Updated Dec 16, 2025
  • Hetu Public Forked from Hsword/Hetu

    A high-performance distributed deep learning system targeting large-scale and automated distributed training.

    PKU-DAIR/Hetu’s past year of commit activity
    Python 331 Apache-2.0 60 1 0 Updated Dec 13, 2025
  • mindware Public Forked from thomas-young-2013/mindware

    An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.

    PKU-DAIR/mindware’s past year of commit activity
    Python 60 MIT 28 0 0 Updated Nov 11, 2025
  • Hetu-Galvatron-Ascend Public

    A hardware plugin for Galvatron on Ascend.

    PKU-DAIR/Hetu-Galvatron-Ascend’s past year of commit activity
    Python 5 Apache-2.0 2 0 0 Updated Oct 17, 2025
  • Hetu-DiT Public
    PKU-DAIR/Hetu-DiT’s past year of commit activity
    Python 34 Apache-2.0 0 0 0 Updated Oct 16, 2025
  • DAIR_Portal_FE Public

    The website serves as an integrated portal platform for PKU-DAIR teams, designed to support presentation, communication, and management.

    PKU-DAIR/DAIR_Portal_FE’s past year of commit activity
    Vue 2 Apache-2.0 0 0 0 Updated Oct 13, 2025
  • DataFlow Public Forked from OpenDCAI/DataFlow

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    PKU-DAIR/DataFlow’s past year of commit activity
    Python 5 Apache-2.0 150 0 0 Updated Jun 29, 2025
  • SAS-Bench Public

    Benchmarking large language models for short answer grading in a fine-grained, multi-subject, and human-aligned setting.

    PKU-DAIR/SAS-Bench’s past year of commit activity
    Python 68 Apache-2.0 3 0 0 Updated May 15, 2025
  • Starter-Guide Public

    A comprehensive guide for beginners in the field of data management and artificial intelligence.

    PKU-DAIR/Starter-Guide’s past year of commit activity
    522 22 1 (1 issue needs help) 0 Updated Apr 8, 2025