Skip to content

Latest commit

 

History

History
62 lines (46 loc) · 2.19 KB

File metadata and controls

62 lines (46 loc) · 2.19 KB

Build Status Documentation Status GitHub license

Portable, scalable and reliable distributed machine learning.

Wormhole is a place where DMLC projects works together to provide scalable and reliable machine learning toolkits that can run on various platforms

Features

  • Portable:
    • Supported platforms: local machine, Apache YARN, MPI and Sungrid Engine
  • Rich support of Data Source
    • All projects can read data from HDFS, S3 or local filesystem
  • Scalable and Reliable

List of Tools

Build & Run

  • Requires a C++11 compiler (e.g.~g++ >=4.8) and git. Install them on Ubuntu

    = 13.10

sudo apt-get update && sudo apt-get install -y build-essential git libgoogle-glog-dev
  • Type make to build all deps and tools

  • All tools can run both in a laptop and in a cluster. For example, train logisitic regression using 2 workers and one servers in local machine

tracker/dmlc_local.py -n 2 -s 1 bin/linear.dmlc learn/linear/guide/demo.conf

Support

If you are having issues, please let us know.

Contribute

  • We are actively building new tools. The source codes of all tools are available under learn/.
  • Wormhole depends on other DMLC projects, which are also under active developing
    • dmlc-core provides I/O modules and job trackers
    • rabit provides reliable BSP Allreduce communication.
    • ps-lite provides the asynchronous key-value push and pull for the parameter server framework.