Skip to content

yliu634/indexfs

 
 

Repository files navigation

IndexFS - 0.4.x

IndexFS is designed as file system middleware layered on top of an existing cluster file system deployment to improve metadata performance as well as small file operation efficiency of the original file system. IndexFS reuses the data path of the underlying file system and packs directory entries, file attributes, and small file data into a set of large, immutable, log-structured, and indexed, data structures (SSTables) that are stored in the underlying file system. Our experiments show that IndexFS is able to our-perform existing solutions such as PVFS, Lustre, and HDFS, by as much as orders of magnitude.

The following is a guide describing how to install and run IndexFS on your local Linux machine. Please visit our project home at http://www.pdl.cmu.edu/indexfs for more information. Please also note that the current implementation of IndexFS is not of production quality and is recommended to be used for research purpose only. Thanks a lot.

INDEXFS INSTALLATION GUIDE

  1. System Prerequisites
  2. Build from Source
  3. IndexFS in Standalone Mode

SYSTEM PREREQUISITES

IndexFS depends on gflags-2.1.2+, glog-0.3.4+, and thrift-0.9.3+. They can be downloaded from https://www.assembla.com/spaces/deltafs/documents. In order to build IndexFS from its source, you will also need a C++ building system such as GUN including gcc, g++, make, autoconf, automake, and libtool.

In addition, some benchmarks that IndexFS uses to evaluate system performance are build with MPI -- at least one implementation of MPI (such as MPICH) should be present for these benchmarks to run.

STEP-BY-STEP INSTRUCTIONS

INSTALL SYSTEM PACKAGES
  • Ubuntu sudo apt -y update sudo apt -y install cmake clang libboost-all-dev librados-dev sudo apt -y install google-perftools libgoogle-perftools-dev cmake build-essential pkgconf sudo apt -y install gdb libssl-dev tmux liblua5.3-dev

      sudo apt -y install gcc g++ make flex bison
      sudo apt -y install autoconf automake libtool pkg-config
      sudo apt -y install zlib1g-dev libsnappy-dev
      sudo apt -y install libboost-all-dev libevent-dev libssl1.0-dev
      sudo apt -y install libfuse-dev libmpich-dev mpich pdsh
      sudo apt -y install default-jdk
    
Build & Install Depends

Use GNU standard building process to build and install gflags, glog, and thrift, in that order.

  • To build gflags and glog:

      sudo apt -y install libgoogle-glog-dev libgflags-dev
    
  • To build thrift:

      wget https://archive.apache.org/dist/thrift/0.10.0/thrift-0.10.0.tar.gz
      tar -xvf thrift-0.10.0.tar.gz
      cd thrift-0.10.0.tar.gz
      ./configure --without-qt4 --without-qt5 --without-c_glib \
                  --without-csharp --without-java --without-erlang \
                  --without-nodejs --without-lua --without-python \
                  --without-perl --without-php --without-php_extension  \
                  --without-ruby --without-haskell --without-go \
                  --without-haxe --without-d --enable-tests=no --enable-tutorial=no
      make && sudo make install && sudo ldconfig
    

BUILD INDEXFS FROM SOURCE

Build IndexFS

IndexFS also follows GNU standard building process. For your convenience, IndexFS provides bootstrap.sh which does this automatically for you.

  • To build IndexFS:

      autoreconf -ifv && bash bootstrap.sh
    

NB: you don't have to install IndexFS into your system. Our scripts will not assume IndexFS binaries to be accessible from your system path.

INDEXFS IN STANDALONE MODE

Run standalone IndexFS

Running IndexFS in standalone mode is a quick way to test if IndexFS has been successfully built.

By being standalone, we mean running one single IndexFS (metadata) server instance and multiple client processes at one single machine. So everything is in one box.

  • To start IndexFS server:

      $INDEXFS_HOME/sbin/start-idxfs.sh
    
  • To start IndexFS clients (processes) and run tests:

      $INDEXFS_HOME/sbin/tree-test.sh
    
  • To stop IndexFS server:

      $INDEXFS_HOME/sbin/stop-idxfs.sh
    

In the above scripts, IndexFS server will be started as a daemon running in the background. It's pid will be remembered at /tmp/indexfs/run/s0/indexfs_server.pid.0.

A simple MPI-based test will be performed against IndexFS in terms of its metadata path. The test will fork 2 client processes to collectively create and stat 8000 files under a single shared directory. This test is expected to conclude within 1 second.

About

New IndexFS core

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • C++ 81.4%
  • C 8.5%
  • HTML 3.1%
  • Makefile 2.3%
  • Shell 2.2%
  • M4 1.9%
  • Other 0.6%