Cometa is easy to self-host thanks to its Docker image. First, install Docker by following the instructions on the official site. Now run Cometa in automatic mode by running the following command in your terminal:
This runs Docker in detached mode so you will not see a log or a terminal prompt.
From now on, the web repository will be served at port 8080 (visit localhost:8080 in your browser to see the repository). Cometa will listen for new datasets in the folder public/pending in mldr (.rds) format. Any new incoming dataset will be partitioned and its metadata will be extracted in order to display it on the repository. Partitions will be created inside public/partitions, and the full dataset will be converted to several formats in public/full.
Running Cometa in interactive mode
If you do not want Cometa to run in automatic mode, you can launch it interactively by changing -dp for -itp in the previous docker run command. Use --rm as well in order to throw away your container when you've finished using it:
Wait for the download to complete, and you should see a welcome message and some options to choose:
Automatic: default behavior in non-interactive mode.
Only partitioning: partitions all datasets in public/pending and exits the program.
Only serve website: launches a web server for the included dataset repository.
Drop to a terminal: advanced mode.
The second option will run different partitioning and cross-validation strategies through your data. Bear in mind this could take from minutes to several hours depending on the size of the datasets.