Application Name
Wordcount MN
Summary
- Name: Wordcount MN
- Contact Person: support-compss@bsc.es
- Access Level: public
- License Agreement: GPL
- Platform: COMPSs
- Repository: Wordcount MN
Description
Wordcount application
Versions
There are two versions of Wordcount, depending on how the input data is given
Version 1
Single input file, where all the text is given in the same file and the chunks are calculated with a BLOCK_SIZE parameter
Version 2
Multiple input files, where the text fragments are already in different files under the same directory
Execution instructions
Usage: runcompss wordcount.multipleFiles.Wordcount <DATA_FOLDER> runcompss wordcount.uniqueFile.Wordcount <DATA_FILE> <BLOCK_SIZE>
where:
- - DATA_FOLDER: Absolute path to the base folder of the dataset files
- - DATA_FILE: Absolute path to the dabase file
- - BLOCK_SIZE: Number of bytes of each block
Execution Example
runcompss wordcount.multipleFiles.Wordcount ~/multipleDataset/ runcompss wordcount.uniqueFile.Wordcount ~/singleDataset/data.txt 100
Build
Option 1: Native java
cd ~/tutorial_apps/java/wordcount/; javac src/main/java/wordcount/multipleFiles/*.java; javac src/main/java/wordcount/uniqueFile/*.java; cd src/main/java/; jar cf wordcount.jar wordcount/ cd ../../../; mv src/main/java/wordcount.jar jar/
Option 2: Maven
cd ~/tutorial_apps/java/wordcount/ mvn clean package
Attachments (2)
- wordcount.tar.gz (4.3 MB) - added by trac 8 years ago.
- wordcount.tar.2.gz (4.3 MB) - added by trac 7 years ago.