wiki:application_pages/tutorial_apps/java/wordcount_sequential

Application Name

Wordcount MN

Summary

  • Name: Wordcount MN
  • Contact Person: support-compss@bsc.es
  • Access Level: public
  • License Agreement: GPL
  • Platform: COMPSs
  • Repository: Wordcount MN

Description

Wordcount application

Versions

There are two versions of Wordcount, depending on how the input data is given

Version 1

Single input file, where all the text is given in the same file and the chunks are calculated with a BLOCK_SIZE parameter

Version 2

Multiple input files, where the text fragments are already in different files under the same directory

Execution instructions

Usage: runcompss wordcount.multipleFiles.Wordcount <DATA_FOLDER> runcompss wordcount.uniqueFile.Wordcount <DATA_FILE> <BLOCK_SIZE>

where:

  • - DATA_FOLDER: Absolute path to the base folder of the dataset files
  • - DATA_FILE: Absolute path to the dabase file
  • - BLOCK_SIZE: Number of bytes of each block

Execution Example

runcompss wordcount.multipleFiles.Wordcount ~/multipleDataset/ runcompss wordcount.uniqueFile.Wordcount ~/singleDataset/data.txt 100

Build

Option 1: Native java

cd ~/tutorial_apps/java/wordcount_sequential/; javac src/main/java/wordcount/multipleFiles/*.java; javac src/main/java/wordcount/uniqueFile/*.java; cd src/main/java/; jar cf wordcount.jar wordcount/ cd ../../../; mv src/main/java/wordcount.jar jar/

Option 2: Maven

cd ~/tutorial_apps/java/wordcount/ mvn clean package

Last modified 7 years ago Last modified on 11/08/16 15:21:56