A benchmarking tool for gpu.js
npm install @gpujs/benchmarkgpu.js needs to be installed separately.@gpujs/benchmark.sh
yarn add @gpujs/benchmark
`
#### Using npm
`sh
npm install @gpujs/benchmark
`
##### NOTE: If it asks for a GPU.js version, you can choose any version of your choice (>=v2.0.0) but the provided dist files will have the version which was the latest during the release of that version of benchmark.$3
#### Building
NOTE: The latest dist files are not included since v2.1.0 due to problems with browserify(https://github.com/gpujs/benchmark/issues/7). This issue will be addressed as soon as possible.
NOTE: The dist files are also included in the npm module and GitHub repository, skip this step if you are not running a modified script locally.
We use browserify and minify to build the distributable files dist/benchmark.js and dist/benchmark.min.js.
After running the setup script, run the following command
`sh
yarn build
`#### Using
Include the benchmark dist file in the HTML file.
`html
`
or, from the npm module
`html
`The exported function is
benchmark.
`js
const out = benchmark(options)
`
##### NOTE: Options are is an Object. See this.$3
#### Javascript
1. Import Benchmark.
`js
const benchmark = require('@gpujs/benchmark')
`
##### OR using ES6 syntax
`js
import benchmark from '@gpujs/benchmark'
`2. Run it.
`js
const benchmarks = benchmark.benchmark(options)
`
OR run Multiple Benchmarks
`js
const benchmarks = benchmark.multipleBenchmark(options)
`
This returns the benchmarks in an Object. See this.
##### NOTE: Options are is an Object. See this.#### CLI
1. Clone the repository and open the directory.
`sh
git clone https://github.com/gpujs/benchmark
cd benchmark
`2. Install
yarn
We use yarn as our package manager. You will have to install that too, as a side effect. (If you have yarn installed, skip this step)
`sh
npm install -g yarn
`3. Install the dependencies
`sh
yarn setup
`
##### NOTE: If it asks for a GPU.js version, you can choose any version of your choice (>=v2.0.0) but the provided dist files will have the version which was the latest during the release of the latest version of benchmark4. Run the tool in the CLI
`sh
yarn start
`
##### OR using node
`sh
node ./index.js
`
This will prompt you to enter the optional [options]
#### Using CLI with JSON Options as Input
`sh
yarn start options
`
options is a stringified JSON object passed as an argument.
##### OR using node
`sh
node ./index.js options
`
Here, options is a stringified JSON object.##### Example
`sh
yarn start '{"num_iterations": 4}'
`#### Options
The following options can be passed on to the
benchmark or multipleBenchmark method.1.
benchmark options:
- cpu(Object) \*: A custom GPU({mode: 'cpu'}) Object to benchmark specific versions of GPU.js(>= v2.0.0). Mandatory in everything except CLI.-
gpu(Object) \*: A custom GPU() Object to benchmark specific versions of GPU.js(>= v2.0.0). (default: The version shipped with benchmark). Mandatory in everything except CLI.-
matrix_size(Integer): The size of the uniform matrix used for benchmarking. (default: 512)-
num_iterations(Integer): The number of iterations of run time calculation. (default: 1)-
logs(Boolean): Toggles console logs by the library.-
cpu_benchmark(Boolean): Toggles the benchmarking of CPU. False is recommended to big matrix sizes. (default: true)2.
multipleBenchmark options:
Multiple Benchmark options have the following structure.
`js
{
common_options: { // options common to all but can be overridden in range or in full_options, preference given to range
cpu_benchmark: false,
cpu: new CPU({mode: 'cpu'}),
gpu: new GPU()
},
range: { // only one of this and full_options works
option_name: 'matrix_size',
interval: [128, 1024],
step: 100 //(default 10)(A.P.: 128, 138, 148, 158) one of step or common_ratio can be used, preference given to step
// common_ratio: 2 (G.P.: 128, 256, 512, 1024)
},
full_options: [
{
// array of options objects for each benchmark(only one of this and range works, preference given to range)
}
]
}
`
- common_options(Object): Options common to all the benchmarks that are run. (Same as benchmark options).
- range(Object): Used to create a set of options using a set of rules, for each benchmark. (only one of range or full_options can be used)
- option_name(String): The option for which the range is applied. This has to be of type Number. It can be one of the benchmark options.
- interval(Array): The upper and lower limits for the option.
- step(Number): The common difference between each option value. All the options will be in an AP. (only one of step or common_ratio can be used, preference is given to step)
- common_ratio(Number): The common ratio between each option value. All the options will be in a GP. (only one of step or common_ratio can be used, preference is given to step)
- full_options(Array): An array of options object, each one corresponding to one benchmark. Each object is the same as benchmark options. (only one of range or full_options can be used)#### Multiple Benchmarks in CLI
`sh
yarn start --multiple [options?]
`
options to the CLI are stored in a stringified JSON object passed as an argument.
More about Multiple Benchmarks.#### Saving Graphs as JSON
1. Plotly Style JSON
`sh
yarn start --multiple --returnPlotlyJSON
`
This will log to the console, plotly.js style JSON which stores the graph data for GPU score v/s matrix size of each benchmark.`sh
yarn start --multiple --savePlotlyJSONToFile=path/to/file.json
`
This saves the plotly.js style JSON data for:
- GPU score v/s matrix size
- GPU matrix multiplication run time v/s matrix size
- CPU score v/s matrix size
- CPU matrix multiplication run time v/s matrix size##### NOTE: If CPU is not benchmarked, CPU score and run time will have non-meaningful negative values which are to be ignored.
##### NOTE: Filename need not have a
.json extension.1. Chartist Style JSON
`sh
yarn start --multiple --returnChartistJSON
`
This will log to the console, chartist.js style JSON which stores the graph data for GPU score v/s matrix size of each benchmark.`sh
yarn start --multiple --saveChartistJSONToFile=path/to/file.json
`
This saves the chartist.js style JSON data for:
- GPU score v/s matrix size
- GPU matrix multiplication run time v/s matrix size
- CPU score v/s matrix size
- CPU matrix multiplication run time v/s matrix size##### NOTE: If CPU is not benchmarked, CPU score and run time will have non-meaningful negative values which are to be ignored.
##### NOTE: Filename need not have a
.json extension.##### NOTE: One or more of the above arguments for JSON output can be used with
--multiple
#### Multiple Benchmarks
Benchmark allows you to run a sequence of benchmarks each with different custom options or each having number options like matrix size changed by a fixed amount.
`js
benchmark.multipleBenchmark(options);
`
Where options is an object with the following properties:
- common_options(Object): Options common to every benchmark in a sequence. (default: {cpu_benchmark: false})
- range(Object): Define a range of option(number type) values, one for each benchmark in the sequence. e.g.: matrix_size: 512, 1024, 1536... or matrix_size: 512, 1024, 2048 ...
Here, the specified option can either be incremented by a fixed number(common difference) or multiplied by a fixed number(common factor).
- option_name(String): The name of the option for which the range is to be set. e.g.: matrix_size (Default: matrix_size)
- interval(Array): An array with upper and lower limits for the range. e.g.: [512, 2048] (Default: [128, 1024])
- step(Number): The fixed number which is to be added(common difference). (Default: 100)
- common_ratio(Number): The fixed number to be multiplied. (Default: none)
###### NOTE: Only one of step and common_ratio can be used
- full_options(Array): An array of objects specifying separate set of options for each benchmark in the sequence(common_options properties can be overridden here). (Default: none)
###### NOTE: Only one of range and full_options can be used##### Examples
1. Range:
`js
benchmark.multipleBenchmark({
common_options: {
cpu_benchmark: false,
logs: false
},
range: {
option_name: 'matrix_size',
interval: [128, 2048],
common_ratio: 2
}
})
`
The above code runs a separate benchmark for the matrix sizes 128, 256, 512, 1024, 2048 which are in GP.2. full_options:
`js
benchmark.multipleBenchmark({
common_options: {
logs: false,
cpu_benchmark: false
},
full_options: [
{
logs: true, // override
matrix_size: 2048
},
{
cpu_benchmark: true, //override
matrix_size: 128
}
]
})
`$3
#### Output
The output of any benchmark(multiple or single) is a
BenchmarkOut Object.#### Stats
The output contains a
stats property which shows the overall stats of the benchmark:
- run_time: The run time stats
- mat_mult, mat_conv, pipe(Object): These three objects contain the stats for each type of benchmark.
- diff: Has a single property that contains performance comparison scores between CPU and GPU.
- cpu_gpu:
- min, max, avg: The minimum, maximum and average time taken stats
- winner(gpu | cpu): The better performer among the two.
- percentage(Number): By how much percentage it is better.-
build_time: The build time stats
- mat_mult, mat_conv: Built time stats for each benchmark.
- diff: Same as the diff object in run_time except that it compares GPU v/s GPU(pipeline mode) in the property gpu_pipe. (P.S. Best Performer and Worst Performer are not included)-
overall: The overall stats
mat_mult, mat_conv: Overall stats for each benchmark
- best_performer(gpu | cpu): The best overall performer.
- worst_performer(gpu | cpu): The worst overall performer.
- diff: Same as the diff object in run_time-
score: The score object is a property of the main output object.
- gpu, cpu(Number): A score is a number representing the overall normalized average performance of the GPU or CPU. This score can be directly compared to other benchmarks or hardware.TECHNICAL: The
score is floor of one-hundredth of the ratio of the total number of operations in matrix multiplication to the time taken for the operations.
- In the case of matrix multiplication, one single operation is taken to be the product of two array elements and the total number of operations is taken to be the cube of one of the dimensions[for a square matrix].#### BenchmarkOut
This object stores the output of Benchmark.
##### Properties
-
mat_gen, mat_pad(Number): Matrix generation and matrix padding times in ms.
- build_time(Object):
- mat_mult, mat_conv(Object)
- gpu, pipe(Number): Compile times for GPU and GPU(pipeline mode) in ms for each benchmark.
- run_time(Object): Run times for each benchmark.
- mat_mult, mat_conv, pipe(Object): Run times for each benchmark.
- gpu, cpu(Object): GPU and CPU run times.
- min, max, avg(Number): The minimum, maximum and average run times in ms.
- deviation (Number): Percentage deviation of results from average value.
- stats(Object): The statistics.##### Methods
-
getDataField(field, index = 0)(returns: *): Gets any one of the output field(property).
- field(String): The name of the field.
- index(Number): The index of the benchmark if multiple benchmarks are run.
- getPlotlyJSON(compare_fields), getChartistJSON(compare_fields)(Returns: Array): Returns plotly or Chartist style JSON Object for charts. (only for multiple benchmarks)
- compare_fields: An array of objects having two properties x and y representing the data to be plotted on their respective axes.
- x, y(String): Can be one of:
- matrix_size
- gpu_score
- cpu_score
- gpu_run_time_mat_mult: GPU matrix multiplication run time
- cpu_run_time_mat_mult: CPU matrix multiplication run time
- gpu_run_time_mat_conv: GPU matrix convolution run time
- cpu_run_time_mat_conv: CPU matrix convolution run time
- pipe_run_time: GPU pipelining run timeDefault value of
compare_fields argument for getPlotlyJSON and getChartistJSON methods:
`js
[
{
x: 'matrix_size',
y: 'gpu_run_time_mat_mult'
},
{
x: 'matrix_size',
y: 'pipe_run_time'
},
{
x: 'matrix_size',
y: 'gpu_score'
}
]
`
#### Benchmarks
##### Matrix Multiplication
This benchmark multiplies two randomly generated uniform-sized matrices and benchmarks the GPU and CPU against the time taken by each.
GPU.js Kernel:
`js
function(a, b) {
let sum = 0;
for (let i = 0; i < this.output.x; i++) {
sum += a[this.thread.y][i] * b[i][this.thread.x];
}
return sum;
}
`##### Matrix Convolution
This benchmark convolves#Convolution) a 3x3 kernel) over a randomly generated uniform sized matrix.
The convolution kernel is
`
1 2 1
2 1 2
1 2 1
`GPU.js Kernel:
`js
function (array, kernel) {
let sum = 0;
for (let i = 0; i < ${kernelX}; i++){
for (let j = 0; j < ${kernelY}; j++){
sum += kernel[j][i] * array[this.thread.y + j][this.thread.x + i];
}
}
return sum;
}
`
Where kernelX and kernelY` are the dimensions of the kernel.##### Pipelining
GPU.js supports a feature called Pipelining and this benchmark benchmarks this feature.
It runs four matrix multiplication benchmarks in a sequence while pipelining the output of the earlier benchmark to be used as an input to the next one. The benchmark is run both on the GPU and the CPU(without pipelining) and the time taken is compared.
When it is run on the GPU, the output of the previous multiplication is passed on to the next call as a texture (a storage unit on the GPU) on the GPU itself which drastically reduces the time taken because the output need not be converted and transferred to the CPU and back.