A benchmarking and validation framework for the X-engine of FX telescope arrays.
Early development. Only targeting TCC and mwax-xGPU so far.
git clone --recursive
to obtain xGPU and TCC- follow instructions to build xGPU and TCC
- ensure
LD_LIBRARY_PATH
points to shared librariestensor-core-correlator/libtcc/libtcc.so
tensor-core-correlator/external/cuda-wrappers/libcu.so
mwax-xGPU/src/libxgpu.so
make
Software | Minimum version |
---|---|
CUDA | 10.0 or later |
CMake | 3.17 or later |
gcc | 9.3 or later |
OS | Linux distro (amd64) |
complex normally distributed input valuesxGPUinput reorder to xGPUexecution of xGPUoutput reorder xGPU to MWAX
TCCfloat to half conversioninput reorder to TCCexecution of TCCoutput reorder xGPU to MWAX
Serialbenchmarking analysisoutput validationresult precision comparison (TCC requires downconversion to FP16)change tcc reorder kernel to support > 1024 channels
- use
-Wl
and-rpath
in Makefile so user doesn't have to manually changeLD_LIBRARY_PATH
- generalise interface
- change main.cu to main.c/main.cpp
- pass arrays as void* and let user typecast to custom complex format
- store size of input/output type in parameters
- compile flags for AMD/CUDA
- save output to disk to compare accuracy and performance across AMD/CUDA