fftw3: Usage of Multi-threaded FFTW
5.2 Usage of Multi-threaded FFTW
================================
Here, it is assumed that the reader is already familiar with the usage
of the uniprocessor FFTW routines, described elsewhere in this manual.
We only describe what one has to change in order to use the
multi-threaded routines.
First, programs using the parallel complex transforms should be
linked with '-lfftw3_threads -lfftw3 -lm' on Unix, or '-lfftw3_omp
-lfftw3 -lm' if you compiled with OpenMP. You will also need to link
with whatever library is responsible for threads on your system (e.g.
'-lpthread' on GNU/Linux) or include whatever compiler flag enables
OpenMP (e.g. '-fopenmp' with gcc).
Second, before calling _any_ FFTW routines, you should call the
function:
int fftw_init_threads(void);
This function, which need only be called once, performs any one-time
initialization required to use threads on your system. It returns zero
if there was some error (which should not happen under normal
circumstances) and a non-zero value otherwise.
Third, before creating a plan that you want to parallelize, you
should call:
void fftw_plan_with_nthreads(int nthreads);
The 'nthreads' argument indicates the number of threads you want FFTW
to use (or actually, the maximum number). All plans subsequently
created with any planner routine will use that many threads. You can
call 'fftw_plan_with_nthreads', create some plans, call
'fftw_plan_with_nthreads' again with a different argument, and create
some more plans for a new number of threads. Plans already created
before a call to 'fftw_plan_with_nthreads' are unaffected. If you pass
an 'nthreads' argument of '1' (the default), threads are disabled for
subsequent plans.
With OpenMP, to configure FFTW to use all of the currently running
OpenMP threads (set by 'omp_set_num_threads(nthreads)' or by the
'OMP_NUM_THREADS' environment variable), you can do:
'fftw_plan_with_nthreads(omp_get_max_threads())'. (The 'omp_' OpenMP
functions are declared via '#include <omp.h>'.)
Given a plan, you then execute it as usual with 'fftw_execute(plan)',
and the execution will use the number of threads specified when the plan
was created. When done, you destroy it as usual with
'fftw_destroy_plan'. As described in Thread safety, plan
_execution_ is thread-safe, but plan creation and destruction are _not_:
you should create/destroy plans only from a single thread, but can
safely execute multiple plans in parallel.
There is one additional routine: if you want to get rid of all memory
and other resources allocated internally by FFTW, you can call:
void fftw_cleanup_threads(void);
which is much like the 'fftw_cleanup()' function except that it also
gets rid of threads-related data. You must _not_ execute any previously
created plans after calling this function.
We should also mention one other restriction: if you save wisdom from
a program using the multi-threaded FFTW, that wisdom _cannot be used_ by
a program using only the single-threaded FFTW (i.e. not calling
'fftw_init_threads'). Words of Wisdom-Saving Plans.