···11-KISS FFT - A mixed-radix Fast Fourier Transform based up on the principle,
22-"Keep It Simple, Stupid."
33-44- There are many great fft libraries already around. Kiss FFT is not trying
55-to be better than any of them. It only attempts to be a reasonably efficient,
66-moderately useful FFT that can use fixed or floating data types and can be
77-incorporated into someone's C program in a few minutes with trivial licensing.
88-99-USAGE:
1010-1111- The basic usage for 1-d complex FFT is:
1212-1313- #include "kiss_fft.h"
1414-1515- kiss_fft_cfg cfg = kiss_fft_alloc( nfft ,is_inverse_fft ,0,0 );
1616-1717- while ...
1818-1919- ... // put kth sample in cx_in[k].r and cx_in[k].i
2020-2121- kiss_fft( cfg , cx_in , cx_out );
2222-2323- ... // transformed. DC is in cx_out[0].r and cx_out[0].i
2424-2525- kiss_fft_free(cfg);
2626-2727- Note: frequency-domain data is stored from dc up to 2pi.
2828- so cx_out[0] is the dc bin of the FFT
2929- and cx_out[nfft/2] is the Nyquist bin (if exists)
3030-3131- Declarations are in "kiss_fft.h", along with a brief description of the
3232-functions you'll need to use.
3333-3434-Code definitions for 1d complex FFTs are in kiss_fft.c.
3535-3636-You can do other cool stuff with the extras you'll find in tools/
3737-3838- * multi-dimensional FFTs
3939- * real-optimized FFTs (returns the positive half-spectrum: (nfft/2+1) complex frequency bins)
4040- * fast convolution FIR filtering (not available for fixed point)
4141- * spectrum image creation
4242-4343-The core fft and most tools/ code can be compiled to use float, double,
4444- Q15 short or Q31 samples. The default is float.
4545-4646-BUILDING:
4747-4848-There are two functionally-equivalent build systems supported by kissfft:
4949-5050- * Make (traditional Makefiles for Unix / Linux systems)
5151- * CMake (more modern and feature-rich build system developed by Kitware)
5252-5353-To build kissfft, the following build environment can be used:
5454-5555- * GNU build environment with GCC, Clang and GNU Make or CMake (>= 3.6)
5656- * Microsoft Visual C++ (MSVC) with CMake (>= 3.6)
5757-5858-Additional libraries required to build and test kissfft include:
5959-6060- * libpng for psdpng tool,
6161- * libfftw3 to validate kissfft results against it,
6262- * python 2/3 with Numpy to validate kissfft results against it.
6363- * OpenMP supported by GCC, Clang or MSVC for multi-core FFT transformations
6464-6565-Environments like Cygwin and MinGW can be highly likely used to build kissfft
6666-targeting Windows platform, but no tests were performed to the date.
6767-6868-Both Make and CMake builds are easily configurable:
6969-7070- * 'KISSFFT_DATATYPE=<datatype>' (for Make) or '-DKISSFFT_DATATYPE=<datatype>'
7171- (for CMake) denote the principal datatype used by kissfft. It can be one
7272- of the following:
7373-7474- * float (default)
7575- * double
7676- * int16_t
7777- * int32_t
7878- * SIMD (requires SSE instruction set support on target CPU)
7979-8080- * 'KISSFFT_OPENMP=1' (for Make) or '-DKISSFFT_OPENMP=ON' (for CMake) builds kissfft
8181- with OpenMP support. Please note that a supported compiler is required and this
8282- option is turned off by default.
8383-8484- * 'KISSFFT_STATIC=1' (for Make) or '-DKISSFFT_STATIC=ON' (for CMake) instructs
8585- the builder to create static library ('.lib' for Windows / '.a' for Unix or Linux).
8686- By default, this option is turned off and the shared library is created
8787- ('.dll' for Windows, '.so' for Linux or Unix, '.dylib' for Mac OSX)
8888-8989- * '-DKISSFFT_TEST=OFF' (for CMake) disables building tests for kissfft. On Make,
9090- building tests is done separately by 'make testall' or 'make testsingle', so
9191- no specific setting is required.
9292-9393- * 'KISSFFT_TOOLS=0' (for Make) or '-DKISSFFT_TOOLS=OFF' (for CMake) builds kissfft
9494- without command-line tools like 'fastconv'. By default the tools are built.
9595-9696- * 'KISSFFT_USE_ALLOCA=1' (for Make) or '-DKISSFFT_USE_ALLOCA=ON' (for CMake)
9797- build kissfft with 'alloca' usage instead of 'malloc' / 'free'.
9898-9999- * "PREFIX=/full/path/to/installation/prefix/directory" (for Make) or
100100- "-DCMAKE_INSTALL_PREFIX=/full/path/to/installation/prefix/directory' (for CMake)
101101- specifies the prefix directory to install kissfft into.
102102-103103-For example, to build kissfft as a static library with 'int16_t' datatype and
104104-OpenMP support using Make, run the command from kissfft source tree:
105105-106106- make KISSFFT_DATATYPE=int16_t KISSFFT_STATIC=1 KISSFFT_OPENMP=1 all
107107-108108-The same configuration for CMake is:
109109-110110- mkdir build && cd build
111111- cmake -DKISSFFT_DATATYPE=int16_t -DKISSFFT_STATIC=ON -DKISSFFT_OPENMP=ON ..
112112- make all
113113-114114-To specify '/tmp/1234' as installation prefix directory, run:
115115-116116- make PREFIX=/tmp/1234 KISSFFT_DATATYPE=int16_t KISSFFT_STATIC=1 KISSFFT_OPENMP=1 install
117117-118118-or
119119-120120- mkdir build && cd build
121121- cmake -DCMAKE_INSTALL_PREFIX=/tmp/1234 -DKISSFFT_DATATYPE=int16_t -DKISSFFT_STATIC=ON -DKISSFFT_OPENMP=ON ..
122122- make all
123123- make install
124124-125125-TESTING:
126126-127127- To validate the build configured as an example above, run the following command from
128128-kissfft source tree:
129129-130130- make KISSFFT_DATATYPE=int16_t KISSFFT_STATIC=1 KISSFFT_OPENMP=1 testsingle
131131-132132-if using Make, or:
133133-134134- make test
135135-136136-if using CMake.
137137-138138-To test all possible build configurations, please run an extended testsuite from
139139-kissfft source tree:
140140-141141- sh test/kissfft-testsuite.sh
142142-143143-Please note that the extended testsuite takes around 20-40 minutes depending on device
144144-it runs on. This testsuite is useful for reporting bugs or testing the pull requests.
145145-146146-BACKGROUND:
147147-148148- I started coding this because I couldn't find a fixed point FFT that didn't
149149-use assembly code. I started with floating point numbers so I could get the
150150-theory straight before working on fixed point issues. In the end, I had a
151151-little bit of code that could be recompiled easily to do ffts with short, float
152152-or double (other types should be easy too).
153153-154154- Once I got my FFT working, I was curious about the speed compared to
155155-a well respected and highly optimized fft library. I don't want to criticize
156156-this great library, so let's call it FFT_BRANDX.
157157-During this process, I learned:
158158-159159- 1. FFT_BRANDX has more than 100K lines of code. The core of kiss_fft is about 500 lines (cpx 1-d).
160160- 2. It took me an embarrassingly long time to get FFT_BRANDX working.
161161- 3. A simple program using FFT_BRANDX is 522KB. A similar program using kiss_fft is 18KB (without optimizing for size).
162162- 4. FFT_BRANDX is roughly twice as fast as KISS FFT in default mode.
163163-164164- It is wonderful that free, highly optimized libraries like FFT_BRANDX exist.
165165-But such libraries carry a huge burden of complexity necessary to extract every
166166-last bit of performance.
167167-168168- Sometimes simpler is better, even if it's not better.
169169-170170-FREQUENTLY ASKED QUESTIONS:
171171- Q: Can I use kissfft in a project with a ___ license?
172172- A: Yes. See LICENSE below.
173173-174174- Q: Why don't I get the output I expect?
175175- A: The two most common causes of this are
176176- 1) scaling : is there a constant multiplier between what you got and what you want?
177177- 2) mixed build environment -- all code must be compiled with same preprocessor
178178- definitions for FIXED_POINT and kiss_fft_scalar
179179-180180- Q: Will you write/debug my code for me?
181181- A: Probably not unless you pay me. I am happy to answer pointed and topical questions, but
182182- I may refer you to a book, a forum, or some other resource.
183183-184184-185185-PERFORMANCE:
186186- (on Athlon XP 2100+, with gcc 2.96, float data type)
187187-188188- Kiss performed 10000 1024-pt cpx ffts in .63 s of cpu time.
189189- For comparison, it took md5sum twice as long to process the same amount of data.
190190-191191- Transforming 5 minutes of CD quality audio takes less than a second (nfft=1024).
192192-193193-DO NOT:
194194- ... use Kiss if you need the Fastest Fourier Transform in the World
195195- ... ask me to add features that will bloat the code
196196-197197-UNDER THE HOOD:
198198-199199- Kiss FFT uses a time decimation, mixed-radix, out-of-place FFT. If you give it an input buffer
200200- and output buffer that are the same, a temporary buffer will be created to hold the data.
201201-202202- No static data is used. The core routines of kiss_fft are thread-safe (but not all of the tools directory).
203203-204204- No scaling is done for the floating point version (for speed).
205205- Scaling is done both ways for the fixed-point version (for overflow prevention).
206206-207207- Optimized butterflies are used for factors 2,3,4, and 5.
208208-209209- The real (i.e. not complex) optimization code only works for even length ffts. It does two half-length
210210- FFTs in parallel (packed into real&imag), and then combines them via twiddling. The result is
211211- nfft/2+1 complex frequency bins from DC to Nyquist. If you don't know what this means, search the web.
212212-213213- The fast convolution filtering uses the overlap-scrap method, slightly
214214- modified to put the scrap at the tail.
215215-216216-LICENSE:
217217- Revised BSD License, see COPYING for verbiage.
218218- Basically, "free to use&change, give credit where due, no guarantees"
219219- Note this license is compatible with GPL at one end of the spectrum and closed, commercial software at
220220- the other end. See http://www.fsf.org/licensing/licenses
221221-222222-TODO:
223223- *) Add real optimization for odd length FFTs
224224- *) Document/revisit the input/output fft scaling
225225- *) Make doc describing the overlap (tail) scrap fast convolution filtering in kiss_fastfir.c
226226- *) Test all the ./tools/ code with fixed point (kiss_fastfir.c doesn't work, maybe others)
227227-228228-AUTHOR:
229229- Mark Borgerding
230230- Mark@Borgerding.net