This floating point unit is conform to IEEE 754-2008 standards. Supported operations are compare, min-max, conversions, addition, subtruction, multiplication, fused multiply add, square root and division in single precisions. Except square root and division all operations are pipelined.
This unit uses canonical nan (not a number) form, if it generates any nan as output. E.g. 0x7FC00000. Therefore extra conditions are added in order compare outputs with testfloat data correctly by nan generation.
Square root and division calculations are using same subunit but different path (algorithm). This subunit has a generic variable performance. For functional iterations which are fast please use 1 and fixed point iterations which are slow use 0. This unit has also own multiplier.
This floating point unit transform the single precision to the pseudo extended precision. The main benefit of this implementation is that the design needs few resources because we do not implement extension in pipeline to handle subnormal numbers. It means that all floating numbers are normalized thanks to pseudo extended precision.
sign | exponent | mantissa | |
---|---|---|---|
single | 1 | 8 | 23 |
pseudo | 1 | 9 | 23 |
comp | max | conv | add | sub | mul | fma |
---|---|---|---|---|---|---|
1 | 1 | 1 | 3 | 3 | 3 | 3 |
performance | division | square root |
---|---|---|
0 | 29 | 28 |
1 | 12 | 14 |
op | type |
---|---|
fmadd | |
fmsub | |
fnmsub | |
fnmadd | |
fadd | |
fsub | |
fmul | |
fdiv | |
fsqrt |
op | rm | type |
---|---|---|
fsgnj | "000" | J |
fsgnj | "001" | JN |
fsgnj | "010" | JX |
op | rm | type |
---|---|---|
fcmp | "000" | EQ |
fcmp | "001" | LT |
fcmp | "010" | LE |
op | rm | type |
---|---|---|
fmax | "000" | MIN |
fmax | "001" | MAX |
op | op.fcvt_op | fmt | type |
---|---|---|---|
fcvt_f2i | "00" | "00" | CONVERSION FROM FLOAT TO INT32 |
fcvt_f2i | "01" | "00" | CONVERSION FROM FLOAT TO UINT32 |
op | op.fcvt_op | fmt | type |
---|---|---|---|
fcvt_i2f | "00" | "00" | CONVERSION FROM INT32 TO FLOAT |
fcvt_i2f | "01" | "00" | CONVERSION FROM UINT32 TO FLOAT |
fmt | type |
---|---|
"00" | FLOAT |
rm | type |
---|---|
"000" | RNE |
"001" | RTZ |
"010" | RDN |
"011" | RUP |
"100" | RMM |
flags | type |
---|---|
"00001" | NX |
"00010" | UF |
"00100" | OF |
"01000" | DZ |
"10000" | NV |
The installation scripts of necessary tools are located in directory tools. These scripts need root permission in order to install packages and tools for simulation and testcase generation.
make tool
To generate test cases you could use following command:
make generate
To simulate the design together with generated test cases you could run following command:
make simulate
This command requires two options for hardware description languages VERILOG and VHDL. The possible settings of these options can be found in the makefile.
Some example executions of this command look like as follows:
make simulate VERILOG=1
make simulate VHDL=1