11 000 100 011 101 111 110 000 000 000 045 Converted to 32 Bit Single Precision IEEE 754 Binary Floating Point Representation Standard

Convert decimal 11 000 100 011 101 111 110 000 000 000 045(10) to 32 bit single precision IEEE 754 binary floating point representation standard (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

What are the steps to convert decimal number
11 000 100 011 101 111 110 000 000 000 045(10) to 32 bit single precision IEEE 754 binary floating point representation (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

1. Divide the number repeatedly by 2.

Keep track of each remainder.

We stop when we get a quotient that is equal to zero.


  • division = quotient + remainder;
  • 11 000 100 011 101 111 110 000 000 000 045 ÷ 2 = 5 500 050 005 550 555 555 000 000 000 022 + 1;
  • 5 500 050 005 550 555 555 000 000 000 022 ÷ 2 = 2 750 025 002 775 277 777 500 000 000 011 + 0;
  • 2 750 025 002 775 277 777 500 000 000 011 ÷ 2 = 1 375 012 501 387 638 888 750 000 000 005 + 1;
  • 1 375 012 501 387 638 888 750 000 000 005 ÷ 2 = 687 506 250 693 819 444 375 000 000 002 + 1;
  • 687 506 250 693 819 444 375 000 000 002 ÷ 2 = 343 753 125 346 909 722 187 500 000 001 + 0;
  • 343 753 125 346 909 722 187 500 000 001 ÷ 2 = 171 876 562 673 454 861 093 750 000 000 + 1;
  • 171 876 562 673 454 861 093 750 000 000 ÷ 2 = 85 938 281 336 727 430 546 875 000 000 + 0;
  • 85 938 281 336 727 430 546 875 000 000 ÷ 2 = 42 969 140 668 363 715 273 437 500 000 + 0;
  • 42 969 140 668 363 715 273 437 500 000 ÷ 2 = 21 484 570 334 181 857 636 718 750 000 + 0;
  • 21 484 570 334 181 857 636 718 750 000 ÷ 2 = 10 742 285 167 090 928 818 359 375 000 + 0;
  • 10 742 285 167 090 928 818 359 375 000 ÷ 2 = 5 371 142 583 545 464 409 179 687 500 + 0;
  • 5 371 142 583 545 464 409 179 687 500 ÷ 2 = 2 685 571 291 772 732 204 589 843 750 + 0;
  • 2 685 571 291 772 732 204 589 843 750 ÷ 2 = 1 342 785 645 886 366 102 294 921 875 + 0;
  • 1 342 785 645 886 366 102 294 921 875 ÷ 2 = 671 392 822 943 183 051 147 460 937 + 1;
  • 671 392 822 943 183 051 147 460 937 ÷ 2 = 335 696 411 471 591 525 573 730 468 + 1;
  • 335 696 411 471 591 525 573 730 468 ÷ 2 = 167 848 205 735 795 762 786 865 234 + 0;
  • 167 848 205 735 795 762 786 865 234 ÷ 2 = 83 924 102 867 897 881 393 432 617 + 0;
  • 83 924 102 867 897 881 393 432 617 ÷ 2 = 41 962 051 433 948 940 696 716 308 + 1;
  • 41 962 051 433 948 940 696 716 308 ÷ 2 = 20 981 025 716 974 470 348 358 154 + 0;
  • 20 981 025 716 974 470 348 358 154 ÷ 2 = 10 490 512 858 487 235 174 179 077 + 0;
  • 10 490 512 858 487 235 174 179 077 ÷ 2 = 5 245 256 429 243 617 587 089 538 + 1;
  • 5 245 256 429 243 617 587 089 538 ÷ 2 = 2 622 628 214 621 808 793 544 769 + 0;
  • 2 622 628 214 621 808 793 544 769 ÷ 2 = 1 311 314 107 310 904 396 772 384 + 1;
  • 1 311 314 107 310 904 396 772 384 ÷ 2 = 655 657 053 655 452 198 386 192 + 0;
  • 655 657 053 655 452 198 386 192 ÷ 2 = 327 828 526 827 726 099 193 096 + 0;
  • 327 828 526 827 726 099 193 096 ÷ 2 = 163 914 263 413 863 049 596 548 + 0;
  • 163 914 263 413 863 049 596 548 ÷ 2 = 81 957 131 706 931 524 798 274 + 0;
  • 81 957 131 706 931 524 798 274 ÷ 2 = 40 978 565 853 465 762 399 137 + 0;
  • 40 978 565 853 465 762 399 137 ÷ 2 = 20 489 282 926 732 881 199 568 + 1;
  • 20 489 282 926 732 881 199 568 ÷ 2 = 10 244 641 463 366 440 599 784 + 0;
  • 10 244 641 463 366 440 599 784 ÷ 2 = 5 122 320 731 683 220 299 892 + 0;
  • 5 122 320 731 683 220 299 892 ÷ 2 = 2 561 160 365 841 610 149 946 + 0;
  • 2 561 160 365 841 610 149 946 ÷ 2 = 1 280 580 182 920 805 074 973 + 0;
  • 1 280 580 182 920 805 074 973 ÷ 2 = 640 290 091 460 402 537 486 + 1;
  • 640 290 091 460 402 537 486 ÷ 2 = 320 145 045 730 201 268 743 + 0;
  • 320 145 045 730 201 268 743 ÷ 2 = 160 072 522 865 100 634 371 + 1;
  • 160 072 522 865 100 634 371 ÷ 2 = 80 036 261 432 550 317 185 + 1;
  • 80 036 261 432 550 317 185 ÷ 2 = 40 018 130 716 275 158 592 + 1;
  • 40 018 130 716 275 158 592 ÷ 2 = 20 009 065 358 137 579 296 + 0;
  • 20 009 065 358 137 579 296 ÷ 2 = 10 004 532 679 068 789 648 + 0;
  • 10 004 532 679 068 789 648 ÷ 2 = 5 002 266 339 534 394 824 + 0;
  • 5 002 266 339 534 394 824 ÷ 2 = 2 501 133 169 767 197 412 + 0;
  • 2 501 133 169 767 197 412 ÷ 2 = 1 250 566 584 883 598 706 + 0;
  • 1 250 566 584 883 598 706 ÷ 2 = 625 283 292 441 799 353 + 0;
  • 625 283 292 441 799 353 ÷ 2 = 312 641 646 220 899 676 + 1;
  • 312 641 646 220 899 676 ÷ 2 = 156 320 823 110 449 838 + 0;
  • 156 320 823 110 449 838 ÷ 2 = 78 160 411 555 224 919 + 0;
  • 78 160 411 555 224 919 ÷ 2 = 39 080 205 777 612 459 + 1;
  • 39 080 205 777 612 459 ÷ 2 = 19 540 102 888 806 229 + 1;
  • 19 540 102 888 806 229 ÷ 2 = 9 770 051 444 403 114 + 1;
  • 9 770 051 444 403 114 ÷ 2 = 4 885 025 722 201 557 + 0;
  • 4 885 025 722 201 557 ÷ 2 = 2 442 512 861 100 778 + 1;
  • 2 442 512 861 100 778 ÷ 2 = 1 221 256 430 550 389 + 0;
  • 1 221 256 430 550 389 ÷ 2 = 610 628 215 275 194 + 1;
  • 610 628 215 275 194 ÷ 2 = 305 314 107 637 597 + 0;
  • 305 314 107 637 597 ÷ 2 = 152 657 053 818 798 + 1;
  • 152 657 053 818 798 ÷ 2 = 76 328 526 909 399 + 0;
  • 76 328 526 909 399 ÷ 2 = 38 164 263 454 699 + 1;
  • 38 164 263 454 699 ÷ 2 = 19 082 131 727 349 + 1;
  • 19 082 131 727 349 ÷ 2 = 9 541 065 863 674 + 1;
  • 9 541 065 863 674 ÷ 2 = 4 770 532 931 837 + 0;
  • 4 770 532 931 837 ÷ 2 = 2 385 266 465 918 + 1;
  • 2 385 266 465 918 ÷ 2 = 1 192 633 232 959 + 0;
  • 1 192 633 232 959 ÷ 2 = 596 316 616 479 + 1;
  • 596 316 616 479 ÷ 2 = 298 158 308 239 + 1;
  • 298 158 308 239 ÷ 2 = 149 079 154 119 + 1;
  • 149 079 154 119 ÷ 2 = 74 539 577 059 + 1;
  • 74 539 577 059 ÷ 2 = 37 269 788 529 + 1;
  • 37 269 788 529 ÷ 2 = 18 634 894 264 + 1;
  • 18 634 894 264 ÷ 2 = 9 317 447 132 + 0;
  • 9 317 447 132 ÷ 2 = 4 658 723 566 + 0;
  • 4 658 723 566 ÷ 2 = 2 329 361 783 + 0;
  • 2 329 361 783 ÷ 2 = 1 164 680 891 + 1;
  • 1 164 680 891 ÷ 2 = 582 340 445 + 1;
  • 582 340 445 ÷ 2 = 291 170 222 + 1;
  • 291 170 222 ÷ 2 = 145 585 111 + 0;
  • 145 585 111 ÷ 2 = 72 792 555 + 1;
  • 72 792 555 ÷ 2 = 36 396 277 + 1;
  • 36 396 277 ÷ 2 = 18 198 138 + 1;
  • 18 198 138 ÷ 2 = 9 099 069 + 0;
  • 9 099 069 ÷ 2 = 4 549 534 + 1;
  • 4 549 534 ÷ 2 = 2 274 767 + 0;
  • 2 274 767 ÷ 2 = 1 137 383 + 1;
  • 1 137 383 ÷ 2 = 568 691 + 1;
  • 568 691 ÷ 2 = 284 345 + 1;
  • 284 345 ÷ 2 = 142 172 + 1;
  • 142 172 ÷ 2 = 71 086 + 0;
  • 71 086 ÷ 2 = 35 543 + 0;
  • 35 543 ÷ 2 = 17 771 + 1;
  • 17 771 ÷ 2 = 8 885 + 1;
  • 8 885 ÷ 2 = 4 442 + 1;
  • 4 442 ÷ 2 = 2 221 + 0;
  • 2 221 ÷ 2 = 1 110 + 1;
  • 1 110 ÷ 2 = 555 + 0;
  • 555 ÷ 2 = 277 + 1;
  • 277 ÷ 2 = 138 + 1;
  • 138 ÷ 2 = 69 + 0;
  • 69 ÷ 2 = 34 + 1;
  • 34 ÷ 2 = 17 + 0;
  • 17 ÷ 2 = 8 + 1;
  • 8 ÷ 2 = 4 + 0;
  • 4 ÷ 2 = 2 + 0;
  • 2 ÷ 2 = 1 + 0;
  • 1 ÷ 2 = 0 + 1;

2. Construct the base 2 representation of the positive number.

Take all the remainders starting from the bottom of the list constructed above.

11 000 100 011 101 111 110 000 000 000 045(10) =


1000 1010 1101 0111 0011 1101 0111 0111 0001 1111 1010 1110 1010 1011 1001 0000 0011 1010 0001 0000 0101 0010 0110 0000 0010 1101(2)


3. Normalize the binary representation of the number.

Shift the decimal mark 103 positions to the left, so that only one non zero digit remains to the left of it:


11 000 100 011 101 111 110 000 000 000 045(10) =


1000 1010 1101 0111 0011 1101 0111 0111 0001 1111 1010 1110 1010 1011 1001 0000 0011 1010 0001 0000 0101 0010 0110 0000 0010 1101(2) =


1000 1010 1101 0111 0011 1101 0111 0111 0001 1111 1010 1110 1010 1011 1001 0000 0011 1010 0001 0000 0101 0010 0110 0000 0010 1101(2) × 20 =


1.0001 0101 1010 1110 0111 1010 1110 1110 0011 1111 0101 1101 0101 0111 0010 0000 0111 0100 0010 0000 1010 0100 1100 0000 0101 101(2) × 2103


4. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point representation:

Sign 0 (a positive number)


Exponent (unadjusted): 103


Mantissa (not normalized):
1.0001 0101 1010 1110 0111 1010 1110 1110 0011 1111 0101 1101 0101 0111 0010 0000 0111 0100 0010 0000 1010 0100 1100 0000 0101 101


5. Adjust the exponent.

Use the 8 bit excess/bias notation:


Exponent (adjusted) =


Exponent (unadjusted) + 2(8-1) - 1 =


103 + 2(8-1) - 1 =


(103 + 127)(10) =


230(10)


6. Convert the adjusted exponent from the decimal (base 10) to 8 bit binary.

Use the same technique of repeatedly dividing by 2:


  • division = quotient + remainder;
  • 230 ÷ 2 = 115 + 0;
  • 115 ÷ 2 = 57 + 1;
  • 57 ÷ 2 = 28 + 1;
  • 28 ÷ 2 = 14 + 0;
  • 14 ÷ 2 = 7 + 0;
  • 7 ÷ 2 = 3 + 1;
  • 3 ÷ 2 = 1 + 1;
  • 1 ÷ 2 = 0 + 1;

7. Construct the base 2 representation of the adjusted exponent.

Take all the remainders starting from the bottom of the list constructed above.


Exponent (adjusted) =


230(10) =


1110 0110(2)


8. Normalize the mantissa.

a) Remove the leading (the leftmost) bit, since it's allways 1, and the decimal point, if the case.


b) Adjust its length to 23 bits, by removing the excess bits, from the right (if any of the excess bits is set on 1, we are losing precision...).


Mantissa (normalized) =


1. 000 1010 1101 0111 0011 1101 0111 0111 0001 1111 1010 1110 1010 1011 1001 0000 0011 1010 0001 0000 0101 0010 0110 0000 0010 1101 =


000 1010 1101 0111 0011 1101


9. The three elements that make up the number's 32 bit single precision IEEE 754 binary floating point representation:

Sign (1 bit) =
0 (a positive number)


Exponent (8 bits) =
1110 0110


Mantissa (23 bits) =
000 1010 1101 0111 0011 1101


Decimal number 11 000 100 011 101 111 110 000 000 000 045 converted to 32 bit single precision IEEE 754 binary floating point representation:

0 - 1110 0110 - 000 1010 1101 0111 0011 1101


How to convert decimal numbers from base ten to 32 bit single precision IEEE 754 binary floating point standard

Follow the steps below to convert a base 10 decimal number to 32 bit single precision IEEE 754 binary floating point:

  • 1. If the number to be converted is negative, start with its the positive version.
  • 2. First convert the integer part. Divide repeatedly by 2 the base ten positive representation of the integer number that is to be converted to binary, until we get a quotient that is equal to zero, keeping track of each remainder.
  • 3. Construct the base 2 representation of the positive integer part of the number, by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above. Thus, the last remainder of the divisions becomes the first symbol (the leftmost) of the base two number, while the first remainder becomes the last symbol (the rightmost).
  • 4. Then convert the fractional part. Multiply the number repeatedly by 2, until we get a fractional part that is equal to zero, keeping track of each integer part of the results.
  • 5. Construct the base 2 representation of the fractional part of the number by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above (they should appear in the binary representation, from left to right, in the order they have been calculated).
  • 6. Normalize the binary representation of the number, by shifting the decimal point (or if you prefer, the decimal mark) "n" positions either to the left or to the right, so that only one non zero digit remains to the left of the decimal point.
  • 7. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary, by using the same technique of repeatedly dividing by 2, as shown above:
    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1
  • 8. Normalize mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal sign if the case) and adjust its length to 23 bits, either by removing the excess bits from the right (losing precision...) or by adding extra '0' bits to the right.
  • 9. Sign (it takes 1 bit) is either 1 for a negative or 0 for a positive number.

Example: convert the negative number -25.347 from decimal system (base ten) to 32 bit single precision IEEE 754 binary floating point:

  • 1. Start with the positive version of the number:

    |-25.347| = 25.347

  • 2. First convert the integer part, 25. Divide it repeatedly by 2, keeping track of each remainder, until we get a quotient that is equal to zero:
    • division = quotient + remainder;
    • 25 ÷ 2 = 12 + 1;
    • 12 ÷ 2 = 6 + 0;
    • 6 ÷ 2 = 3 + 0;
    • 3 ÷ 2 = 1 + 1;
    • 1 ÷ 2 = 0 + 1;
    • We have encountered a quotient that is ZERO => FULL STOP
  • 3. Construct the base 2 representation of the integer part of the number by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above:

    25(10) = 1 1001(2)

  • 4. Then convert the fractional part, 0.347. Multiply repeatedly by 2, keeping track of each integer part of the results, until we get a fractional part that is equal to zero:
    • #) multiplying = integer + fractional part;
    • 1) 0.347 × 2 = 0 + 0.694;
    • 2) 0.694 × 2 = 1 + 0.388;
    • 3) 0.388 × 2 = 0 + 0.776;
    • 4) 0.776 × 2 = 1 + 0.552;
    • 5) 0.552 × 2 = 1 + 0.104;
    • 6) 0.104 × 2 = 0 + 0.208;
    • 7) 0.208 × 2 = 0 + 0.416;
    • 8) 0.416 × 2 = 0 + 0.832;
    • 9) 0.832 × 2 = 1 + 0.664;
    • 10) 0.664 × 2 = 1 + 0.328;
    • 11) 0.328 × 2 = 0 + 0.656;
    • 12) 0.656 × 2 = 1 + 0.312;
    • 13) 0.312 × 2 = 0 + 0.624;
    • 14) 0.624 × 2 = 1 + 0.248;
    • 15) 0.248 × 2 = 0 + 0.496;
    • 16) 0.496 × 2 = 0 + 0.992;
    • 17) 0.992 × 2 = 1 + 0.984;
    • 18) 0.984 × 2 = 1 + 0.968;
    • 19) 0.968 × 2 = 1 + 0.936;
    • 20) 0.936 × 2 = 1 + 0.872;
    • 21) 0.872 × 2 = 1 + 0.744;
    • 22) 0.744 × 2 = 1 + 0.488;
    • 23) 0.488 × 2 = 0 + 0.976;
    • 24) 0.976 × 2 = 1 + 0.952;
    • We didn't get any fractional part that was equal to zero. But we had enough iterations (over Mantissa limit = 23) and at least one integer part that was different from zero => FULL STOP (losing precision...).
  • 5. Construct the base 2 representation of the fractional part of the number, by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above:

    0.347(10) = 0.0101 1000 1101 0100 1111 1101(2)

  • 6. Summarizing - the positive number before normalization:

    25.347(10) = 1 1001.0101 1000 1101 0100 1111 1101(2)

  • 7. Normalize the binary representation of the number, shifting the decimal point 4 positions to the left so that only one non-zero digit stays to the left of the decimal point:

    25.347(10) =
    1 1001.0101 1000 1101 0100 1111 1101(2) =
    1 1001.0101 1000 1101 0100 1111 1101(2) × 20 =
    1.1001 0101 1000 1101 0100 1111 1101(2) × 24

  • 8. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point:

    Sign: 1 (a negative number)

    Exponent (unadjusted): 4

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

  • 9. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary (base 2), by using the same technique of repeatedly dividing it by 2, as already demonstrated above:

    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1 = (4 + 127)(10) = 131(10) =
    1000 0011(2)

  • 10. Normalize the mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal point) and adjust its length to 23 bits, by removing the excess bits from the right (losing precision...):

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

    Mantissa (normalized): 100 1010 1100 0110 1010 0111

  • Conclusion:

    Sign (1 bit) = 1 (a negative number)

    Exponent (8 bits) = 1000 0011

    Mantissa (23 bits) = 100 1010 1100 0110 1010 0111

  • Number -25.347, converted from the decimal system (base 10) to 32 bit single precision IEEE 754 binary floating point =
    1 - 1000 0011 - 100 1010 1100 0110 1010 0111