11 001 110 000 000 000 000 000 000 001 264 Converted to 32 Bit Single Precision IEEE 754 Binary Floating Point Representation Standard

Convert decimal 11 001 110 000 000 000 000 000 000 001 264(10) to 32 bit single precision IEEE 754 binary floating point representation standard (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

What are the steps to convert decimal number
11 001 110 000 000 000 000 000 000 001 264(10) to 32 bit single precision IEEE 754 binary floating point representation (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

1. Divide the number repeatedly by 2.

Keep track of each remainder.

We stop when we get a quotient that is equal to zero.


  • division = quotient + remainder;
  • 11 001 110 000 000 000 000 000 000 001 264 ÷ 2 = 5 500 555 000 000 000 000 000 000 000 632 + 0;
  • 5 500 555 000 000 000 000 000 000 000 632 ÷ 2 = 2 750 277 500 000 000 000 000 000 000 316 + 0;
  • 2 750 277 500 000 000 000 000 000 000 316 ÷ 2 = 1 375 138 750 000 000 000 000 000 000 158 + 0;
  • 1 375 138 750 000 000 000 000 000 000 158 ÷ 2 = 687 569 375 000 000 000 000 000 000 079 + 0;
  • 687 569 375 000 000 000 000 000 000 079 ÷ 2 = 343 784 687 500 000 000 000 000 000 039 + 1;
  • 343 784 687 500 000 000 000 000 000 039 ÷ 2 = 171 892 343 750 000 000 000 000 000 019 + 1;
  • 171 892 343 750 000 000 000 000 000 019 ÷ 2 = 85 946 171 875 000 000 000 000 000 009 + 1;
  • 85 946 171 875 000 000 000 000 000 009 ÷ 2 = 42 973 085 937 500 000 000 000 000 004 + 1;
  • 42 973 085 937 500 000 000 000 000 004 ÷ 2 = 21 486 542 968 750 000 000 000 000 002 + 0;
  • 21 486 542 968 750 000 000 000 000 002 ÷ 2 = 10 743 271 484 375 000 000 000 000 001 + 0;
  • 10 743 271 484 375 000 000 000 000 001 ÷ 2 = 5 371 635 742 187 500 000 000 000 000 + 1;
  • 5 371 635 742 187 500 000 000 000 000 ÷ 2 = 2 685 817 871 093 750 000 000 000 000 + 0;
  • 2 685 817 871 093 750 000 000 000 000 ÷ 2 = 1 342 908 935 546 875 000 000 000 000 + 0;
  • 1 342 908 935 546 875 000 000 000 000 ÷ 2 = 671 454 467 773 437 500 000 000 000 + 0;
  • 671 454 467 773 437 500 000 000 000 ÷ 2 = 335 727 233 886 718 750 000 000 000 + 0;
  • 335 727 233 886 718 750 000 000 000 ÷ 2 = 167 863 616 943 359 375 000 000 000 + 0;
  • 167 863 616 943 359 375 000 000 000 ÷ 2 = 83 931 808 471 679 687 500 000 000 + 0;
  • 83 931 808 471 679 687 500 000 000 ÷ 2 = 41 965 904 235 839 843 750 000 000 + 0;
  • 41 965 904 235 839 843 750 000 000 ÷ 2 = 20 982 952 117 919 921 875 000 000 + 0;
  • 20 982 952 117 919 921 875 000 000 ÷ 2 = 10 491 476 058 959 960 937 500 000 + 0;
  • 10 491 476 058 959 960 937 500 000 ÷ 2 = 5 245 738 029 479 980 468 750 000 + 0;
  • 5 245 738 029 479 980 468 750 000 ÷ 2 = 2 622 869 014 739 990 234 375 000 + 0;
  • 2 622 869 014 739 990 234 375 000 ÷ 2 = 1 311 434 507 369 995 117 187 500 + 0;
  • 1 311 434 507 369 995 117 187 500 ÷ 2 = 655 717 253 684 997 558 593 750 + 0;
  • 655 717 253 684 997 558 593 750 ÷ 2 = 327 858 626 842 498 779 296 875 + 0;
  • 327 858 626 842 498 779 296 875 ÷ 2 = 163 929 313 421 249 389 648 437 + 1;
  • 163 929 313 421 249 389 648 437 ÷ 2 = 81 964 656 710 624 694 824 218 + 1;
  • 81 964 656 710 624 694 824 218 ÷ 2 = 40 982 328 355 312 347 412 109 + 0;
  • 40 982 328 355 312 347 412 109 ÷ 2 = 20 491 164 177 656 173 706 054 + 1;
  • 20 491 164 177 656 173 706 054 ÷ 2 = 10 245 582 088 828 086 853 027 + 0;
  • 10 245 582 088 828 086 853 027 ÷ 2 = 5 122 791 044 414 043 426 513 + 1;
  • 5 122 791 044 414 043 426 513 ÷ 2 = 2 561 395 522 207 021 713 256 + 1;
  • 2 561 395 522 207 021 713 256 ÷ 2 = 1 280 697 761 103 510 856 628 + 0;
  • 1 280 697 761 103 510 856 628 ÷ 2 = 640 348 880 551 755 428 314 + 0;
  • 640 348 880 551 755 428 314 ÷ 2 = 320 174 440 275 877 714 157 + 0;
  • 320 174 440 275 877 714 157 ÷ 2 = 160 087 220 137 938 857 078 + 1;
  • 160 087 220 137 938 857 078 ÷ 2 = 80 043 610 068 969 428 539 + 0;
  • 80 043 610 068 969 428 539 ÷ 2 = 40 021 805 034 484 714 269 + 1;
  • 40 021 805 034 484 714 269 ÷ 2 = 20 010 902 517 242 357 134 + 1;
  • 20 010 902 517 242 357 134 ÷ 2 = 10 005 451 258 621 178 567 + 0;
  • 10 005 451 258 621 178 567 ÷ 2 = 5 002 725 629 310 589 283 + 1;
  • 5 002 725 629 310 589 283 ÷ 2 = 2 501 362 814 655 294 641 + 1;
  • 2 501 362 814 655 294 641 ÷ 2 = 1 250 681 407 327 647 320 + 1;
  • 1 250 681 407 327 647 320 ÷ 2 = 625 340 703 663 823 660 + 0;
  • 625 340 703 663 823 660 ÷ 2 = 312 670 351 831 911 830 + 0;
  • 312 670 351 831 911 830 ÷ 2 = 156 335 175 915 955 915 + 0;
  • 156 335 175 915 955 915 ÷ 2 = 78 167 587 957 977 957 + 1;
  • 78 167 587 957 977 957 ÷ 2 = 39 083 793 978 988 978 + 1;
  • 39 083 793 978 988 978 ÷ 2 = 19 541 896 989 494 489 + 0;
  • 19 541 896 989 494 489 ÷ 2 = 9 770 948 494 747 244 + 1;
  • 9 770 948 494 747 244 ÷ 2 = 4 885 474 247 373 622 + 0;
  • 4 885 474 247 373 622 ÷ 2 = 2 442 737 123 686 811 + 0;
  • 2 442 737 123 686 811 ÷ 2 = 1 221 368 561 843 405 + 1;
  • 1 221 368 561 843 405 ÷ 2 = 610 684 280 921 702 + 1;
  • 610 684 280 921 702 ÷ 2 = 305 342 140 460 851 + 0;
  • 305 342 140 460 851 ÷ 2 = 152 671 070 230 425 + 1;
  • 152 671 070 230 425 ÷ 2 = 76 335 535 115 212 + 1;
  • 76 335 535 115 212 ÷ 2 = 38 167 767 557 606 + 0;
  • 38 167 767 557 606 ÷ 2 = 19 083 883 778 803 + 0;
  • 19 083 883 778 803 ÷ 2 = 9 541 941 889 401 + 1;
  • 9 541 941 889 401 ÷ 2 = 4 770 970 944 700 + 1;
  • 4 770 970 944 700 ÷ 2 = 2 385 485 472 350 + 0;
  • 2 385 485 472 350 ÷ 2 = 1 192 742 736 175 + 0;
  • 1 192 742 736 175 ÷ 2 = 596 371 368 087 + 1;
  • 596 371 368 087 ÷ 2 = 298 185 684 043 + 1;
  • 298 185 684 043 ÷ 2 = 149 092 842 021 + 1;
  • 149 092 842 021 ÷ 2 = 74 546 421 010 + 1;
  • 74 546 421 010 ÷ 2 = 37 273 210 505 + 0;
  • 37 273 210 505 ÷ 2 = 18 636 605 252 + 1;
  • 18 636 605 252 ÷ 2 = 9 318 302 626 + 0;
  • 9 318 302 626 ÷ 2 = 4 659 151 313 + 0;
  • 4 659 151 313 ÷ 2 = 2 329 575 656 + 1;
  • 2 329 575 656 ÷ 2 = 1 164 787 828 + 0;
  • 1 164 787 828 ÷ 2 = 582 393 914 + 0;
  • 582 393 914 ÷ 2 = 291 196 957 + 0;
  • 291 196 957 ÷ 2 = 145 598 478 + 1;
  • 145 598 478 ÷ 2 = 72 799 239 + 0;
  • 72 799 239 ÷ 2 = 36 399 619 + 1;
  • 36 399 619 ÷ 2 = 18 199 809 + 1;
  • 18 199 809 ÷ 2 = 9 099 904 + 1;
  • 9 099 904 ÷ 2 = 4 549 952 + 0;
  • 4 549 952 ÷ 2 = 2 274 976 + 0;
  • 2 274 976 ÷ 2 = 1 137 488 + 0;
  • 1 137 488 ÷ 2 = 568 744 + 0;
  • 568 744 ÷ 2 = 284 372 + 0;
  • 284 372 ÷ 2 = 142 186 + 0;
  • 142 186 ÷ 2 = 71 093 + 0;
  • 71 093 ÷ 2 = 35 546 + 1;
  • 35 546 ÷ 2 = 17 773 + 0;
  • 17 773 ÷ 2 = 8 886 + 1;
  • 8 886 ÷ 2 = 4 443 + 0;
  • 4 443 ÷ 2 = 2 221 + 1;
  • 2 221 ÷ 2 = 1 110 + 1;
  • 1 110 ÷ 2 = 555 + 0;
  • 555 ÷ 2 = 277 + 1;
  • 277 ÷ 2 = 138 + 1;
  • 138 ÷ 2 = 69 + 0;
  • 69 ÷ 2 = 34 + 1;
  • 34 ÷ 2 = 17 + 0;
  • 17 ÷ 2 = 8 + 1;
  • 8 ÷ 2 = 4 + 0;
  • 4 ÷ 2 = 2 + 0;
  • 2 ÷ 2 = 1 + 0;
  • 1 ÷ 2 = 0 + 1;

2. Construct the base 2 representation of the positive number.

Take all the remainders starting from the bottom of the list constructed above.

11 001 110 000 000 000 000 000 000 001 264(10) =


1000 1010 1101 1010 1000 0000 1110 1000 1001 0111 1001 1001 1011 0010 1100 0111 0110 1000 1101 0110 0000 0000 0000 0100 1111 0000(2)


3. Normalize the binary representation of the number.

Shift the decimal mark 103 positions to the left, so that only one non zero digit remains to the left of it:


11 001 110 000 000 000 000 000 000 001 264(10) =


1000 1010 1101 1010 1000 0000 1110 1000 1001 0111 1001 1001 1011 0010 1100 0111 0110 1000 1101 0110 0000 0000 0000 0100 1111 0000(2) =


1000 1010 1101 1010 1000 0000 1110 1000 1001 0111 1001 1001 1011 0010 1100 0111 0110 1000 1101 0110 0000 0000 0000 0100 1111 0000(2) × 20 =


1.0001 0101 1011 0101 0000 0001 1101 0001 0010 1111 0011 0011 0110 0101 1000 1110 1101 0001 1010 1100 0000 0000 0000 1001 1110 000(2) × 2103


4. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point representation:

Sign 0 (a positive number)


Exponent (unadjusted): 103


Mantissa (not normalized):
1.0001 0101 1011 0101 0000 0001 1101 0001 0010 1111 0011 0011 0110 0101 1000 1110 1101 0001 1010 1100 0000 0000 0000 1001 1110 000


5. Adjust the exponent.

Use the 8 bit excess/bias notation:


Exponent (adjusted) =


Exponent (unadjusted) + 2(8-1) - 1 =


103 + 2(8-1) - 1 =


(103 + 127)(10) =


230(10)


6. Convert the adjusted exponent from the decimal (base 10) to 8 bit binary.

Use the same technique of repeatedly dividing by 2:


  • division = quotient + remainder;
  • 230 ÷ 2 = 115 + 0;
  • 115 ÷ 2 = 57 + 1;
  • 57 ÷ 2 = 28 + 1;
  • 28 ÷ 2 = 14 + 0;
  • 14 ÷ 2 = 7 + 0;
  • 7 ÷ 2 = 3 + 1;
  • 3 ÷ 2 = 1 + 1;
  • 1 ÷ 2 = 0 + 1;

7. Construct the base 2 representation of the adjusted exponent.

Take all the remainders starting from the bottom of the list constructed above.


Exponent (adjusted) =


230(10) =


1110 0110(2)


8. Normalize the mantissa.

a) Remove the leading (the leftmost) bit, since it's allways 1, and the decimal point, if the case.


b) Adjust its length to 23 bits, by removing the excess bits, from the right (if any of the excess bits is set on 1, we are losing precision...).


Mantissa (normalized) =


1. 000 1010 1101 1010 1000 0000 1110 1000 1001 0111 1001 1001 1011 0010 1100 0111 0110 1000 1101 0110 0000 0000 0000 0100 1111 0000 =


000 1010 1101 1010 1000 0000


9. The three elements that make up the number's 32 bit single precision IEEE 754 binary floating point representation:

Sign (1 bit) =
0 (a positive number)


Exponent (8 bits) =
1110 0110


Mantissa (23 bits) =
000 1010 1101 1010 1000 0000


Decimal number 11 001 110 000 000 000 000 000 000 001 264 converted to 32 bit single precision IEEE 754 binary floating point representation:

0 - 1110 0110 - 000 1010 1101 1010 1000 0000


How to convert decimal numbers from base ten to 32 bit single precision IEEE 754 binary floating point standard

Follow the steps below to convert a base 10 decimal number to 32 bit single precision IEEE 754 binary floating point:

  • 1. If the number to be converted is negative, start with its the positive version.
  • 2. First convert the integer part. Divide repeatedly by 2 the base ten positive representation of the integer number that is to be converted to binary, until we get a quotient that is equal to zero, keeping track of each remainder.
  • 3. Construct the base 2 representation of the positive integer part of the number, by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above. Thus, the last remainder of the divisions becomes the first symbol (the leftmost) of the base two number, while the first remainder becomes the last symbol (the rightmost).
  • 4. Then convert the fractional part. Multiply the number repeatedly by 2, until we get a fractional part that is equal to zero, keeping track of each integer part of the results.
  • 5. Construct the base 2 representation of the fractional part of the number by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above (they should appear in the binary representation, from left to right, in the order they have been calculated).
  • 6. Normalize the binary representation of the number, by shifting the decimal point (or if you prefer, the decimal mark) "n" positions either to the left or to the right, so that only one non zero digit remains to the left of the decimal point.
  • 7. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary, by using the same technique of repeatedly dividing by 2, as shown above:
    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1
  • 8. Normalize mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal sign if the case) and adjust its length to 23 bits, either by removing the excess bits from the right (losing precision...) or by adding extra '0' bits to the right.
  • 9. Sign (it takes 1 bit) is either 1 for a negative or 0 for a positive number.

Example: convert the negative number -25.347 from decimal system (base ten) to 32 bit single precision IEEE 754 binary floating point:

  • 1. Start with the positive version of the number:

    |-25.347| = 25.347

  • 2. First convert the integer part, 25. Divide it repeatedly by 2, keeping track of each remainder, until we get a quotient that is equal to zero:
    • division = quotient + remainder;
    • 25 ÷ 2 = 12 + 1;
    • 12 ÷ 2 = 6 + 0;
    • 6 ÷ 2 = 3 + 0;
    • 3 ÷ 2 = 1 + 1;
    • 1 ÷ 2 = 0 + 1;
    • We have encountered a quotient that is ZERO => FULL STOP
  • 3. Construct the base 2 representation of the integer part of the number by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above:

    25(10) = 1 1001(2)

  • 4. Then convert the fractional part, 0.347. Multiply repeatedly by 2, keeping track of each integer part of the results, until we get a fractional part that is equal to zero:
    • #) multiplying = integer + fractional part;
    • 1) 0.347 × 2 = 0 + 0.694;
    • 2) 0.694 × 2 = 1 + 0.388;
    • 3) 0.388 × 2 = 0 + 0.776;
    • 4) 0.776 × 2 = 1 + 0.552;
    • 5) 0.552 × 2 = 1 + 0.104;
    • 6) 0.104 × 2 = 0 + 0.208;
    • 7) 0.208 × 2 = 0 + 0.416;
    • 8) 0.416 × 2 = 0 + 0.832;
    • 9) 0.832 × 2 = 1 + 0.664;
    • 10) 0.664 × 2 = 1 + 0.328;
    • 11) 0.328 × 2 = 0 + 0.656;
    • 12) 0.656 × 2 = 1 + 0.312;
    • 13) 0.312 × 2 = 0 + 0.624;
    • 14) 0.624 × 2 = 1 + 0.248;
    • 15) 0.248 × 2 = 0 + 0.496;
    • 16) 0.496 × 2 = 0 + 0.992;
    • 17) 0.992 × 2 = 1 + 0.984;
    • 18) 0.984 × 2 = 1 + 0.968;
    • 19) 0.968 × 2 = 1 + 0.936;
    • 20) 0.936 × 2 = 1 + 0.872;
    • 21) 0.872 × 2 = 1 + 0.744;
    • 22) 0.744 × 2 = 1 + 0.488;
    • 23) 0.488 × 2 = 0 + 0.976;
    • 24) 0.976 × 2 = 1 + 0.952;
    • We didn't get any fractional part that was equal to zero. But we had enough iterations (over Mantissa limit = 23) and at least one integer part that was different from zero => FULL STOP (losing precision...).
  • 5. Construct the base 2 representation of the fractional part of the number, by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above:

    0.347(10) = 0.0101 1000 1101 0100 1111 1101(2)

  • 6. Summarizing - the positive number before normalization:

    25.347(10) = 1 1001.0101 1000 1101 0100 1111 1101(2)

  • 7. Normalize the binary representation of the number, shifting the decimal point 4 positions to the left so that only one non-zero digit stays to the left of the decimal point:

    25.347(10) =
    1 1001.0101 1000 1101 0100 1111 1101(2) =
    1 1001.0101 1000 1101 0100 1111 1101(2) × 20 =
    1.1001 0101 1000 1101 0100 1111 1101(2) × 24

  • 8. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point:

    Sign: 1 (a negative number)

    Exponent (unadjusted): 4

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

  • 9. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary (base 2), by using the same technique of repeatedly dividing it by 2, as already demonstrated above:

    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1 = (4 + 127)(10) = 131(10) =
    1000 0011(2)

  • 10. Normalize the mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal point) and adjust its length to 23 bits, by removing the excess bits from the right (losing precision...):

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

    Mantissa (normalized): 100 1010 1100 0110 1010 0111

  • Conclusion:

    Sign (1 bit) = 1 (a negative number)

    Exponent (8 bits) = 1000 0011

    Mantissa (23 bits) = 100 1010 1100 0110 1010 0111

  • Number -25.347, converted from the decimal system (base 10) to 32 bit single precision IEEE 754 binary floating point =
    1 - 1000 0011 - 100 1010 1100 0110 1010 0111