1 000 010 001 010 010 000 110 101 009 100 Converted to 32 Bit Single Precision IEEE 754 Binary Floating Point Representation Standard

Convert decimal 1 000 010 001 010 010 000 110 101 009 100(10) to 32 bit single precision IEEE 754 binary floating point representation standard (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

What are the steps to convert decimal number
1 000 010 001 010 010 000 110 101 009 100(10) to 32 bit single precision IEEE 754 binary floating point representation (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

1. Divide the number repeatedly by 2.

Keep track of each remainder.

We stop when we get a quotient that is equal to zero.


  • division = quotient + remainder;
  • 1 000 010 001 010 010 000 110 101 009 100 ÷ 2 = 500 005 000 505 005 000 055 050 504 550 + 0;
  • 500 005 000 505 005 000 055 050 504 550 ÷ 2 = 250 002 500 252 502 500 027 525 252 275 + 0;
  • 250 002 500 252 502 500 027 525 252 275 ÷ 2 = 125 001 250 126 251 250 013 762 626 137 + 1;
  • 125 001 250 126 251 250 013 762 626 137 ÷ 2 = 62 500 625 063 125 625 006 881 313 068 + 1;
  • 62 500 625 063 125 625 006 881 313 068 ÷ 2 = 31 250 312 531 562 812 503 440 656 534 + 0;
  • 31 250 312 531 562 812 503 440 656 534 ÷ 2 = 15 625 156 265 781 406 251 720 328 267 + 0;
  • 15 625 156 265 781 406 251 720 328 267 ÷ 2 = 7 812 578 132 890 703 125 860 164 133 + 1;
  • 7 812 578 132 890 703 125 860 164 133 ÷ 2 = 3 906 289 066 445 351 562 930 082 066 + 1;
  • 3 906 289 066 445 351 562 930 082 066 ÷ 2 = 1 953 144 533 222 675 781 465 041 033 + 0;
  • 1 953 144 533 222 675 781 465 041 033 ÷ 2 = 976 572 266 611 337 890 732 520 516 + 1;
  • 976 572 266 611 337 890 732 520 516 ÷ 2 = 488 286 133 305 668 945 366 260 258 + 0;
  • 488 286 133 305 668 945 366 260 258 ÷ 2 = 244 143 066 652 834 472 683 130 129 + 0;
  • 244 143 066 652 834 472 683 130 129 ÷ 2 = 122 071 533 326 417 236 341 565 064 + 1;
  • 122 071 533 326 417 236 341 565 064 ÷ 2 = 61 035 766 663 208 618 170 782 532 + 0;
  • 61 035 766 663 208 618 170 782 532 ÷ 2 = 30 517 883 331 604 309 085 391 266 + 0;
  • 30 517 883 331 604 309 085 391 266 ÷ 2 = 15 258 941 665 802 154 542 695 633 + 0;
  • 15 258 941 665 802 154 542 695 633 ÷ 2 = 7 629 470 832 901 077 271 347 816 + 1;
  • 7 629 470 832 901 077 271 347 816 ÷ 2 = 3 814 735 416 450 538 635 673 908 + 0;
  • 3 814 735 416 450 538 635 673 908 ÷ 2 = 1 907 367 708 225 269 317 836 954 + 0;
  • 1 907 367 708 225 269 317 836 954 ÷ 2 = 953 683 854 112 634 658 918 477 + 0;
  • 953 683 854 112 634 658 918 477 ÷ 2 = 476 841 927 056 317 329 459 238 + 1;
  • 476 841 927 056 317 329 459 238 ÷ 2 = 238 420 963 528 158 664 729 619 + 0;
  • 238 420 963 528 158 664 729 619 ÷ 2 = 119 210 481 764 079 332 364 809 + 1;
  • 119 210 481 764 079 332 364 809 ÷ 2 = 59 605 240 882 039 666 182 404 + 1;
  • 59 605 240 882 039 666 182 404 ÷ 2 = 29 802 620 441 019 833 091 202 + 0;
  • 29 802 620 441 019 833 091 202 ÷ 2 = 14 901 310 220 509 916 545 601 + 0;
  • 14 901 310 220 509 916 545 601 ÷ 2 = 7 450 655 110 254 958 272 800 + 1;
  • 7 450 655 110 254 958 272 800 ÷ 2 = 3 725 327 555 127 479 136 400 + 0;
  • 3 725 327 555 127 479 136 400 ÷ 2 = 1 862 663 777 563 739 568 200 + 0;
  • 1 862 663 777 563 739 568 200 ÷ 2 = 931 331 888 781 869 784 100 + 0;
  • 931 331 888 781 869 784 100 ÷ 2 = 465 665 944 390 934 892 050 + 0;
  • 465 665 944 390 934 892 050 ÷ 2 = 232 832 972 195 467 446 025 + 0;
  • 232 832 972 195 467 446 025 ÷ 2 = 116 416 486 097 733 723 012 + 1;
  • 116 416 486 097 733 723 012 ÷ 2 = 58 208 243 048 866 861 506 + 0;
  • 58 208 243 048 866 861 506 ÷ 2 = 29 104 121 524 433 430 753 + 0;
  • 29 104 121 524 433 430 753 ÷ 2 = 14 552 060 762 216 715 376 + 1;
  • 14 552 060 762 216 715 376 ÷ 2 = 7 276 030 381 108 357 688 + 0;
  • 7 276 030 381 108 357 688 ÷ 2 = 3 638 015 190 554 178 844 + 0;
  • 3 638 015 190 554 178 844 ÷ 2 = 1 819 007 595 277 089 422 + 0;
  • 1 819 007 595 277 089 422 ÷ 2 = 909 503 797 638 544 711 + 0;
  • 909 503 797 638 544 711 ÷ 2 = 454 751 898 819 272 355 + 1;
  • 454 751 898 819 272 355 ÷ 2 = 227 375 949 409 636 177 + 1;
  • 227 375 949 409 636 177 ÷ 2 = 113 687 974 704 818 088 + 1;
  • 113 687 974 704 818 088 ÷ 2 = 56 843 987 352 409 044 + 0;
  • 56 843 987 352 409 044 ÷ 2 = 28 421 993 676 204 522 + 0;
  • 28 421 993 676 204 522 ÷ 2 = 14 210 996 838 102 261 + 0;
  • 14 210 996 838 102 261 ÷ 2 = 7 105 498 419 051 130 + 1;
  • 7 105 498 419 051 130 ÷ 2 = 3 552 749 209 525 565 + 0;
  • 3 552 749 209 525 565 ÷ 2 = 1 776 374 604 762 782 + 1;
  • 1 776 374 604 762 782 ÷ 2 = 888 187 302 381 391 + 0;
  • 888 187 302 381 391 ÷ 2 = 444 093 651 190 695 + 1;
  • 444 093 651 190 695 ÷ 2 = 222 046 825 595 347 + 1;
  • 222 046 825 595 347 ÷ 2 = 111 023 412 797 673 + 1;
  • 111 023 412 797 673 ÷ 2 = 55 511 706 398 836 + 1;
  • 55 511 706 398 836 ÷ 2 = 27 755 853 199 418 + 0;
  • 27 755 853 199 418 ÷ 2 = 13 877 926 599 709 + 0;
  • 13 877 926 599 709 ÷ 2 = 6 938 963 299 854 + 1;
  • 6 938 963 299 854 ÷ 2 = 3 469 481 649 927 + 0;
  • 3 469 481 649 927 ÷ 2 = 1 734 740 824 963 + 1;
  • 1 734 740 824 963 ÷ 2 = 867 370 412 481 + 1;
  • 867 370 412 481 ÷ 2 = 433 685 206 240 + 1;
  • 433 685 206 240 ÷ 2 = 216 842 603 120 + 0;
  • 216 842 603 120 ÷ 2 = 108 421 301 560 + 0;
  • 108 421 301 560 ÷ 2 = 54 210 650 780 + 0;
  • 54 210 650 780 ÷ 2 = 27 105 325 390 + 0;
  • 27 105 325 390 ÷ 2 = 13 552 662 695 + 0;
  • 13 552 662 695 ÷ 2 = 6 776 331 347 + 1;
  • 6 776 331 347 ÷ 2 = 3 388 165 673 + 1;
  • 3 388 165 673 ÷ 2 = 1 694 082 836 + 1;
  • 1 694 082 836 ÷ 2 = 847 041 418 + 0;
  • 847 041 418 ÷ 2 = 423 520 709 + 0;
  • 423 520 709 ÷ 2 = 211 760 354 + 1;
  • 211 760 354 ÷ 2 = 105 880 177 + 0;
  • 105 880 177 ÷ 2 = 52 940 088 + 1;
  • 52 940 088 ÷ 2 = 26 470 044 + 0;
  • 26 470 044 ÷ 2 = 13 235 022 + 0;
  • 13 235 022 ÷ 2 = 6 617 511 + 0;
  • 6 617 511 ÷ 2 = 3 308 755 + 1;
  • 3 308 755 ÷ 2 = 1 654 377 + 1;
  • 1 654 377 ÷ 2 = 827 188 + 1;
  • 827 188 ÷ 2 = 413 594 + 0;
  • 413 594 ÷ 2 = 206 797 + 0;
  • 206 797 ÷ 2 = 103 398 + 1;
  • 103 398 ÷ 2 = 51 699 + 0;
  • 51 699 ÷ 2 = 25 849 + 1;
  • 25 849 ÷ 2 = 12 924 + 1;
  • 12 924 ÷ 2 = 6 462 + 0;
  • 6 462 ÷ 2 = 3 231 + 0;
  • 3 231 ÷ 2 = 1 615 + 1;
  • 1 615 ÷ 2 = 807 + 1;
  • 807 ÷ 2 = 403 + 1;
  • 403 ÷ 2 = 201 + 1;
  • 201 ÷ 2 = 100 + 1;
  • 100 ÷ 2 = 50 + 0;
  • 50 ÷ 2 = 25 + 0;
  • 25 ÷ 2 = 12 + 1;
  • 12 ÷ 2 = 6 + 0;
  • 6 ÷ 2 = 3 + 0;
  • 3 ÷ 2 = 1 + 1;
  • 1 ÷ 2 = 0 + 1;

2. Construct the base 2 representation of the positive number.

Take all the remainders starting from the bottom of the list constructed above.

1 000 010 001 010 010 000 110 101 009 100(10) =


1100 1001 1111 0011 0100 1110 0010 1001 1100 0001 1101 0011 1101 0100 0111 0000 1001 0000 0100 1101 0001 0001 0010 1100 1100(2)


3. Normalize the binary representation of the number.

Shift the decimal mark 99 positions to the left, so that only one non zero digit remains to the left of it:


1 000 010 001 010 010 000 110 101 009 100(10) =


1100 1001 1111 0011 0100 1110 0010 1001 1100 0001 1101 0011 1101 0100 0111 0000 1001 0000 0100 1101 0001 0001 0010 1100 1100(2) =


1100 1001 1111 0011 0100 1110 0010 1001 1100 0001 1101 0011 1101 0100 0111 0000 1001 0000 0100 1101 0001 0001 0010 1100 1100(2) × 20 =


1.1001 0011 1110 0110 1001 1100 0101 0011 1000 0011 1010 0111 1010 1000 1110 0001 0010 0000 1001 1010 0010 0010 0101 1001 100(2) × 299


4. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point representation:

Sign 0 (a positive number)


Exponent (unadjusted): 99


Mantissa (not normalized):
1.1001 0011 1110 0110 1001 1100 0101 0011 1000 0011 1010 0111 1010 1000 1110 0001 0010 0000 1001 1010 0010 0010 0101 1001 100


5. Adjust the exponent.

Use the 8 bit excess/bias notation:


Exponent (adjusted) =


Exponent (unadjusted) + 2(8-1) - 1 =


99 + 2(8-1) - 1 =


(99 + 127)(10) =


226(10)


6. Convert the adjusted exponent from the decimal (base 10) to 8 bit binary.

Use the same technique of repeatedly dividing by 2:


  • division = quotient + remainder;
  • 226 ÷ 2 = 113 + 0;
  • 113 ÷ 2 = 56 + 1;
  • 56 ÷ 2 = 28 + 0;
  • 28 ÷ 2 = 14 + 0;
  • 14 ÷ 2 = 7 + 0;
  • 7 ÷ 2 = 3 + 1;
  • 3 ÷ 2 = 1 + 1;
  • 1 ÷ 2 = 0 + 1;

7. Construct the base 2 representation of the adjusted exponent.

Take all the remainders starting from the bottom of the list constructed above.


Exponent (adjusted) =


226(10) =


1110 0010(2)


8. Normalize the mantissa.

a) Remove the leading (the leftmost) bit, since it's allways 1, and the decimal point, if the case.


b) Adjust its length to 23 bits, by removing the excess bits, from the right (if any of the excess bits is set on 1, we are losing precision...).


Mantissa (normalized) =


1. 100 1001 1111 0011 0100 1110 0010 1001 1100 0001 1101 0011 1101 0100 0111 0000 1001 0000 0100 1101 0001 0001 0010 1100 1100 =


100 1001 1111 0011 0100 1110


9. The three elements that make up the number's 32 bit single precision IEEE 754 binary floating point representation:

Sign (1 bit) =
0 (a positive number)


Exponent (8 bits) =
1110 0010


Mantissa (23 bits) =
100 1001 1111 0011 0100 1110


Decimal number 1 000 010 001 010 010 000 110 101 009 100 converted to 32 bit single precision IEEE 754 binary floating point representation:

0 - 1110 0010 - 100 1001 1111 0011 0100 1110


How to convert decimal numbers from base ten to 32 bit single precision IEEE 754 binary floating point standard

Follow the steps below to convert a base 10 decimal number to 32 bit single precision IEEE 754 binary floating point:

  • 1. If the number to be converted is negative, start with its the positive version.
  • 2. First convert the integer part. Divide repeatedly by 2 the base ten positive representation of the integer number that is to be converted to binary, until we get a quotient that is equal to zero, keeping track of each remainder.
  • 3. Construct the base 2 representation of the positive integer part of the number, by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above. Thus, the last remainder of the divisions becomes the first symbol (the leftmost) of the base two number, while the first remainder becomes the last symbol (the rightmost).
  • 4. Then convert the fractional part. Multiply the number repeatedly by 2, until we get a fractional part that is equal to zero, keeping track of each integer part of the results.
  • 5. Construct the base 2 representation of the fractional part of the number by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above (they should appear in the binary representation, from left to right, in the order they have been calculated).
  • 6. Normalize the binary representation of the number, by shifting the decimal point (or if you prefer, the decimal mark) "n" positions either to the left or to the right, so that only one non zero digit remains to the left of the decimal point.
  • 7. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary, by using the same technique of repeatedly dividing by 2, as shown above:
    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1
  • 8. Normalize mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal sign if the case) and adjust its length to 23 bits, either by removing the excess bits from the right (losing precision...) or by adding extra '0' bits to the right.
  • 9. Sign (it takes 1 bit) is either 1 for a negative or 0 for a positive number.

Example: convert the negative number -25.347 from decimal system (base ten) to 32 bit single precision IEEE 754 binary floating point:

  • 1. Start with the positive version of the number:

    |-25.347| = 25.347

  • 2. First convert the integer part, 25. Divide it repeatedly by 2, keeping track of each remainder, until we get a quotient that is equal to zero:
    • division = quotient + remainder;
    • 25 ÷ 2 = 12 + 1;
    • 12 ÷ 2 = 6 + 0;
    • 6 ÷ 2 = 3 + 0;
    • 3 ÷ 2 = 1 + 1;
    • 1 ÷ 2 = 0 + 1;
    • We have encountered a quotient that is ZERO => FULL STOP
  • 3. Construct the base 2 representation of the integer part of the number by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above:

    25(10) = 1 1001(2)

  • 4. Then convert the fractional part, 0.347. Multiply repeatedly by 2, keeping track of each integer part of the results, until we get a fractional part that is equal to zero:
    • #) multiplying = integer + fractional part;
    • 1) 0.347 × 2 = 0 + 0.694;
    • 2) 0.694 × 2 = 1 + 0.388;
    • 3) 0.388 × 2 = 0 + 0.776;
    • 4) 0.776 × 2 = 1 + 0.552;
    • 5) 0.552 × 2 = 1 + 0.104;
    • 6) 0.104 × 2 = 0 + 0.208;
    • 7) 0.208 × 2 = 0 + 0.416;
    • 8) 0.416 × 2 = 0 + 0.832;
    • 9) 0.832 × 2 = 1 + 0.664;
    • 10) 0.664 × 2 = 1 + 0.328;
    • 11) 0.328 × 2 = 0 + 0.656;
    • 12) 0.656 × 2 = 1 + 0.312;
    • 13) 0.312 × 2 = 0 + 0.624;
    • 14) 0.624 × 2 = 1 + 0.248;
    • 15) 0.248 × 2 = 0 + 0.496;
    • 16) 0.496 × 2 = 0 + 0.992;
    • 17) 0.992 × 2 = 1 + 0.984;
    • 18) 0.984 × 2 = 1 + 0.968;
    • 19) 0.968 × 2 = 1 + 0.936;
    • 20) 0.936 × 2 = 1 + 0.872;
    • 21) 0.872 × 2 = 1 + 0.744;
    • 22) 0.744 × 2 = 1 + 0.488;
    • 23) 0.488 × 2 = 0 + 0.976;
    • 24) 0.976 × 2 = 1 + 0.952;
    • We didn't get any fractional part that was equal to zero. But we had enough iterations (over Mantissa limit = 23) and at least one integer part that was different from zero => FULL STOP (losing precision...).
  • 5. Construct the base 2 representation of the fractional part of the number, by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above:

    0.347(10) = 0.0101 1000 1101 0100 1111 1101(2)

  • 6. Summarizing - the positive number before normalization:

    25.347(10) = 1 1001.0101 1000 1101 0100 1111 1101(2)

  • 7. Normalize the binary representation of the number, shifting the decimal point 4 positions to the left so that only one non-zero digit stays to the left of the decimal point:

    25.347(10) =
    1 1001.0101 1000 1101 0100 1111 1101(2) =
    1 1001.0101 1000 1101 0100 1111 1101(2) × 20 =
    1.1001 0101 1000 1101 0100 1111 1101(2) × 24

  • 8. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point:

    Sign: 1 (a negative number)

    Exponent (unadjusted): 4

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

  • 9. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary (base 2), by using the same technique of repeatedly dividing it by 2, as already demonstrated above:

    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1 = (4 + 127)(10) = 131(10) =
    1000 0011(2)

  • 10. Normalize the mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal point) and adjust its length to 23 bits, by removing the excess bits from the right (losing precision...):

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

    Mantissa (normalized): 100 1010 1100 0110 1010 0111

  • Conclusion:

    Sign (1 bit) = 1 (a negative number)

    Exponent (8 bits) = 1000 0011

    Mantissa (23 bits) = 100 1010 1100 0110 1010 0111

  • Number -25.347, converted from the decimal system (base 10) to 32 bit single precision IEEE 754 binary floating point =
    1 - 1000 0011 - 100 1010 1100 0110 1010 0111