1 000 110 000 000 011 110 010 000 000 210 Converted to 32 Bit Single Precision IEEE 754 Binary Floating Point Representation Standard

Convert decimal 1 000 110 000 000 011 110 010 000 000 210(10) to 32 bit single precision IEEE 754 binary floating point representation standard (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

What are the steps to convert decimal number
1 000 110 000 000 011 110 010 000 000 210(10) to 32 bit single precision IEEE 754 binary floating point representation (1 bit for sign, 8 bits for exponent, 23 bits for mantissa)

1. Divide the number repeatedly by 2.

Keep track of each remainder.

We stop when we get a quotient that is equal to zero.


  • division = quotient + remainder;
  • 1 000 110 000 000 011 110 010 000 000 210 ÷ 2 = 500 055 000 000 005 555 005 000 000 105 + 0;
  • 500 055 000 000 005 555 005 000 000 105 ÷ 2 = 250 027 500 000 002 777 502 500 000 052 + 1;
  • 250 027 500 000 002 777 502 500 000 052 ÷ 2 = 125 013 750 000 001 388 751 250 000 026 + 0;
  • 125 013 750 000 001 388 751 250 000 026 ÷ 2 = 62 506 875 000 000 694 375 625 000 013 + 0;
  • 62 506 875 000 000 694 375 625 000 013 ÷ 2 = 31 253 437 500 000 347 187 812 500 006 + 1;
  • 31 253 437 500 000 347 187 812 500 006 ÷ 2 = 15 626 718 750 000 173 593 906 250 003 + 0;
  • 15 626 718 750 000 173 593 906 250 003 ÷ 2 = 7 813 359 375 000 086 796 953 125 001 + 1;
  • 7 813 359 375 000 086 796 953 125 001 ÷ 2 = 3 906 679 687 500 043 398 476 562 500 + 1;
  • 3 906 679 687 500 043 398 476 562 500 ÷ 2 = 1 953 339 843 750 021 699 238 281 250 + 0;
  • 1 953 339 843 750 021 699 238 281 250 ÷ 2 = 976 669 921 875 010 849 619 140 625 + 0;
  • 976 669 921 875 010 849 619 140 625 ÷ 2 = 488 334 960 937 505 424 809 570 312 + 1;
  • 488 334 960 937 505 424 809 570 312 ÷ 2 = 244 167 480 468 752 712 404 785 156 + 0;
  • 244 167 480 468 752 712 404 785 156 ÷ 2 = 122 083 740 234 376 356 202 392 578 + 0;
  • 122 083 740 234 376 356 202 392 578 ÷ 2 = 61 041 870 117 188 178 101 196 289 + 0;
  • 61 041 870 117 188 178 101 196 289 ÷ 2 = 30 520 935 058 594 089 050 598 144 + 1;
  • 30 520 935 058 594 089 050 598 144 ÷ 2 = 15 260 467 529 297 044 525 299 072 + 0;
  • 15 260 467 529 297 044 525 299 072 ÷ 2 = 7 630 233 764 648 522 262 649 536 + 0;
  • 7 630 233 764 648 522 262 649 536 ÷ 2 = 3 815 116 882 324 261 131 324 768 + 0;
  • 3 815 116 882 324 261 131 324 768 ÷ 2 = 1 907 558 441 162 130 565 662 384 + 0;
  • 1 907 558 441 162 130 565 662 384 ÷ 2 = 953 779 220 581 065 282 831 192 + 0;
  • 953 779 220 581 065 282 831 192 ÷ 2 = 476 889 610 290 532 641 415 596 + 0;
  • 476 889 610 290 532 641 415 596 ÷ 2 = 238 444 805 145 266 320 707 798 + 0;
  • 238 444 805 145 266 320 707 798 ÷ 2 = 119 222 402 572 633 160 353 899 + 0;
  • 119 222 402 572 633 160 353 899 ÷ 2 = 59 611 201 286 316 580 176 949 + 1;
  • 59 611 201 286 316 580 176 949 ÷ 2 = 29 805 600 643 158 290 088 474 + 1;
  • 29 805 600 643 158 290 088 474 ÷ 2 = 14 902 800 321 579 145 044 237 + 0;
  • 14 902 800 321 579 145 044 237 ÷ 2 = 7 451 400 160 789 572 522 118 + 1;
  • 7 451 400 160 789 572 522 118 ÷ 2 = 3 725 700 080 394 786 261 059 + 0;
  • 3 725 700 080 394 786 261 059 ÷ 2 = 1 862 850 040 197 393 130 529 + 1;
  • 1 862 850 040 197 393 130 529 ÷ 2 = 931 425 020 098 696 565 264 + 1;
  • 931 425 020 098 696 565 264 ÷ 2 = 465 712 510 049 348 282 632 + 0;
  • 465 712 510 049 348 282 632 ÷ 2 = 232 856 255 024 674 141 316 + 0;
  • 232 856 255 024 674 141 316 ÷ 2 = 116 428 127 512 337 070 658 + 0;
  • 116 428 127 512 337 070 658 ÷ 2 = 58 214 063 756 168 535 329 + 0;
  • 58 214 063 756 168 535 329 ÷ 2 = 29 107 031 878 084 267 664 + 1;
  • 29 107 031 878 084 267 664 ÷ 2 = 14 553 515 939 042 133 832 + 0;
  • 14 553 515 939 042 133 832 ÷ 2 = 7 276 757 969 521 066 916 + 0;
  • 7 276 757 969 521 066 916 ÷ 2 = 3 638 378 984 760 533 458 + 0;
  • 3 638 378 984 760 533 458 ÷ 2 = 1 819 189 492 380 266 729 + 0;
  • 1 819 189 492 380 266 729 ÷ 2 = 909 594 746 190 133 364 + 1;
  • 909 594 746 190 133 364 ÷ 2 = 454 797 373 095 066 682 + 0;
  • 454 797 373 095 066 682 ÷ 2 = 227 398 686 547 533 341 + 0;
  • 227 398 686 547 533 341 ÷ 2 = 113 699 343 273 766 670 + 1;
  • 113 699 343 273 766 670 ÷ 2 = 56 849 671 636 883 335 + 0;
  • 56 849 671 636 883 335 ÷ 2 = 28 424 835 818 441 667 + 1;
  • 28 424 835 818 441 667 ÷ 2 = 14 212 417 909 220 833 + 1;
  • 14 212 417 909 220 833 ÷ 2 = 7 106 208 954 610 416 + 1;
  • 7 106 208 954 610 416 ÷ 2 = 3 553 104 477 305 208 + 0;
  • 3 553 104 477 305 208 ÷ 2 = 1 776 552 238 652 604 + 0;
  • 1 776 552 238 652 604 ÷ 2 = 888 276 119 326 302 + 0;
  • 888 276 119 326 302 ÷ 2 = 444 138 059 663 151 + 0;
  • 444 138 059 663 151 ÷ 2 = 222 069 029 831 575 + 1;
  • 222 069 029 831 575 ÷ 2 = 111 034 514 915 787 + 1;
  • 111 034 514 915 787 ÷ 2 = 55 517 257 457 893 + 1;
  • 55 517 257 457 893 ÷ 2 = 27 758 628 728 946 + 1;
  • 27 758 628 728 946 ÷ 2 = 13 879 314 364 473 + 0;
  • 13 879 314 364 473 ÷ 2 = 6 939 657 182 236 + 1;
  • 6 939 657 182 236 ÷ 2 = 3 469 828 591 118 + 0;
  • 3 469 828 591 118 ÷ 2 = 1 734 914 295 559 + 0;
  • 1 734 914 295 559 ÷ 2 = 867 457 147 779 + 1;
  • 867 457 147 779 ÷ 2 = 433 728 573 889 + 1;
  • 433 728 573 889 ÷ 2 = 216 864 286 944 + 1;
  • 216 864 286 944 ÷ 2 = 108 432 143 472 + 0;
  • 108 432 143 472 ÷ 2 = 54 216 071 736 + 0;
  • 54 216 071 736 ÷ 2 = 27 108 035 868 + 0;
  • 27 108 035 868 ÷ 2 = 13 554 017 934 + 0;
  • 13 554 017 934 ÷ 2 = 6 777 008 967 + 0;
  • 6 777 008 967 ÷ 2 = 3 388 504 483 + 1;
  • 3 388 504 483 ÷ 2 = 1 694 252 241 + 1;
  • 1 694 252 241 ÷ 2 = 847 126 120 + 1;
  • 847 126 120 ÷ 2 = 423 563 060 + 0;
  • 423 563 060 ÷ 2 = 211 781 530 + 0;
  • 211 781 530 ÷ 2 = 105 890 765 + 0;
  • 105 890 765 ÷ 2 = 52 945 382 + 1;
  • 52 945 382 ÷ 2 = 26 472 691 + 0;
  • 26 472 691 ÷ 2 = 13 236 345 + 1;
  • 13 236 345 ÷ 2 = 6 618 172 + 1;
  • 6 618 172 ÷ 2 = 3 309 086 + 0;
  • 3 309 086 ÷ 2 = 1 654 543 + 0;
  • 1 654 543 ÷ 2 = 827 271 + 1;
  • 827 271 ÷ 2 = 413 635 + 1;
  • 413 635 ÷ 2 = 206 817 + 1;
  • 206 817 ÷ 2 = 103 408 + 1;
  • 103 408 ÷ 2 = 51 704 + 0;
  • 51 704 ÷ 2 = 25 852 + 0;
  • 25 852 ÷ 2 = 12 926 + 0;
  • 12 926 ÷ 2 = 6 463 + 0;
  • 6 463 ÷ 2 = 3 231 + 1;
  • 3 231 ÷ 2 = 1 615 + 1;
  • 1 615 ÷ 2 = 807 + 1;
  • 807 ÷ 2 = 403 + 1;
  • 403 ÷ 2 = 201 + 1;
  • 201 ÷ 2 = 100 + 1;
  • 100 ÷ 2 = 50 + 0;
  • 50 ÷ 2 = 25 + 0;
  • 25 ÷ 2 = 12 + 1;
  • 12 ÷ 2 = 6 + 0;
  • 6 ÷ 2 = 3 + 0;
  • 3 ÷ 2 = 1 + 1;
  • 1 ÷ 2 = 0 + 1;

2. Construct the base 2 representation of the positive number.

Take all the remainders starting from the bottom of the list constructed above.

1 000 110 000 000 011 110 010 000 000 210(10) =


1100 1001 1111 1000 0111 1001 1010 0011 1000 0011 1001 0111 1000 0111 0100 1000 0100 0011 0101 1000 0000 0100 0100 1101 0010(2)


3. Normalize the binary representation of the number.

Shift the decimal mark 99 positions to the left, so that only one non zero digit remains to the left of it:


1 000 110 000 000 011 110 010 000 000 210(10) =


1100 1001 1111 1000 0111 1001 1010 0011 1000 0011 1001 0111 1000 0111 0100 1000 0100 0011 0101 1000 0000 0100 0100 1101 0010(2) =


1100 1001 1111 1000 0111 1001 1010 0011 1000 0011 1001 0111 1000 0111 0100 1000 0100 0011 0101 1000 0000 0100 0100 1101 0010(2) × 20 =


1.1001 0011 1111 0000 1111 0011 0100 0111 0000 0111 0010 1111 0000 1110 1001 0000 1000 0110 1011 0000 0000 1000 1001 1010 010(2) × 299


4. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point representation:

Sign 0 (a positive number)


Exponent (unadjusted): 99


Mantissa (not normalized):
1.1001 0011 1111 0000 1111 0011 0100 0111 0000 0111 0010 1111 0000 1110 1001 0000 1000 0110 1011 0000 0000 1000 1001 1010 010


5. Adjust the exponent.

Use the 8 bit excess/bias notation:


Exponent (adjusted) =


Exponent (unadjusted) + 2(8-1) - 1 =


99 + 2(8-1) - 1 =


(99 + 127)(10) =


226(10)


6. Convert the adjusted exponent from the decimal (base 10) to 8 bit binary.

Use the same technique of repeatedly dividing by 2:


  • division = quotient + remainder;
  • 226 ÷ 2 = 113 + 0;
  • 113 ÷ 2 = 56 + 1;
  • 56 ÷ 2 = 28 + 0;
  • 28 ÷ 2 = 14 + 0;
  • 14 ÷ 2 = 7 + 0;
  • 7 ÷ 2 = 3 + 1;
  • 3 ÷ 2 = 1 + 1;
  • 1 ÷ 2 = 0 + 1;

7. Construct the base 2 representation of the adjusted exponent.

Take all the remainders starting from the bottom of the list constructed above.


Exponent (adjusted) =


226(10) =


1110 0010(2)


8. Normalize the mantissa.

a) Remove the leading (the leftmost) bit, since it's allways 1, and the decimal point, if the case.


b) Adjust its length to 23 bits, by removing the excess bits, from the right (if any of the excess bits is set on 1, we are losing precision...).


Mantissa (normalized) =


1. 100 1001 1111 1000 0111 1001 1010 0011 1000 0011 1001 0111 1000 0111 0100 1000 0100 0011 0101 1000 0000 0100 0100 1101 0010 =


100 1001 1111 1000 0111 1001


9. The three elements that make up the number's 32 bit single precision IEEE 754 binary floating point representation:

Sign (1 bit) =
0 (a positive number)


Exponent (8 bits) =
1110 0010


Mantissa (23 bits) =
100 1001 1111 1000 0111 1001


Decimal number 1 000 110 000 000 011 110 010 000 000 210 converted to 32 bit single precision IEEE 754 binary floating point representation:

0 - 1110 0010 - 100 1001 1111 1000 0111 1001


How to convert decimal numbers from base ten to 32 bit single precision IEEE 754 binary floating point standard

Follow the steps below to convert a base 10 decimal number to 32 bit single precision IEEE 754 binary floating point:

  • 1. If the number to be converted is negative, start with its the positive version.
  • 2. First convert the integer part. Divide repeatedly by 2 the base ten positive representation of the integer number that is to be converted to binary, until we get a quotient that is equal to zero, keeping track of each remainder.
  • 3. Construct the base 2 representation of the positive integer part of the number, by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above. Thus, the last remainder of the divisions becomes the first symbol (the leftmost) of the base two number, while the first remainder becomes the last symbol (the rightmost).
  • 4. Then convert the fractional part. Multiply the number repeatedly by 2, until we get a fractional part that is equal to zero, keeping track of each integer part of the results.
  • 5. Construct the base 2 representation of the fractional part of the number by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above (they should appear in the binary representation, from left to right, in the order they have been calculated).
  • 6. Normalize the binary representation of the number, by shifting the decimal point (or if you prefer, the decimal mark) "n" positions either to the left or to the right, so that only one non zero digit remains to the left of the decimal point.
  • 7. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary, by using the same technique of repeatedly dividing by 2, as shown above:
    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1
  • 8. Normalize mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal sign if the case) and adjust its length to 23 bits, either by removing the excess bits from the right (losing precision...) or by adding extra '0' bits to the right.
  • 9. Sign (it takes 1 bit) is either 1 for a negative or 0 for a positive number.

Example: convert the negative number -25.347 from decimal system (base ten) to 32 bit single precision IEEE 754 binary floating point:

  • 1. Start with the positive version of the number:

    |-25.347| = 25.347

  • 2. First convert the integer part, 25. Divide it repeatedly by 2, keeping track of each remainder, until we get a quotient that is equal to zero:
    • division = quotient + remainder;
    • 25 ÷ 2 = 12 + 1;
    • 12 ÷ 2 = 6 + 0;
    • 6 ÷ 2 = 3 + 0;
    • 3 ÷ 2 = 1 + 1;
    • 1 ÷ 2 = 0 + 1;
    • We have encountered a quotient that is ZERO => FULL STOP
  • 3. Construct the base 2 representation of the integer part of the number by taking all the remainders of the previous dividing operations, starting from the bottom of the list constructed above:

    25(10) = 1 1001(2)

  • 4. Then convert the fractional part, 0.347. Multiply repeatedly by 2, keeping track of each integer part of the results, until we get a fractional part that is equal to zero:
    • #) multiplying = integer + fractional part;
    • 1) 0.347 × 2 = 0 + 0.694;
    • 2) 0.694 × 2 = 1 + 0.388;
    • 3) 0.388 × 2 = 0 + 0.776;
    • 4) 0.776 × 2 = 1 + 0.552;
    • 5) 0.552 × 2 = 1 + 0.104;
    • 6) 0.104 × 2 = 0 + 0.208;
    • 7) 0.208 × 2 = 0 + 0.416;
    • 8) 0.416 × 2 = 0 + 0.832;
    • 9) 0.832 × 2 = 1 + 0.664;
    • 10) 0.664 × 2 = 1 + 0.328;
    • 11) 0.328 × 2 = 0 + 0.656;
    • 12) 0.656 × 2 = 1 + 0.312;
    • 13) 0.312 × 2 = 0 + 0.624;
    • 14) 0.624 × 2 = 1 + 0.248;
    • 15) 0.248 × 2 = 0 + 0.496;
    • 16) 0.496 × 2 = 0 + 0.992;
    • 17) 0.992 × 2 = 1 + 0.984;
    • 18) 0.984 × 2 = 1 + 0.968;
    • 19) 0.968 × 2 = 1 + 0.936;
    • 20) 0.936 × 2 = 1 + 0.872;
    • 21) 0.872 × 2 = 1 + 0.744;
    • 22) 0.744 × 2 = 1 + 0.488;
    • 23) 0.488 × 2 = 0 + 0.976;
    • 24) 0.976 × 2 = 1 + 0.952;
    • We didn't get any fractional part that was equal to zero. But we had enough iterations (over Mantissa limit = 23) and at least one integer part that was different from zero => FULL STOP (losing precision...).
  • 5. Construct the base 2 representation of the fractional part of the number, by taking all the integer parts of the previous multiplying operations, starting from the top of the constructed list above:

    0.347(10) = 0.0101 1000 1101 0100 1111 1101(2)

  • 6. Summarizing - the positive number before normalization:

    25.347(10) = 1 1001.0101 1000 1101 0100 1111 1101(2)

  • 7. Normalize the binary representation of the number, shifting the decimal point 4 positions to the left so that only one non-zero digit stays to the left of the decimal point:

    25.347(10) =
    1 1001.0101 1000 1101 0100 1111 1101(2) =
    1 1001.0101 1000 1101 0100 1111 1101(2) × 20 =
    1.1001 0101 1000 1101 0100 1111 1101(2) × 24

  • 8. Up to this moment, there are the following elements that would feed into the 32 bit single precision IEEE 754 binary floating point:

    Sign: 1 (a negative number)

    Exponent (unadjusted): 4

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

  • 9. Adjust the exponent in 8 bit excess/bias notation and then convert it from decimal (base 10) to 8 bit binary (base 2), by using the same technique of repeatedly dividing it by 2, as already demonstrated above:

    Exponent (adjusted) = Exponent (unadjusted) + 2(8-1) - 1 = (4 + 127)(10) = 131(10) =
    1000 0011(2)

  • 10. Normalize the mantissa, remove the leading (leftmost) bit, since it's allways '1' (and the decimal point) and adjust its length to 23 bits, by removing the excess bits from the right (losing precision...):

    Mantissa (not-normalized): 1.1001 0101 1000 1101 0100 1111 1101

    Mantissa (normalized): 100 1010 1100 0110 1010 0111

  • Conclusion:

    Sign (1 bit) = 1 (a negative number)

    Exponent (8 bits) = 1000 0011

    Mantissa (23 bits) = 100 1010 1100 0110 1010 0111

  • Number -25.347, converted from the decimal system (base 10) to 32 bit single precision IEEE 754 binary floating point =
    1 - 1000 0011 - 100 1010 1100 0110 1010 0111