Question

Assume we are using the simple model for the floating-point representation given in this book (the...

Assume we are using the simple model for the floating-point representation given in this book (the representation uses a 14-bit format, 5 bits for the exponent with a bias of 15, a zero-normalized mantissa of 8 bits, and a single sign bit). What is the real number represented by 01110010000101? Show how the computer would add this number with 238.5 using the floating-point addition algorithm. Verify your results by adding them in decimal, what is the relative error in the representation of the output sum. Note from student: Attempting to convert 01110010000101 to decimal resulted in 12,448. I'm not sure if this is correct and I cannot find a converter online. Any help would be appreciated!

Homework Answers

Answer #1

Solution for the problem is provided below, please comment if any doubts:

Here the 14-bit floating point representation is used with:

            5 bit exponent, 8 bit zero-normalized mantissa and a sign bit.

Part I:

The floating point number given is: 01110010000101

The sign bit = 0, Positive

The 5 bit exponent = 11100 = 28, but it is a biased value by 15, thus the original exponent value =28-15=13

The mantissa part is: 10000101, since zero normalized, the mantissa will be =

= 0.10000101

Now shift the point to 13 bits right and place 0’s if place is not there

0.10000101*213 = 1000010100000 = 4256

Thus the real number equivalent of 01110010000101 = 4256

Part II:

To add with 238.5

First convert 238.5 to floating point format

First convert to binary, 238.5 = 11101110.1

Now convert to zero normalized mantissa and compute the exponent as the shifted bits of point.

11101110.1=0.111011101*2^8

Now perform the addition of , 0.10000101*213 and 0.111011101*28

For that first make both the exponents same

0.111011101*28 = 0.00000111011101*213

Now perform the addition of equal exponent numbers

0.1000010100000*213 + 0.00000111011101*213

=0.10001001111101*213

Now convert back to floating point, eliminate bits after 8 bits in result

Add 15 to exponent, 13+15=28= 11100

Mantissa: 10001001

Sign: 0

Result= 01110010001001

Convert back to decimal

= 0. 10001001*213

=1000100100000

=4384

The actual result has to be = 4256+238.5 =4494.5

The relative error = (4494.5-4384)/ 4494.5 = 0.02 or 2%

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Given a 12-bit IEEE floating point format with 5 exponent bits: Give the hexadecimal representation for...
Given a 12-bit IEEE floating point format with 5 exponent bits: Give the hexadecimal representation for the bit-pattern representing −∞−∞. Give the hexadecimal representation for the bit-patterns representing +0 and -1. Give the decimal value for the floating point number represented by the bit-pattern 0xcb0. Give the decimal value for largest finite positive number which can be represented? Give the decimal value for the non-zero negative floating point number having the smallest magnitude. What are the smallest and largest magnitudes...
Concern the following 16-bit floating point representation: The first bit is the sign of the number...
Concern the following 16-bit floating point representation: The first bit is the sign of the number (0 = +, 1 = -), the next nine bits are the mantissa, the next bit is the sign of the exponent, and the last five bits are the magnitude of the exponent. All numbers are normalized, i.e. the first bit of the mantissa is one, except for zero which is all zeros. 1. How many significant binary digits do numbers in this representation...
Matlab uses IEEE double precision numbers: 64-bit floating point representation 1 bit : sign 11 bits:...
Matlab uses IEEE double precision numbers: 64-bit floating point representation 1 bit : sign 11 bits: exponent 52 bits: mantissa. Calculate largest number (less than inf) that can be stored accurately Calculate smallest number (x>0) that can be stored accurately Calculate the machine epsilon Show all work step by step and repeat for 10 bit floating point (bit sign, 4 bits exponent and 5 bits mantissa)
Matlab uses IEEE double precision numbers: 64-bit floating point representation 1 bit : sign 11 bits:...
Matlab uses IEEE double precision numbers: 64-bit floating point representation 1 bit : sign 11 bits: exponent 52 bits: mantissa. Calculate largest number that can be stored accurately Calculate smallest number (x>0) that can be stored accurately Calculate the machine epsilon Show all work step by step and explain calculations Now calculate the largest number and smallest number for a 10 bit floating point (1 bit for the sign, 4 bits exponent and 5 bits mantissa)
.A floating-point number representation on a certain system has a sign bit, a 4-bit exponent and...
.A floating-point number representation on a certain system has a sign bit, a 4-bit exponent and a 5-bit significand. Please provide steps taken to get the answer a) What is the largest positive and the smallest positive number that can be stored on this system if the storage is normalized? (Assume no bits are implied, there is no biasing, exponents use two's complement notation, and exponents of all zeros and all ones are allowed.) b) What bias should be used...
Question 9.1 Half-precision Floating-point Format (50 marks) Do some research and find out how real (floating...
Question 9.1 Half-precision Floating-point Format Do some research and find out how real (floating point) numbers are represented in Binary. (a) (10 =6+4 marks) Devise your own 16-bit representation for floating point numbers. Draw a diagram of your representation and explain what the various bits are used for. Explain in detail: (i) How many bits are allocated to the mantissa and the exponent, respectively? (ii) What defines the range and the precision (or accuracy) of the numbers stored in floating...
urgent: Consider a 5-bit floating point representation based on the IEEE floating point format with one...
urgent: Consider a 5-bit floating point representation based on the IEEE floating point format with one sign bit, the next two bits of the exponent (exponent bias is 1), and the last two bits of the significand. Fill in the table below. For column M and Value, your answer must be expressed as a fraction of the form x/4. Bits M E Value 0 01 00 0 01 01 0 01 10 0 01 11 0 10 00 1 4/4...
Assuming a 5-bit IEEE (754 standard) floating-point format where 1 bit is used for the sign,...
Assuming a 5-bit IEEE (754 standard) floating-point format where 1 bit is used for the sign, 3 bits for the exponent, and 1 bit for the fraction, write the formulas for the exponent E, the significand M, the fraction f, and the value V for the quantities that follow and also describe the bit representation. Please show all steps to receive full credit. The number 5.0 The largest odd integer that can be represented exactly The reciprocal of the smallest...
What is the 16-bit binary representation (in hexadecimal using lower-case letters, e.g., 0x39ab) of -13 1/4...
What is the 16-bit binary representation (in hexadecimal using lower-case letters, e.g., 0x39ab) of -13 1/4 (base 10) when represented as an IEEE 16-bit floating-point number? The IEEE 16-bit floating-point representation uses formulae consistent with those for the 32bit single-precision representation, except for using 5 bits for the exponent (instead of 8 in the case of the 32-bit representation) and a bias of 15.
Find the internal representation of the following decimal number in the Single Precision Floating Point format...
Find the internal representation of the following decimal number in the Single Precision Floating Point format of the value: -17.6 Non-terminating fractions should be carried out 6 places. You will show the different steps involved in this transformation by filling out the fields below. The value 17 in binary is ___ 2 (no leading or trailing zeroes). The value .6 in binary is ____ 2 (complete to 6 places) Normalized fraction: 1.____ 2 x 2Exponent. Exponent=_____. Biased Exponent in Binary:...
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT