Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Understanding Floating-Point Numbers & Computer Arithmetic in MATH 441/541 - Prof. Erin K., Study notes of Mathematical Methods for Numerical Analysis and Optimization

This document from math 441/541, numerical analysis course, introduces the first meeting topic about computer number representation and arithmetic. It covers internal number representation, focusing on binary, decimal, and hexidecimal bases, and computer (floating-point) representation of numbers using the ieee standard. Determining the value of shift, the number of numbers that can be represented exactly, the range of numbers, and plotting computer numbers. It also covers normalized k-digit decimal floating-point representation, rounding vs. Chopping, absolute and relative error, and computer arithmetic operations. The document also mentions stable operations and a handout on computations to be wary of.

Typology: Study notes

Pre 2010

Uploaded on 08/18/2009

koofers-user-k48
koofers-user-k48 🇺🇸

5

(1)

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
MATH 441/541 - Numerical Analysis
First Meeting: Computer Number Representation and Arithmetic
Thursday, August 23, 2007
Outline
1. Internal Number Representation
(a) Bases (Binary, Decimal, Hexidecimal, Converting)
(b) Computer (Floating-Point) Representation of Numbers (IEEE Standard) BINARY
y= (1)s2(cshift)(1 + f)
where
s= sign bit
cshift = shifted exponent (characteristic)
f= fractional part (mantissa)
with double precision-floating point numbers using 64 bit representation:
1 bit for sign, s
11 bits for exponent, c
52 bits for fraction, f
sexponent, cfraction, f
· · ·
i. Determining the value of shift.
ii. The number of numbers we can represent exactly.
iii. The range of the numbers we can represent.
iv. Plotting computer numbers (to see the meshing pattern).
v. Determining the pattern:
A. Increasing/decreasing number of bits for c=?
B. Increasing/decreasing number of bits for f=?
vi. Definitions:
A. Underflow
B. Overflow
C. Machine Epsilon,
D. Unit Roundoff, u
(c) Normalized k-Digit DECIMAL Floating-Point Representation of Numbers
±0.d1d2d3· · · dk×10n, d16= 0, di {0,1,· · · ,9}
i. Rounding vs. Chopping
ii. Absolute and Relative Error
pf2

Partial preview of the text

Download Understanding Floating-Point Numbers & Computer Arithmetic in MATH 441/541 - Prof. Erin K. and more Study notes Mathematical Methods for Numerical Analysis and Optimization in PDF only on Docsity!

MATH 441/541 - Numerical Analysis First Meeting: Computer Number Representation and Arithmetic Thursday, August 23, 2007

Outline

  1. Internal Number Representation (a) Bases (Binary, Decimal, Hexidecimal, Converting) (b) Computer (Floating-Point) Representation of Numbers (IEEE Standard) – BINARY

y = (−1)s^2 (c−shif t)^ (1 + f ) where s = sign bit c − shif t = shifted exponent (characteristic) f = fractional part (mantissa)

with double precision-floating point numbers using 64 bit representation: 1 bit for sign, s 11 bits for exponent, c 52 bits for fraction, f s exponent, c fraction, f · · ·

i. Determining the value of shif t. ii. The number of numbers we can represent exactly. iii. The range of the numbers we can represent. iv. Plotting computer numbers (to see the meshing pattern). v. Determining the pattern: A. Increasing/decreasing number of bits for c =⇒? B. Increasing/decreasing number of bits for f =⇒? vi. Definitions: A. Underflow B. Overflow C. Machine Epsilon,  D. Unit Roundoff, u (c) Normalized k-Digit DECIMAL Floating-Point Representation of Numbers ± 0 .d 1 d 2 d 3 · · · dk × 10 n, d 1 6 = 0, di ∈ { 0 , 1 , · · · , 9 }

i. Rounding vs. Chopping ii. Absolute and Relative Error

  1. Computer Arithmetic

(a) Definition of f l(x), floating-point representation of x:

f l(x) = x (1 + δ), where |δ| < u (unit roundoff)

(b) Operations: ⊕, , ⊗, ÷© (c) Stable Operations (proving or disproving) (d) Handout on Computations to Be Wary Of

  1. Rates of Convergence and Terminology

(a) {αn} → α with a rate of convergence O(βn)