Part I: The Basics
Programming languages have distinctive features that determine the kinds of applications for which they are well suited. They also share many fundamental attributes. Essentially all languages provide:
Built-in data types such as integers, characters, and so forth
Expressions and statements to manipulate values of these types
Variables, which let us give names to the objects we use
Control structures, such as if or while, that allow us to conditionally execute or repeat a set of actions
Functions that let us abstract actions into callable units of computation
Most modern programming languages supplement this basic set of features in two ways: They let programmers extend the language by defining their own data types, and they provide a set of library routines that define useful functions and data types not otherwise built into the language.
In C++, as in most programming languages, the type of an object determines what operations can be performed on it.
In contrast, C++ is a statically typed language; type-checking is done at compile time. As a consequence, the compiler must be told the type of every name used in the program before that name can be used.
One of the primary design goals of C++ is to let programmers define their own types that are as easy to use as the built-in types. The Standard C++ library uses these features to implement a rich library of class types and associated functions.
Chapter 2. Variables and Basic Types
In C++ the support for types is extensive: The language itself defines a set of primitive types and ways in which we can modify existing types. It also provides a set of features that allow us to define our own types.
2.1. Primitive Built-in Types
- The void type has no associated values and can be used in only a limited set of circumstances. The void type is most often used as the return type for a function that has no return value
- The size of the arithmetic types varies across machines. By size, we mean the number of bits used to represent the type. The standard guarantees a minimum size for each of the arithmetic types, but it does not prevent compilers from using larger sizes. Indeed, almost all compilers use a larger size for int than is strictly required.
- Table 2.1. C++: Arithmetic Types
- Because the number of bits varies, the maximum (or minimum) values that these types can represent also vary by machine.
2.1.1. Integral Types
- The arithmetic types that represent integers, characters, and boolean values are collectively referred to as the integral types.
- There are two character types: char and wchar_t. The char type is guaranteed to be big enough to hold numeric values that correspond to any character in the machine's basic character set. As a result, chars are usually a single machine byte. The wchar_t type is used for extended character sets, such as those used for Chinese and Japanese, in which some characters cannot be represented within a single char.
- The types short, int, and long represent integer values of potentially different sizes. Typically, shorts are represented in half a machine word, ints in a machine word, and longs in either one or two machine words (on 32-bit machines, ints and longs are usually the same size)
- Machine-Level Representation of The Built-in Types
- The C++ built-in types are closely tied to their representation in the computer's memory. Computers store data as a sequence of bits, each of which holds either 0 or 1.
- At the bit level, memory has no structure and no meaning.
- The most primitive way we impose structure on memory is by processing it in chunks. Most computers deal with memory as chunks of bits of particular sizes, usually powers of 2.
- Although the exact sizes can vary from one machine to another, we usually refer to a chunk of 8 bits as a "byte" and 32 bits, or 4 bytes, as a "word."
- Most computers associate a numbercalled an addresswith each byte in memory.
- To give meaning to the byte at address 736425, we must know the type of the value stored there. Once we know the type, we know how many bits are needed to represent a value of that type and how to interpret those bits.
- Signed and Unsigned Types
- The integral types, except the boolean type, may be either signed or unsigned. As its name suggests, a signed type can represent both negative and positive numbers (including zero), whereas an unsigned type represents only values greater than or equal to zero.
- The integers, int, short, and long, are all signed by default. To get an unsigned type, the type must be specified as unsigned, such as unsigned long. The unsigned int type may be abbreviated as unsigned. That is, unsigned with no other type implies unsigned int.
- Unlike the other integral types, there are three distinct types for char: plain char, signed char, and unsigned char. Although there are three distinct types, there are only two ways a char can be represented. The char type is respresented using either the signed char or unsigned char version. Which representation is used for char varies by compiler.
- How Integral Values Are Represented
- In an unsigned type, all the bits represent the value.
- The C++ standard does not define how signed types are represented at the bit level. Instead, each compiler is free to decide how it will represent signed types. These representations can affect the range of values that a signed type can hold. We are guaranteed that an 8-bit signed type will hold at least the values from 127 through 127; many implementations allow values from 128 through 127.
- Under the most common strategy for representing signed integral types, we can view one of the bits as a sign bit. Whenever the sign bit is 1, the value is negative; when it is 0, the value is either 0 or a positive number.
- Assignment to Integral Types
- The type of an object determines the values that the object can hold. This fact raises the question of what happens when one tries to assign a value outside the allowable range to an object of a given type. The answer depends on whether the type is signed or unsigned.
- For unsigned types, the compiler must adjust the out-of-range value so that it will fit. The compiler does so by taking the remainder of the value modulo the number of distinct values the unsigned target type can hold.
- For the unsigned types, a negative value is always out of range. An object of unsigned type may never hold a negative value. Some languages make it illegal to assign a negative value to an unsigned type, but C++ does not.
- Beware: In C++ it is perfectly legal to assign a negative number to an object with unsigned type. The result is the negative value modulo the size of the type. So, if we assign 1 to an 8-bit unsigned char, the resulting value will be 255, which is 1 modulo 256.
- When assigning an out-of-range value to a signed type, it is up to the compiler to decide what value to assign. In practice, many compilers treat signed types similarly to how they are required to treat unsigned types. That is, they do the assignment as the remainder modulo the size of the type. However, we are not guaranteed that the compiler will do so for the signed types.
2.1.2. Floating-Point Types
- The types float, double, and long double represent floating-point single-, double-, and extended-precision values.
- Typically, floats are represented in one word (32 bits), doubles in two words (64 bits), and long double in either three or four words (96 or 128 bits). The size of the type determines the number of significant digits a floating-point value might contain.
- Beware:The float type is usually not precise enough for real programsfloat is guaranteed to offer only 6 significant digits. The double type guarantees at least 10 significant digits, which is sufficient for most calculations.
2.2. Literal Constants
- a literal constant: literal because we can speak of it only in terms of its value; constant because its value cannot be changed. Every literal has an associated type.
- Literals exist only for the built-in types. There are no literals of class types. Hence, there are no literals of any of the library types.
- Advice: Using the Built-in Arithmetic Types
- C++, like C, is designed to let programs get close to the hardware when necessary, and the integral types are defined to cater to the peculiarities of various kinds of hardware. Most programmers can (and should) ignore these complexities by restricting the types they actually use.
- In practice, many uses of integers involve counting.When counting in other circumstances, it is usually right to use an unsigned value. Doing so avoids the possibility that a value that is too large to fit results in a (seemingly) negative result.
- When performing integer arithmetic, it is rarely right to use shorts. In most programs, using shorts leads to mysterious bugs when a value is assigned to a short that is bigger than the largest number it can hold. What happens depends on the machine, but typically the value "wraps around" so that a number too large to fit turns into a large negative number. For the same reason, even though char is an integral type, the char type should be used to hold characters and not for computation. The fact that char is signed on some implementations and unsigned on others makes it problematic to use it as a computational type.
- On most machines, integer calculations can safely use int. Technically speaking, an int can be as small as 16 bitstoo small for most purposes. In practice, almost all general-purpose machines use 32-bits for ints, which is often the same size used for long. The difficulty in deciding whether to use int or long occurs on machines that have 32-bit ints and 64-bit longs. On such machines, the run-time cost of doing arithmetic with longs can be considerably greater than doing the same calculation using a 32-bit int. Deciding whether to use int or long requires detailed understanding of the program and the actual run-time performance cost of using long versus int.
- Determining which floating-point type to use is easier: It is almost always right to use double. The loss of precision implicit in float is significant, whereas the cost of double precision calculations versus single precision is negligible. In fact, on some machines, double precision is faster than single. The precision offered by long double usually is unnecessary and often entails considerable extra run-time cost.
- Rules for Integer Literals
- We can write a literal integer constant using one of three notations: decimal, octal, or hexadecimal. These notations, of course, do not change the bit representation of the value, which is always binary.
- Literal integer constants that begin with a leading 0 (zero) are interpreted as octal; those that begin with either 0x or 0X are interpreted as hexadecimal.
- By default, the type of a literal integer constant is either int or long. The precise type depends on the value of the literalvalues that fit in an int are type int and larger values are type long. By adding a suffix, we can force the type of a literal integer constant to be type long or unsigned or unsigned long. We specify that a constant is a long by immediately following the value with either L or l (the letter "ell" in either uppercase or lowercase).
- TIPS: When specifying a long, use the uppercase L: the lowercase letter l is too easily mistaken for the digit 1.
- In a similar manner, we can specify unsigned by following the literal with either U or u. We can obtain an unsigned long literal constant by following the value by both L and U.
- There are no literals of type short.
- Rules for Floating-Point Literals : 2010.8.23 learning mark
- We can use either common decimal notation or scientific notation to write floating-point literal constants.
- Using scientific notation, the exponent is indicated either by E or e.
- By default, floating-point literals are type double.
- We indicate single precision by following the value with either F or f. Similarly, we specify extended precision by following the value with either L or l
- 3.14159F .001f 12.345L 0. 3.14159E0f 1E-3F 1.2345E1L 0e0
- Boolean and Character Literals
- The words true and false are literals of type bool
- Printable character literals are written by enclosing the character within single quotation marks:'a' '2' ',' ' ' // blank
- We can obtain a wide-character literal of type wchar_t by immediately preceding the character literal with an L, as in L'a'
- Escape Sequences for Nonprintable Characters
- Some characters are nonprintable.
- Nonprintable characters and special characters are written using an escape sequence. An escape sequence begins with a backslash. The language defines the following escape sequences:
- An escape sequence begins with a backslash. The language defines the following escape sequences:
newline
\n
horizontal tab
\t
vertical tab
\v
backspace
\b
carriage return
\r
formfeed
\f
alert (bell)
\a
backslash
\\
question mark
\?
single quote
\'
double quote
\"
- We can write any character as a generalized escape sequence of the form \ooo where ooo represents a sequence of as many as three octal digits. The value of the octal digits represents the numerical value of the character.
- The following examples are representations of literal constants using the ASCII character set: \7 (bell) \12 (newline) \40 (blank) \0 (null) \062 ('2') \115 ('M')
- The character represented by '\0' is often called a "null character," and has special significance,
- We can also write a character using a hexadecimal escape sequence \xddd consisting of a backslash, an x, and one or more hexadecimal digits.
- Character String Literals
- String literals are arrays of constant characters,
- String literal constants are written as zero or more characters enclosed in double quotation marks. Nonprintable characters are represented by their underlying escape sequence.
- For compatibility with C, string literals in C++ have one character in addition to those typed in by the programmer. Every string literal ends with a null character added by the compiler.
- A character literal 'A' represents the single character A, whereas “A" represents an array of two characters: the letter A and the null character.
- Just as there is a wide character literal, such as L'a' there is a wide string literal, again preceded by L, such as L"a wide string literal" The type of a wide string literal is an array of constant wide characters. It is also terminated by a wide null character.
- Concatenated String Literals
- Two string literals (or two wide string literals) that appear adjacent to one another and separated only by spaces, tabs, or newlines are concatenated into a single new string literal.
- What happens if you attempt to concatenate a string literal and a wide string literal? For example: std::cout << "multi-line " L"literal " << std::endl; The result is undefinedthat is, there is no standard behavior defined for concatenating the two different types. The program might appear to work, but it also might crash or produce garbage values. Moreover, the program might behave differently under one compiler than under another.
- Advice: Don't Rely on Undefined Behavior
- Programs that use undefined behavior are in error. If they work, it is only by coincidence. Undefined behavior results from a program error that the compiler cannot detect or from an error that would be too much trouble to detect.
- Unfortunately, programs that contain undefined behavior can appear to execute correctly in some circumstances and/or on one compiler. There is no guarantee that the same program, compiled under a different compiler or even a subsequent release of the current compiler, will continue to run correctly. Nor is there any guarantee that what works with one set of inputs will work with another.
- Programs should not (knowingly) rely on undefined behavior. Similarly, programs usually should not rely on machine-dependent behavior, such as assuming that the size of an int is a fixed and known value. Such programs are said to be nonportable. When the program is moved to another machine, any code that relies on machine-dependent behavior may have to be found and corrected. Tracking down these sorts of problems in previously working programs is, mildly put, a profoundly unpleasant task.
- Multi-Line Literals
- There is a more primitive (and less useful) way to handle long strings that depends on an infrequently used program formatting feature: Putting a backslash as the last character on a line causes that line and the next to be treated as a single line.
- // ok: A \ before a newline ignores the line break
std::cou\
t << "Hi" << st\
d::endl; - is equivalent to std::cout << "Hi" << std::endl;
- Note that the backslash must be the last thing on the line no comments or trailing blanks are allowed.
- Also, any leading spaces or tabs on the subsequent lines are part of the literal.
- For this reason, the continuation lines of the long literal do not have the normal indentation.
Section 2.3. Variables
-
Key Concept: Strong Static Typing
- C++ is a statically typed language, which means that types are checked at compile time. The process by which types are checked is referred to as type-checking.
- In most languages, the type of an object constrains the operations that the object can perform. If the type does not support a given operation, then an object of that type cannot perform that operation.
- In C++, whether an operation is legal or not is checked at compile time. When we write an expression, the compiler checks that the objects used in the expression are used in ways that are defined by the type of the objects. If not, the compiler generates an error message; an executable file is not produced.
- As our programs, and the types we use, get more complicated, we'll see that static type checking helps find bugs in our programs earlier. A consequence of static checking is that the type of every entity used in our programs must be known to the compiler. Hence, we must define the type of a variable before we can use that variable in our programs.
-
2.3.1. What Is a Variable?
- A variable provides us with named storage that our programs can manipulate.
- Each variable in C++ has a specific type, which determines the size and layout of the variable's memory; the range of values that can be stored within that memory; and the set of operations that can be applied to the variable.
- C++ programmers tend to refer to variables as "variables" or as "objects" interchangeably.
- Lvalues and Rvalues
- there are two kinds of expressions in C++:
- lvalue (pronounced "ell-value"): An expression that is an lvalue may appear as either the left-hand or right-hand side of an assignment.
- rvalue (pronounced "are-value"): An expression that is an rvalue may appear on the right- but not left-hand side of an assignment.
- Variables are lvalues and so may appear on the left-hand side of an assignment. Numeric literals are rvalues and so may not be assigned.
- Note: In the course of the text, we'll see a number of situations in which the use of an rvalue or lvalue impacts the behavior and/or the performance of our programsin particular when passing and returning values from a function.
- Terminology: What Is an object?
- C++ programmers tend to be cavalier in their use of the term object. Most generally, an object is a region of memory that has a type. More specifically, evaluating an expression that is an lvalue yields an object.
- Strictly speaking, some might reserve the term object to describe only variables or values of class types. Others might distinguish between named and unnamed objects, always referring to variables when discussing named objects. Still others distinguish between objects and values, using the term object for data that can be changed by the program and using the term value for those that are read-only.
- In this book, we'll follow the more colloquial usage that an object is a region of memory that has a type. We will freely use object to refer to most of the data manipulated by our programs regardless of whether those data have built-in or class type, are named or unnamed, or are data that can be read or written.
-
2.3.2. The Name of a Variable
- The name of a variable, its identifier, can be composed of letters, digits, and the underscore character. It must begin with either a letter or an underscore. Upper- and lowercase letters are distinct: Identifiers in C++ are case-sensitive.
- Best Pratices: There is no language-imposed limit on the permissible length of a name, but out of consideration for others that will read and/or modify our code, it should not be too long.
- C++ Keywords: C++ reserves a set of words for use within the language as keywords. Keywords may not be used as program identifiers.
- C++ also reserves a number of words that can be used as alternative names for various operators. These alternative names are provided to support character sets that do not support the standard set of C++ operator symbols.
- In addition to the keywords, the standard also reserves a set of identifiers for use in the library. Identifiers cannot contain two consecutive underscores, nor can an identifier begin with an underscore followed immediately by an upper-case letter. Certain identifiers those that are defined outside a function may not begin with an underscore.
- Conventions for Variable Names
- There are a number of generally accepted conventions for naming variables. Following these conventions can improve the readability of a program.
- A variable name is normally written in lowercase letters.
- An identifier is given a mnemonic name that is, a name that gives some indication of its use in a program, such as on_loan or salary.
- An identifier containing multiple words is written either with an underscore between each word or by capitalizing the first letter of each embedded word.
- Best Practices: The most important aspect of a naming convention is that it be applied consistently
-
-
2.3.3. Defining Objects
- Each definition starts with a type specifier, followed by a comma-separated list of one or more names. A semicolon terminates the definition.
- The type determines the amount of storage that is allocated for the variable and the set of operations that can be performed on it.
- Multiple variables may be defined in a single statement:
- Initialization
- A definition specifies a variable's type and identifier. A definition may also provide an initial value for the object. An object defined with a specified first value is spoken of as initialized.
- C++ supports two forms of variable initialization: copy-initialization and direct-initialization. The copy-initialization syntax uses the equal (=) symbol; direct-initialization places the initializer in parentheses: int ival(1024); // direct-initialization int ival = 1024; // copy-initialization
- Beware:Although, at this point in the book, it may seem obscure to the reader, in C++ it is essential to understand that initialization is not assignment. Initialization happens when a variable is created and gives that variable its initial value. Assignment involves obliterating an object's current value and replacing that value with a new one.
- Many new C++ programmers are confused by the use of the = symbol to initialize a variable. It is tempting to think of initialization as a form of assignment. But initialization and assignment are different operations in C++. This concept is particularly confusing because in many other languages the distinction is irrelevant and can be ignored. Moreover, even in C++ the distinction rarely matters until one attempts to write fairly complex classes. Nonetheless, it is a crucial concept and one that we will reiterate throughout the text.
- Note:There are subtle differences between copy- and direct-initialization when initializing objects of a class type. We won't completely explain these differences until Chapter 13. For now, it's worth knowing that the direct syntax is more flexible and can be slightly more efficient.
- Using Multiple Initializers
- When we initialize an object of a built-in type, there is only one way to do so: We supply a value, and that value is copied into the newly defined object. For built-in types, there is little difference between the direct and the copy forms of initialization.
- For objects of a class type, there are initializations that can be done only using direct-initialization.
- Initializing Multiple Variables
- When a definition defines two or more variables, each variable may have its own initializer. The name of an object becomes visible immediately, and so it is possible to initialize a subsequent variable to the value of one defined earlier in the same definition. Initialized and uninitialized variables may be defined in the same definition. Both forms of initialization syntax may be intermixed:
-
2.3.4. Variable Initialization Rules
- When we define a variable without an initializer, the system sometimes initializes the variable for us. What value, if any, is supplied depends on the type of the variable and may depend on where it is defined.
- Initialization of Variables of Built-in Type
- Whether a variable of built-in type is automatically initialized depends on where it is defined.
- Variables defined outside any function body are initialized to zero.
- Variables of built-in type defined inside the body of a function are uninitialized.
- Using an uninitialized variable for anything other than as the left-hand operand of an assignment is undefined.
- Bugs due to uninitialized variables can be hard to find. As we cautioned on page 42, you should never rely on undefined behavior.
- Best Practices: We recommend that every object of built-in type be initialized. It is not always necessary to initialize such variables, but it is easier and safer to do so until you can be certain it is safe to omit an initializer.
- Caution: Uninitialized Variables Cause Run-Time Problems
- Using an uninitialized object is a common program error, and one that is often difficult to uncover. The compiler is not required to detect a use of an uninitialized variable, although many will warn about at least some uses of uninitialized variables. However, no compiler can detect all uses of uninitialized variables.
- Sometimes, we're lucky and using an uninitialized variable results in an immediate crash at run time. Once we track down the location of the crash, it is usually pretty easy to see that the variable was not properly initialized.
- Other times, the program completes but produces erroneous results. Even worse, the results can appear correct when we run our program on one machine but fail on another. Adding code to the program in an unrelated location can cause what we thought was a correct program to suddenly start to produce incorrect results.
- The problem is that uninitialized variables actually do have a value. The compiler puts the variable somewhere in memory and treats whatever bit pattern was in that memory as the variable's initial state. When interpreted as an integral value, any bit pattern is a legitimate valuealthough the value is unlikely to be one that the programmer intended. Because the value is legal, using it is unlikely to lead to a crash. What it is likely to do is lead to incorrect execution and/or incorrect calculation.
- Initialization of Variables of Class Type
- Each class defines how objects of its type can be initialized. Classes control object initialization by defining one or more constructors
- Each class may also define what happens if a variable of the type is defined but an initializer is not provided. A class does so by defining a special constructor, known as the default constructor. This constructor is called the default constructor because it is run "by default;" if there is no initializer, then this constructor is used. The default constructor is used regardless of where a variable is defined.
- Most classes provide a default constructor. If the class has a default constructor, then we can define variables of that class without explicitly initializing them.
- Some class types do not have a default constructor. For these types, every definition must provide explicit initializer(s). It is not possible to define variables of such types without giving an initial value.
-
2.3.5. Declarations and Definitions
- In order for multiple files to access the same variable, C++ distinguishes between declarations and definitions.
- A definition of a variable allocates storage for the variable and may also specify an initial value for the variable. There must be one and only one definition of a variable in a program.
- A declaration makes known the type and name of the variable to the program. A definition is also a declaration: When we define a variable, we declare its name and type.
- We can declare a name without defining it by using the extern keyword. A declaration that is not also a definition consists of the object's name and its type preceded by the keyword extern:extern int i; // declares but does not define i int i; // declares and defines i
- An extern declaration is not a definition and does not allocate storage. In effect, it claims that a definition of the variable exists elsewhere in the program. A variable can be declared multiple times in a program, but it must be defined only once.
- A declaration may have an initializer only if it is also a definition because only a definition allocates storage. The initializer must have storage to initialize. If an initializer is present, the declaration is treated as a definition even if the declaration is labeled extern:extern double pi = 3.1416; // definition
- An extern declaration may include an initializer only if it appears outside a function.
- Because an extern that is initialized is treated as a definition, any subseqent definition of that variable is an error:
- Similarly, a subsequent extern declaration that has an initializer is also an error:
- The distinction between a declaration and a definition may seem pedantic but in fact is quite important.
- NOTE:In C++ a variable must be defined exactly once and must be defined or declared before it is used.
- Any variable that is used in more than one file requires declarations that are separate from the variable's definition. In such cases, one file will contain the definition for the variable. Other files that use that same variable will contain declarations for but not a definition of that same variable.
-
2.3.6. Scope of a Name
- The context used to distinguish the meanings of names is a scope. A scope is a region of the program. A name can refer to different entities in different scopes.
- Most scopes in C++ are delimited by curly braces. Generally, names are visible from their point of declaration until the end the scope in which the declaration appears.
- Names defined outside any function have global scope; they are accessible from anywhere in the program.
- local scope.
- It can be used in that statement but not elsewhere in main. It has statement scope.
- Scopes in C++ Nest
- Scopes in C++ Nest
- Beware:Programs such as the preceeding are likely to be confusing. It is almost always a bad idea to define a local variable with the same name as a global variable that the function uses or might use. It is much better to use a distinct name for the local.
- We'll have more to say about local and global scope in Chapter 7 and about statement scope in Chapter 6. C++ has two other levels of scope: class scope, which we'll cover in Chapter 12 and namespace scope, which we'll see in Section 17.2.
- Scopes in C++ Nest
-
2.3.7. Define Variables Where They Are Used
- In general, variable definitions or declarations can be placed anywhere within the program that a statement is allowed. A variable must be declared or defined before it is used.
- Best Practices:It is usually a good idea to define an object near the point at which the object is first used.
- Defining an object where the object is first used improves readability. The reader does not have to go back to the beginning of a section of code to find the definition of a particular variable. Moreover, it is often easier to give the variable a useful initial value when the variable is defined close to where it is first used.
- One constraint on placing declarations is that variables are accessible from the point of their definition until the end of the enclosing block. A variable must be defined in or before the outermost scope in which the variable will be used.
-
Section 2.4. const Qualifier
- magic number, one whose significance is not evident within the context of its use. It is as if the number had been plucked by magic from thin air.)
- The second problem is maintainability.
- Defining a const Object
- The const type qualifier provides a solution: It transforms an object into a constant.
- NOTE:Because we cannot subsequently change the value of an object declared to be const, we must initialize it when it is defined:
- const Objects Are Local to a File By Default
- Unlike other variables, unless otherwise specified, const variables declared at global scope are local to the file in which the object is defined. The variable exists in that file only and cannot be accessed by other files.
- We can make a const object accessible throughout the program by specifying that it is extern:
- NOTE:Nonconst variables are extern by default. To make a const variable accessible to other files we must explicitly specify that it is extern
Section 2.5. References
- A reference serves as an alternative name for an object. In real-world programs, references are primarily used as formal parameters to functions.
- In this section we introduce and illustrate the use of references as independent objects.
- A reference is a compound type that is defined by preceding a variable name by the & symbol. A compound type is a type that is defined in terms of another type. In the case of references, each reference type "refers to" some other type. We cannot define a reference to a reference type, but can make a reference to any other data type.
- A reference must be initialized using an object of the same type as the reference:
- A Reference Is an Alias
- Because a reference is just another name for the object to which it is bound, all operations on a reference are actually operations on the underlying object to which the reference is bound:
- NOTE: When a reference is initialized, it remains bound to that object as long as the reference exists. There is no way to rebind a reference to a different object.
- The important concept to understand is that a reference is just another name for an object.
- A consequence of this rule is that you must initialize a reference when you define it; initialization is the only way to say to which object a reference refers.
- Defining Multiple References
- We can define multiple references in a single type definition. Each identifier that is a reference must be preceded by the & symbol:
- const References
- A const reference is a reference that may refer to a const object:
- Terminology: const Reference is a Reference to const: C++ programmers tend to be cavalier in their use of the term const reference. Strictly speaking, what is meant by "const reference" is "reference to const." Similarly, programmers use the term "nonconst reference" when speaking of reference to a nonconst type. This usage is so common that we will follow it in this book as well.
- A const reference can be initialized to an object of a different type or to an rvalue (Section 2.3.1, p. 45), such as a literal constant: (The same initializations are not legal for nonconst references. Rather, they result in compile-time errors. The reason is subtle and warrants an explanation.)
- int i = 42; // legal for const references only const
- int &r = 42;
- const int &r2 = r + i;
- Allowing only const references to be bound to values requiring temporaries avoids the problem entirely because a const reference is read-only.
- NOTE:A nonconst reference may be attached only to an object of the same type as the reference itself.
- A const reference may be bound to an object of a different but related type or to an rvalue.
Section 2.6. Typedef Names
- A typedef lets us define a synonym for a type:
- A typedef name can be used as a type specifier:
- A typedef definition begins with the keyword typedef, followed by the data type and identifier. The identifier, or typedef name, does not introduce a new type but rather a synonym for the existing data type. A typedef name can appear anywhere in a program that a type name can appear.
- Typedefs are commonly used for one of three purposes:
- To hide the implementation of a given type and emphasize instead the purpose for which the type is used
-
To streamline complex type definitions, making them easier to understand
-
To allow a single type to be used for more than one purpose while making the purpose clear each time the type is used
Section 2.7. Enumerations
- Enumerations provide an alternative method of not only defining but also grouping sets of integral constants.
- Defining and Initializing Enumerations
- An enumeration is defined using the enum keyword, followed by an optional enumeration name, and a comma-separated list of enumerators enclosed in braces.
- enum open_modes {input, output, append}; // input is 0, output is 1, and append is 2
- Enumerators Are const Values
- We may supply an initial value for one or more enumerators. The value used to initialize an enumerator must be a constant expression. A constant expression is an expression of integral type that the compiler can evaluate at compile time. An integral literal constant is a constant expression, as is a const object (Section 2.4, p. 56) that is itself initialized from a constant expression.
- An enumerator value need not be unique.
- It is not possible to change the value of an enumerator. As a consequence an enumerator is itself a constant expression and so can be used where a constant expression is required.
- Each enum Defines a Unique Type
- Each enum defines a new type. An object of enumeration type may be initialized or assigned only by one of its enumerators or by another object of the same enumeration type:
Section 2.8. Class Types
- In C++ we define our own data types by defining a class. A class defines the data that an object of its type contains and the operations that can be executed by objects of that type.
- Class Design Starts with the Operations
- Each class defines an interface and implementation. The interface consists of the operations that we expect code that uses the class to execute. The implementation typically includes the data needed by the class. The implementation also includes any functions needed to define the class but that are not intended for general use.
- When we define a class, we usually begin by defining its interfacethe operations that the class will provide. From those operations we can then determine what data the class will require to accomplish its tasks and whether it will need to define any functions to support the implementation.
- Defining the Sales_item Class
- A class definition starts with the keyword class followed by an identifier that names the class. The body of the class appears inside curly braces. The close curly must be followed by a semicolon.
- Beware:It is a common mistake among new programmers to forget the semicolon at the end of a class definition.
- The class body, which can be empty, defines the data and operations that make up the type. The operations and data that are part of a class are referred to as its members.The operations are referred to as the member functions (Section 1.5.2, p. 24) and the data as data members.
- The class also may contain zero or more public or private access labels. An access label controls whether a member is accessible outside the class. Code that uses the class may access only the public members.
- When we define a class, we define a new type. The class name is the name of that type.
- Each class defines its own scope (Section 2.3.6, p. 54). That is, the names given to the data and operations inside the class body must be unique within the class but can reuse names defined outside the class.
- Class Data Members
- The data members of a class are defined in somewhat the same way that normal variables are defined.
- The data members of a class define the contents of the objects of that class type.
- There is one crucially important difference between how we define variables and class data members: We ordinarily cannot initialize the members of a class as part of their definition. When we define the data members, we can only name them and say what types they have. Rather than initializing data members when they are defined inside the class definition, classes control initialization through special member functions called constructors
- Access Labels
- Access labels control whether code that uses the class may use a given member. Member functions of the class may use any member of their own class, regardless of the access level. The access labels, public and private, may appear multiple times in a class definition. A given label applies until the next access label is seen.
- The public section of a class defines members that can be accessed by any part of the program. Ordinarily we put the operations in the public section so that any code in the program may execute these operations.
- Code that is not part of the class does not have access to the private members.
- Using the struct Keyword
- C++ supports a second keyword, struct, that can be used to define class types. The struct keyword is inherited from C.
- If we define a class using the class keyword, then any members defined before the first access label are implicitly private; ifwe usethe struct keyword, then those members are public.
- Whether we define a class using the class keyword or the struct keyword affects only the default initial access level.
- There are only two differences between this class definition and our initial class definition: Here we use the struct keyword, and we eliminate the use of public keyword immediately following the opening curly brace. Members of a struct are public, unless otherwise specified, so there is no need for the public label.
- NOTE:The only difference between a class defined with the class keyword or the struct keyword is the default access level: By default, members in a struct are public; those in a class are private.
Section 2.9. Writing Our Own Header Files
- ordinarily class definitions go into a header file.
- In fact, C++ programs use headers to contain more than class definitions.
- Programs made up of multiple files need a way to link the use of a name and its declaration. In C++ that is done through header files.
- To allow programs to be broken up into logical parts, C++ supports what is commonly known as separate compilation. Separate compilation lets us compose a program from several files.
- 2.9.1. Designing Our Own Headers
- A header provides a centralized location for related declarations. Headers normally contain class definitions, extern variable declarations, and function declarations,
- Proper use of header files can provide two benefits: All files are guaranteed to use the same declaration for a given entity; and should a declaration require change, only the header needs to be updated.
- Some care should be taken in designing headers. The declarations in a header should logically belong together. A header takes time to compile. If it is too large programmers may be reluctant to incur the compile-time cost of including it.
- TIPS:To reduce the compile time needed to process headers, some C++ implementations support precompiled header files. For more details, consult the reference manual of your C++ implementation.
- Headers Are for Declarations, Not Definitions
- When designing a header it is essential to remember the difference between definitions, which may only occur once, and declarations, which may occur multiple times
- Compiling and Linking Multiple Source Files
- To produce an executable file, we must tell the compiler not only where to find our main function but also where to find the definition of the member functions defined by the Sales_item class. Let's assume that we have two files: main.cc, which contains the definition of main, and Sales_item.cc, which contains the Sales_item member functions. We might compile these files as follows:
- $ CC -c main.cc Sales_item.cc # by default generates a.exe # some compilers generate a.out
- $ CC -c main.cc Sales_item.cc -o main # puts the executable in main.exe
- where $ is our system prompt and # begins a command-line comment. We can now run the executable file, which will run our main program.
- If we have only changed one of our .cc source files, it is more efficient to recompile only the file that actually changed. Most compilers provide a way to separately compile each file. This process usually yields a .o file, where the .o extension implies that the file contains object code.
- The compiler lets us link object files together to form an executable. On the system we use, in which the compiler is invoked by a command named CC, we would compile our program as follows:
$ CC -c main.cc # generates main.o
$ CC -c Sales_item.cc # generates Sales_item.o
$ CC main.o Sales_item.o # by default generates a.exe;
# some compilers generate a.out# puts the executable in main.exe
$ CC main.o Sales_item.o -o main - You'll need to check with your compiler's user's guide to understand how to compile and execute programs made up of multiple source files.
- Many compilers offer an option to enhance the error detection of the compiler. Check your compiler's user's guide to see what additional checks are available.
- To produce an executable file, we must tell the compiler not only where to find our main function but also where to find the definition of the member functions defined by the Sales_item class. Let's assume that we have two files: main.cc, which contains the definition of main, and Sales_item.cc, which contains the Sales_item member functions. We might compile these files as follows:
- Beware:Because headers are included in multiple source files, they should not contain definitions of variables or functions.
- There are three exceptions to the rule that headers should not contain definitions: classes, const objects whose value is known at compile time, and inline functions (Section 7.6 (p. 256) covers inline functions) are all defined in headers. These entities may be defined in more than one source file as long as the definitions in each file are exactly the same.
- These entities are defined in headers because the compiler needs their definitions (not just declarations) to generate code.
- That const objects are defined in a header may require a bit more explanation.
- Some const Objects Are Defined in Headers
- Recall that by default a const variable (Section 2.4, p. 57) is local to the file in which it is defined. As we shall now see, the reason for this default is to allow const variables to be defined in header files.
- Generally speaking, a constant expression is an expression that the compiler can evaluate at compile-time. A const variable of integral type may be a constant expression when it is itself initialized from a constant expression. However, for the const to be a constant expression, the initializer must be visible to the compiler. To allow multiple files to use the same constant value, the const and its initializer must be visible in each file. To make the initializer visible, we normally define such consts inside a header file. That way the compiler can see the initializer whenever the const is used.
- There is one important implication of this behavior. When we define a const in a header file, every source file that includes that header has its own const variable with the same name and value.
- When the const is initialized by a constant expression, then we are guaranteed that all the variables will have the same value. Moreover, in practice, most compilers will replace any use of such const variables by their corresponding constant expression at compile time. So, in practice, there won't be any storage used to hold const variables that are initialized by constant expressions.
- When a const is initialized by a value that is not a constant expression, then it should not be defined in header file. Instead, as with any other variable, the const should be defined and initialized in a source file. An extern declaration for that const should be made in the header, enabling multiple files to share that variable.
- 2.9.2. A Brief Introduction to the Preprocessor
- The #include facility is a part of the C++ preprocessor. The preprocessor manipulates the source text of our programs and runs before the compiler. C++ inherits a fairly elaborate preprocessor from C. Modern C++ programs use the preprocessor in a very restricted fashion.
- A #include directive takes a single argument: the name of a header. The pre-processor replaces each #include by the contents of the specified header. Our own headers are stored in files. System headers may be stored in a compiler-specific format that is more efficient. Regardless of the form in which a header is stored, it ordinarily contains class definitions and declarations of the variables and functions needed to support separate compilation.
- Headers Often Need Other Headers
- Headers often #include other headers. The entities that a header defines often use facilities from other headers.
- Including other headers is so common that it is not unusual for a header to be included more than once in the same source file.
- Accordingly, it is important to design header files so that they can be included more than once in a single source file. We must ensure that including a header file more than once does not cause multiple definitions of the classes and objects that the header file defines. A common way to make headers safe uses the preprocessor to define a header guard. The guard is used to avoid reprocessing the contents of a header file if the header has already been seen.
- Avoiding Multiple Inclusions
- Beware:Names used for preprocessor variables must be unique within the program. Any uses of a name that matches a preprocessor variable is assumed to refer to the preprocessor variable.
- To help avoid name clashes, preprocessor variables usually are written in all uppercase letters.
- A preprocessor variable has two states: defined or not yet defined. Various preprocessor directives define and test the state of preprocessor variables. The #define directive takes a name and defines that name as a preprocessor variable. The #ifndef directive tests whether the specified preprocessor variable has not yet been defined. If it hasn't, then everything following the #ifndef is processed up to the next #endif.
- We can use these facilities to guard against including a header more than once:
#ifndef SALESITEM_H
#define SALESITEM_H
// Definition of Sales_itemclass and related functions goes here
#endif - Best Practices: Headers should have guards, even if they aren't included by another header. Header guards are trivial to write and can avoid mysterious compiler errors if the header subsequently is included more than once.
- This strategy works well provided that no two headers define and use a pre-processor constant with the same name. We can avoid problems with duplicate preprocessor variables by naming them for an entity, such as a class, that is defined inside the header. A program can have only one class named Sales_item. By using the class name to compose the name of the header file and the preprocessor variable, we make it pretty likely that only one file will use this preprocessor variable.
- Using Our Own Headers
- The #include directive takes one of two forms:
#include <standard_header>
#include "my_file.h" - If the header name is enclosed by angle brackets (< >), it is presumed to be a standard header. The compiler will look for it in a predefined set of locations, which can be modified by setting a search path environment variable or through a command line option. The search methods used vary significantly across compilers. We recommend you ask a colleague or consult your compiler's user's guide for further information. If the header name is enclosed by a pair of quotation marks, the header is presumed to be a nonsystem header. The search for nonsystem headers usually begins in the directory in which the source file is located.
- The #include directive takes one of two forms:
Exercise 2.3~Exercise 2.6::
// EXE-2.1.cpp : Defines the entry point for the console application.
#include "stdafx.h"
#include <iostream>
using namespace std;
int main()
{
unsigned char a(-1);
//So, if we assign -1 to an 8-bit unsigned char, the resulting value will be 255, which is 1 modulo 256.
cout << static_cast<int>(a) << endl;
//Exercise 2.3:
//If a short on a given machine has 16 bits then what is the largest number that can be assigned to a short? To an unsigned short?
short lowerBound = 0x8000;
short upperBound = 0x7FFF;
unsigned short upperBoundOfUnsigned = 0xFFFF;
cout << "The upper bound of signed short is " << upperBound << endl;
cout << "The lower bound of signed short is " << lowerBound << endl;
cout << "The upper bound of unsigned short is " << upperBoundOfUnsigned << endl;
//Exercise 2.4:
//What value is assigned if we assign 100,000 to a 16-bit unsigned short? What value is assigned if we assign 100,000 to a plain 16-bit short?
//When the value to be signed is out range, the PC will get the lower bits with the variable type's length;
//Here the PC will cut the value as 2bytes, 16bits;
//1000000 in HEX is 0xF4240, and the lower 16bits is 0x4240. The 0x4240 in dec is 16960.
//So the signed and unsigned short will both get the 16960 value for the sign bit of the 0x4240 is zero;
signed short val1 = 1000000; // expected 16990, but actually is 16960; why?
unsigned short val2 = 1000000; // expected 16975 why it is 16960
cout << "The value of signed short with the 1000000 is " << val1 << endl;
cout << "The value of unsigned short with the 1000000 is " << val2 << endl;
// Here try assign the 1032768 ( 0xFC240) assign to signed and unsigned short;
// The lower 16bits is 0xC240. For the sign bit is 1, so it will get different value;
val1 = 1032768;
val2 = 1032768;
cout << "The value of signed short with the 1032768 is " << val1 << endl;
cout << "The value of unsigned short with the 1032768 is " << val2 << endl;
return 0;
}
Exercise 2.9:
1024f is ilegal, 1024.0f works;
3.14UL, the UL is not defined, 3.14L works;
Exercise 2.10:
#include "stdafx.h"
#include <iostream>
using namespace std;
int main()
{
//Using escape sequences, write a program to print 2M followed by a newline. Modify the program to print 2, then a tab, then an M, followed by a newline.
cout << "2M\n" ;
cout << "2\tM\n";
return 0;
}
Exercise 2.11:
#include "stdafx.h"
#include <iostream>
using namespace std;
int main()
{
//Write a program that prompts the user to input two numbers, the base and exponent. Print the result of raising the base to the power of the exponent.
cout << "Please input the base and exponent(be care the sequence):" << endl;
int base, exponent;
cin >> base >> exponent;
int product(1);
for (int i = 0; i < exponent ; ++i)
{
product*=base;
}
cout << "The result is :" << product << endl;
return 0;
}
Define the data members of classes to represent the following types:
(a) a phone number (b) an address
(c) an employee or a company (d) a student at a university
Exercise 2.33:
Determine what options your compiler offers for increasing the warning level. Recompile selected earlier programs using this option to see whether additional problems are reported.
?
No comments:
Post a Comment