Short Assignment 1 is due Friday.
Last time, we looked at the instance variables and some of the methods in the Counter
class in Counter.java. Today, we continue.
The next method in the Counter
class is toString
:
/**
* Return a String representation with at least 2 digits, padding with a
* leading 0 if necessary.
*
* @return a String representation of the counter with at least 2 digits.
*/
public String toString() {
DecimalFormat fmt = new DecimalFormat(FORMAT); // use at least 2 digits
return fmt.format(myValue);
The toString
method is called automatically whenever we need the string representation of an object. Notice that it returns a String
. If we don't supply the toString
method in a class definition and we try to convert the object to a string, we just get the object's address in memory, in hexadecimal (base 16), which is rarely useful. Here, the toString
method converts the counter's value to a character string of at least two digits. It uses the class DecimalFormat
that we imported up at the top of the file. The toString
method first creates a DecimalFormat
object with the Java keyword new
. The only way to create a new object in Java is via the keyword new
. The class variable FORMAT
is passed to the constructor for DecimalFormat
.
The new
operator does three things:
Let's take a look at the variable fmt
. It is a local variable, which means it exists only during the call to the toString
method. The variable is created when toString
executes, and it ceases to exist when toString
completes.
Now that you know that the new
operator returns a reference to the object created, you know what is assigned to fmt
: a reference to the DecimalFormat
object that new
created and ran a constructor on. Not surprisingly, then, fmt
is declared as a DecimalFormat
object. Except that it's not. It's actually declared as a reference to a DecimalFormat
object. If you've seen aliasing in C or Python, where two different variables hold the address of the same object, you can appreciate the difference between an object and a reference to an object.
Having created a DecimalFormat
object and assigning its address to the local variable fmt
, the toString
method then calls the format
method on this DecimalFormat
object, with myValue
as a parameter (the only parameter) to the format
method. We don't know how the format
method works, but that's abstraction acting in our favor. Of course, we haven't even said what the format
method does, but we can worry about that later.
What's important here is how we call a method on an object. We called fmt.format(myValue)
. Here, fmt
is a reference to a DecimalFormat
object, format
is a method of the DecimalFormat
class, and myValue
is the parameter to the call. The general form of a method call is
objectReference.methodName(parameters)
Python programmers have seen this syntax before, but C programmers have not.
I like to think of objects as "things" and methods as "actions." Indeed, most of the time, the name of a class should be a noun or a noun phrase. And the name of a method should include a verb. (With the obvious exception for constructors, which are required to have the same name as the class.)
Now, what does the format
method return? You can tell from how it's used. The toString
method just takes what format
returns and immediately returns it to its caller. And we know that toString
returns a String
or, more precisely, a reference to a String
. Therefore, the format
method must return a reference to a String
.
The last method in the Counter
class has the special name main
:
/**
* A main program to test the counter.
* (Including such testing programs is a good idea.)
*/
public static void main(String args[]) {
// Create variables that can reference two Counters.
Counter c1, c2;
c1 = new Counter(5); // wraps at 5
c2 = new Counter(); // wraps at 12
final int TIMES = 50;
System.out.println("c1\tc2\tsum");
// Show lots of Counter values.
for (int i = 0; i < TIMES; i++) {
System.out.println(c1 + "\t" + c2 + "\t" + (c1.getValue() + c2.getValue()));
// Tick both Counters.
c1.tick();
c2.tick();
}
c1.reset();
c2.reset();
System.out.println("After reset:\t" + c1 + "\t" + c2);
c1.set(4);
c2.set(10);
System.out.println("After set:\t" + c1 + "\t" + c2);
c1.set(5);
c2.set(-1);
System.out.println("After invalid:\t" + c1 + "\t" + c2);
}
The main
method is where program execution starts. In Eclipse, we always have to say which class contains the main
method in which we want to start executing the program. So we can have lots of main
methods in an Eclipse project, but only one will be the main
method.
The header of the main
method is boilerplate: public static void main(String args[])
. As we'll see, a static method is not called on any object. The main
method must be static, because it's the first thing executed, and so no objects can possibly have been created at the time main
starts executing. The parameter String args[]
allows for command-line arguments. The []
part indicates that args
is actually an array of references to String
. C programmers have seen arrays. So have Python programmers, but they're called "lists" in Python.
What does the main
method do? It starts by declaring local variables c1
and c2
as references to Counter
objects. Then it creates two Counter
objects. The first one has a limit of 5; the new
operator calls the one-parameter constructor. The second object has a limit of 12; the new
operator calls the parameterless constructor (which is also called a default constructor). Remembering that new
returns a reference to the object it creates, we see that c1
is assigned a reference to the Counter
whose limit is 5 and c2
is assigned a reference to the Counter
whose limit is 12.
Next, main
declares a final local variable TIMES
with the value 50. Then, main
prints some output to the console by calling the method println
on a built-in object referenced by System.out
. The println
method takes a string and prints it on the console. As in C and Python, we can include escaped characters in a string literal; here, \t
denotes the tab character.
Then comes a for-loop. The syntax is exactly the same as in C, but this syntax will be new for Python programmers. Java for-loops are quite a bit more general than in Python, and they do not require lists (or arrays) to be involved. A for-loop has the form
for (initial; test; update)
body
and it is equivalent to a while-loop with the form
initial
while (test) {
body
update
}
Therefore, we could have written the for-loop as the equivalent while-loop
int i = 0;
while (i < TIMES) {
System.out.println(c1 + "\t" + c2 + "\t" + (c1.getValue() + c2.getValue()));
// Tick both Counters.
c1.tick();
c2.tick();
i++;
}
Although you might find the Python loop header for i in range(TIMES):
to be easier, the more general form of Java for-loops has advantages that we'll see later on.
The body of the for-loop calls the tick
method on each of the two Counter
objects, using the objectReference.methodName(parameters) syntax.
After the for-loop terminates, the main
method calls reset
on each Counter
and then prints the values of the two objects. There's a lot going on in the parameter to println
, so let's take a look. The parameter to println
must be a String
(more precisely a reference to a String
). But we're "adding" the string literal "After reset:\t"
and the reference c1
. To whatever that addition gives back, we're adding the string literal "\t"
, and then we add the reference c2
. What is going on?
Java assumes that if you apply the +
operator and one of the operands is a String
(again, really a reference to a String
…do you see how a string literal's type is really "reference to a String
object"?), then the other operand is converted to a String
if necessary and concatenated with the String
operand. And so we convert the Counter
object that c1
references to a String
and concatenate that String
with the string literal "After reset:\t"
. Just how is that Counter
object converted to a String
? By an implicit call to its toString
method! In other words, Java will automatically call toString
on this Counter
object and use the reference to a String
that toString
returned. We concatenate the two strings, and then we concatenate the string literal "\t"
to that result, and then call toString
implicitly on the Counter
that c2
references, concatenating the string returned by toString
what we've built up so far. Then we concatenate another tab character, followed by concatenating an int value that is implicitly converted to a String
. That int value is the sum of the values of the two Counter
objects, using values returned by calls to getValue
on each of the two objects. Notice how the parentheses around that last sum makes it so that last sum is not concatenation, but plain old addition. That ultimate string is what is passed to println
and printed on the console. Whew!
A word about the toString
method. Remember that it calls the format
method of the DecimalFormat
class on an object that was initialized with a string of 00
. The format
method in this situation will take its parameter (the myValue
instance variable of a Counter
) and return its string representation, but padded with leading zeros if necessary. For example, if myValue
is 7, then format
returns 07
, but if myValue
is 107 then format
returns 107
.
From here on, the main
method is easy. It sets the value of the Counter
referenced by c1
to 4 and the value of the Counter
referenced by c2
to 10, and then it prints out the values again. Then, main
sets the value of the Counter
referenced by c1
to 5, and it attempts to set the value of the Counter
referenced by c2
to $-$1. But $-$1 is not a legal value for any counter, and so the value 0 will be used instead. Finally, main
prints the new values of the two Counter
objects.
I said that abstraction is an important concept. How does it play into this example? Because all the instance variables of a Counter
object are private, no code from outside the Counter
class can access them. The only way to interact with a Counter
is by calling its methods. The caller knows what the methods do, but not how it does them. Granted, the implementations of the tick
, set
, reset
, and getValue
methods are pretty obvious, but the toString
method could have been implemented in several different ways. The caller cares not. As long as the methods do what they're supposed to do, the caller should be happy.
One of the biggest changes for Python programmers in moving to Java is that you have to declare your variables and say what type each will hold. It isn't that Python didn't have types. It was that Python checked them at run time. Java checks them statically at compile time rather than at run time. By saving all of those run time-checks, Java code is able to run significantly faster than Python code.
Let's remember what a type is and why we have them. At the basic level, computer memory is a long sequence of 1's and 0's, each called a bit (BInary digiT). Memory is broken into eight-bit chunks, called bytes, each with its own address. There are may things that you might want to store in a computer: numbers, characters, colors, sounds, wildebeest objects, etc. Even computer instructions are stored in memory. Each "type" of thing has its own encoding as bits.
When programming in machine language the programmer has to remember what each byte is supposed to represent. If he or she starts treating computer instructions as if they were integers, or integers as if they were computer instructions, bad things happen. (That is what many computer viruses do. They store a bit pattern as data, and then trick the program into running that pattern as code.)
The two things that a type tells us are
How to interpret the bits as a representation of the particular data.
Which operations are valid on that data.
Smalltalk, an early OO languge, has a very elegant solution to the type problem. Everything is an object, and the operations that are valid on the object are precisely those defined by its methods. The methods understand how to interpret the bit patterns. It is elegant, but having to ask a number to add itself to another number via a method call is a lot slower than using the hardware "add" instruction. As a result, Smalltalk is s-l-o-w.
Java compromises. Most things are objects whose types are defined by the methods that they implement and the data that they store. The object data is stored somewhere in memory, and variables hold references to the object that tell how to find it. The variables c1
and c2
in the Counter
class are examples. In practice the reference is a memory address, but Java treats the reference abstractly and uses it to communicate with the object. (Unlike C and C++, Java has no way to manipulate the memory addresses.) All references in Java are the same size, no matter how many instance variables and methods the object has.
For speed's sake, Java has eight primitive types that are not objects:
Four types for storing integers of in different numbers of bytes: byte
(1), short
(2), int
(4), and long
(8). We will almost always use int
, which has a range between just under negative 2 billion and to just over 2 billion. (That's because an int
has 32 bits. In 32 bits, we can store all integers in the range - 231 to 231 - 1, and $2^{31} = $ 2,147,483,648.) byte
is -128 to 127. short
is basically plus or minus 32,000 ($2^{15} = $ 32,768). long
is huge: - 263 to 263 - 1, and $2^{63} = $ 9,223,372,036,854,775,808.
Two types for storing numbers with fractions in scientific notation: float
(4 bytes) and double
(8 bytes). float
has 7 or 8 significant decimal digits, double
has about 15 significant decimal digits.
char
, a type for storing characters in a system called Unicode (2 bytes).
boolean
, which stores true
or false
.
Values of a primitive type are actually stored in the variable itself, and so these primitives have different lengths in memory. Therefore, Java has two kinds of variables. If a variable is of a primitive type, the variable contains the actual data itself (the bit pattern representing the data). If a variable is of type reference to an object, then it contains a reference, and the data itself is stored elsewhere.
Why is this distinction important? My wife and I have a joint checking account. We each have an ATM card. The cards are different and have different names on them, but the refer to the same checking account. If I withdraw money with my ATM card, there is less money in the account, and if my wife then looks at the balance it will be smaller even though she did nothing with her ATM card.
In this analogy, the account is the object (and bank account objects are a common example in textbooks). The ATM cards are the variables, each holding a reference to the same account. Any changes made by either of us to the account via our ATM card are seen by both. On the other hand, if my wife has her ATM card re-programmed so that it refers to her personal account (changes the reference stored in the variable), that won't affect my card or the account. She just will no longer be able to use that ATM card to withdraw money from our joint account, because it no longer refers to our joint account.
As in Python and C, when you divide two integer types, you get integer division, which truncates the result.
Integer literals (e.g., 123
) are assumed to be of type int
, unless followed by a L
or l
, in which case they are long
.
Floating-point literals (e.g., 3.14159) contain a decimal point and are assumed to be double
, unless followed by an F
or f
, in which case they are float
. The scientific notation form that you've seen in Python or C works in Java, too, e.g., 6.02e23
. E
or e
means "10 to the".
As in C, character literals are written between single quotes, e.g., 'a'
. Pythonistas: Python does not have single-character variables or literals. In Python, 'a'
is a string consisting of one character, but in Java it's a character literal, which is a different type from a string. As in C and Python, '2'
means the character 2, not the integer 2.
As in C and Python, use the backslash as an escape character. For a character literal that's a single quote: '\''
. For a backslash: '\\'
. And '\n'
is the newline character, with '\t'
being the tab character.
In Java, a String
is an object rather than a primitive type. String literals are surrounded by double quotes: "This is a string"
. Strings can be concatenated using the +
operator. To include a double-quote in a string, escape it with backslash: "Strings are enclosed by \" characters"
. String literals cannot extend over multiple lines. If your String literal is that long, break it into pieces on separate lines and use the +
operator to concatenate them.
Java has type conversions. Some happen automatically. Others must be specifically requested via a process called casting. The distinction is whether the conversion can result in the loss of significant information. If an int
is used where a double
is expected, it is safe to do the conversion and it happens automatically. (Every int
can also be represented as a double
.) If a double
is used where an int
is expected, we have a problem. There is no good integer representation for 3.3, for instance. So we have to make a specific cast, which trucates the fractional part. You cast by putting the desired type name in parentheses to the left of the expression being cast:
double d = 3.3;
int n = (int) d;
Going in the "safe" direction is called widening; going the other way is called narrowing. The order of primitive types from narrowest to widest is
byte => short => int => long => float => double
It is also widening to go from a char
to an int
or a double
, because the Unicode in a char
is stored as an integer value.
As I said before, think of a class like a blueprint, and of an object like a house. Defining a class does not create an object of that class, just as drawing a blueprint does not cause a house to be built. A blueprint says that when we build a house, here's what it will be like. And a class says that when we create an object, here are the instance variables and methods it will have.
Although Java does not explicitly require us to, we observe various conventions for capitalization styles of various types of identifiers:
Class names conventionally use title case: the first letter of each word in the name is capitalized. Example: BarrelOfMonkeys
Variables and method names conventionally start with a lowercase letter and then capitalize successive words. Example: poundsOfBarbecue
Constants (final variables) are all uppercase, with words separated by underscores. Example: AVOGADROS_NUMBER
.
Java has three distinct kinds of variables:
Instance variables. The way that an object stores its "personal" data. One copy of each instance variable is created each time an object is created. The object's instance variables continue to exist as long as the object exists.
Class variables (also known as static variables). These belong to the class, not to any particular object. So there is exactly one per class, no matter how many objects of the class there are. These variables exist before any objects are created. They last as long as the program is running. If any object changes the value of a class variable the value is changed for all objects of that class, because there is only one, and it's shared among all the objects in the class.
Local variables. These are "scratch paper" created within a method. A local variable is created when execution reaches the variable's declaration, and it goes away no later than when the method returns. (Actually, when the program reaches the end of the "block" where the variable is declared. Blocks are delimited by curly braces. If you re-enter a block, a new copy of the variable gets created. Any previous value is gone.)
Any variable declared within a method is a local variable. Parameters are always local variables for their methods.
If a variable is declared outside a method, then it's either an instance variable (if the declaration does not include static
) or a class variable (if the declaration includes static
).
When you declare an instance variable, static variable, or method, you should declare it as either public
or private
.
If a variable is public, then it can be read or written from anywhere in the program. If a variable is private, then only methods in its class can read or write it. Most of the time, instance and class variables should be private. That way, only the methods in the class can see them. That's important from the point of view of abstraction: code from outside the class should interact with objects in the class only through the methods.
If a method is public, then it can be called from anywhere in the program. If a method is private, then only methods in its class can call it. Most of the time, methods are public. Private methods can be useful, however, especially as "helper" methods within a class that code from outside the class need not know about.
Later in the course, we'll see another way to declare variables and methods: protected
.
Just as in C and Python, when you call a method, parameters get their values via call by value. Some terminology:
A parameter declared in a method header is a formal parameter. Remember that a formal parameter is also a local variable.
The value of a parameter supplied at the point of call is an actual parameter, or an argument.
In call by value, the value of each actual parameter is copied into the corresponding formal parameter. Actual parameters match up with formal parameters one by one, left to right. A formal parameters must be a single variable. An actual parameter can be any expression of the correct type.
Suppose that an object needs a reference to itself. Java supplies the keyword this
for just such a purpose. In Python, self
accomplishes the same purpose. (Note that whereas Python requires the first parameter of a method to be self
, you don't make a parameter for this
in Java.)
Here's one simple reason you'd want to use this
. Suppose that we wrote our Counter
class as follows:
public class Counter {
private int limit; // upper limit on the counter
private int value; // current value
...
public Counter(int limit) {
// How to assign the parameter limit to the instance variable limit?
}
...
}
So how could we assign the parameter limit
to the instance variable limit
? We can't say limit = limit;
because that would just assign a variable's value to the same variable. The problem here is that we have an instance variable and a local variable (the formal parameter) with the same name. Java's rule is that when such a conflict occurs, the local variable wins. So the line limit = limit;
assigns the value of the formal parameter limit
to the formal parameter limit
.
But we can use this
to say that we mean the instance variable:
public Counter(int limit) {
this.limit = limit;
}
When we write this.limit
, we're saying "the instance variable limit
in the object that the constructor is running on." Similarly, we could write the set
method as
public void set(int value) {
if (value >= 0 && value < limit)
this.value = value;
else
this.value = 0;
}
The tests use the formal parameter, and the assignment is to the instance variable of the object that set
is called on.
Now, you might wonder whether we really need this
. After all, if we had the sequence
c = Counter();
c.set(4);
couldn't we just refer to c
in the set
method? No way! First, c
is probably local to some other piece of code, and it's not known within the set
method. Second, what if we had two Counter
objects:
c1 = Counter();
c2 = Counter();
c1.set(4);
c2.set(7);
Now how do you know which Counter
you want to use in set
? Sometimes it's the one that c1
references, and sometimes it's the one that c2
references. The pronoun this
always references the object that the method was called on. Problem solved.
As you've seen, you call System.out.println
to print to the console. The String
that you supply as a parameter is what's printed. If you don't supply a parameter, then a blank line prints to the console.
If you want to print to the console but not end with a newline, then call System.out.print
. For example, if you were printing a prompt and wanted the typed input to appear on the same line as the prompt, you could write
System.out.print("Enter a prime number: ");
C programmers are familiar with the ++
and --
operators for incrementing and decrementing. (It might be hard to see but that's two minus signs in a row, with no space between them.) When you use these operators, they have to apply to variables.
int x = 7;
x++; // legal; x gets the value 8
7++; // not legal; 7 is not a variable
int y = 10;
(x+y)++; // not legal; (x+y) is not a variable
y--; // legal; y gets the value 9
You can use the ++
and --
operators in expressions. If you put the ++
or --
before the variable, then the increment or decrement happens before the variable's value is used in the expression. If you put the ++
or --
after the variable, then the increment or decrement happens after the variable's value is used in the expression.
int x = 7, y = 10;
int z = ++x * y--;
Here, x
is incremented before it's used in the expression, and so x
gets the value 8, and the value of x
used in the expression is 8. But y
is decremented after it's used in the expression, and so although y
will get the value 9, the value of y
used in the expression is 10. Hence, z
gets the value 80.
As you can see, you can make some complicated expressions using the ++
and --
operators. Just because you can doesn't mean you should!