Python Lesson 3
Lesson outline
Python scalars
Native Python lists
Python control structures: conditionals
Python control structures: loops
Application to NumPy data
Exercises
Standard Python scalar types
Python has several built-in types for handling numerical data, strings, Boolean (logical True
or False
) values, and dates and times; they are called scalars.
Numeric scalars
There are two numeric scalars, float and int. The float type corresponds to a double-precision (64-bit) floating-point number. The int type is an arbitrary precision signed integer.
Floats are double precision by default (no single precision type) and they can be expressed in scientific notation
Strings
Python is considered a very powerful programming language for string manipulation. Strings are defined surrounding the text by single or double quotes
You can use one or the other, but always start and end with the same. Depending the one you are using, you can include the other in the string, as in string_3
. If you need to include the same quote than the marker in the string you have to escape it with a backslash, as in string_4
.
Usin triple quotes you can define a multiline string
In strings you can use the count
method
As in the previous example, the backslash is an escape character used to specify special characters.
You can access string characters by its index
Strings are immutable objects, you cannot alter them by index, try
However, you can append or prepend characters to a string with the + operator, that concatenates them
You can also use the replace method
You can join two strings using the sum symbol. This is very used when making graphs. To add a numerical or other scalar to the string you can use the str
function.
You can further control strings using the format()
method as in the following examples
For more info check https://www.python.org/dev/peps/pep-3101/#format-specifiers.
Booleans
The Booleans values in Python are True
and False
and can be combined with the keywords or
, and
, and not
.
The None
type
None
typeThe Python null type is None
, which is returned by any function that does not explicitly return a value.
When functions are defined, None
is a common default value for function arguments (check Lesson 4).
Type casting
You can transform among Python objects with the str
, float
, int
, and bool
commands
Native Python Lists
The native Python data structures are lists, dicts (AKA hashes), tuples, and sets. In this lesson we cover only lists and the rest will be covered in the following lesson.
Defining Python Lists
A list is a sequence of Python objects (maybe scalars or not, and may have also different types) that has variable lenght and is mutable. They can be defined using square brackets or the list
command.
You can slice Python lists
As it happens with Numpy, when dealing with native Python structures, one needs to be always aware that Python uses the so called pass by reference and not the pass by value strategy of other programming languages. Thus, an assignment implies a reference to data in the righthand side. If we execute
the data are not copied and we do not have two data instances. Instead, both lists, list_example_2
and list_example_3
point to the same data. Having this in mind, we can understand what follows
You can use the list
function if you want to make a copy of the list
We will revisit this important aspect once we arrive to the subject of function definition in Python, where we should be careful to avoid unwanted side effects.
You can also create nested lists
Modifying Python lists
And you can add elements to the end of a list using the append
method.
The insert
method (computationally more expensive than the append
method) allows for the insertion of elements at any place in the list
The opposite method to insert
is pop
, which removes an element at a given place of the list
You can also remove elements by value using the remove
method, which removes the first appearance of a given list element.
The in
keyword allow to check if there exists an element of the list
And you can obtain the index of a given element with the index()
method that finds the given element and returns its position (remember, indeces start in zero). If the same element is present multiple times, the index of the first occurrence of the element is returned.
Adding two lists concatenates them
You can use the method extend
also to join two lists, which is more efficient than directly adding them
When doing calculations, one should always have in mind that a native Python list is a sequence of objects that can be of different types. This flexibility comes upon a cost on efficiency when compared to Numpy ndarrays, that store data of a single type in contiguous memory blocks.
Making choices in Python: conditionals
Conditional alter the flow of a program depending on statements True
or False
value.
Possible conditionals
a == b
:True
ifa
equalsb
a != b
:True
ifa
is not equal tob
a < b
,a <= b
:True
ifa
is less than (less than or equal) tob
a > b
,a >= b
:True
ifa
is greater than (greater than or equal) tob
a is b
:True
ifa
andb
reference the same Python objecta is not b
:True
ifa
andb
reference different Python objects
The if ... elif ... else
control structure
if ... elif ... else
control structureThe simplest case is the single if
conditional with syntax
The indented code block only is run if the conditional
statement evaluates to True
.
You can test several alternatives with elif
blocks and add a final default clause if none of the previous are true using else
We can use this conditional structure to calculate the body mass index from the height and weight of a person (in m and kg units)
The ternary expression
This expression combines an if...else..
statement with single blocks formed by a single command into a single line. It's useful and terse but it does not help increasing the readability of the code.
The syntax is
and it is equivalent to
An example is
Control Structures in Python: loops
Control structures alter the program flow. We explain the for
and while
control structures.
The for
loop
for
loopThe for
control structure defines a loop over an iterator with a syntax
A string is a possible iterator
You can control the loop flow using the keywords continue
and break
. The continue
statement skip the rest of the block and continues with the next iteration
The break
statement finish the loop and continues with the ensuing program statements.
The Python statement range
is specially useful for working with loops. The command range(start, stop, step)
provides and iterator starting in start
, ending in stop-1
and with differences of step
(default value one).
In this way we can sum all multiples of 7 or 13 from zero to ten thousand
If we want to sum all even integers from 2 to 100 we proceed as follows
Note that statements as total = total + number
are so common that they can be expressed more succintly as total += number
. There are equivalent statements -=
, *=
and /=
.
The list
statement allows to materialize an iterator
A very common situation is that in a loop we want to access the indexes of the elements that are being iterated. This can be done as follows
As this is a common need, Python has a built-in sequence method called enumerate
that returns a sequence of index, values
(a tuple, explained in next lesson).
The while
loop
while
loopThe while
control structure defines a loop depending on a condition. The loop block is run until the conditional becomes False
with a syntax
For example
You can also use the keywords continue
and break
to control a while
loop flow.
Sometimes you can fall into an infinite loop as this one
In this case it is important to know how to interrupt the loop: click on menu entry Kernel -> Interrupt
. You can find an application of an infinite loop in the next lesson.
Sometimes loops become infinite unexpectedly. The following example illustrates the point mentioned above about the possible problems arising when using conditionals with floats. The while loop, seemingly inoffensive, depending on the value of the i2
variable can end up as an infinite loop (try e.g. i2 = 0.3,0.7,0.8,0.9
).
Testing floats equality (*)
Be always aware that due to the finite precision in their internal representation and rounding/numerical errors conditionals can be tricky when applied to floats. As a rule, testing the equality of two floats x
and y
as x==y
is highly discouraged and may have unpredictable side effects.
The way to avoid such problems is to replace float equality comparisons for inequalities, in this case while i1 <= i2:
or, even better, compare integer quantities. When you need to compare to floats, define a tolerance -depending on the nature of the problem- and compare the difference of the two values versus this tolerance. You can also check the useful NumPy function np.isclose
.
Application to NumPy data
We can apply the control structures to NumPy data. If we load again a set of data
If, for example, we want to check how many years have attained a monthly average temperature larger than a given threshold we can do this as follows
We can also, using a second loop, print which months have temperatures larger than the stablished threshold.
Of course, you can skip loops using NumPy Boolean matrices indexing. Last commands can be replaced to a very concise form as
The np.any
NumPy command tests whether any array element along a given axis evaluates to True
. Note that Boolean values are treated as 1 and 0 in arithmetic operations.
To find elements in an array verifying a given condition it is of interest the function np.where
. In can be used in two ways. The first one just specifying the condition provides the positions where the condition is fulfilled:
This is equivalent to the use of:
which provides the position or the content
The other option is to specify the condition and to provide two matrices that should be broadcastable. This use resembles the ternary expression
if true you obtain the corresponding element of x
and -1
otherwise.
In this example we use two matrices
Exercises
Exercise 3.1: Meking use of conditionals compute the equivalence between the age of a dog and a human taking into account the rules that follow: (1) A one-year-old dog is equivalent to a 14-year-old human being. (2) A dog that is two years old is considered equivalent to a 22 year old person. (3) Each additional dog year is equivalent to five human years.
Exercise 3.2: Prepare a loop such that given when it is given as an input a string variable it produces as output another string variable equal to the first string in reversed order, e.g. a = "abcd" and reverseda = "dcba".
Exercise 3.3: Use a loop to convert the string "Testing loops and strings…" into another string, changing spaces into "_" (underscore).
Exercise 3.4: Given the list
AA = [1.0, -2.0, 3.0, 5.5, 0.3]
and considering that these are the values of the coefficients for a polynomialPx = AA[0] + AA[1]*x + ... + AA[n]*x**n
, prepare a loop that computes the polynomial value for a given independent variablex
value. For example, in the given casePx = 7.8
forx = 1
.Exercise 3.5: Compute for the loaded temperature dataset the average seasonal temperatures and gives as a results a Python list with these temperatures.
Exercise 3.6: Build a code that, for a given month and considering the loaded temperature dataset, finds the mean and minimum temperature values for the selected month and then it builds a Python list with the years that have an average temperature for the selected month that is less than the average value of the minimum and mean temperature.
Last updated
Was this helpful?