Algorithm Complexity
Written and composed by Laurent Haan
http://www.progressive-coding.com
Introduction to Algorithm Complexity
[...] an algorithm is a procedure (a finite set of well-defined
instructions) for accomplishing some task which, given an initial state,
will terminate in a defined end-state. The computational complexity and
efficient implementation of the algorithm are important in computing,
and this depends on suitable data structures.
Wikipedia: http://en.wikipedia.org/wiki/Algorithm
In Computer Science, it is important to measure the quality of algorithms, especially the
specific amount of a certain resource an algorithm needs. Examples of such resources
would be time or memory storage. Nowadays, memory storage is almost a non-essential
factor when designing algorithms but be aware that several systems still have memory
constraints, such as Digital Signal Processors in embedded systems.
Different algorithms may complete the same task with a different set of instructions in
less or more time, space or effort than other. The analysis and study of algorithms is a
discipline in Computer Science which has a strong mathematical background. It often
relies on theoretical analysis of pseudo-code.
To compare the efficiency of algorithms, we don't rely on abstract measures such as the
time difference in running speed, since it too heavily relies on the processor power and
other tasks running in parallel. The most common way of qualifying an algorithm is the
Asymptotic Notation, also called Big O.
Asymptotic Notation
The symbol O is used to describe an asymptotic upper bound for the magnitude of a
function in terms of another, simpler function.
This means that for x > k, when x tends to infinity, the value f(x) will always be inferior
to C *g(x) (with C a constant).
The idea behind this notation is that it allows us to qualify to efficiency of an algorithm by
telling us how often a certain algorithm will execute operations before terminating. Let's
start with a simple example:
void f ( int a[], int n )
{
int i;
printf ( "N = %d\n", n );
for ( i = 0; i < i =" 0;" i =" 0;" mid =" (" n =" mid;"> x )
i = mid + 1;
else
return mid;
}
return 0;
}
We can call this an O(N) algorithm and not be wrong because the time complexity will
never exceed O(N). But because the array is split in half each time, the number of steps
is always going to be equal to the base-2 logarithm of N, which is considerably less than
O(N). So an even better choice would be to set the upper bound to log N, which is the
upper limit that we know we're guaranteed never to cross. Therefore, a more accurate
claim is that binary search is a logarithmic, or O(log2 N), algorithm.
Bubble Sort
A sorting algorithm is an algorithm that puts elements of a list in a certain order. The
most used orders are numerical or lexicographical order. There exist numerous sort
algorithms and there efficiency vary greatly!
Bubble sort is a straightforward and simplistic method of sorting data that is used in
computer science education. The algorithm starts at the beginning of the data set. It
compares the first two elements, and if the first is greater than the second, it swaps
them. It continues doing this for each pair of adjacent elements to the end of the data
set. It then starts again with the first two elements, repeating until no swaps have
occurred on the last pass. Although simple, this algorithm is highly inefficient and is
rarely used except in education. For sorting small numbers of data (e.g. 20) it is better
than Quicksort.
void bubbleSort(int *array, int size)
{
int swapped = 0;
int x;
do
{
swapped = 0;
for (x = 0; x <>array[x+1])
{
swap(&array[x], &array[x+1]);
swapped = 1;
}
}
} while (swapped);
}
1. What is the worst case performance ?
2. What is the best case performance ?
3. Can you think of a way to improve the performance ?
Selection Sort
Selection sort is a simple sorting algorithm that improves on the performance of bubble
sort. It works by first finding the smallest element using a linear search and swapping it
into the first position in the list, then finding the second smallest element by scanning
the remaining elements, and so on. Selection sort is unique compared to almost any
other algorithm in that its running time is not affected by the prior ordering of the list, it
performs the same number of operations because of its simple structure.
void selectionSort(int *array, int size)
{
int x,y,min;
for (x = 0; x < min =" x;" y="x+1;" if="">< min =" y;" x =" 0;">= 0 && array[pos] > value)
{
array[pos+1] = array[pos];
pos--;
}
array[pos+1] = value;
}
1. What is the worst case performance ?
2. What is the best case performance ?
3. Can you think of a way to improve the performance ?
Heap Sort
Heapsort is a much more efficient version of Selection Sort. It also works by determining
the largest (or smallest) element of the list, placing that at the end (or beginning) of the
list, then continuing with the rest of the list, but accomplishes this task efficiently by
using a data structure called a heap, a special type of binary tree. Once the data list has
been made into a heap, the root node is guaranteed to be the largest element. It is
removed and placed at the end of the list, then the heap is rearranged so the largest
element remaining moves to the root . Using the heap, finding the next largest element
takes O(log n) time, instead of O(n) for a linear scan as in simple selection sort. This
allows Heapsort to run in O(n log n) time.
Merge Sort
Merge sort takes advantage of the ease of merging already sorted lists into a new sorted
list. It starts by dividing the list into two sublists which are then sorted again with Merge
Sort. This continues until the size of the list equals one (which is sorted). Then it merges
those sublists again into bigger lists and only a few simple steps are required to merge
two sorted lists.
function mergesort(m, start, end)
if length(m) ≤ 1
return m
else
middle = start+end / 2
left = mergesort(m, start, middle)
right = mergesort(m, middle,end)
result = merge(left, right)
return result
Since Merge Sort is a recursive implementation working on two independent sublists, it
can be efficiently implemented in parallel using threads. The speed gain of a parallel
execution is doubled on dual core processors.
Quick Sort
Quicksort is a divide and conquer algorithm, which relies on a partition operation: to
partition an array, we choose an element, called a pivot, move all smaller elements
before the pivot, and move all greater elements after it. This can be done efficiently in
linear time and in-place. We then recursively sort the lesser and greater sublists. Efficient
implementations of quicksort (with in-place partitioning) are somewhat complex, but are
among the fastest sorting algorithms in practice. Together with its modest O(log n) space
usage, this makes quicksort one of the most popular sorting algorithms, available in
many standard libraries. The most complex issue in quicksort is choosing a good pivot
element; consistently poor choices of pivots can result in drastically slower (O(n2))
performance, but if at each step we choose the median as the pivot then it works in O(n
log n).
Exercise: Stone Age boulders
Two stone age men have gathered an impressive collection of boulders in their cave, all
of different size and weight, standing neatly one after the other, in the order they have
been collected. To restore some order in the room, they want to arrange the boulders
from the smallest to the largest, with the smallest at the entrance of the cave and the
largest close to the back wall.
Each boulder is only represented by its weight, so the heavier it is, the largest it is (we
assume that they are all made of the same material). As there are only 2 stone age men,
and the space inside the cave is limited, they are only allowed to swap two boulders at a
time. Additionally, to save their energy, they want to use a method that allows them to
move the minimum necessary weight only.
Write an algorithm that takes an array of boulders and orders it from the smallest to the
largest, by only swapping two boulders at a time but with the least effort in terms of kilos
moved.
Example:
{5, 3, 1}
-> {1, 3, 5} and necessary effort: 1+5 = 6
{6, 4, 1, 2}
-> {6, 1, 4, 2}, effort 5
-> {6, 2, 4, 1}, effort 3
-> {1, 2, 4, 6}, effort 7
total effort: 15
No comments:
Post a Comment