AnalysisAndDevelopmentOfSortingAlgorithms/Programming Assignment 2.tex at master · mraediaz/AnalysisAndDevelopmentOfSortingAlgorithms · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
\documentclass[12pt]{article}

\author{Melany Diaz}
\title{Programming Assignment 2: Insertion Sort Analysis}
\date{\today}

% math
\usepackage{amsmath,amssymb}
%\usepackage{amsthm} % uncomment to enable theorem environment

\usepackage{listings}%used to present classes and java code

\usepackage{indentfirst}

\usepackage{graphicx}

% 1 inch margins
\usepackage[margin=1in]{geometry}

\usepackage{fancyhdr}
\pagestyle{fancy}
\lhead{CS 215}
\chead{Programming Assignment 2: Insertion Sort Analysis}
\rhead{Melany Diaz}


\begin{document}

\maketitle

\begin{abstract}

This report will be validating and analyzing the run-time complexity of Insertion Sort, Merge Sort, Heap Sort, and Quick Sort. We hope to discover the conditions, if any exist, where it would be most efficient to use one sort versus another by analyzing their best, worst, and random case run-times for multiple input sizes.

\end{abstract}

\section*{MOTIVATION AND BACKGROUND}

Sorting data is a problem that often occurs in computer applications, like when sorting through a file system or easing the process of searching for a value. Different factors can affect the time required to sort, thus many algorithms exist to solve the task of organizing comparable values. Because the exact speed of an algorithm depends on the details of the data that must be sorted, the run-time of algorithms is typically discussed in terms of the size of its input. For example, if the algorithm must process $n$ objects, it might have a run-time linearly proportional to n, which would look like $O(n)$. Some other run-times proportional to $n$ are exponential, polynomial or logarithmic. Yet the run-time of an algorithm isn't solely dependent on the size of the input; the execution time of many sorting algorithms can vary due to pre-existing order of the elements that must be sorted. For example, if a sorting algorithm must sort a set of objects that are already in sorted order, it could take much less time to re-organize than a set of objects in random order. Consequently, when analyzing the time complexity of an algorithm, one must keep in mind that algorithm's best case, worst case, and random case run-times. Typically, the best case is when an algorithm is given a collection of objects to sort that is already in sorted order, it's worst case is when it is given a collection of objects sorted in the opposite order, and it's random case is when it is given a collection sorted in no particular order.\\

Some sorting algorithms include Insertion Sort, Merge Sort, Heap Sort, and Quick Sort. Each of these have their own benefits and disadvantages, as well as particular moments when it would be more appropriate to use one sort style versus another. For example, Insertion Sort works fastest when there isn't much data, yet for larger amounts of input, the other sorting algorithms surpass the speed of Insertion Sort. Because of this, it is important for computer scientists to familiarize themselves with the different run-time behaviors and complexities of these sorting algorithms.\\


The purpose of this report is to study and analyze the complexity of Insertion Sort, Merge Sort, Heap Sort, and Quick Sort. By validating and comparing the run-time complexities of the best, worst, and random cases of each, we hope to gain a better understanding of these algorithm's run-times. Through analyzing each case, we will discover the conditions, if any exist, that would make one sort algorithm more efficient than the others. We also hope to discover at what input size $n_0$ will the run-time behaviors of our implementations begin to assimilate to that algorithm's asymptotic run-time. \\

As a final goal, we gain to discover which algorithm has the biggest "leading constant." Due to its reputation for being the surpassed in speed by other algorithms when $n$ gets large, we predict, \textit{a priori}, that Insertion Sort will have the biggest leading constant.


\section*{PROCEDURE}
In order to implement Insert Sort, Merge Sort, Heap Sort and Quick Sort, it is important to under stand some details about each. Some of those details include their pre and post conditions, their invariants, and those invariant's properties. \\

\subsection*{Insert Sort}

Our first algorithm, Insert Sort, is best known for being efficient with small input sizes. Insertion Sort moves from the beginning of the list to the end of it exactly once. During the process of sorting, if a value is out of place, it is moved back to where it should.  Note its following conditions.\\

\textbf{Input:} A sequence of $n$ elements $<a_1, a_2, ..., a_n>$.\\

\textbf{Output:} A permutation $<a_1', a_2', ..., a_n'>$ of the input sequence such that \\
\indent $a_1 \le a_2 \le ... \le a_n$.\\

The pseudo code for Insert Sort is as follows.

\lstinputlisting[language=java, breaklines = true]{InsertionSortPseudo.txt}

It is important to note the loop invariant for insertion sort and it's correctness. \\

\textbf{Loop invariant:}  At the start of each iteration of the \textbf{for} loop of lines 1-8, the subarray $A'[1 .. j-1]$ consists of the elements originally in $A[1 .. j-1]$, but in  sorted order.\\

To show that the loop invariant implies the algorithm is correct, we must show it's initialization, maintenance and termination. \\

\textbf{Initialization: } We start by showing that the loop invariant holds before the first loop iteration, when $j = 2$. The subarray $A[1..j - 1]$, therefore, consists of just the single element $A[1]$, which is in fact the original element in $A[1]$. Moreover, this subarray is sorted (trivially, of course), which shows that the loop invariant holds prior to the first iteration of the loop.\\

\textbf{Maintenance: } Next, we tackle the second property: showing that each iteration maintains the loop invariant. Informally, the body of the for loop works by moving $A[j-1], A[j-2], A[j-3]$, and so on by one position to the right until it finds the proper position for $A[j]$ (lines 4–7), at which point it inserts the value of $A[j]$ (line 8). The subarray $A[1..j$] then consists of the elements originally in $A[1..j]$, but in sorted order. Incrementing $j$ for the next iteration of the for loop then preserves the loop invariant.\\

\textbf{Termination:} Finally, we examine what happens when the loop terminates. The condition causing the for loop to terminate is that $ j  > A.length = n$. Because each loop iteration increases $j$ by 1, we must have $j = n + 1$ at that time. Substituting $n + 1$ for $j$ in the wording of loop invariant, we have that the subarray $A[1..n]$ consists of the elements originally in $A[1..n]$, but in sorted order. Observing that the subarray $A[1..n]$ is the entire array, we conclude that the entire array is sorted. Hence, the algorithm is correct.\\

Now that we know the algorithm's implementation details, we conclude by writing an insertion sort class that we may use to find the run-times for Insert Sort. An example of what a finalized insertion sort would look like may be as shown bellow.\\

\includegraphics[]{InsertSortPhoto}


\subsection*{Merge Sort}
Our next algorithm, Merge Sort, is best known for comparing during a merging step. A merging step is when two lists are combined to output a single sorted list. During this, the first available element of each list is compared and the lower value is appended to the output list. When either list runs out of values, the remaining elements of the opposing list are appended to the output case. \\

The key operation of merge sort is that merging operation of the two sorted sequences. This is done by calling an auxiliary procedure called Merge. Another notable component of merge is the use of a sentinel, or a special value used to simplify the code.  Note the following conditions necessary to implement Merge Sort.\\

\textbf{Input:} A sequence of $n$ elements $<a_1, a_2, ..., a_n>$.\\

\textbf{Output:} A permutation $<a_1', a_2', ..., a_n'>$ of the input sequence such that \\
\indent $a_1 \le a_2 \le ... \le a_n$.\\
The pseudo code for Merge Sort is as follows.

\lstinputlisting[language=java, breaklines = true]{MergeSortPseudo.txt}

It is important to note the loop invariant for insertion sort and it's correctness. \\

\textbf{Loop invariant:} At the start of each iteration of the \textbf{for }loop of lines 12–17, the subarray $A[p..k-1]$ contains the $k - p$ smallest elements of $L[1..n]$ and $R[1..m]$, in sorted order. Moreover, $L[i]$ and $R[i]$ are the smallest elements of their arrays that have not been copied back into $A$.  \\

To show that the loop invariant implies the algorithm is correct, we must show it's initialization, maintenance and termination. \\

\textbf{Initialization:} Prior to the first iteration of the loop, we have $k=p$, so that the subarray $A[p..k-1]$ is empty. This empty subarray contains the $k-p=0$ smallest elements of $L$ and $R$, and since $i=j=1$, both $L[i]$ and $R[i]$ are the smallest elements of their arrays that have not been copied back into $A$. \\

\textbf{Maintenance:} To see that each iteration maintains the loop invariant, let us first suppose that $L[i] \le R[j]$. Then $L[i]$ is the smallest element not yet copied back into $A$. Because $A[p..k-1]$ contains the $k-p+1$ smallest elements, after line 14 copies $L[i]$ into the $A[i]$, the subarray $A[p..k]$ will contain the $k-p+1$ smallest elements. Incrementing $k$ (in the for loop update) and $i$ (in line 15) reestablishes the loop invariant for the next iteration. If instead  $L[i] > R[j]$, then lines 16–17 perform the appropriate action to maintain the loop invariant.\\

\textbf{Termination:} At termination, $k=r+1$. By the loop invariant, the subarray $A[p..k-1]$, which is $A[p..r]$, contains the $k-p=r-p+1$ smallest
elements of $L[1..n1+1]$ and $R[1..n2+1]$, in sorted order. The arrays $L$
and $R$ together contain $n1+n2 +2 = r-p+3$ elements. All but the two
largest have been copied back into $A$, and these two largest elements are the sentinels. \\

Now that we know the algorithm's implementation details, we conclude by writing an merge sort class that we may use to find the run-times for merge Sort. An example of what a finalized merge sort would look like may be as shown bellow.\\

\includegraphics[]{MergeSortPhoto1}
\includegraphics[]{MergeSortPhoto2}


\subsection*{Heap Sort}
Our next algorithm, Heap Sort, is best known for using the data structure called a "heap." This indicates that the largest value is in the root, but also that any subtree of a heap is itself a heap. Like Insert Sort, but unlike Merge Sort, Heap Sort sorts in place. However, it is much faster than Insert Sort when working with large amounts of data. Thus, Heap Sort combines the better attributes of the two sorting algorithms we have already discussed [1]. Note its following conditions.\\

\textbf{Input:} A sequence of $n$ elements $<a_1, a_2, ..., a_n>$.\\

\textbf{Output:} A permutation $<a_1', a_2', ..., a_n'>$ of the input sequence such that \\
\indent $a_1 \le a_2 \le ... \le a_n$.\\

The pseudo code for Heap Sort is as follows.

\lstinputlisting[language=java, breaklines = true]{HeapSortPseudo.txt}

It is important to note the loop invariant for insertion sort and it's correctness. In this case, we will be studying the loop invariant for BUILD-MAX-HEAP \\

\textbf{Loop invariant:} At the start of each iteration of the \textbf{for} loop of lines 2-3, each node $i+1, i+2, ....,n$ is the root of a max-heap. \\

To show that the loop invariant implies the algorithm is correct, we must show it's initialization, maintenance and termination.  \\

\textbf{Initialization:} Prior to the first iteration of the loop, $i=floor(n/2)$. Each node $floor(n/2) +1, floor(n/2)+2,...,n$ is a leaf and is thus the root of a trivial max-heap.\\

\textbf{Maintenance:} To see that each iteration maintains the loop invariant, observe that the children of node $i$ are numbered higher than $i$ . By the loop invariant, therefore, they are both roots of max-heaps. This is precisely the condition required for the call MAX-HEAPIFY($A,i$)/ to make node $i$ a max-heap root. Moreover, the MAX-HEAPIFY call preserves the property that nodes $i+1, i+2, ...,n$ are all roots of max-heaps. Decrementing $i$ in the for loop update reestablishes the loop invariant for the next iteration.\\

\textbf{Termination:} At termination, $i=0$. By the loop invariant, each node $1,2,...,n$ is the root of a max-heap. In particular, node 1 is.\\

Now that we know the algorithm's implementation details, we conclude by writing an Heap sort class that we may use to find the run-times for Heap Sort. An example of what a finalized Heap sort would look like may be as shown bellow.\\

\includegraphics[]{HeapSortPhoto1}
\includegraphics[]{HeapSortPhoto2}


\subsection*{Quick Sort}
Our next algorithm, Quick Sort, is best known for its reputation of being the best practical choice for sorting. Quick Sort has a remarkably efficient random case. It also has the advantage, like Insert Sort and Heap Sort of working in place. Note its following conditions.\\

\textbf{Input:} A sequence of $n$ elements $<a_1, a_2, ..., a_n>$.\\

\textbf{Output:} A permutation $<a_1', a_2', ..., a_n'>$ of the input sequence such that \\
\indent $a_1 \le a_2 \le ... \le a_n$.\\

The pseudo code for Quick Sort is as follows.

\lstinputlisting[language=java, breaklines = true]{QuickSortPseudo.txt}

It is important to note the loop invariant for insertion sort and it's correctness. In this case, we will be studying the loop invariant of the PARTITION method. \\

\textbf{Loop invariant:} At the beginning of each iteration of the loop of lines 3-6, for any array index $k$, \\
\begin{enumerate}
	\item if $p \le k \le i$, then $A[k] \le x$.
	\item if $i+1 \le k \le j-1$, then $A[k] > x$.
	\item if $k  = r$, then $A[k]= x$.
\end{enumerate}

To show that the loop invariant implies the algorithm is correct, we must show it's initialization, maintenance and termination. \\

\textbf{Initialization:} Prior to the first iteration of the loop, $i = p-1$ and $j = p $ Because no values lie between $p$ and $i$ and no values lie between $i+1$ and $j-1$, the first two conditions of the loop invariant are trivially satisfied. The assignment in line 1 satisfies the third condition.\\

\textbf{Maintenance:} We consider two cases, depending on the outcome of the test in line 4. When $A[j]>x$ the only action in the loop is to increment $j$ . After $j$ is incremented, condition 2 holds for $A[j-1]$ and all other entries remain unchanged. When $A[j]\le x$ the loop increments $i$, swaps $A[i]$and $A[j]$ and then increments $j$ . Because of the swap, we now have that $A[i]\le x$, and condition 1 is satisfied. Similarly, we also have that $A[j-1]>x$, since the
item that was swapped into $A[j-1]$ is, by the loop invariant, greater than $x$.\\

\textbf{Termination:} At termination, $j = r$. Therefore, every entry in the array is in one of the three sets described by the invariant, and we have partitioned the values in the array into three sets: those less than or equal to $x$, those greater than $x$, and a singleton set containing $x$. \\

Now that we know the algorithm's implementation details, we conclude by writing an Quick sort class that we may use to find the run-times for Quick Sort. An example of what a finalized Quick sort would look like may be as shown bellow.\\

\includegraphics[]{QuickSortPhoto1}
\includegraphics[]{QuickSortPhoto2}

\pagebreak

\section*{TESTING}
Program testing is the process used to help identify the correctness, completeness, and quality of a class. The process of testing involves executing a program with the intent of finding errors and bugs. One of our goals for this project was to find a way to create a test driver so that it exercised the program. The following matrix shows the tests the program went through, the expected results, the actual results, and the solution. \\


	\begin{tabular}{|p{3.5cm}|p{3.5cm}|p{3.5cm}|p{3.5cm}|}
	\hline
	Test & Expected Result & Actual Result & Remedy\\
	\hline
	Array of length zero  & program continues with n = 0 & For some sorts (like Insert Sort) This didn't cause any problem. Other sorts, like Merge Sort, gave an out of bounds exception & The exception came because these sorts require the array length to be divided in half, and 0/2 is out of bounds. The solution to this was found by inserting an assert statement that would guarantee the arrays would never analyze an array of length 0.\\
	\hline
	Array of negative length & an exception & As expected & Implemented the program so that an array of negative length would never be made\\
	\hline
	Array with just one element  & Since an array of size one is already sorted, I expect that the times for Best, Worst, Average will be the same & After multiple trials, the results were either the same time, or a time differing by (at most) 2000 nanoseconds & Since the times were never more than 2000 nanoseconds apart from each other, I think it is safe to assume that those seconds come from the \textit{TimeToSort()} method, rather than the actual \textit{InsertSort} method itself. \\
	\hline
	Array with null items  & an exception is thrown due to asserts & As Expected & N/A \\
	\hline
	\end{tabular}

\subsection*{PROBLEMS ENCOUNTERED}

As with any programming project, a programmer must be able to keep track of any problems encountered and of the solutions found to couter them. Following is a description of the problems I found while programming my classes and how I chose to tackle them. \\

An issue that consistently came up while implementing my program was making sure that the methods sorted any comparable element, and not just integers. The methods were originally implemented to analyze integers, however, when changing them to analyze other elements, a lot of errors on overriding, raw types, and casting kept showing up. After a long time of debugging, these issues were resolved. \\

Since each sorting algorithm has different conditions for their best, worst, and random cases, it was difficult at first to modify the original code (used for Programming Assignment 1: Insertion Sort) to work well with the new requirements. After some trial and error, the best  solution was found to be refactoring most of the code in the main class to each of the sorting classes. That way, each class would determine its own best, worst, and average cases for each algorithm.\\

On the same note, finding how best, worst, and average arrays for input proved to be quite a challenge. Deducting how each array would need to be preordered for each case was as difficult as finding a way to implement that in the code.\\

\pagebreak

\section*{EXPERIMENTAL ANALYSIS AND ASYMPTOTIC RUN-TIME COMPARISON}
After confirming that the program meets the conditions of each sort, and that it produces the required results, we now must compare the run-time complexities of the four sorting algorithms with their asymptotic run-times.\\

By running the program  for different input sizes $n$, pre-ordered accordingly to find the best, worst, and random cases, we found the different run-times for each sorting algorithm. We predict that our results will act according to each algorithm's asymptotic growth-rates, as described by the table in figure 1. To best compare if our results align with this prediction, we plotted the time behavior for each case as a function of $n$. Our results are described in the following sections.\\

\begin{figure}
	\includegraphics[]{growthRates}
	\caption{Growth Rates for Selected Sorting Algorithms}
\end{figure}

\subsection*{Insert Sort}

\subsubsection*{Best Case}
The best case input for insertion sort is an array whose elements are already sorted in increasing order. In this case, Insert Sort typically has linear running time (i.e. O($n$)).\\

After plotting the best case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{InsertBest}\\

Note that even with the outliers, the graph still follows linear behavior, defined by the trendline $y = 14.501x + 46676$. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 1500$.

\subsubsection*{Worst  Case}

Asymptotically, The worst case input for insertion sort is an array that is sorted in the opposite order, or in decreasing order. In this case, every iteration of the method will scan and shift the entire sorted subarray before continuing, thus producing a quadraic running time (i.e. O($n^2$)).\\

After plotting the worst case data points found using the program, we found the results shown in the following chart.\\

\includegraphics[]{InsertWorst}\\

Note that this time we have fewer outliers than with the best case. Even so, the graph follows the predicted behavior of O($n^2$) defined by the trendline $y = 2.4014x^2 + 1531.5x + 8E+06$. As you can see from the graph, $n_0$ is very near the beginning, we could estimate that it falls around $n_0 = 50$.

\subsubsection*{Random Case}
The asymptotic random case for insertion sort is typically also quadraic, due to the implementation of the method. Thus, like with the worst-case behavior, the asymptotic running time of the  random case is O($n^2$).\\

After plotting the random case data points found using the program, we found the results shown in the following chart. \\


\includegraphics[]{InsertRandom}\\

It is interesting to see how similar the random case is to the the worst case with Insertion sort. Again, our data shows O($n^2)$ behavior with a trendline around $y = 0.9062x^2 + 3694.9x - 1E+06$. Again, $n_0$ is very near the beginning, so we similarly estimate that it falls around $n_0 = 50$.\\

After analyzing our findings, we can conclude that the best conditions to use insertion sort is when there are not a lot of elements, or when they are already in sorted order, since the run-time is linearly proportional to the amount of elements to be sorted. \\

\subsection*{Merge Sort}

\subsubsection*{Best Case}

Merge sort's best case is when the largest element of one sorted sublist is smaller than the first element of its opposing sub-list, for every merge stem that occurs. Only one element from the opposing list is compared, which reduces the number of comparisons in each merge step to $n/2$. The best case of Merge sort typically has logarithmic running time (i.e. O($nlgn$)).\\

After plotting the best case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{MergeBest}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = 1E+06ln(x) - 2E+06$. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 1000 $.
\subsubsection*{Worst  Case}

The worst case scenario for Merge Sort is when, during ever merge step, exactly one value remains in the opposing list; in other words, no comparisons were skipped. This situation occurs when the two largest values in a merge stem are contained in opposing lists. When this situation occurs, Merge Sort must continue comparing list elements in each of the opposing lists until the two largest values are compared[6]. In this case, Merge Sort typically has logarithmic running time (i.e. O($nlgn$)).\\

After plotting the worst case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{MergeWorst}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = 1E+06ln(x) - 2E+06$. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 1000 $.
\subsubsection*{Random Case}

A quality of Merge Sort, is that its best, worst and random cases all have the same asymptotic running times, O($nlgn$). We can see this quality after plotting the random case data points found using the program.\\

\includegraphics[]{MergeRandom}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = 983328ln(x) - 2E+06
 $. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 1000 $.

\subsection*{Heap Sort}

When it comes to finding the best possible inputs for achieving the best, worst, and random cases, Heap Sort is an interesting algorithm to study. Since the first step of a heap sort is to build a heap with the initial elements, then the input is already in a specified order. This indicates that the time required to build that heap happens outside the Heap Sort algorithm (typically in max-heap). This is partly responsible for the fact that Heap Sort has the same running time for its best, worst and random case: O($nlgn$).

\subsubsection*{Best Case}

As mentioned before, the asymptotic best case for Heap Sort is O($nlgn$) After plotting the best case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{HeapBest}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = -2922ln(n) + 52973 $. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 50 $ Since right from the beginning our data aligned with the asymptotic trendline.
\subsubsection*{Worst  Case}
The asymptotic worst case for Heap Sort is O($nlgn$).\\

After plotting the best case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{HeapWorst}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = 11556ln(n)-15087$.  A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 50 $ Since right from the beginning our data aligned with the asymptotic trendline.
\subsubsection*{Random Case}

The asymptotic random case for Heap Sort is also O($nlgn$).\\

After plotting the best case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{HeapRandom}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = 8060 ln(n)-1856.6$. A good estimation from the graph would indicate that $n_0$ would  A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 50 $ Since right from the beginning our data aligned with the asymptotic trendline.

\subsection*{Quick Sort}

\subsubsection*{Best Case}

The asymptotic best case for Quick Sort is O($nlgn$). This happens when the most even possible split, PARTITION produces two subproblems. Each subproblem would be of size no more than $n/2$, since one is of size $floor(n/2)$ and one of size $(n/2)-1$[1].\\

After plotting the best case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{QuickBest}\\

Note that even with the two outliers in the beginning, the graph still follows logarithmic behavior, defined by the trendline $y = 915005ln(n) -3E+06$. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 50 $ Since right from the beginning our data aligned with the asymptotic trendline.

\subsubsection*{Worst  Case}

The worst case behavior for Quick Sort occurs when the partitioning routine produces one subproblem with $n-1$ elements and one with 0 elements.[6] The asymptotic run-time for Quick Sort's worst case is O($n^2$), the same as Insert Sort. Moreover, this occurs when the input array is already completely sorted- a situation in which Insert Sort runs in O($n$) time.

After plotting the worst case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{QuickWorst}\\

Note that even with the outliers, the graph follows the predicted behavior of O($n^2$), defined by the trendline $y = 1.6867n^2 + 6013.9n+21736$. A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 50 $ Since right from the beginning our data aligned with the asymptotic trendline.

\subsubsection*{Random Case}

The random case running time for Quick Sort is much closer to its best case than to its worst case[1]. It's running time is therefore O($nlgn$).\\

After plotting the random case data points found using the program, we found the results shown in the following chart. \\

\includegraphics[]{QuickRandom}\\

Note that even with the outliers, the graph still follows logarithmic behavior, defined by the trendline $y = 945043ln(n)-3E+06$.  A good estimation from the graph would indicate that $n_0$ would lie around $n_0 = 50 $ Since right from the beginning our data aligned with the asymptotic trendline.

\section*{CONCLUSIONS}

After constructing an sorting program that implemented the Insert Sort, Merge Sort, Heap Sort and Quick Sort algorithms, we were able to analyze and compare their run-time behaviors. We compared the time complexities we found to the asymptotic run-time complexities for best, worst, and average cases of each algorithm style. We have also found the conditions that make one algorithm better than the others and have discovered that the algorithm with the largest leading constant is InsertSort. With a better understanding of each sorting algorithm, we have learned how to generate different complexities, implement and scrutinize a class so that it surpasses an intensive testing phase, and have extensively understood the Insertion Sort, Merge Sort, Heap Sort, and Quick Sort algorithms and their corresponding run-time complexities. \\

We predicted, \textit{a priori}, that Insert Sort would have the "largest constant". After finding the functions for each algorithms best, worst, and random case, we found that the example with the largest leading constant was Insert Sort's best case with $y = 14.501x + 46676$.
\pagebreak

\section*{APPENDIX A: Main Class}
The following is the Java class written for the main class.

\hrulefill
\lstinputlisting[language=java, breaklines = true]{SortingAnalysis.txt}
\pagebreak

\section*{Appendix B: InsertionSort Class}

The following is the Java class written for Insertion Sort. This class provides a method that uses the Insertion Sort algorithm. The method produces a permutation of the array to be sorted with all of the elements sorted in increasing order. \\

Written in accord with the corresponding pseudocode, this class requires an array that must be sorted and will produce another array, based off of the original and with the same elements, but in sorted order: going from lowest to highest. \\

Since the assignment requires a timing mechanism to be able to time how long it takes to execute an InsertSort method, this class provides a second method, which times the execution process.

\hrulefill
\lstinputlisting[language=java, breaklines = true]{insertionSortCode.txt}
\pagebreak

\section*{Appendix C: InsertionSortDecreasing Class}

The following is the Java class written for Insertion Sort in Decreasing order. This class provides a method that uses the InsertionSortDescending algorithm. The method produces a permutation of the array to be sorted with all of the elements sorted in decreasing order. \\

Written in accord with the corresponding pseudocode, this class requires an array that must be sorted and will produce another array, based off of the original and with the same elements, but in sorted order: going from highest to lowest. \\


\hrulefill
\lstinputlisting[language=java, breaklines = true]{insertionSortDecreasing.txt}
\pagebreak

\section*{Appendix C: MergeSort Class}

The following is the Java class written for Merge Sort. This class provides a method that uses the Merge Sort algorithm. The method produces a permutation of the array to be sorted with all of the elements sorted in increasing order. \\

Written in accord with the corresponding pseudocode, this class requires an array that must be sorted and will produce another array, based off of the original and with the same elements, but in sorted order: going from lowest to highest. \\

Since the assignment requires a timing mechanism to be able to time how long it takes to execute an MergeSort method, this class provides a second method, which times the execution process.

\hrulefill
\lstinputlisting[language=java, breaklines = true]{mergeSortCode.txt}
\pagebreak

\section*{Appendix D: HeapSort Class}

The following is the Java class written for Heap Sort. This class provides a method that uses the Heap Sort algorithm. The method produces a permutation of the array to be sorted with all of the elements sorted in increasing order. \\

Written in accord with the corresponding pseudocode, this class requires an array that must be sorted and will produce another array, based off of the original and with the same elements, but in sorted order: going from lowest to highest. \\

Since the assignment requires a timing mechanism to be able to time how long it takes to execute an MergeSort method, this class provides a second method, which times the execution process.

\hrulefill
\lstinputlisting[language=java, breaklines = true]{heapSortCode.txt}
\pagebreak

\section*{Appendix E: QuickSort Class}

The following is the Java class written for Quick Sort. This class provides a method that uses the Quick Sort algorithm. The method produces a permutation of the array to be sorted with all of the elements sorted in increasing order. \\

Written in accord with the corresponding pseudocode, this class requires an array that must be sorted and will produce another array, based off of the original and with the same elements, but in sorted order: going from lowest to highest. \\

Since the assignment requires a timing mechanism to be able to time how long it takes to execute an MergeSort method, this class provides a second method, which times the execution process.

\hrulefill
\lstinputlisting[language=java, breaklines = true]{quickSortCode.txt}
\pagebreak
\section*{REFERENCES}


[1]Cormen, Thomas H. Introduction to algorithms. MIT press, 2009.\\


[2]Heap sort. (n.d.). Retrieved February 14, 2016, from http://www.slideshare.net/saraeida/heap-sort-use-jing\\

[3]Lbackstrom. "The Importance of Algorithms – Topcoder." The Importance of Algorithms – Topcoder. Accessed January 31, 2016. https://www.topcoder.com/community/data-science/data-science-tutorials/the-importance-of-algorithms/.\\


[4]Merge.java. (n.d.). Retrieved February 14, 2016, from http://algs4.cs.princeton.edu/22mergesort/Merge.java.html\\

[5]"Nairaland Forum." The Importance Of Software Testing And Not Just Software Programming. April 01, 2008. Accessed January 31, 2016. http://www.nairaland.com/124053/importance-software-testing-not-just.\\

[6]Quiles Luis. merge Sort. Florida Institute of Technology.

\end{document}