programming_under_the_hood/CountingCh.xml at master · johnnyb/programming_under_the_hood · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<chapter id="countingchapter">
<title>Counting Like a Computer</title>
<!-- FIXME - See Dominique note -->
<!--

Copyright 2002 Jonathan Bartlett

Permission is granted to copy, distribute and/or modify this
document under the terms of the GNU Free Documentation License,
Version 1.1 or any later version published by the Free Software
Foundation; with no Invariant Sections, with no Front-Cover Texts,
and with no Back-Cover Texts.  A copy of the license is included in fdl.xml

-->

<sect1>
<title>Counting</title>

<sect2>
<title>Counting Like a Human</title>

<para>
In many ways, computers count just like humans.  So, before we
start learning how computers count, let's take a deeper look at
how we count.
</para>

<para>
How many fingers do you have?  No, it's not a trick question.
Humans (normally) have ten fingers.  Why is that significant?
Look at our numbering system.  At what point does a one-digit
number become a two-digit number?  That's right, at ten.  Humans
count and do math using a base ten numbering system.  Base ten
means that we group everything in tens.  Let's say we're counting
sheep.  1, 2, 3, 4, 5, 6, 7, 8, 9, 10.  Why did we all of a sudden
now have two digits, and re-use the 1?  That's because we're grouping
our numbers by ten, and we have 1 group of ten sheep.  Okay, let's
go to the next number 11.  That means we have 1 group of ten sheep,
and 1 sheep left ungrouped.  So we continue - 12, 13, 14, 15, 16, 17,
18, 19, 20.  Now we have 2 groups of ten.  21 - 2 groups of ten, and
1 sheep ungrouped.  22 - 2 groups of ten, and 2 sheep ungrouped.   So,
let's say we keep counting, and get to 97, 98, 99, and 100.  Look, it
happened again!  What happens at 100?  We now have ten groups of ten.
At 101 we have ten groups of ten, and 1 ungrouped sheep.  So we can
look at any number like this.  If we counted 60879 sheep, that would
mean that we had 6 groups of ten groups of ten groups of ten groups of ten,
0 groups of ten groups of ten groups of ten, 8 groups of ten groups of ten,
7 groups of ten, and 9 sheep left ungrouped.
</para>

<para>
So, is there anything significant about grouping things by ten?  No!  It's
just that grouping by ten is how we've always done it, because we have
ten fingers.  We could have grouped at nine or at eleven (in which case
we would have had to make up a new symbol).  The only difference between
the different groupings of numbers is that we have to re-learn our
multiplication, addition, subtraction, and division tables for each
grouping.  The rules
haven't changed, just the way we represent them.  Also, some of our
tricks that we learned don't always apply, either.  For example, let's
say we grouped by nine instead of ten.  Moving the decimal point one
digit to the right no longer multiplies by ten, it now multiplies by nine.
In base nine, 500 is only nine times as large as 50.
</para>
</sect2>

<sect2>
<title>Counting Like a Computer</title>

<para>
The question is, how many fingers does the computer have to count with?
The computer only has two fingers.  So that means all of the groups are
groups of two.  So, let's count in binary - 0 (zero), 1 (one), 10 (two -
one group of two), 11 (three - one group of two and one left over),
100 (four - two groups of two), 101 (five - two groups of two and one left
over), 110 (six - two groups of two and one group of two), and so on.
In base two, moving the decimal one digit to the right multiplies by two, and
moving it to the left divides by two.  Base two<indexterm><primary>base two</primary></indexterm> is also referred to
as binary.
</para>

<para>
The nice thing about base two is that the basic math tables are very
short.  In base ten, the multiplication tables are ten columns wide,
and ten columns tall.  In base two, it is very simple:

<programlisting>
<!-- NOTE - these need to be converted to tables -->

Table of binary addition

+ |  0  |  1
--+-----+-----
0 |  0  |  0
--+-----+-----
1 |  1  | 10

Table of binary multiplication

* |  0  |  1
--+-----+-----
0 |  0  |  0
--+-----+-----
1 |  0  |  1

</programlisting>

So, let's add the numbers 10010101 with 1100101:

<programlisting>
   10010101
+   1100101
-----------
   11111010
</programlisting>

Now, let's multiply them:

<programlisting>

        10010101
     *   1100101
     -----------
        10010101
       00000000
      10010101
     00000000
    00000000
   10010101
  10010101
 ---------------
  11101011001001
</programlisting>

</para>

</sect2>

<sect2>
<title>Conversions Between Binary and Decimal</title>

<para>
Let's learn how to convert numbers from binary<indexterm><primary>binary</primary></indexterm> (base two<indexterm><primary>base two</primary></indexterm>) to
decimal<indexterm><primary>decimal</primary></indexterm> (base ten<indexterm><primary>base ten</primary></indexterm>).  This is actually a rather simple process.
If you remember, each digit<indexterm><primary>digit</primary></indexterm> stands for some grouping of two.  So,
we just need to add up what each digit represents, and we will have
a decimal number.  Take the binary number 10010101.  To find out
what it is in decimal, we take it apart like this:

<programlisting>
     1    0    0    1    0    1    0    1
     |    |    |    |    |    |    |    |
     |    |    |    |    |    |    |    Individual units (2^0)
     |    |    |    |    |    |    0 groups of 2 (2^1)
     |    |    |    |    |    1 group of 4 (2^2)
     |    |    |    |    0 groups of 8 (2^3)
     |    |    |    1 group of 16 (2^4)
     |    |    0 groups of 32 (2^5)
     |    0 groups of 64 (2^6)
     1 group of 128 (2^7)
</programlisting>

and then we add all of the pieces together, like this:

<programlisting>
1*128 + 0*64 + 0*32 + 1*16 + 0*8 + 1*4 + 0*2 + 1*1 =
128 + 16 + 4 + 1 =
149
</programlisting>

So 10010101 in binary is 149 in decimal.  Let's look at 1100101.  It
can be written as

<programlisting>
1*64 + 1*32 + 0 * 16 + 0*8 + 1*4 + 0*2 + 1*1 =
64 + 32 + 4 + 1 =
101
</programlisting>

So we see that 1100101 in binary is 101 in decimal.  Let's look at
one more number, 11101011001001.  You can convert it to decimal by doing

<programlisting>
1*8192 + 1*4096 + 1*2048 + 0*1024 + 1*512 + 0*256
       + 1*128 + 1*64 + 0*32 + 0*16 + 1*8 + 0*4
       + 0*2 + 1*1 =

8192 + 4096 + 2048 + 512 + 128 + 64 + 8 + 1 =

15049
</programlisting>

Now, if you've been paying attention, you have noticed that the numbers
we just converted are the same ones we used to multiply with earlier.
So, let's check our results: 101 * 149 = 15049.  It worked!
</para>

<para>
Now let's look at going from decimal<indexterm><primary>decimal</primary></indexterm> back to binary<indexterm><primary>binary</primary></indexterm>.  In order to do
the conversion, you have to <emphasis>divide</emphasis> the number
into groups of two.  So, let's say you had the number 17.  If you
divide it by two, you get 8 with 1 left over.  So that means there are
8 groups of two, and 1 ungrouped.  That means that the rightmost digit
will be 1.  Now, we have the rigtmost digit figured out, and 8 groups
of 2 left over.  Now, let's see how many groups of two groups of two we
have, by dividing 8 by 2.  We get 4, with nothing left over.  That
means that all groups two can be further divided into more groups of
two.  So, we have 0 groups of only two.  So the next digit to the
left is 0.  So, we divide 4 by 2 and get two, with 0 left over, so
the next digit is 0.  Then, we divide 2 by 2 and get 1, with 0 left over.
So the next digit is 0.  Finally, we divide 1 by 2 and get 0 with 1
left over, so the next digit to the left is 1.  Now, there's nothing
left, so we're done.  So, the number we wound up with is 10001.
</para>

<para>
Previously, we converted to binary 11101011001001 to decimal 15049.  Let's
do the reverse to make sure that we did it right:

<programlisting>
15049 / 2 = 7524    Remaining 1
7524 / 2 = 3762     Remaining 0
3762 / 2 = 1881     Remaining 0
1881 / 2 = 940      Remaining 1
940 / 2 = 470       Remaining 0
470 / 2 = 235       Remaining 0
235 / 2 = 117       Remaining 1
117 / 2 = 58        Remaining 1
58 / 2 = 29         Remaining 0
29 / 2 = 14         Remaining 1
14 / 2 = 7          Remaining 0
7 / 2 = 3           Remaining 1
3 / 2 = 1           Remaining 1
1 / 2 = 0           Remaining 1
</programlisting>

Then, we put the remaining numbers back together, and we have the original
number!  Remember the first division remainder goes to the far right, so
from the bottom up you have 11101011001001.
</para>

<para>
Each digit in a binary number is called a <emphasis>bit<indexterm><primary>bits</primary></indexterm></emphasis>, which
stands for <emphasis>binary digit<indexterm><primary>binary digit</primary></indexterm></emphasis>.  Remember, computers divide up
their memory into storage locations called bytes<indexterm><primary>bytes</primary></indexterm>.  Each storage location on an x86 processor (and most others) is 8 bits long.  Earlier we said
that a byte can hold any number between 0 and 255.  The reason for this
is that the largest number you can fit into 8 bits is 255.  You can
see this for yourself if you convert binary 11111111 into decimal:

<programlisting>
11111111 =

(1 * 2^7) + (1 * 2^6) + (1 * 2^5) + (1 * 2^4) + (1 * 2^3)
          + (1 * 2^2) + (1 * 2^1) + (1 * 2^0) =

128 + 64 + 32 + 16 + 8 + 4 + 2 + 1 =

255
</programlisting>

The largest number that you can hold in 16 bits is 65535.   The largest
number you can hold in 32 bits is 4294967295 (4 billion).  The largest
number you can hold in 64 bits is 18,446,744,073,709,551,615.  The
largest number you can hold in 128 bits is
340,282,366,920,938,463,463,374,607,431,768,211,456.  Anyway, you
see the picture.  For x86 processors, most of the time you will deal with 4-byte
numbers (32 bits), because that's the size of the registers<indexterm><primary>registers</primary></indexterm>.
</para>

</sect2>

</sect1>

<sect1 id="truthbinarynumbers">
<title>Truth, Falsehood, and Binary Numbers</title>

<para>
Now we've seen that the computer stores everything as sequences of
1's and 0's.  Let's look at some other uses of this.  What if, instead
of looking at a sequence of bits<indexterm><primary>bits</primary></indexterm> as a number, we instead looked at
it as a set of switches<indexterm><primary>switches</primary></indexterm>.  For example, let's say there are four switches
that control lighting in the house.  We have a switch for outside lights,
a switch for the hallway lights, a switch for the living room lights,
and a switch for the bedroom lights.  We could make a little table
showing which of these were on and off, like so:

<programlisting>
Outside  Hallway  Living Room  Bedroom
  On       Off        On         On
</programlisting>

It's obvious from looking at this that all of the lights are on except
the hallway ones.  Now, instead of using the words "On" and "Off",
let's use the numbers 1 and 0.  1 will represent on, and 0 will represent
off.  So, we could represent the same information as

<programlisting>
Outside  Hallway  Living Room  Bedroom
   1        0           1         1
</programlisting>

Now, instead of having labels on the light switches, let's say we just
memorized which position went with which switch.  Then, the same
information could be represented as

<programlisting>
1           0           1         1
</programlisting>

or as

<programlisting>
1011
</programlisting>

This is just one of many ways you can use the computers storage locations
to represent more than just numbers.  The computers memory just sees numbers,
but programmers can use these numbers to represent anything their imaginations
can come up with.  They just sometimes have to be creative when figuring out
the best representation.
</para>

<para>
Not only can you do regular arithmetic with binary numbers, they also have
a few operations of their own, called binary
<indexterm zone="truthbinarynumbers"><primary>binary operations</primary></indexterm>
or logical operations
<indexterm><primary>logical operations</primary></indexterm>.
The standard binary operations are
</para>

<itemizedlist>
<listitem><para>AND<indexterm><primary>AND</primary></indexterm></para></listitem>
<listitem><para>OR<indexterm><primary>OR</primary></indexterm></para></listitem>
<listitem><para>NOT<indexterm><primary>NOT</primary></indexterm></para></listitem>
<listitem><para>XOR<indexterm><primary>XOR</primary></indexterm></para></listitem>
</itemizedlist>

<para>
Before we look at examples, I'll describe them for you.
AND takes two bits and returns one bit.  AND will return a 1 only if
both bits are 1, and a 0 otherwise.  For example, 1 AND 1 is 1, but
1 AND 0 is 0, 0 AND 1 is 0, and 0 AND 0 is 0.
</para>

<para>
OR takes two bits
and returns one bit.  It will return 1 if either of the original bits
is 1.  For example, 1 OR 1 is 1, 1 OR 0 is one, 0 OR 1 is 1, but 0 OR 0
is 0.
</para>

<para>
NOT only takes one bit and returns its opposite.  NOT 1 is 0 and
NOT 0 is 1.
</para>

<para>
Finally, XOR is like OR, except it returns 0 if both bits
are 1.
</para>

<para>
Computers can do these operations on whole registers at a time.
For example, if a register has 10100010101010010101101100101010 and
another one has 10001000010101010101010101111010, you can run any of
these operations on the whole registers.  For example, if we were to
AND them, the computer will run from the first bit to the 32nd and run
the AND<indexterm><primary>AND</primary></indexterm> operation on that bit in both registers.  In this case:

<programlisting>
10100010101010010101101100101010 AND
10001000010101010101010101111010
--------------------------------
10000000000000010101000100101010
</programlisting>

You'll see that the resulting set of bits only has a one where
<emphasis>both</emphasis> numbers had a one, and in every other position
it has a zero.  Let's look at what an OR<indexterm><primary>OR</primary></indexterm> looks like:

<programlisting>
10100010101010010101101100101010 OR
10001000010101010101010101111010
--------------------------------
10101010111111010101111101111010
</programlisting>

In this case, the resulting number has a 1 where either number has
a 1 in the given position.  Let's look at the NOT<indexterm><primary>NOT</primary></indexterm> operation:

<programlisting>
NOT 10100010101010010101101100101010
------------------------------------
    01011101010101101010010011010101
</programlisting>

This just reverses each digit.  Finally, we have XOR<indexterm><primary>XOR</primary></indexterm>, which is
like an OR, except if <emphasis>both</emphasis> digits are 1, it
returns 0.

<programlisting>
10100010101010010101101100101010 XOR
10001000010101010101010101111010
--------------------------------
00101010111111000000111001010000
</programlisting>

This is the same two numbers used in the OR operation, so you can
compare how they work.  Also, if you XOR a number with itself, you
will always get 0, like this:

<programlisting>
10100010101010010101101100101010 XOR
10100010101010010101101100101010
--------------------------------
00000000000000000000000000000000
</programlisting>

</para>

<para>
These operations are useful for two reasons:

<itemizedlist>
<listitem><para>The computer can do them extremely fast</para></listitem>
<listitem><para>You can use them to compare many truth values at the same time</para></listitem>
</itemizedlist>

You may not have known that different instructions execute at different
speeds.  It's true, they do.  And these operations are the
fastest on most processors.  For example, you saw that XORing a number with
itself produces 0.  Well, the XOR<indexterm><primary>XOR</primary></indexterm> operation is faster than the loading operation, so
many programmers use it to load a register<indexterm><primary>register</primary></indexterm> with zero.  For example, the
code

<programlisting>
	movl  $0, %eax
</programlisting>

is often replaced by

<programlisting>
	xorl  %eax, %eax
</programlisting>

We'll discuss speed more in <xref linkend="optimizationch" />, but I want you
to see how programmers often do tricky things, especially with these
binary operators, to make things fast.  Now let's look at how we
can use these operators to manipulate true<indexterm><primary>true</primary></indexterm>/false<indexterm><primary>false</primary></indexterm> values.  Earlier
we discussed how binary numbers can be used to represent any number
of things.  Let's use binary numbers to represent what things my
Dad and I like.  First, let's look at the things I like:

<programlisting>
Food: yes
Heavy Metal Music: yes
Wearing Dressy Clothes: no
Football: yes
</programlisting>

Now, let's look at what my Dad likes:

<programlisting>
Food: yes
Heavy Metal Music: no
Wearing Dressy Clothes: yes
Football: yes
</programlisting>

Now, let's use a 1 to say yes we like something, and a 0 to say no we don't.
Now we have:

<programlisting>
Me
Food: 1
Heavy Metal Music: 1
Wearing Dressy Clothes: 0
Football: 1

Dad
Food: 1
Heavy Metal Music: 0
Wearing Dressy Clothes: 1
Football: 1
</programlisting>

Now, if we just memorize which position each of these are in, we have

<programlisting>
Me
1101

Dad
1011
</programlisting>

Now, let's see we want to get a list of things both my Dad and I like.
You would use the AND<indexterm><primary>AND</primary></indexterm> operation.  So

<programlisting>
1101 AND
1011
--------
1001
</programlisting>

Which translates to

<programlisting>
Things we both like
Food: yes
Heavy Metal Music: no
Wearing Dressy Clothes: no
Football: yes
</programlisting>

Remember, the computer has no idea what the ones and zeroes represent.
That's your job and your program's job.  If you wrote a program
around this representation your program would at some point examine
each bit and have code to tell the user what it's for (if you asked a computer
what two people agreed on and it answered 1001, it wouldn't be very
useful).  Anyway, let's say we want to know the things that we disagree
on.  For that we would use XOR<indexterm><primary>XOR</primary></indexterm>, because it will return 1 only if one
or the other is 1, but not both.  So

<programlisting>
1101 XOR
1011
--------
0110
</programlisting>

And I'll let you translate that back out.
</para>

<para>
The previous operations: AND, OR, NOT, and XOR are called
<emphasis>boolean operators<indexterm><primary>boolean operators</primary></indexterm></emphasis>
because they were first studied by George Boole.
So, if someone mentiones boolean operators<indexterm><primary>boolean operators</primary></indexterm> or
boolean algebra<indexterm><primary>boolean algebra</primary></indexterm>,
you now know what they are talking about.
</para>

<para>
In addition to the boolean operations, there
are also two binary operators<indexterm><primary>binary operators</primary></indexterm> that aren't boolean, shift<indexterm><primary>shift</primary></indexterm> and rotate<indexterm><primary>rotate</primary></indexterm>.
Shifts and rotates each do what their name implies, and can do so to
the right or the left.  A left shift moves each digit of a binary number
one space to the left, puts a zero in the ones spot, and chops off
the furthest digit to the left.  A left rotate does the same thing, but
takes the furthest digit to the left and puts it in the ones spot.  For
example,

<programlisting>
Shift left  10010111 = 00101110
Rotate left 10010111 = 00101111
</programlisting>

Notice that if you rotate a number for every digit it has (i.e. - rotating a 32-bit number 32 times), you wind up
with the same number you started with.  However, if you <emphasis>shift</emphasis> a number for every digit
you have, you wind up with 0.  So, what are these shifts useful for?
Well, if you have binary numbers representing things, you use shifts
to peek at each individual value.  Let's say, for instance, that
we had my Dad's likes stored in a register (32 bits).  It
would look like this:

<programlisting>
00000000000000000000000000001011
</programlisting>

Now, as we said previously, this doesn't work as program output.  So,
in order to do output, we would need to do shifting<indexterm><primary>shifting</primary></indexterm> and
<emphasis>masking<indexterm><primary>masking</primary></indexterm></emphasis>.  Masking is the process of eliminating
everything you don't want.  In this case, for every value we are looking
for, we will shift the number so that value is in the ones place, and
then mask that digit so that it is all we see.  Masking is accomplished
by doing an AND<indexterm><primary>AND</primary></indexterm>
with a number that has the bits we are interested in set to 1.
For example, let's say
we wanted to print out whether my Dad likes dressy clothes or not.  That
data is the second value from the right.  So, we have to shift the
number right 1 digit so it looks like this:

<programlisting>
00000000000000000000000000000101
</programlisting>

and then, we just want to look at that digit, so we mask it by ANDing
it with 00000000000000000000000000000001.

<programlisting>
00000000000000000000000000000101 AND
00000000000000000000000000000001
-----------------------------------
00000000000000000000000000000001
</programlisting>

This will make the value of the register 1 if my Dad likes dressy
clothes, and 0 if he doesn't.  Then we can do a comparison to 1
and print the results.  The code would look like this:

<programlisting>
	#NOTE - assume that the register %ebx holds
	#       my Dad's preferences

	movl  %ebx, %eax #This copies the information into %eax so
	                 #we don't lose the original data

	shrl  $1, %eax   #This is the shift operator.  It stands
	                 #for Shift Right Long.  This first number
	                 #is the number of positions to shift,
	                 #and the second is the register to shift

	#This does the masking
	andl  $0b00000000000000000000000000000001, %eax

	#Check to see if the result is 1 or 0
	cmpl  $0b00000000000000000000000000000001, %eax

	je    yes_he_likes_dressy_clothes

	jmp   no_he_doesnt_like_dressy_clothes
</programlisting>

And then we would have two labels which printed something about
whether or not he likes dressy clothes and then exits.
The <literal>0b</literal> notation means that what follows is a binary number<indexterm><primary>binary number</primary></indexterm>.
In this case it wasn't needed, because 1 is the same in any
numbering system, but I put it there for clarity.
We also didn't need the 31 zeroes, but I put them in
to make a point that the number you are using is 32 bits.
</para>

<para>
When a number represents a set of options for a function<indexterm><primary>functions</primary></indexterm> or system call<indexterm><primary>system call</primary></indexterm>,
the individual true/false elements are called <emphasis>flags<indexterm><primary>flags</primary></indexterm></emphasis>.
Many system calls have numerous options that are all
set in the same register using a mechanism like we've described.
The <literal>open<indexterm><primary>open</primary></indexterm></literal> system call, for example, has as its
second parameter a list of flags to tell the operating system how
to open the file.  Some of the flags include:
</para>

<variablelist>


<varlistentry>
<term><literal>O_WRONLY<indexterm><primary>O_WRONLY</primary></indexterm></literal></term>
<listitem><para>
This flag is <literal>0b00000000000000000000000000000001</literal> in binary, or <literal>01</literal> in octal (or any number system for that matter).  This says to open the file in write-only mode.
</para></listitem>
</varlistentry>

<varlistentry>
<term><literal>O_RDWR<indexterm><primary>O_RDWR</primary></indexterm></literal></term>
<listitem><para>
This flag is <literal>0b00000000000000000000000000000010</literal> in binary, or <literal>02</literal> in octal.  This says to open the file for both reading and writing.
</para></listitem>
</varlistentry>

<varlistentry>
<term><literal>O_CREAT<indexterm><primary>O_CREAT</primary></indexterm></literal></term>
<listitem><para>
This flag is <literal>0b00000000000000000000000001000000</literal> in binary, or <literal>0100</literal> in octal.  It means to create the file if it doesn't already exist.
</para></listitem>
</varlistentry>


<varlistentry>
<term><literal>O_TRUNC<indexterm><primary>O_TRUNC</primary></indexterm></literal></term>
<listitem><para>
This flag is <literal>0b00000000000000000000001000000000</literal> in binary, or <literal>01000</literal> in octal.  It means to erase the contents of the file if the file already exists.
</para></listitem>
</varlistentry>


<varlistentry>
<term><literal>O_APPEND<indexterm><primary>O_APPEND</primary></indexterm></literal></term>
<listitem><para>
This flag is <literal>0b00000000000000000000010000000000</literal> in binary, or <literal>02000</literal> in octal.  It means to start writing at the end of the file rather than at the beginning.
</para></listitem>
</varlistentry>

</variablelist>

<para>
To use these flags, you simply OR<indexterm><primary>OR</primary></indexterm> them together in the combination that you
want.  For example, to open a file in write-only mode, and have it create the
file if it doesn't exist, I would use <literal>O_WRONLY</literal> (01) and
<literal>O_CREAT</literal> (0100).  OR'd together, I would have 0101.
</para>

<para>
Note that if you don't set either <literal>O_WRONLY</literal> or <literal>O_RDWR</literal>, then the file is automatically opened in read-only mode (<literal>O_RDONLY</literal>, except that it isn't really a flag since it's zero).
</para>

<para>
Many functions and system calls use flags for options, as it allows a single
word to hold up to 32 possible options if each option is represented by a
single bit.
</para>

</sect1>

<sect1>
<title>The Program Status Register</title>

<para>
We've seen how bits on a register can be used to give the answers
of yes/no and true/false statements.  On your computer, there
is a register called the <emphasis>program status register<indexterm><primary>program status register</primary></indexterm></emphasis>.
This register holds a lot of information about what happens in a computation.
For example, have you ever wondered what would happen if you added
two numbers and the result was larger than would fit in a register?
The program status register has a flag called the carry flag<indexterm><primary>carry flag</primary></indexterm>.
You can test it to see if the last computation overflowed the register.
There are flags for a number of different statuses.  In fact, when
you do a compare (<literal>cmpl<indexterm><primary>cmpl</primary></indexterm></literal>) instruction, the result
is stored in this register.  The conditional jump<indexterm><primary>conditional jump</primary></indexterm> instructions (<literal>jge</literal>,
<literal>jne</literal>, etc) use these results to tell whether or not
they should jump.  <literal>jmp<indexterm><primary>jmp</primary></indexterm></literal>, the
unconditional jump<indexterm><primary>unconditional jump</primary></indexterm>, doesn't care what is in the status register, since
it is unconditional.
</para>

<para>
Let's say you needed to store a number larger than 32 bits.  So, let's
say the number is 2 registers wide, or 64 bits.  How could you handle
this?  If you wanted to add two 64 bit numbers, you would add
the least significant registers first.  Then, if you detected an
carry, you could add 1 to the most significant register.
In fact, this is probably the way you learned to
do decimal addition.  If the result in one column is more than 9, you
simply carried the number to the next most significant column.  If you
added 65 and 37, first you add 7 and 4 to get 12.  You keep the 2 in
the right column, and carry the one to the next column.   There you add
6, 3, and the 1 you carried.  This results in 10.  So, you keep the
zero in that column and carry the one to the next most significant column,
which is empty, so you just put the one there.  Luckily, 32 bits is usually
big enough to hold the numbers we use regularly.
</para>

<para>
Additional program status register flags are examined in <xref linkend="instructionsappendix" />.
</para>

</sect1>


<sect1>
<title>Other Numbering Systems</title>

<para>
What we have studied so far only applies to positive integers.  However,
real-world numbers are not always positive integers.  Negative numbers
and numbers with decimals are also used.
</para>

<sect2 id="floatingpoint">
<title>Floating-point Numbers</title>

<para>
So far, the only numbers we've dealt with are integers - numbers
with no decimal point.  Computers have a general problem with
numbers with decimal points, because computers can only store
fixed-size, finite values.  Decimal numbers can be any length, including
infinite length (think of a repeating decimal, like the result of
1 / 3).
</para>

<para>
The way a computer handles decimals is by storing them
at a fixed precision<indexterm><primary>precision</primary></indexterm> (number of significant bits).
A computer stores decimal numbers in two
parts - the <emphasis>exponent<indexterm><primary>exponent</primary></indexterm></emphasis> and the <emphasis>mantissa<indexterm><primary>mantissa</primary></indexterm></emphasis>.  The mantissa contains the actual
digits that will be used, and the exponent is what magnitude the number
is.  For example, 12345.2 can be represented as 1.23452 * 10^4.  The mantissa
is 1.23452 and the exponent is 4 with a base of 10.  Computers, however,
use a base of 2.  All numbers are stored
as X.XXXXX * 2^XXXX.  The number 1, for example, is stored as 1.00000 * 2^0.
Separating the mantissa and the exponent into two different values is called
a <emphasis>floating point</emphasis><indexterm><primary>floating point</primary></indexterm> representation, because the position of the significant digits with respect to the decimal point can vary based on the exponent.
</para>

<para>
Now, the mantissa and the exponent are only so long, which leads to some
interesting problems.  For example, when a computer
stores an integer, if you add 1 to it, the resulting number is one larger.
This does not necessarily happen with floating point numbers.  If the
number is sufficiently big, adding 1 to it might
not even register in the mantissa (remember, both parts are only so long).
This affects several things, especially order of operations.  If you
add 1.0 to a given floating point number, it might not even affect the number
if it is large enough.  For example, on x86 platforms, a four-byte
floating-point number, although it can represent very large numbers, cannot
have 1.0 added to it past 16777216.0, because it is no longer significant.
The number no longer changes when 1.0 is added to it.  So, if there is
a multiplication followed by an addition it may give a different result than
if the addition is performed first.
</para>

<para>
You should note that it takes most computers a lot longer
to do floating-point arithmetic than it does integer arithmetic.  So,
for programs that really need speed, integers are mostly used.
</para>

<!-- FIXME - need floating point reference
<para>
For more information on using floating point numbers in assembly
language, see
</para>
-->

</sect2>

<sect2>
<title>Negative Numbers</title>

<para>
How would you think that negative numbers<indexterm><primary>negative numbers</primary></indexterm> on a computer might be represented?
One thought might be to use the first digit of a number
as the sign<indexterm><primary>sign</primary></indexterm>, so <literal>00000000000000000000000000000001</literal> would
represent the number 1, and <literal>10000000000000000000000000000001</literal>
would represent -1.  This makes a lot of sense, and in fact some old processors
work this way.  However, it has some problems.  First of all, it takes a lot
more circuitry to add and subtract signed numbers represented this way.
Even more problematic, this
representation has a problem with the number 0.  In
this system, you could have both a negative and a positive 0.  This leads
to a lot of questions, like "should negative zero be equal to positive zero?",
and "What should the sign of zero be in various circumstances?".
</para>

<para>
These problems were overcome by using a representation of negative
numbers called
<emphasis>two's complement<indexterm><primary>two's complement</primary></indexterm></emphasis> representation.  To get the negative
representation of a number in two's complement form, you must perform the
following steps:
</para>

<orderedlist>
<listitem><para>Perform a NOT<indexterm><primary>NOT</primary></indexterm> operation on the number</para></listitem>
<listitem><para>Add one to the resulting number</para></listitem>
</orderedlist>

<para>
So, to get the negative of <literal>00000000000000000000000000000001</literal>,
you would first do a NOT operation, which gives
<literal>11111111111111111111111111111110</literal>, and then add one, giving
<literal>11111111111111111111111111111111</literal>.  To get negative two,
first take <literal>00000000000000000000000000000010</literal>.  The NOT
of that number is <literal>11111111111111111111111111111101</literal>.  Adding
one gives <literal>11111111111111111111111111111110</literal>.  With this
representation, you can add numbers just as if they were positive, and
come out with the right answers.  For example, if you add one plus negative
one in binary, you will notice that all of the numbers flip to zero.  Also,
the first digit still carries the sign<indexterm><primary>sign</primary></indexterm> bit, making it simple to determine
whether or not the number is positive or negative.
Negative numbers will always have a <literal>1</literal> in the leftmost
bit.  This also changes which numbers are valid for a given number of bits.
With signed numbers, the possible magnitude of the values is split to allow
for both positive and negative numbers.  For example, a byte<indexterm><primary>bytes</primary></indexterm> can normally have
values up to 255.  A signed byte, however, can store values from -128 to 127.
</para>

<para>
One thing to note about the two's complement<indexterm><primary>two's complement</primary></indexterm> representation of signed numbers<indexterm><primary>signed numbers</primary></indexterm> is that, unlike unsigned quantities, if you increase the number
of bits, you can't just add zeroes to the left of the number.  For example,
let's say we are dealing with four-bit quantities and we had the number
-3, <literal>1101</literal>.  If we were to extend this into an eight-bit
register, we could not represent it as <literal>00001101</literal> as this
would represent 13, not -3.  When you increase the size of a signed quantity
in two's complement representation, you have to perform
<emphasis>sign extension<indexterm><primary>sign extension</primary></indexterm></emphasis>.  Sign extension means that you have
to pad the left-hand side of the quantity with whatever digit is in the
sign digit when you add bits.  So, if we extend a negative number by 4 digits,
we should fill the new digits with a 1.  If we extend a positive number by 4 digits, we
should fill the new digits with a 0.  So, the extension of -3 from four to
eight bits will yield <literal>11111101</literal>.
</para>

<para>
The x86 processor has different forms of several instructions depending on
whether they expect the quantities they operate on to be signed<indexterm><primary>signed</primary></indexterm> or unsigned<indexterm><primary>unsigned</primary></indexterm>.
These are listed in <xref linkend="instructionsappendix" />.  For example,
the x86 processor has both a sign-preserving shift-right, <literal>sarl<indexterm><primary>sarl</primary></indexterm></literal>, and a shift-right which does not preserve the sign bit, <literal>shrl<indexterm><primary>shrl</primary></indexterm></literal>.
</para>

</sect2>

</sect1>

<sect1 id="octalhexadecimal">
<title>Octal and Hexadecimal Numbers</title>

<para>
The numbering systems discussed so far have been decimal<indexterm><primary>decimal</primary></indexterm> and binary<indexterm><primary>binary</primary></indexterm>.
However, two others are used common in computing - octal<indexterm><primary>octal</primary></indexterm> and hexadecimal<indexterm><primary>hexadecimal</primary></indexterm>.
In fact, they are probably written more often than binary.  Octal is
a representation that only uses the numbers 0 through 7.  So the octal
number 10 is actually 8 in decimal because it is one group of eight.
Octal 121 is decimal 81 (one group of
64 (8^2), two groups of 8, and one left over).  What makes octal nice
is that every 3 binary digits make one octal digit (there is no such grouping
of binary digits into decimal).  So 0 is 000, 1 is 001, 2 is 010, 3 is
011, 4 is 100, 5 is 101, 6 is 110, and 7 is 111.
</para>

<para id="octalhexadecimalpermissions">
<indexterm zone="octalhexadecimalpermissions"><primary>permissions</primary></indexterm>
Permissions in Linux
are done using octal.  This is because Linux permissions are based on the
ability to read, write and execute.  The first bit is the read permission,
the second bit is the write permission, and the third bit is the execute
permission.  So, 0 (000) gives no permissions, 6 (110) gives read and
write permission, and 5 (101) gives read and execute permissions.
These numbers are then used for the three different sets of permissions - the
owner, the group, and everyone else.
The number 0644 means read and write
for the first permission set, and read-only for the second and third set.
The first permission set is for the owner of the file.  The
third permission set is for the group owner of the file.  The last
permission set is for everyone else.  So, <literal>0751</literal>
means that the owner of the file can read, write, and execute the
file, the group members can read and execute the file, and everyone
else can only execute the file.
</para>

<para>
Anyway, as you can see, octal is used to group bits (binary digits)
into threes.  The way the assembler knows that a number is octal is
because octal<indexterm><primary>octal</primary></indexterm> numbers are prefixed with a zero.
For example 010 means 10 in octal, which
is 8 in decimal. If you just write 10 that means 10 in decimal.  The
beginning zero is what differentiates the two.
So, <emphasis>be careful not to put any leading zeroes in front of decimal
numbers, or they will be interepreted as octal numbers</emphasis>!
</para>

<para>
Hexadecimal numbers (also called just "hex")
use the numbers 1-15 for each digit.  however,
since 10-15 don't have their own numbers, hexadecimal uses the
letters <literal>a</literal> through <literal>f</literal> to represent
them.  For example, the letter <literal>a</literal> represents 10, the
letter <literal>b</literal> represents 11, and so on.  10 in hexadecimal
is 16 in decimal.  In octal, each digit represented three bits.  In
hexadecimal<indexterm><primary>hexadecimal</primary></indexterm>, each digit represents four bits.  Every two digits is
a full byte<indexterm><primary>bytes</primary></indexterm>, and eight digits is a 32-bit word<indexterm><primary>word</primary></indexterm>.  So you see, it
is considerably easier to write a hexadecimal number than it is
to write a binary number, because it's only a quarter as many digits.
The most important number to remember in
hexadecimal is <literal>f</literal>, which means that all bits are
set.  So, if I want to set all of the bits of a register to 1, I
can just do

<programlisting>
	movl  $0xFFFFFFFF, %eax
</programlisting>

Which is considerably easier and less error-prone than writing

<programlisting>
	movl  $0b11111111111111111111111111111111, %eax
</programlisting>

Note also that hexadecimal<indexterm><primary>hexadecimal</primary></indexterm> numbers are prefixed with <literal>0x</literal>.
So, when we do

<programlisting>
	int<indexterm><primary>int</primary></indexterm>   $0x80
</programlisting>

We are calling interrupt number 128 (8 groups of 16), or interrupt
number <literal>0b00000000000000000000000010000000</literal>.
</para>

<para>
Hexadecimal and octal numbers take some getting used to, but they
are heavily used in computer programming.  It might be worthwhile
to make up some numbers in hex and try to convert them back and forth
to binary, decimal, and octal.
</para>
</sect1>

<sect1>
<title>Order of Bytes in a Word</title>

<para>
One thing that confuses many people when dealing with bits<indexterm><primary>bits</primary></indexterm> and bytes<indexterm><primary>bytes</primary></indexterm> on a low
level is that, when bytes are written from registers<indexterm><primary>registers</primary></indexterm> to memory<indexterm><primary>memory</primary></indexterm>, their bytes
are written out least-significant-portion-first.<footnote><para><emphasis>Significance</emphasis> in this context is referring to which digit they represent.  For example, in the number 294, the digit 2 is the most significant because it represents the hundreds place, 9 is the next most significant, and 4 is the least significant.</para></footnote>  What most people expect
is that if they have a word in a register, say <literal>0x5d 23 ef ee</literal> (the spacing is so you can see where the bytes are), the bytes will be
written to memory in that order.  However, on x86 processors, the bytes are actually
written in reverse order.  In memory the bytes would be
<literal>0xee ef 23 5d</literal> on x86 processors.  The bytes are written