Susam's C Pages

Pointers in K&R

Sat, 05 Sep 2020 00:00:00 +0000

I learnt C from the book The C Programming Language, 2nd ed. (K&R) written by Brian Kernighan and Dennis Ritchie about 18 years ago during my engineering studies. The subject of pointers was generally believed to be scary among fellow students and many of them bought pretty fat books that were dedicated solely to the topic of pointers. However, when I reached Chapter 5 of the book , I found that it did a wonderful job at teaching pointers in merely 34 pages. The chapter opens with this sentence:

A pointer is a variable that contains the address of a variable.

The exact point at which the whole topic of pointers became crystal clear was when I encountered this sentence in § 5.3 Pointers and Arrays:

Rather more surprising, at first sight, is the fact that a reference to a[i] can also be written as *(a+i).

Indeed, it was easy to confirm that by compiling and running the following program:

#include 

int main() {
    int a[] = {2, 3, 5, 7, 11};
    printf("%d\n", *(a + 2));
    printf("%d\n", a[2]);
    printf("%d\n", 2[a]);
    return 0;
}

The output is:

5
5
5

C was the first serious programming language I was learning back then and at that time, I don't think I could have come across a better book than K&R to learn this subject. Like many others, I too feel that this book is a model for technical writing. I wish more technical books were written like this with clear presentation and concise treatment.

Read on website | #c | #programming | #technology | #book

Leap Year Test in K&R

Sat, 29 Feb 2020 00:00:00 +0000

About 18 years ago, while learning to program a computer using C, I learnt the following test for leap year from the book The C Programming Language, 2nd ed. (K&R) written by Brian Kernighan and Dennis Ritchie. Section 2.5 (Arithmetic Operators) of the book uses the following test:

(year % 4 == 0 && year % 100 != 0) || year % 400 == 0

It came as a surprise to me. Prior to reading this, I did not know that centurial years are not leap years except for those centurial years that are also divisible by 400. Until then, I always incorrectly thought that all years divisible by 4 are leap years. I have witnessed only one centurial year, namely the year 2000, which happens to be divisible by 400. As a result, the year 2000 proved to be a leap year and my misconception remained unchallenged for another few years until I finally came across the above test in K&R.

Now that I understand that centurial years are not leap years unless divisible by 400, it is easy to confirm this with the Unix cal command. Enter cal 1800 or cal 1900 and we see calendars of non-leap years. But enter cal 2000 and we see the calendar of a leap year.

By the way, the following leap year test is equally effective:

year % 4 == 0 && (year % 100 != 0 || year % 400 == 0)

Update: In the comments section, Thaumasiotes explains why both tests work. Let me take the liberty of elaborating that comment further with a truth table. We use the notation A, B and C respectively, for the three comparisons in the above expressions. Then the two tests above can be expressed as the following boolean expressions:

(A && B) || C
A && (B || C)

Now normally these two boolean expressions are not equivalent. The truth table below shows this:

A B C (A && B) || C A && (B || C)

F F F F F

F F T T F

F T F F F

F T T T F

T F F F F

T F T T T

T T F T T

T T T T T

`A`	`B`	`C`	`(A && B) \|\| C`	`A && (B \|\| C)`
F	F	F	F	F
F	F	T	T	F
F	T	F	F	F
F	T	T	T	F
T	F	F	F	F
T	F	T	T	T
T	T	F	T	T
T	T	T	T	T

We see that there are two cases where the last two columns differ. This confirms that the two boolean expressions are not equivalent. The two cases where the boolean expressions yield different results occur when A is false and C is true. But these cases are impossible! If A is false and C is true, it means we have year % 4 != 0 and year % 400 == 0 which is impossible.

If year % 400 == 0 is true, then year % 4 == 0 must also hold true. In other words, if C is true, A must also be true. Therefore, the two cases where the last two columns differ cannot occur and may be ignored. The last two columns are equal in all other cases and that is why the two tests we have are equivalent.

C Standard Terms for Behaviour

Thu, 31 May 2018 00:00:00 +0000

Here are some excerpts from the final drafts of the C99 and C11 standards n1256.pdf and n1570.pdf respectively.

§3.4.0: behavior: external appearance or action
§3.4.1: implementation-defined behavior: unspecified behavior where each implementation documents how the choice is made.

EXAMPLE: An example of implementation-defined behavior is the propagation of the high-order bit when a signed integer is shifted right.
§3.4.2: locale-specific behavior: behavior that depends on local conventions of nationality, culture, and language that each implementation documents.

EXAMPLE: An example of locale-specific behavior is whether the islower function returns true for characters other than the 26 lowercase Latin letters.
§3.4.3: undefined behavior: behavior, upon use of a nonportable or erroneous program construct or of erroneous data, for which this International Standard imposes no requirements.

NOTE: Possible undefined behavior ranges from ignoring the situation completely with unpredictable results, to behaving during translation or program execution in a documented manner characteristic of the environment (with or without the issuance of a diagnostic message), to terminating a translation or execution (with the issuance of a diagnostic message).

EXAMPLE: An example of undefined behavior is the behavior on integer overflow.
§3.4.4: unspecified behavior: use of an unspecified value, or other behavior where this International Standard provides two or more possibilities and imposes no further requirements on which is chosen in any instance.

EXAMPLE: An example of unspecified behavior is the order in which the arguments to a function are evaluated.

Read on website | #c | #programming | #technology

Loopy C Puzzle

Sat, 01 Oct 2011 00:00:00 +0000

Integer Underflow

Let us talk a little bit about integer underflow and undefined behaviour in C before we discuss the puzzle I want to share in this post.

#include 

int main()
{
    int i;
    for (i = 0; i < 6; i--) {
        printf(".");
    }
    return 0;
}

This code invokes undefined behaviour. The value in variable i decrements to INT_MIN after |INT_MIN| iterations. In the next iteration, there is a negative overflow which is undefined for signed integers in C. On many implementations though, INT_MIN - 1 wraps around to INT_MAX. Since INT_MAX is not less than 6, the loop terminates. With such implementations, this code prints print |INT_MIN| + 1 dots. With 32-bit integers, that amounts to 2147483649 dots. Here is one such example output:

$ gcc -std=c89 -Wall -Wextra -pedantic foo.c && ./a.out | wc -c
2147483649

It is worth noting that the above behaviour is only one of the many possible ones. The code invokes undefined behaviour and the ISO standard imposes no requirements on a specific implementation of the compiler regarding what the behaviour of such code should be. For example, an implementation could also exploit the undefined behaviour to turn the loop into an infinite loop. In fact, GCC does optimise it to an infinite loop if we compile the code with the -O2 option.

# This never terminates!
$ gcc -O2 -std=c89 -Wall -Wextra -pedantic foo.c && ./a.out

Puzzle

Let us take a look at the puzzle now.

Add or modify exactly one operator in the following code such that it prints exactly 6 dots.

for (i = 0; i < 6; i--) {
    printf(".");
}

An obvious solution is to change i-- to i++.

for (i = 0; i < 6; i++) {
    printf(".");
}

There are a few more solutions to this puzzle. One of the solutions is very interesting. We will discuss the interesting solution in detail below.

Solutions

Update on 02 Oct 2011: The puzzle has been solved in the comments section. We will discuss the solutions now. If you want to think about the problem before you see the solutions, this is a good time to pause and think about it. There are spoilers ahead.

Here is a list of some solutions:

for (i = 0; i < 6; i++)
for (i = 0; i < 6; ++i)
for (i = 0; -i < 6; i--)
for (i = 0; i + 6; i--)
for (i = 0; i ^= 6; i--)

The last solution involving the bitwise XOR operation is not immediately obvious. A little analysis is required to understand why it works.

Generalisation

Let us generalise the puzzle by replacing $ 6 $ in the loop with an arbitrary positive integer $ n. $ The loop in the last solution now becomes:

for (i = 0; i ^= n; i--) {
    printf(".");
}

If we denote the value of the variable i set by the execution of i ^= n after $ k $ dots are printed as $ f(k), $ then \[ f(k) = \begin{cases} 0 & \text{if } k = 0, \\ n \oplus (f(k - 1) - 1) & \text{if } k \gt 1 \end{cases} \] where $ k $ is a nonnegative integer, $ n $ is a positive integer and the symbol $ \oplus $ denotes bitwise XOR operation on two nonnegative integers.

Note that $ f(0) $ represents the value of i set by the execution of i ^= n when no dots have been printed yet.

If we can show that $ n $ is the least value of $ k $ for which $ f(k) = 0, $ it would prove that the loop terminates after printing $ n $ dots.

We will see in the next section that for odd values of $ n, $ \[ f(k) = \begin{cases} n & \text{if } k \text{ is even}, \\ 1 & \text{if } k \text{ is odd}. \end{cases} \] Therefore there is no value of $ k $ for which $ f(k) = 0 $ when $ n $ is odd. As a result, the loop never terminates when $ n $ is odd.

We will then see that for even values of $ n $ and $ 0 \leq k \leq n, $ \[ f(k) = 0 \iff k = n. \] Therefore the loop terminates after printing $ n $ dots when $ n $ is even.

Lemmas

We will first prove a few lemmas about some interesting properties of the bitwise XOR operation. We will then use it to prove the claims made in the previous section.

Lemma 1. For an odd positive integer $ n, $ \[ n \oplus (n - 1) = 1 \] where the symbol $ \oplus $ denotes bitwise XOR operation on two nonnegative integers.

Proof. Let the binary representation of $ n $ be $ b_m \dots b_1 b_0 $ where $ m $ is a nonnegative integer and $ b_m $ represents the most significant nonzero bit of $ n. $ Since $ n $ is an odd number, $ b_0 = 1. $ Thus $ n $ may be written as \[ b_m \dots b_1 1. \] As a result $ n - 1 $ may be written as \[ b_m \dots b_1 0. \] The bitwise XOR of both binary representations is $ 1. $

Lemma 2. For a nonnegative integer $ n, $ \[ n \oplus 1 = \begin{cases} n + 1 & \text{if } n \text{ is even}, \\ n - 1 & \text{if } n \text{ is odd}. \end{cases} \] where the symbol $ \oplus $ denotes bitwise XOR operation on two nonnegative integers.

Proof. Let the binary representation of $ n $ be $ b_m \dots b_1 b_0 $ where $ m $ is a nonnegative integer and $ b_m $ represents the most significant nonzero bit of $ n. $

If $ n $ is even, $ b_0 = 0. $ In this case, $ n $ may be written as $ b_m \dots b_1 0. $ Thus $ n \oplus 1 $ may be written as $ b_m \dots b_1 1. $ Therefore $ n \oplus 1 = n + 1. $

If $ n $ is odd, $ b_0 = 1. $ In this case, $ n $ may be written as $ b_m \dots b_1 1. $ Thus $ n \oplus 1 $ may be written as $ b_m \dots b_1 0. $ Therefore $ n \oplus 1 = n - 1. $

Note that for odd $ n, $ lemma 1 can also be derived as a corollary of lemma 2 in this manner: \[ k \oplus (k - 1) = k \oplus (k \oplus 1) = (k \oplus k) \oplus 1 = 0 \oplus 1 = 1. \]

Lemma 3. If $ x $ is an even nonnegative integer and $ y $ is an odd positive integer, then $ x \oplus y $ is odd, where the symbol $ \oplus $ denotes bitwise XOR operation on two nonnegative integers.

Proof. Let the binary representation of $ x $ be $ b_{xm_x} \dots b_{x1} b_{x0} $ and that of $ y $ be $ b_{ym_y} \dots b_{y1} b_{y0} $ where $ m_x $ and $ m_y $ are nonnegative integers and $ b_{xm_x} $ and $ b_{xm_y} $ represent the most significant nonzero bits of $ x $ and $ y $ respectively.

Since $ x $ is even, $ b_{x0} = 0. $ Since $ y $ is odd, $ b_{y0} = 1. $

Let $ z = x \oplus y $ with a binary representation of $ b_{zm_z} \dots b_{z1} b_{z0} $ where $ m_{zm_z} $ is a nonnegative integer and $ b_{zm_z} $ is the most significant nonzero bit of $ z. $

We get $ b_{z0} = b_{x0} \oplus b_{y0} = 0 \oplus 1 = 1. $ Therefore $ z $ is odd.

Theorems

Theorem 1. Let $ \oplus $ denote bitwise XOR operation on two nonnegative integers and \[ f(k) = \begin{cases} n & \text{if } n = 0, \\ n \oplus (f(n - 1) - 1) & \text{if } n \gt 1. \end{cases} \] where $ k $ is a nonnegative integer and $ n $ is an odd positive integer. Then \[ f(k) = \begin{cases} n & \text{if } k \text{ is even}, \\ 1 & \text{if } k \text{ is odd}. \end{cases} \]

Proof. This is a proof by mathematical induction. We have $ f(0) = n $ by definition. Therefore the base case holds good.

Let us assume that $ f(k) = n $ for any even $ k $ (induction hypothesis). Let $ k' = k + 1 $ and $ k'' = k + 2. $

If $ k $ is even, we get \begin{align*} f(k') & = n \oplus (f(k) - 1) && \text{(by definition)} \\ & = n \oplus (n - 1) && \text{(by induction hypothesis)} \\ & = 1 && \text{(by lemma 1)},\\ f(k'') & = n \oplus (f(k') - 1) && \text{(by definition)} \\ & = n \oplus (1 - 1) && \text{(since $ f(k') = 1 $)} \\ & = n \oplus 0 \\ & = n. \end{align*}

Since $ f(k'') = n $ and $ k'' $ is the next even number after $ k, $ the induction step is complete. The induction step shows that for every even $ k, $ $ f(k) = n $ holds good. It also shows that as a result of $ f(k) = n $ for every even $ k, $ we get $ f(k') = 1 $ for every odd $ k'. $

Theorem 2. Let $ \oplus $ denote bitwise XOR operation on two nonnegative integers and \[ f(k) = \begin{cases} n & \text{if } n = 0, \\ n \oplus (f(n - 1) - 1) & \text{if } n \gt 1. \end{cases} \] where $ k $ is a nonnegative integer, $ n $ is an even positive integer and $ 0 \leq k \leq n. $ Then \[ f(k) = 0 \iff k = n. \]

Proof. We will first show by the principle of mathematical induction that for even $ k, $ $ f(k) = n - k. $ We have $ f(0) = n $ by definition, so the base case holds good. Now let us assume that $ f(k) = n - k $ holds good for any even $ k $ where $ 0 \leq k \leq n $ (induction hypothesis).

Since $ n $ is even (by definition) and $ k $ is even (by induction hypothesis), $ f(k) = n - k $ is even. As a result, $ f(k) - 1 $ is odd. By lemma 3, we conclude that $ f(k + 1) = n \oplus (f(k) - 1) $ is odd.

Now we perform the induction step as follows: \begin{align*} f(k + 2) & = n \oplus (f(k + 1) - 1) && \text{(by definition)} \\ & = n \oplus (f(k + 1) \oplus 1) && \text{(by lemma 2 for odd $ n $)} \\ & = n \oplus ((n \oplus (f(k) - 1)) \oplus 1) && \text{(by definition)} \\ & = (n \oplus n ) \oplus ((f(k) - 1) \oplus 1) && \text{(by associativity of XOR)} \\ & = 0 \oplus ((f(k) - 1) \oplus 1) \\ & = (f(k) - 1) \oplus 1 \\ & = (f(k) - 1) - 1 && \text{(from lemma 2 for odd $ n $)} \\ & = f(k) - 2 \\ & = n - k - 2 && \text{(by induction hypothesis).} \end{align*} This completes the induction step and proves that $ f(k) = n - k $ for even $ k $ where $ 0 \leq k \leq n. $

We have shown above that $ f(k) $ is even for every even $ k $ where $ 0 \leq k \leq n $ which results in $ f(k + 1) $ as odd for every odd $ k + 1. $ This means that $ f(k) $ cannot be $ 0 $ for any odd $ k. $ Therefore $ f(k) = 0 $ is possible only even $ k. $ Solving $ f(k) = n - k = 0, $ we conclude that $ f(k) = 0 $ if and only if $ k = n. $

URL in C

Fri, 03 Jun 2011 00:00:00 +0000

Here is an interesting C puzzle I created recently. It is a silly one but you might find it amusing.

#include 

int main()
{
    https://susam.net/
    printf("hello, world\n");
    return 0;
}

This code compiles and runs successfully.

$ c99 hello.c && ./a.out
hello, world

However, the C99 standard does not mention anywhere that a URL is a valid syntactic element in C. How does this code work then?

Update on 04 Jun 2011: The puzzle has been solved in the comments section. If you want to think about the problem before you see the solutions, this is a good time to pause and think about it. There are spoilers ahead.

The code works fine because https: is a label and // following it begins a comment. In case, you are wondering if // is indeed a valid comment in C, yes, it is, since C99. Download the C99 standard, go to section 6.4.9 (Comments) and read the second point which mentions this:

Except within a character constant, a string literal, or a comment, the characters // introduce a comment that includes all multibyte characters up to, but not including, the next new-line character. The contents of such a comment are examined only to identify multibyte characters and to find the terminating new-line character.

Read on website | #c | #programming | #technology | #puzzle

Ternary Operator Puzzle

Wed, 06 Apr 2011 00:00:00 +0000

What is the shortest statement you can write in the C or C++ programming language to express the following statement?

a = (a == 0 ? 0 : 1);

See the comments page for the solution.

Read on website | #c | #programming | #technology | #puzzle

Clumsy Pointers

Mon, 29 Nov 2010 00:00:00 +0000

Pointer Declarator

Here is a fun puzzle that involves complex type declarations in C:

Without using typedef, declare x as a pointer to a function that takes one argument which is an array of 10 pointers to functions which in turn take int * as their only argument and returns a pointer to a function which has int * argument and void return type.

Here is a simpler way to state this puzzle:

Without using typedef, declare x as a pointer that is equivalent to the following declaration of x:

typedef void (*func_t)(int *);
func_t (*x)(func_t [10]);

If you want to think about this puzzle, this is a good time to pause and think about it. There are spoilers ahead.

Let me describe how I solve such problems. Let us start from the right end of the problem and work our way to the left end defining each part one by one.

void x(int *)
A function that has int * argument and void return type.

void (*x)(int *)
A pointer to a function that has int * argument and void return type.

void (*x())(int *)
A function that returns a pointer to a function that has int * argument and void return type.

void (*x(void (*)(int *)))(int *)
A function that has a pointer to a function that has int * argument and void return type as argument and returns a pointer to a function which has int * argument and void return type.

void (*x(void (*[10])(int *)))(int *)
A function that has an array of 10 pointers to functions that has int * argument and void return type as argument and returns a pointer to a function which has int * argument and void return type.

void (*(*x)(void (*[10])(int *)))(int *)
A pointer to a function that has an array of 10 pointers to functions that has int * argument and void return type as argument and returns a pointer to a function which has int * argument and void return type.

Example Code

Here is an example that uses the above pointer declaration in a program in order to verify that it works as expected:

#include 

/* A function which has int * argument and void return type.  */
void g(int *a)
{
    printf("g(): a = %d\n", *a);
}

/* A function which has an array of 10 pointers to g()-like functions
   and returns a pointer to a g()-like funciton.  */
void (*f(void (*a[10])(int *)))(int *)
{
    int i;
    for (i = 0; i < 10; i++)
        a[i](&i);
    return g;
}

int main()
{
    /* An array of 10 pointers to g().  */
    void (*a[10])(int *) = {g, g, g, g, g, g, g, g, g, g};

    /* A pointer to function f().  */
    void (*(*x)(void (*[10])(int *)))(int *) = f;

    /* A pointer to function g() returned by f().  */
    void (*y)(int *a) = x(a);

    int i = 10;
    y(&i);
    return 0;
}

Here is the output of this program:

$ gcc -Wall -Wextra -pedantic -std=c99 foo.c && ./a.out
g(): a = 0
g(): a = 1
g(): a = 2
g(): a = 3
g(): a = 4
g(): a = 5
g(): a = 6
g(): a = 7
g(): a = 8
g(): a = 9
g(): a = 10

Stack Overwriting Function

Wed, 28 Jul 2010 00:00:00 +0000

Skipping Over a Function Call

Here is a C puzzle that involves some analysis of the machine code generated from it followed by manipulation of the runtime stack. The solution to this puzzle is implementation-dependent. Here is the puzzle:

Consider this C code:

#include 

void f()
{
}

int main()
{
    printf("1\n");
    f();
    printf("2\n");
    printf("3\n");
    return 0;
}

Define the function f() such that the output of the above code is:

1
3

Printing 3 in f() and exiting is not allowed as a solution.

If you want to think about this problem, this is a good time to pause and think about it. There are spoilers ahead.

The solution essentially involves figuring out what code we can place in the body of f() such that it causes the program to skip over the machine code generated for the printf("2\n") operation. I'll share two solutions for two different implementations:

gcc 4.3.2 on 64-bit Debian 5.0.3 running on 64-bit Intel Core 2 Duo.
Microsoft Visual Studio 2005 on 32-bit Windows XP running on 64-bit Intel Core 2 Duo.

Solution for GCC

Let us first see step by step how I approached this problem for GCC. We add a statement char a = 7; to the function f(). The code looks like this:

#include 

void f()
{
    char a = 7;
}

int main()
{
    printf("1\n");
    f();
    printf("2\n");
    printf("3\n");
    return 0;
}

There is nothing special about the number 7 here. We just want to define a variable in f() and assign some value to it.

Then we compile the code and analyse the machine code generated for f() and main() functions.

$ gcc -c overwrite.c && objdump -d overwrite.o

overwrite.o:     file format elf64-x86-64


Disassembly of section .text:

0000000000000000 :
   0:   55                      push   %rbp
   1:   48 89 e5                mov    %rsp,%rbp
   4:   c6 45 ff 07             movb   $0x7,-0x1(%rbp)
   8:   c9                      leaveq
   9:   c3                      retq

000000000000000a :
   a:   55                      push   %rbp
   b:   48 89 e5                mov    %rsp,%rbp
   e:   bf 00 00 00 00          mov    $0x0,%edi
  13:   e8 00 00 00 00          callq  18 
  18:   b8 00 00 00 00          mov    $0x0,%eax
  1d:   e8 00 00 00 00          callq  22 
  22:   bf 00 00 00 00          mov    $0x0,%edi
  27:   e8 00 00 00 00          callq  2c 
  2c:   bf 00 00 00 00          mov    $0x0,%edi
  31:   e8 00 00 00 00          callq  36 
  36:   b8 00 00 00 00          mov    $0x0,%eax
  3b:   c9                      leaveq
  3c:   c3                      retq

When main() calls f(), the microprocessor saves the return address (where the control must return to after f() is executed) in stack. The line at offset 1d in the listing above for main() is the call to f(). After f() is executed, the instruction at offset 22 is executed. Therefore the return address that is saved on stack is the address at which the instruction at offset 22 would be present at runtime.

The instructions at offsets 22 and 27 are the instructions for the printf("2\n") call. These are the instructions we want to skip over. In other words, we want to modify the return address in the stack from the address of the instruction at offset 22 to that of the instruction at offset 2c. This is equivalent to skipping 10 bytes (0x2c - 0x22 = 10) of machine code or adding 10 to the return address saved in the stack.

Now how do we get hold of the return address saved in the stack when f() is being executed? This is where the variable a we defined in f() helps. The instruction at offset 4 is the instruction generated for assigning 7 to the variable a.

From the knowledge of how microprocessor works and from the machine code generated for f(), we find that the following sequence of steps are performed during the call to f():

The microprocessor saves the return address by pushing the content of RIP (instruction pointer) register into the stack.
The function f() pushes the content of the RBP (base pointer) register into the stack.
The function f() copies the content of the RSP (stack pointer) register to the RBP register.
The function f() stores the byte value 7 at the memory address specified by the content of RBP minus 1. This achieves the assignment of the value 7 to the variable a.

After 7 is assigned to the variable a, the stack is in the following state:

Address Content Size (in bytes)

&a + 5 Return address (old RIP) 8

&a + 1 Old base pointer (old RBP) 8

&a Variable a 1

Address	Content	Size (in bytes)
`&a + 5`	Return address (old RIP)	8
`&a + 1`	Old base pointer (old RBP)	8
`&a`	Variable `a`	1

If we add 9 to the address of the variable a, i.e. &a, we get the address where the return address is stored. We saw earlier that if we increment this return address by 10 bytes, it solves the problem. Therefore here is the solution code:

#include 

void f()
{
    char a;
    (&a)[9] += 10;
}

int main()
{
    printf("1\n");
    f();
    printf("2\n");
    printf("3\n");
    return 0;
}

Finally, we compile and run this code and confirm that the solution works fine:

$ gcc overwrite.c && ./a.out
1
3

Solution for Visual Studio

Now we will see another example solution, this time for Visual Studio 2005.

Like before we define a variable a in f(). The code now looks like this:

#include 

void f()
{
    char a = 7;
}

int main()
{
    printf("1\n");
    f();
    printf("2\n");
    printf("3\n");
    return 0;
}

Then we compile the code and analyse the machine code generated from it.

C:\>cl overwrite.c
Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.42
for 80x86
Copyright (C) Microsoft Corporation.  All rights reserved.

overwrite.c
Microsoft (R) Incremental Linker Version 8.00.50727.42
Copyright (C) Microsoft Corporation.  All rights reserved.

/out:overwrite.exe
overwrite.obj

C:\>dumpbin /disasm overwrite.obj
Microsoft (R) COFF/PE Dumper Version 8.00.50727.42
Copyright (C) Microsoft Corporation.  All rights reserved.


Dump of file overwrite.obj

File Type: COFF OBJECT

_f:
  00000000: 55                 push        ebp
  00000001: 8B EC              mov         ebp,esp
  00000003: 51                 push        ecx
  00000004: C6 45 FF 07        mov         byte ptr [ebp-1],7
  00000008: 8B E5              mov         esp,ebp
  0000000A: 5D                 pop         ebp
  0000000B: C3                 ret
  0000000C: CC                 int         3
  0000000D: CC                 int         3
  0000000E: CC                 int         3
  0000000F: CC                 int         3
_main:
  00000010: 55                 push        ebp
  00000011: 8B EC              mov         ebp,esp
  00000013: 68 00 00 00 00     push        offset $SG2224
  00000018: E8 00 00 00 00     call        _printf
  0000001D: 83 C4 04           add         esp,4
  00000020: E8 00 00 00 00     call        _f
  00000025: 68 00 00 00 00     push        offset $SG2225
  0000002A: E8 00 00 00 00     call        _printf
  0000002F: 83 C4 04           add         esp,4
  00000032: 68 00 00 00 00     push        offset $SG2226
  00000037: E8 00 00 00 00     call        _printf
  0000003C: 83 C4 04           add         esp,4
  0000003F: 33 C0              xor         eax,eax
  00000041: 5D                 pop         ebp
  00000042: C3                 ret

  Summary

           B .data
          57 .debug$S
          2F .drectve
          43 .text

Just like in the previous objdump listing, in this listing too, the instruction at offset 4 shows where the variable a is allocated and the instructions at offsets 25, 2A and 2F show the instructions we want to skip, i.e. instead of returning to the instruction at offset 25, we want the microprocessor to return to the instruction at offset 32. This involves skipping 13 bytes (0x32 - 0x25 = 13) of machine code.

Unlike the previous objdump listing, in this listing we see that the Visual Studio I am using is a 32-bit on, so it generates machine code to use 32-bit registers like EBP, ESP, etc. Thus the stack looks like this after 7 is assigned to the variable a:

Address Content Size (in bytes)

&a + 5 Return address (old EIP) 4

&a + 1 Old base pointer (old EBP) 4

&a Variable a 1

Address	Content	Size (in bytes)
`&a + 5`	Return address (old EIP)	4
`&a + 1`	Old base pointer (old EBP)	4
`&a`	Variable `a`	1

If we add 5 to the address of the variable a, i.e. &a, we get the address where the return address is stored. Here is the solution code:

#include 

void f()
{
    char a;
    (&a)[5] += 13;
}

int main()
{
    printf("1\n");
    f();
    printf("2\n");
    printf("3\n");
    return 0;
}

Finally, we compile and run this code and confirm that the solution works fine:

C:\>cl /w overwrite.c
Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.42
for 80x86
Copyright (C) Microsoft Corporation.  All rights reserved.

overwrite.c
Microsoft (R) Incremental Linker Version 8.00.50727.42
Copyright (C) Microsoft Corporation.  All rights reserved.

/out:overwrite.exe
overwrite.obj

C:\>overwrite.exe
1
3

Conclusion

The machine code that the compiler generates for a given C code is highly dependent on the implementation of the compiler. In the two examples above, we have two different solutions for two different compilers.

Even with the same brand of compiler, the way it generates machine code for a given code may change from one version of the compiler to another. Therefore, it is very likely that the above solution would not work on another system (such as your system) even if you use the same compiler that I am using in the examples above.

However, we can arrive at the solution for an implementation of the compiler by determining what number to add to &a to get the address where the return address is saved on stack and what number to add to this return address to make it point to the instruction we want to skip to after f() returns.

Read on website | #c | #programming | #technology | #puzzle

Big-Endian on Little-Endian

Sun, 20 Jun 2010 00:00:00 +0000

In this post, I will share how I set up big-endian emulation on my little-endian Intel machine to tets a program for byte order related issues. I used the QEMU PowerPC emulator to set up the big-endian emulation. The steps to do so are documented in the list below.

Install QEMU.
```
apt-get update && apt-get install qemu
```
Download mol-0.9.72.1.tar.bz2 from http://sourceforge.net/projects/mac-on-linux/files/ and copy the file named video.x from the downloaded tarball to /usr/share/qemu/. This is necessary to prevent qemu-system-ppc from complaining about it.
```
wget https://sourceforge.net/projects/mac-on-linux/files/mac-on-linux/mol-0.9.72.1/mol-0.9.72.1.tar.bz2
tar -xjf mol-0.9.72.1.tar.bz2
sudo cp mol-0.9.72.1/mollib/drivers/video.x /usr/share/qemu/
```
Create a QEMU hard disk image.
```
qemu-img create powerpc.img 2G
```

Download Debian for PowerPC and install it on the QEMU hard disk image.

wget http://cdimage.debian.org/debian-cd/5.0.4/powerpc/iso-cd/debian-504-powerpc-CD-1.iso
qemu-system-ppc -m 512 -boot d -hda powerpc.img -cdrom debian-504-powerpc-CD-1.iso

Boot the QEMU PowerPC emulator with the new hard disk image.
```
qemu-system-ppc -m 512 -hda powerpc.img
```

Write a small program inside the new Debian system, say, endian.c like this:

#include 

int main()
{
    int n = 1;
    printf(*((char *) &n) ? "little-endian\n" : "big-endian\n");
    return 0;
}

Compile and execute the C program.
```
$ gcc endian.c && ./a.out
big-endian
```

Read on website | #c | #programming | #technology

Sequence Points

Wed, 26 May 2010 00:00:00 +0000

Code Examples

A particular type of question comes up often in C programming forums. Here is an example of such a question:

#include 

int main()
{
    int i = 5;
    printf("%d %d %d\n", i, i--, ++i);
    return 0;
}

The output is 5 6 5 when compiled with GCC and 6 6 6 when compiled with the C compiler that comes with Microsoft Visual Studio. The versions of the compilers with which I got these results are:

gcc (Debian 4.3.2-1.1) 4.3.2
Microsoft Visual Studio 2005 32-Bit C/C++ Optimizing Compiler Version 14.00.50727.42 for 80x86

Here is another example of such a question:

#include 

int main()
{
    int a = 5;
    a += a++ + a++;
    printf("%d\n", a);
    return 0;
}

In this case, I got the output 17 with both the compilers.

The behaviour of such C programs is undefined. Consider the following two statements:

printf("%d %d %d\n", i, i--, ++i);
a += a++ + a++;

We will see below that in both the statements, the variable is modified twice between two consecutive sequence points. If the value of a variable is modified more than once between two consecutive sequence points, the behaviour is undefined. Such code may behave differently when compiled with different compilers.

K&R

Before looking at the relevant sections of the C99 standard, let us see what the book The C Programming Language, Second Edition says about such C statements. In Section 2.12 (Precedence and Order of Evaluation) of the book, the authors write:

C, like most languages, does not specify the order in which the operands of an operator are evaluated. (The exceptions are &&, ||, ?:, and ','.) For example, in a statement like
x = f() + g();
f may be evaluated before g or vice versa; thus if either f or g alters a variable on which the other depends, x can depend on the order of evaluation. Intermediate results can be stored in temporary variables to ensure a particular sequence.

In the next paragraph, they write,

Similarly, the order in which function arguments are evaluated is not specified, so the statement
printf("%d %d\n", ++n, power(2, n));    /* WRONG */
can produce different results with different compilers, depending on whether n is incremented before power is called. The solution, of course, is to write
++n;
printf("%d %d\n", n, power(2, n));

They provide one more example in this section:

One unhappy situation is typified by the statement
a[i] = i++;
The question is whether the subscript is the old value of i or the new. Compilers can interpret this in different ways and generate different answers depending on their interpretation.

C99

To read more about this, download the C99 standard, go to section 5.1.2.3 (Program execution) and see the second point which mentions:

Accessing a volatile object, modifying an object, modifying a file, or calling a function that does any of those operations are all side effects,¹¹⁾ which are changes in the state of the execution environment. Evaluation of an expression may produce side effects. At certain specified points in the execution sequence called sequence points, all side effects of previous evaluations shall be complete and no side effects of subsequent evaluations shall have taken place. (A summary of the sequence points is given in annex C.)

Then go to section 6.5 and see the second point which mentions:

Between the previous and next sequence point an object shall have its stored value modified at most once by the evaluation of an expression.⁷²⁾ Furthermore, the prior value shall be read only to determine the value to be stored.⁷³⁾

Finally go to Annex C (Sequence Points). It lists all the sequence points. For example, the following is mentioned as a sequence point:

The call to a function, after the arguments have been evaluated (6.5.2.2).

This means that in the statement

printf("%d %d %d\n", i, i--, ++i);

there is a sequence point after the evaluation of the three arguments (i, i-- and ++i) and before the printf() function is called. But none of the items specified in Annex C implies that there is a sequence point between the evaluation of the arguments. Yet the value of i is modified more than once during the evaluation of these arguments. This makes the behaviour of this statement undefined. Further, the value of i is being read not only for determining what it must be updated to but also for using as arguments to the printf() call. This also makes the behaviour of this code undefined.

Let us see another example of a sequence point from Annex C.

The end of a full expression: an initializer (6.7.8); the expression in an expression statement (6.8.3); the controlling expression of a selection statement (if or switch) (6.8.4); the controlling expression of a while or do statement (6.8.5); each of the expressions of a for statement (6.8.5.3); the expression in a return statement (6.8.6.4).

Therefore in the statement

a += a++ + a++;

there is a sequence point at the end of the complete expression (marked with a semicolon) but there is no other sequence point before it. Yet the value of a is modified twice before the sequence point. Thus the behaviour of this statement is undefined.

Read on website | #c | #programming | #technology

Obfuscated Main

Sun, 02 Nov 2003 00:00:00 +0000

I have been running a mailing list called ncoders on Yahoo Groups for the past few months. I created it to host discussions on computers, programming and network protocols among university students. There are currently about 150 students from various universities across the world on the list. A few weeks ago, someone posted a C programming puzzle to the group. The puzzle asked whether it was possible to write a C program such that the main() function does not seem to appear in the code. Here's a solution I came up with, which involves obfuscating the identifier main using preprocessor macros and the ## token-pasting operator.

#include 

#define decode(s,t,u,m,p,e,d) m ## s ## u ## t
#define begin decode(a,n,i,m,a,t,e)

int begin()
{
    printf("Stumped?\n");
}

This program compiles and runs successfully. Here is the output:

Stumped?

Let me explain how this code works. When the C preprocessor runs on this code, the following preprocessing steps occur:

begin is replaced with decode(a,n,i,m,a,t,e),
decode(a,n,i,m,a,t,e) is replaced with m ## a ## i ## n and
m ## a ## i ## n is replaced with main.

Thus begin() is replaced with main().

Update on 31 Jul 2007: Although the mailing list referred to in this post no longer exists, this tiny piece of code seems to have survived on the web. A quick search shows so many occurrences of this code on the web. It is quite surprising to me that a rather silly piece of code written during a Sunday afternoon to solve an equally silly puzzle has been the subject of much discussion!

Read on website | #c | #programming | #technology | #puzzle

C Quine

Sun, 19 Oct 2003 00:00:00 +0000

A quine is a computer program that produces an exact copy of its own source code as its output. It must not consume any input, so tricks involving reading its own source code and printing it are not permitted.

The Classic Quine

Here is a classic quine I came across a few days ago in a mailing list:

main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34);}";printf(s,34,s,34);}

This program is written in K&R C. The current version of GCC compiles it fine. It is a valid quine on ASCII machines because this program uses the integer code 34 to print the quotation mark (") character. This will be explained further in the next section. On another implementation of the C compiler which does not use ASCII code for the quotation mark character, the program needs to be modified to the use the correct code.

Here are some commands that demonstrate the quine:

$ printf '%s' 'main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34);}";printf(s,34,s,34);}' > quine.c
$ cc quine.c && ./a.out > out.txt && diff quine.c out.txt
$ cat quine.c; echo
main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34);}";printf(s,34,s,34);}
$ ./a.out
main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34);}";printf(s,34,s,34);}

The source code of this quine does not end with a newline. The -n option of GNU echo ensures that the source code file is created without a terminating newline.

Close Look at the Classic Quine

Let us take a close look at how the quine introduced in the previous section works. Let us add some newlines in the source code of this quine for the sake of clarity.

main()
{
    char*s="main(){char*s=%c%s%c;printf(s,34,s,34);}";
    printf(s,34,s,34);
}

This is almost the same program presented in the previous section. Only a few newlines have been added to it to make the program easier to read.

We can see that the printf call uses the string s as the format string. The format string contains three conversion specifications: %c, %s and %c. The arguments for these conversions are: 34, the string s itself and 34 once again. Note that 34 is the ASCII code for the quotation mark character ("). With that in mind, let us now construct the output of the printf call in a step-by-step manner.

The initial portion of the output consists of the format string from the beginning up to, but not including, the first conversion specification copied unchanged to the output stream. Here it is:

main(){char*s=

Then the first conversion specification %c is processed, the corresponnding argument 34 is taken and a quotation mark is printed like this:

Then the second conversion specification %s is processed. The corresponding argument is the string s itself, so the entire string is printed like this:

main(){char*s=%c%s%c;printf(s,34,s,34);}

Then the third conversion specification %c is processed. The corresponding argument is 34 again, so once again a quotation mark is printed like this:

Finally, the rest of the format string is copied unchanged to produce the following output:

;printf(s,34,s,34);}

Here are all the five parts of the output presented next to each other:

main(){char*s=

main(){char*s=%c%s%c;printf(s,34,s,34);}

;printf(s,34,s,34);}

Writing them all out in a single line, we get this:

main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34);}";printf(s,34,s,34);}

This output matches the source code of the program thus confirming that our program is a quine.

Classic Quine With Terminating Newline

The source code of the classic quine presented above does not terminate with a newline. I found that a little bothersome because I am used to always terminating my source code with a single trailing newline at the end. So I decided to modify that quine a little to ensure that it always ends with a newline. This is the quine I arrived at:

main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34,10);}%c";printf(s,34,s,34,10);}

Compared to the quine in the previous sections, this one has an additional %c at the end of the formal string and the integer 10 as the corresponding argument to ensure that the output ends with a newline. Here is a demonstration of this quine:

$ echo 'main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34,10);}%c";printf(s,34,s,34,10);}' > quine.c
$ cc quine.c && ./a.out > out.txt && diff quine.c out.txt
$ cat quine.c
main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34,10);}%c";printf(s,34,s,34,10);}
$ ./a.out
main(){char*s="main(){char*s=%c%s%c;printf(s,34,s,34,10);}%c";printf(s,34,s,34,10);}

C89 Quine

The classic C quines presented above are written in K&C. They do not conform to the C standard. However, with some modifications to the quines presented above, we can get a quine that conforms to the C89 standard:

#include 
int main(){char*s="#include %cint main(){char*s=%c%s%c;printf(s,10,34,s,34,10);return 0;}%c";printf(s,10,34,s,34,10);return 0;}

Here is a demonstration of this quine:

$ echo '#include 
int main(){char*s="#include %cint main(){char*s=%c%s%c;printf(s,10,34,s,34,10);return 0;}%c";printf(s,10,34,s,34,10);return 0;}' > quine.c
$ cc -std=c89 -Wall -Wextra -pedantic quine.c && ./a.out > out.txt && diff quine.c out.txt
$ cat quine.c
#include 
int main(){char*s="#include %cint main(){char*s=%c%s%c;printf(s,10,34,s,34,10);return 0;}%c";printf(s,10,34,s,34,10);return 0;}
$ ./a.out
#include 
int main(){char*s="#include %cint main(){char*s=%c%s%c;printf(s,10,34,s,34,10);return 0;}%c";printf(s,10,34,s,34,10);return 0;}

Read on website | #c | #programming | #technology | #puzzle

Susam's C Pages

Pointers in K&R

Leap Year Test in K&R

C Standard Terms for Behaviour

Loopy C Puzzle

Integer Underflow

Puzzle

Solutions

Generalisation

Lemmas

Theorems

URL in C

Ternary Operator Puzzle

Clumsy Pointers

Pointer Declarator

Example Code

Further Reading

Stack Overwriting Function

Skipping Over a Function Call

Solution for GCC

Solution for Visual Studio

Conclusion

Big-Endian on Little-Endian

Sequence Points

Code Examples

K&R

C99

Obfuscated Main

C Quine

The Classic Quine

Close Look at the Classic Quine

Classic Quine With Terminating Newline

C89 Quine