Outpost Universe Forums

Projects & Development => Outpost 2 Programming & Development => Topic started by: Hooman on October 20, 2005, 08:50:55 PM

Title: Coding Challenges
Post by: Hooman on October 20, 2005, 08:50:55 PM
I just thought I'd post a few things for people to think about. Most of these will be C++ related, but not all.


1) Output a program that outputs all of it's source code. The compiled program must output the source code with no input files. This source code must then be capable of being compiled back into the program you just ran.

Basically, if you send someone the exe, they can run it, get the source, compile it, and send the exe back on to someone else who can then do the same thing.

As an added rule to make sure people do this as is intended, you are also not allowed to tell the compiler to insert your source code into the the exe. (Kinda like an incbin "filename" for asm, which just dumps the contents of that file into the exe.)


2) In C++, the result of:
i = ++i + 1;
is ...?


3) In C++, the result of writing the following code:
cout << "Hello World"[2] << endl;
will be ...?


4) Getting the compiler to do work for you
Let's say you have some constant c, and you want to output the value of sin( c ). How can you get the compiler to output this value without calling runtime functions, and without having to precompute this value yourself. (In other words, don't get out a calculator, figure out the answer, and just cout the answer). The compiler must do the calculation for you. (Yes, the compiler, not your compiled program). You may make use of whatever features the compiler supports if needed. The ultimate goal is to have the compiler output code along the lines of:
Code: [Select]
MOV x, d
where d = sin( c ), given input code along the lines of:
Code: [Select]
x = sin( c );


More to come later....
Title: Coding Challenges
Post by: HaXtOr on October 20, 2005, 10:10:08 PM
asm questions would be more fun
Title: Coding Challenges
Post by: Hooman on October 21, 2005, 07:36:48 PM
5) In intel assembly, how would you get the instruction pointer into the EAX register?
 
Title: Coding Challenges
Post by: Eddy-B on October 27, 2005, 05:57:09 PM
Quote
5) In intel assembly, how would you get the instruction pointer into the EAX register?
come'on coders .. don't let ME solve everything!!

it got something to do with a CALL and a POP .. thats all i'm saying .. now get going and solve this!!
Title: Coding Challenges
Post by: Hooman on October 27, 2005, 06:08:28 PM
Looks like I have a few challenges to repost.

6) How do you compute the absolute value without using a jump/branch? For assembly, assume the input is in EAX, and the output is also in EAX. No jumps (branches/calls, etc.) are allowed.

7) Reset the least significant 1 bit in a 32 bit value to 0. If the value is 0 (all 32 bits are 0) then the output should be 0. The must be done brance free. (Again, no jumps/branches/calls). If you answer this in a high level language, it must be done in a way that's guaranteed not to compile to jumps/branches/calls.
Ex. 1: (for 8 bits)
Input: 01001101
Output: 01001100

Ex. 2: (for 8 bits)
Input: 01011000
Output: 01010000

Edit: This last challenge is meant to be read as: The least significant bit that is set to a 1 needs to be reset to 0. As in, the first 1 bit from the right. Not the LSB.
 
Title: Coding Challenges
Post by: Eddy-B on October 29, 2005, 08:17:08 PM
Quote
7) Reset the least significant 1 bit in a 32 bit value to 0. If the value is 0 (all 32 bits are 0) then the output should be 0. The must be done brance free. (Again, no jumps/branches/calls). If you answer this in a high level language, it must be done in a way that's guaranteed not to compile to jumps/branches/calls.
Ex. 1: (for 8 bits)
Input: 01001101
Output: 01001100

Ex. 2: (for 8 bits)
Input: 01011000
Output: 01010000

Edit: This last challenge is meant to be read as: The least significant bit that is set to a 1 needs to be reset to 0. As in, the first 1 bit from the right. Not the LSB.
I'm not sure this is the most efficient code, but then again, you dont want me to use jumps or branches, so how efficient can you get ?
input and output are memory addresses.

Code: [Select]
; initialization
MOV CX,0
MOV EAX,input

; bit processing - repeat these lines 32 times  
SHR EAX,1
RCL BL,1
MOV CH,BL
AND BL,CL
SHR BL,1
RCR output,1
OR CL,CH

A little explaination:
SHR - shift out a single bit -> will be stored in Carry Flag
RCL - put the Carry Flag into BL
MOV - store this bit value into CH, we'll need it later on
AND - depends on the bit from the previous process; if it was 1, nothing changes, if it was 0, BL is set to 0
SHR - again, shift out the bit, store it in Carry
RCR - and insert it into the output field
OR - if bit from the previous process was already 1 nothing happens; otherwise it stores the bit from this last operation into CL
 
Title: Coding Challenges
Post by: Hooman on October 29, 2005, 10:48:16 PM
Yes, I do believe that will work. Nice answer.

Quote
I'm not sure this is the most efficient code, but then again, you dont want me to use jumps or branches, so how efficient can you get ?
It's not. :P
You can do this in one line of C++ or 2 lines of assembly (maybe 3 depending on the instructions you choose to use, or if you have a different architecture). I doubt it can be done better than that though.

So, a question now would be, how is this done better?  :)
 
Title: Coding Challenges
Post by: Eddy-B on October 30, 2005, 05:50:41 AM
Quote
So, a question now would be, how is this done better?
and i leave that up to you.
The question was to DO it, and not to do it efficiently, or fast, or small code or whatever.
You want small, efficient code, use branches! :P


as for your other coding thing:
Code: [Select]
MOV	EAX,input
MOV ECX,0

PUSH EAX
RCL EAX,1
RCL ECX,1
POP EAX
PUSH ECX
NEG ECX
XOR EAX,ECX
POP ECX
ADD EAX,ECX

MOV output,EAX
Title: Coding Challenges
Post by: Eddy-B on October 30, 2005, 05:59:07 AM
Code: [Select]
char *s[]={
"#include <stdio.h>",
"#include <string.h>",
"char *translate(char *s)",
"{",
" int x=0;",
" while (s[x])",
" {",
"  switch (s[x])",
"  {",
"   case 38: s[x]=34; break;",
"   case 47: s[x]=92; break;",
"   case 36: s[x]=37; break;",
"  }",
"  ++x;",
" }",
" return s;",
"}",
"void main()",
"{",
" int i=-1,j=-1;",
" char ss[80];",
" printf(&char *s[]={/n&);",
" while (s[++i]!=0) printf(&/&$s/&,/n&,s[i]);",
" printf(&0/n};/n&);",
" while (++j<i) { strcpy(ss,s[j]); printf(&$s/n&,translate(ss)); }",
"}",
0
};
#include <stdio.h>
#include <string.h>
char *translate(char *s)
{
 int x=0;
 while (s[x])
 {
  switch (s[x])
  {
   case 38: s[x]=34; break;
   case 47: s[x]=92; break;
   case 36: s[x]=37; break;
  }
  ++x;
 }
 return s;
}
void main()
{
 int i=-1,j=-1;
 char ss[80];
 printf("char *s[]={\n");
 while (s[++i]!=0) printf("\"%s\",\n",s[i]);
 printf("0\n};\n");
 while (++j<i) { strcpy(ss,s[j]); printf("%s\n",translate(ss)); }
}
Reposted this, cause it was removed when the server went down. Also edited it to accomodate Hooman's complaint about the Microsoft Compiler crashing on my code :P
Title: Coding Challenges
Post by: Eddy-B on October 30, 2005, 06:01:13 AM
Quote
5) In intel assembly, how would you get the instruction pointer into the EAX register?
Doesn't anybody know the answer to this simple one ?

CALL this subroutine to get EIP:
Quote
POP EAX
PUSH EAX
RET
Title: Coding Challenges
Post by: Hooman on October 30, 2005, 05:02:46 PM
Well, looks like you've answered 1, 5, 6, 7.

Quote
QUOTE 
So, a question now would be, how is this done better? 

and i leave that up to you.
The question was to DO it, and not to do it efficiently, or fast, or small code or whatever.
You want small, efficient code, use branches!

About the question, yes true, but....
The point of branch free code is sorta to be efficient. The idea is that branches are terrible for pipelined processors (which all modern CPUs are). So if you can code something equivalently, but avoid branches, you can often get a gain in efficiency.



So, since those problems are solved, I'd like to up the ante a little.  :ph34r:

8) Solve challenge 1, but this time without overwriting any data that you've defined or allocated space for (you being the important part here). Relating this to the solution of problem 1, the string array that is modified and reprinted will be in violation of the rules for challenge #8. Also, the loop index which is defined and modified by that solution will also be in violation of the rules for challenge #8. Now, that's not to say your program can't make use of variables. You just can't *modify* any that YOU have defined (this includes both data you've defined, and the data pointed to by pointers you've defined). Calling a library function like printf will use some of it's own local variables, but this should be of no concern and will not be considered against the rules. You must not call a library function that writes to a buffer you've defined. Ex:
Code: [Select]
char buff[64];
snprintf(buff, 64, "Writing data to a buffer you've defined!"); // Against the rules!

printf("Hello World!"); // Nothing wrong with this
char string1 = "This is a string";
printf("%s", string1); // Nothing wrong with this either
As an added difficulty (maybe?), I'd like to add a rule against using using library functions that allocate a buffer and fill it before returning the pointer to you. That is, something along the lines of:
Code: [Select]
char *string2 = CopyString(string1); // Against the rules
string2[1] = "\n"; // Clearly against the above rules about writing to data through a pointer you've defined.

char *string3 = CopyAndTranslate(string1); // Also against the rules

Basically, all this is to say any memory being written to at run time MUST be done by standard C++ library functions, and you must gain no direct access to such memory. (All such memory has it's use confined to the standard C++ library function that uses it, and no pointers to such memory are returned).

Note: This challenge is likely much harder than the original, and will definately require some rethinking for a solution.



9) Solve challenge #6 with at most 5 assembly instructions. (You *should* need less than this).

10) Solve challegne #7 with at most 5 assembly instructions. (You *should* need less than this).



*Following is a repost of a challenge that was lost when the forums went down*
(Let's see if anyone remembers the answer :))
11) In C++, the result of writing the following code:
cout << 2["Hello World"] << endl;
will be ...?
 
Title: Coding Challenges
Post by: Eddy-B on October 31, 2005, 02:39:49 AM
ok - then let someone else solve these.
8) simple: just change the one i've written up there^ adding the new rules hooman created
9) takes a little bit of work on the code
10) i'd like to see YOU do that hooman, with just 5 opcodes, you cannot process 32 bits. therefor i think it be impossible without using jumps.
11) well .. i DO remember this C 'quirk'  It looks outright stupid the way you put it, but i guess ur right about it .. lol


oh, branches are inefficient ?  i need to find the pentium opcode list that includes processorcycles, to believe that.  I never got into that stuff since after 1990, but i do remember from earlier assembly coding with the 6502 a branch took 2 or 3 cycles (depending in the jump was taken or not), as compared to a Load Accu Immediate, taking also 2 cycles, or a Load Accu from memory (16bit) taking 4 cycles. This just as an example.
Title: Coding Challenges
Post by: Hooman on October 31, 2005, 02:56:06 AM
Pipelined processors don't handle branches well. It wasn't the same back in the day of dinosaurs!  :P  J/k. But serious, once the pentium was introduced, that's when this becomes valid. You'll also notice that the pentium manuals was about where they stopped including the number of clock cycles each instruction took. You can't really include these things when they vary greatly based on pipeline stalls/flushes. The idea is, the next few instructions would have already been fetched from memory by the time the processor realised the instruction was a jump, and where it's supposed to jump to. (Lag is especially bad if it needs to calculate the address using offset constants, etc.) Thus, those extra instruction that have been read in must be cleared, and so you essential get bubbles of NOPs flowing through the pipeline.

As for working on 32 bits at a time....
ADD EAX, EAX
AND EAX, EDX
NOT EAX
etc.
All those certainly work on 32 bits at a time. You just have to be a little more creative.  :o

And don't worry, I'm not posting impossible challenges. I haven't posted anything I don't already know the answer to.


As for #8. I think it needs a fair bit of rethinking. Probably not so easy to mod the code you already have. But hey, maybe you know of some ingenious way I haven't thought of.


So, do you really want me to answer #10? Or should I leave it an open question for now?
 
Title: Coding Challenges
Post by: Eddy-B on October 31, 2005, 12:04:14 PM
Quote
As for #8. I think it needs a fair bit of rethinking. Probably not so easy to mod the code you already have. But hey, maybe you know of some ingenious way I haven't thought of.
as i said: modify it just slightly.. i should've done this earlier. I was thinking about it, but i thought it was too much work, for just a simple coding test, that you designed. But since you put so much weight on it - i did it anyway: change all text into bytes :P  It saves the 'translate' function altogether, and sticks to your rules. Plus it circumvents the MS compiler "bug" as i call it :P
Code: [Select]
char *s="\x23\x69\x6E\x63\x6C\x75\x64\x65\x20\x3C\x73\x74\x64\x69\x6F\x2E\x68\x3E\x0A\x76\x6F\x69\x64\x20\x6D\x61\x69\x6E\x28\x29\x0A\x7B\x0A\x20\x69\x6E\x74\x20\x69\x3D\x2D\x31\x3B\x0A\x20\x70\x72\x69\x6E\x74\x66\x28\x22\x63\x68\x61\x72\x20\x2A\x73\x3D\x5C\x22\x22\x29\x3B\x0A\x20\x77\x68\x69\x6C\x65\x20\x28\x73\x5B\x2B\x2B\x69\x5D\x21\x3D\x30\x29\x20\x70\x72\x69\x6E\x74\x66\x28\x22\x5C\x5C\x78\x25\x30\x32\x58\x22\x2C\x73\x5B\x69\x5D\x29\x3B\x0A\x20\x70\x72\x69\x6E\x74\x66\x28\x22\x5C\x22\x3B\x5C\x6E\x25\x73\x22\x2C\x73\x29\x3B\x0A\x7D";
#include <stdio.h>
void main()
{
 int i=-1;
 printf("char *s=\"");
 while (s[++i]!=0) printf("\\x%02X",s[i]);
 printf("\";\n%s",s);
}
oh, that first line char *s ... until just before the #include is ONE line, do not put linebreaks in it! Oh, and i wouldn't call it "ingenious" :lol:

Quote
So, do you really want me to answer #10? Or should I leave it an open question for now?
(...)
well, if you can do better: show me .. or let someone else do it. I'm going to continue with more important stuff - like my Renegades project, and my AI project.
Title: Coding Challenges
Post by: Eddy-B on October 31, 2005, 12:24:07 PM
Quote
9) Solve challenge #6 with at most 5 assembly instructions. (You *should* need less than this).
Code: [Select]
int absolute(int input)
{
  _asm
  {
    MOV EAX,input
    SHR EAX,31
    SUB input,EAX
    NEG EAX
    XOR input,EAX
  };
  return input;
}
Title: Coding Challenges
Post by: Hooman on October 31, 2005, 10:27:35 PM
For challenge #8, you defined int i = -1, and used i++. So technically, it breaks the rules.  :D

Challenge #9 is solved. That was nice. Here is my solution in case you were wondering:
Code: [Select]
CDQ
SUB EAX, EDX
XOR EAX, EDX

Oh, and also, my solution for #5 was essentially:
Code: [Select]
CALL dummyLabel
dummyLabel:
POP EAX
It's also nice to note that since CALL instruction are relative (to the end of the instruction/start of the next instruction), the constant built into the CALL instruction is 0. So even if you wanted to avoid using a label, you could just force the offset part to be 0.


As for #10, you're probably gonna laugh at how simple it actually is. In C++, you can code the following:
Code: [Select]
i = i & (i-1);
Which of course if written in assembly might have been:
Code: [Select]
LEA EDX, [EAX - 1]
AND EAX, EDX
Or, if you object to the LEA (perhaps adapting code to a different architecture?)
Code: [Select]
MOV EDX, EAX
DEC EDX
AND EAX, EDX
I love that one.
 (thumbsup)
 
Title: Coding Challenges
Post by: RockNavator on November 01, 2005, 03:55:38 AM
The simple ones dont seem to be getting any attention so Im gonna just try to get them outa the way...

2)i=++i+1; == i=i+2; where i is an  int

3)"Hello World" is a char[] so the subscript of 2 would designated the third element of the array... the letter l (el)

11)2["Hello World"]=="Hello World"[2] due to the way in which pointer multiplication works.  
Title: Coding Challenges
Post by: Hooman on November 01, 2005, 04:00:07 AM
Ohh, fresh meat!  B)

Answer 3 is correct. Good job.

Answer 11 is correct. Again, good job. (Note: this one was answered before but the post was lost.)

Answer 2.... at best, maybe. At worst, nope. Try again.  :D
Title: Coding Challenges
Post by: Eddy-B on November 01, 2005, 01:24:59 PM
Quote
Answer 11 is correct. Again, good job. (Note: this one was answered before but the post was lost.)
..and he remembered not 100%, it is addition, not multiplication :P

hooman: i was wondering WHY you wanted that weird thing with the LS-1-B. But it was just a way of telling us, "hey i found something neat, can anyone reproduce it?"  It was fun while it lasted!  (thumbsup)

[edit] oh, about answer 2) , rock is correct, and hooman: USE BRACKETS for crying outloud. It not only makes things more readable, it also prevents the compiler from doing things you don't expect:

it should read: i=(++i)+1;    and not  i=++(i+1) which is invalid, coz you can't increase an expression.

equals: i+=2;  or more to the ground:  i=i+2;
 
Title: Coding Challenges
Post by: Hooman on November 01, 2005, 08:26:59 PM
So far no correct answer for #2 has been given.  :ph34r:

As for why...? Well, let's just say you're gonna hate me when you hear it. Challenge #2 is indeed a trick question. Can anyone see what's wrong with it?  :o

It isn't a question about bracketing. The question was posted as intended. There should be no brackets, and it doesn't matter how you break up whitespace (or remove it completely) it works the same.
 
Title: Coding Challenges
Post by: LupusShearhart on November 05, 2005, 10:40:57 PM
The answer to number 2:

Assuming i is an integer initialized to 0:

i = 2.

Now let's see what happens when we put i into a loop of 6.

i would be 12.

Each time i = ++i + 1; passes, i is incremented by 2.

So saying i = ++i + 1; would be similar to saying i = i + 2.
Title: Coding Challenges
Post by: Hooman on November 05, 2005, 11:18:38 PM
Wow hey. I wasn't expecting an answer from you. How's it going? :)

Unfortuneatly, I'd have to say that's not quite the right answer. (Although, it is what I expected most people to think, and is the logical thing to assume, and likely what your comiler will do). I remind people this one is a trick question.

So far, 3 people have given me that answer, so I feel I should hint at what is wrong. Think about the order in which the updates to the variable i are performed. (*cough* *cough* if you're not sure about this, it might be an idea to check into some C++ specs *cough* *cough*).
 
Title: Coding Challenges
Post by: LupusShearhart on November 06, 2005, 01:25:47 AM
Actually, I plugged the code directly into MSVC++ and got those outputs, then the same outputs from a compiled Java program, and just traced the outputs per iteration. It all came to the same conclusion of it being i = i + 2. And yeah - I've not been around very much compliments of school and work.

PS: The only other logical way of thinking of how that could come out would be how  Fortran would run it:

i = i+i+1. Though that's not how C++ runs it.

So the iterations, where i starts at zero would be:

i = 0 + 0 + 1
i = 1 + 1 + 1
i = 3 + 3 + 1
i = 7 + 7 + 1
i = 15 + 15 + 1
i = 31 + 31 + 1
etc.
etc.


But if you run i = ++i + 1; in C++, it incriments by two per iteration of the loop. Try it. Put this in a main for a console app and put a line break (debug mark) on the output line of code to show the integer value.

Code: [Select]
int i=0;
int z=0;
for(z=0;z<6;z++)
{
i = ++i + 1;
}
printf ("The return is: %1d \n", i);
return 0;

Guaranteed, the output would be 12.

So the following:
Code: [Select]
#include "stdafx.h"

int main(int argc, char* argv[])
{
int i=0;
int z=0;
int w=0;
for(z=0;z<1000;z++)
{
  i = ++i + 1;
}
printf ("The return is: %1d \n", i);
i=0;
for (w=0;w<1000;w++)
{
  i= i + 2;
}
printf ("The return is: %1d \n", i);
return 0;
}
returns the following:
Code: [Select]
The return is: 2000
The return is: 2000
Press any key to continue

This is a coding challenge, after all. Any free C compiler will produce the same result. Even code compiled in Java or under a linux machine will still produce the same result.
(Reason Edited: Adding code.)
Title: Coding Challenges
Post by: Hooman on November 06, 2005, 03:30:28 AM
Well, I guess I should spill this one before people kill me.

Quote
This is a coding challenge, after all. Any free C compiler will produce the same result. Even code compiled in Java or under a linux machine will still produce the same result.
Not necessarily.

So the answer to #2 then....
Quote
2) In C++, the result of:
i = ++i + 1;
is ...?
Undefined!  :P

While looking at some article on the C++ specification, I noticed the following line:
Quote
i = ++i + 1;                    // the behavior is unspecified

Which of course begs some reasoning. I know.
Quote
4 Except  where noted, the order of evaluation of operands of individual
  operators and subexpressions of individual expressions, and the  order
  in  which side effects take place, is unspecified.  Between the previ-
  ous and next sequence point a scalar  object  shall  have  its  stored
  value  modified at most once by the evaluation of an expression.  Fur-
  thermore, the prior value shall be  accessed  only  to  determine  the
  value  to  be stored.  The requirements of this paragraph shall be met
  for each allowable ordering of the subexpressions of  a  full  expres-
  sion; otherwise the behavior is undefined.
Another example which more clearly illustrates the problem that was given is:
Quote
i = v[i++];                     // the behavior is unspecified
Does the i++ change the value of i before or after the assignment?

Similarly, does the ++i change the value of i before or after the assignment in i = ++i + 1; ? If it change it before, the value is overwritten by the assignment and the effect is i = i + 2. But it is apparently perfectly legal for the compiler to have the storage from ++i after the assignment, leading to code that works as i = i + 1 (the assignment result is discarded).


So yeah, I believe you in what you say about MSVC. I know. I've tried it myself. I got the same result you did.


I hope nobody wants to kill me over that one.  :unsure: I don't think any of ther others were trick questions like that.
 
Title: Coding Challenges
Post by: Eddy-B on November 06, 2005, 07:19:37 AM
/me kills Hooman :P

I know i have found problems with the ++ operator LONG before you were even coding (i think). When using the object inspector of Borland's C-Builder, you can move your mouse over a variable.  When moving over this expressing (any expression using ++) the debugger access "++i", and so in inspecting the variable INCREASING its contents. When moving your mouse out, and over it again, you'd see and that the value got increased. An anooyance to say the least.

But to say the result of a line like i = ++i + 1 is undefined is rediculous.

Look at the order of things:
C reads expressions from left-to-right, honouring the precedence of the operators. * and / go before + and -.  Unary operators like *, ++ and -- go before even the binary operators, like + and -. If there's no rule  (if they are equally leveled operators), C will simply execute them from left-to-right.

The first thing C will do is the (++i). It reads in i and increases it, storing the value back into i. To be more specific: INC [DWORD] will be the result: just 1 instruction.
The next step will be to take the result of the previous step, and do the next step:  +1, again increasing the value of i.
The last step is the assignment =, and the final result will be stored back again into i.

Precedence works fine, even a sick line like  ++i++ increases i by 2, and if used as an array value, like you have:  v[++i++] it uses i after the first INC, and before the second INC!


And, Hooman, please: these are NOT "C challanges", these are interpretation issues. If ALL C compilers do the job the exact same way, but the specification by Kernighan and Ritchie might be written in such a way, it could opun a discussion about what they meant, it doesn't mean  that all compilers are wrong, or produce undefined results.

 
Title: Coding Challenges
Post by: Hooman on November 06, 2005, 04:49:32 PM
I told you people would want to kill me for that one.  :P

Consider this.
Code: [Select]
j = i = 5;
Here, i = 5 is an expression that evaluates to 5. Essentially, = is an operator with a side effect of assignment. Same idea with either ++i or i++. They are both operators with a side effect of assignment. Now consider:
Code: [Select]
i = i++ + ++ i;
You have 3 operators here, and according to that paragraph I quoted, when evaluating the expression (remember that the whole line is an expression that evaluates to the result getting assigned to i on the left) the order of the side effects of those 3 operators is undefined.

(There was some discussion in a compiler course I took on what that expression did. People came to the conclusion that it was undefined. Or maybe it was j = i++ + ++i; ?)

Here's another one for you:
Code: [Select]
i = 0;
j = i++ + i++ + i++;
What is the value of j? I just tried this with MSVC. When I tried it, I got 0.
Now try this:
Code: [Select]
i = 0;
j = ++i + ++i + ++i;
What is the value of j? I just tried this with MSVC, and I got 7. How does that work? So I took a look at the compiled code and here is what it did.

It incremented the value of i, and then stored it. (++i + ++i + ++i).
Then it read i from memory and incremented it again. (++i + ++i + ++i).
Then it read i from memory, and added it to i from memory. (++i + ++i + ++i).
Then it read i from memory and incremented it again. (++i + ++i + ++i).
Then it added that value to the sum of the previous 2. (++i + ++i + ++i).
Then it performed the assignment to j.

Note how the effect of the first ++i was felt by the second ++i. Then the effect of both the ++i was felt by the third ++i. Ok, so this seems to follow exactly the method you've described so far. But is that surprising? No. MSVC obviously follows a particular method here. This seems to be the most obvious way of doing things, so no real surprise they've done it this way.


Ok, so now try:
Code: [Select]
i = 0;
j = i++ + ++i + ++i;
The result is 4. Try doing this one in a left to right order with ++ (pre or post) taking prescendence over +. You should get 5. Ok, now don't increment the i++ (post) until after j is assigned. You get 3. Ok, so now evaluate ++i, and then add that to i++, and then add the third term ++i. And then finish the increment from the ++i. You get 4. Go look at the assembly. This was the order in which is was done. It's not quite so obvious how that follows your set of rules now. The ++i on the RIGHT side of the first + took precendence over the i++ on the LEFT side of it. Clearly not left to right ordering here. It's also not right to left ordering, since that would give you 6.

Ok, now we try:
Code: [Select]
i = 0;
j = ++i + i++ + ++i;
The result is again 4. Taking a look at the assembly, the ++i on the LEFT is evaluated first. Then the i++ is evaluated (remember, it evaluates to the value before the increment) and added to that result. Then the final ++i is evaluated. The result is assigned to j. And then finally, the effect of i++ is done. Ok, so in this case, it was left to right ordering. Or was it? It could equally well have been right to left ordering here and you'd get the same result. Not really any way to tell here.

Next up:
Code: [Select]
i = 0;
j = i++ + i++ + ++i;
The result is 1. In this case, the (i++ + i++) was performed first. Simple left to right ordering in this case.


Code: [Select]
j = ++i++;
Produces a compile error. "needs l-value". As does
Code: [Select]
j = ++(i++);
(Not too surprising)
Code: [Select]
j = (++i)++;
Results in 1.


Also, interestingly enough.
Code: [Select]
i = 0;
j = (++i + ++i) + ++i;
Evaluates to 7.
Code: [Select]
i = 0;
j = ++i + (++i + ++i);
Evaluates to 9.
Does that last part bother anyone else here? On closer inspection, the 3 increments are performed before any of the additions on that last example. I guess I can understand that.


As a final set of examples:
Code: [Select]
i = 0;
j = ++i = ++i + ++i;
i = 0;
j = ++i = (++i + ++i);
i = 0;
j = (++i = ++i) + ++i;
The results are 6, 6, and 6. (But the values of i are 6, 6 and 3).


So yeah, MSVC has probably followed a certain standard (even if the C++ specs don't specify one here). It doesn't mean one particular compiler is going to produce undefined results. Of course it won't. Not unless it's using random numbers during the compilation process. (I'm sure that would bother most people). But it does mean that a different compiler (that follows the C++ specs) CAN produce different results for the same code. It doesn't mean it will, especially not if there is an obvious way of doing things. But as for the standard that MSVC uses, the best I can determine is the following. The ++i's take effect on the i in a left to right order, and if an expression can be evaluated in a left to right order following the precendence rules, then it is. Otherwise, the ++i's will continue taking effect and the expressions are evaluated afterwards. The assignments seem to take place after the ++i's, and then the i++'s seem to take effect after the assignments.

Ay any rate, I don't buy your simple left to right ordering theory. Not without extra rules to clear up some issues. Nor do I trust any expressions where a variable is used more than once on the same line involving ++ (pre), ++ (post) or =.
Title: Coding Challenges
Post by: LupusShearhart on November 06, 2005, 08:09:02 PM
Well - in theory, it's undefined, but that's merely theory. I thought coding was s'pposed to be applied/practical applications? *shrug* Oh well, no harm done.

'sides, the only way I'd want to kill ya is if you completely destroyed everything that is Outpost... so... :-P
Title: Coding Challenges
Post by: Hooman on November 06, 2005, 08:45:44 PM
Quote
'sides, the only way I'd want to kill ya is if you completely destroyed everything that is Outpost... so... :-P

Lol. Well, good to know.  :rolleyes:

But as for how every compiler that I've ever seen does it, you 3 people all provided correct answers. Although, I heard from Eddy earlier today that one of my examples from my last post gave a different answer in Tubro C than it did in MSVC.  :lol:  I'd be curious to know a little more about this.


Btw, there are currently only 2 unsolved challenges left. I think they were #4 and #11? (The one about making the compiler calculate constants for you, and the "improved"/harder version of #1). Would people like some more challenges to be posted?  :)  (I'll avoid the "undefined" answers from now on I think).
 
Title: Coding Challenges
Post by: Eddy-B on November 16, 2005, 10:38:20 AM
okay, since you really like code with NO jumps/branches.. try this one:
create a string -> HEX converter routine without using branches

example:
input = "2A"  (i.e. 0x32 0x41)
output = 0x2A

a 2 digit converter is enough, since a 4 digit string simply uses the routine twice etc..

have fun!

PS: as an added 'difficulty' it should be able to accept uppercase and/or lowercase chars, so an inut of "2a" should also give a correct answer.
Title: Coding Challenges
Post by: Hooman on November 16, 2005, 07:04:53 PM
Ohh, I like it. :) I'll try it out after I get some school work done.  <_<

Does it matter what happens if a non hex string is given? Undefined output? Just plain doesn't crash? Zero output?
 
Title: Coding Challenges
Post by: Eddy-B on November 17, 2005, 11:01:09 AM
oh, 1 extra challenge: it should be able to handle 1-digit inputs as well (in that case the 2nd "digit" is the terminating zero).

It is okay if a non-hex string gives undetermined output.
Title: Coding Challenges
Post by: Hooman on November 17, 2005, 08:32:39 PM
Here you go Eddy. Here is some sample code where two hex digits from a string are converted to an int. (I could have used char, but the original code used cout and it displayed the character rather than the value. I also decided not to assume anything about the calling convention and let the compiler handle loading parameters and retruning the return value.)

If a NULL string pointer is passed in, it should crash, but otherwise it shouldn't. At worst, it should return garbage results if the input wasn't in the proper range. I added some checking for both the first and second characters being NULL. I also made sure that if only a single digit is passed in (before the NULL terminator), it'll return the number in the ones-place, and not in the sixteens place. Not sure if that's what you wanted, but it seemed to be the most useful way of coding it. Although, not correcting this would have made for smaller code, as would not caring if the first character was a NULL or not and reading the second one anyways. I checked against this to avoid memory access errors.


Code: [Select]
#include "stdio.h"


// "0" 48 (0x30) 00110000  -  "9" 57  (0x39) 00111001
// "A" 65 (0x41) 01000001  -  "Z" 90  (0x5A) 01011010  -  "F" 70  (0x46) 01000110
// "a" 97 (0x61) 01100001  -  "z" 122 (0x7A) 01111010  -  "f" 102 (0x66) 01100110


int HexConvert(char *hexString)
{
int retValue;

__asm
{
  // Load the string address
  MOV EDX, hexString
  XOR EAX, EAX    // Clear return value to 0

  // Convert the first digit
  MOV AL, [EDX]    // Load the character
  // Convert letters to 10+
  MOV CL, AL
  SHL CL, 1
  SAR CL, 8
  SUB AL, CL
  SHL CL, 3
  SUB AL, CL
  // Cut off non numeric bits
  AND AL, 0x0F
  // At this point in AL, numebrs 0-9 should produce 0-9 and letters A-F and a-f should produce 10-15.


  // Process a second digit (if the first wasn't a null terminator)
  MOV CL, [EDX]    // Load the character
  CMP CL, 0
  SETNE CL
  MOVSX ECX, CL
  ADD EDX, ECX
  SHL EAX, 12     // Store previous result in upper nibble of AH

  // Convert the first digit
  MOV AL, [EDX]    // Load the character
  // Convert letters to 10+
  MOV CL, AL
  SHL CL, 1
  SAR CL, 8
  SUB AL, CL
  SHL CL, 3
  SUB AL, CL
  // Cut off non numeric bits
  AND AL, 0x0F
  // At this point in AL, numebrs 0-9 should produce 0-9 and letters A-F and a-f should produce 10-15.

  // Ignore result if terminating NULL and use a single digit
  MOV CL, [EDX]    // Load the character
  CMP CL, 0
  SETE CL
  SHL CL, 2
  SHR EAX, CL     // (If CL == 0) Move upper nibble of AH to lower nibble, lower nibble of AL is 0


  // Combine nibblein AH with nibble in AL
  OR AL, AH     // Value is now in AL
  XOR AH, AH     // Clear AH
  MOV retValue, EAX
}

return retValue;
}

void main()
{
char *text = "2A";
printf("%X\n", HexConvert(text));
}
Title: Coding Challenges
Post by: Eddy-B on November 18, 2005, 03:04:16 AM
i'll proof-read ur code during my lunch break.
As for the "what i wanted":

input="2A" -> output = 0x2A
input="A\0" -> output = 0x0A  (1 digit with terminating NULL)

the first digit cannot be NULL, since you'd have an empty string that way!
if you DO input "\0A" i think my routine may even still produce 0x0A as an output, but i'd have to check that :)

btw: my routine is 14 lines of asm (not counting the push and pops to save registers, sice you also do not save them, nor the RET). Anyway: see if you can scrape off some bytes.
Oh, and i written it in asm altogether, not in C - but that shouldn't matter..

[edit] It seemed i was too soon with that reply :blush: the check for 1 digit i did in my main, and does use a branch, so that part is excluded from the 14 lines, but i'll work it in :P
Title: Coding Challenges
Post by: Hooman on November 18, 2005, 02:19:24 PM
Ahh yes, I thought that check was a little odd to put into asm without branches. If it was just converting a single byte to it's hex equivalent (1 nibble), the core of what I wrote was only about 8 lines I think. My checks for things like 0 terminators and whatnot infalted it quite a bit. Still, I'm sure there was room for improvement before that anyways. I'll take another look sometime later on.
 
Title: Coding Challenges
Post by: Eddy-B on November 18, 2005, 05:11:28 PM
I worked out the 1-digit check (branch free), adding 10 lines of asm code. I did find some odd compiling from MSVC btw: my asm compiles to a much smaller code using Turbo C..

btw: SAR CL, 8  is illegal by itself, and is translated into 8 time SAR CL,1
---------------

to extend to this challenge (the original idea for this).. i created a com-executable that accepts a 1 or 2-digit HEX input (from the command line) and sets the screen mode to whatever you supply. The code on its own is 78 bytes, and i've added another 80 bytes of "usage" text, to make a total of 160 bytes MODE.COM :D
---------------

my coding challenge #2:
I got a nice little homework assignment a dozen years ago: create a multiplication "program", that accepts two 8bit arguments, and of course a 16bit product. I think we discussed something like this before.  The original question i got was to do it in Z80 language, but it would work best in 8086 for now.
It has to be fast, and short; jumps/branches are allowed, but no advanced opcodes, and do not use MUL. Factors are supplied in AL and DL (default for __fastcall) and are signed. Resulting product should be returned in AX.
Code: [Select]
short Multiply(char factor1,char factor2)

challenge #3:
same thing, but a division: it takes a 8bit divisor in AL and 8bit divider DL. The result: quotient is an 8bit "integer" in AL and an 8bit remainder (uhm.. well, just put it in AH for now). Again all of those are signed.
Code: [Select]
short Divide(char divisor,char divider)
Example: 11/5 = 2, remainder 1  =>  
Code: [Select]
short result=Divide(11,5);
char quotient=result & 0x00FF;
char remainder=result >> 8;

It doesn't matter much - it's just a way of communicating back to c.
Raw code for both challenges to be written as an *.asm source file, using basic 8086 instructions (no DIV and MUL).

Any of the other people who know a little about coding wanna give it a shot ? (coz i know hooman can do this - it's no big deal for hm)
Title: Coding Challenges
Post by: Hooman on November 18, 2005, 07:51:04 PM
Ahh yes. I think I there is a post around here somewhere with the Z80 solution for multiplication. I think I'll leave this one for other people.

I'm gonna go look into that SAR thingy. That's not something I noticed.  :blink:



Edit: I get a 3 byte opcode for SAR CL, 8. No problems there for me.
 
Title: Coding Challenges
Post by: Eddy-B on November 19, 2005, 03:28:20 PM
Must be a new Pentium code then. All i know is shifts and rotates etc are only 1 bit at a time, i.e. SHL AX,1  and when u want to shift more (upto 256 positions) you should use CL as an index:  MOV CL,8   SAR AX,CL

then again, i never got around to learning the new instruction set of the pentium. It seems both win32-based borland and MS compilers compile it to that opcode. It's just that i like to use my DOS-based compiler for small code, as it tends to create much smaller code :)
Title: Coding Challenges
Post by: Hooman on November 19, 2005, 06:15:12 PM
Ahh yes, I remember some comment about not being able to shift by more than one bit on older machines unless the number of bits to shift by was stored in CL. Now though, I just write the code and see if the assembly complains. They've done a fair bit of work since the pentium to make the instruction set more orthogonal. Now, if it seems like you might be able to do it, you usually can. Although, if you check the actual encoding of the opcodes, that might vary more than you expected since you're using new added on instructions.

I'll get back to cleaning up that code as soon as a finish an assignment for school. Gah, I hate compositions, they take me so very long.  :angry:  
Title: Coding Challenges
Post by: Eddy-B on November 25, 2005, 03:40:49 PM
So... ?
anyone want to give it a try?
Title: Coding Challenges
Post by: Hooman on November 27, 2005, 09:55:01 PM
Not too sure what you're referring to.

Anyways, drowning in school work. Will get back to this in... a month?  :( Ok, not quite that long. I've largely forgotten about cleaning up that code, and I don't have time for it at the moment.

There are still 4 unanswered challenges by my last count.


There was that one about outputting your own source code without modifying any variables you defined.

There was one about getting the compiler to calculate constants for you. Feel free to use compiler specific extensions to fine tune this one and get it working right. I've heard some compilers are more fussy about compile time, and other are more fussy about optimizing.

There are two from Eddy. One about multiplying and one about dividing.


Did I miss any? Or do you not feel that hex converted is really answered? (I know it could be better).
Title: Coding Challenges
Post by: Eddy-B on November 29, 2005, 01:21:38 PM
the hex-convert is ok. was just for fun anywayz.
The multiply & divide: use your preschool math abilities :)  (1 word: long-division ,, or are that 2 words?)
Title: Coding Challenges
Post by: Hooman on October 20, 2006, 03:28:01 AM
Well, it seems nobody was planning on answering the most recent question about a program that outputs it's own source code. (Without modifying any variables you have access to. That is, any memory that is written, is done so deep within standard C functions that you can't see and don't have access to, such as printf or cout).


Well, I was cleaning up my harddrive and deleting old C++ projects. I thought I'd post this code before deleting it.

Code: [Select]
#include <stdio.h>

void main()
{
char *escape = "\\"; char *enter = "\n"; char *quote = "\"";
char *string = "#include <stdio.h>%s%svoid main()%s{%schar *escape = %s%s%s%s; char *enter = %s%sn%s; char *quote = %s%s%s%s;%schar *string = %s%s%s;%sprintf(string, enter, enter, enter, enter, quote, escape, escape, quote, quote, escape, quote, quote, escape, quote, quote, enter, quote, string, quote, enter, enter, enter);%s}%s";
printf(string, enter, enter, enter, enter, quote, escape, escape, quote, quote, escape, quote, quote, escape, quote, quote, enter, quote, string, quote, enter, enter, enter);
}

I know it looks a little sick, but it works. Every variables I defined remains constant. The only memory writing should occur deep within printf to variables I have no control ot access to.
 
Title: Coding Challenges
Post by: BlackBox on October 20, 2006, 01:53:49 PM
Quote
The only memory writing should occur deep within printf to variables I have no control ot access to.
Just to be picky since it would be fun :P

You (well, the compiler) still has to write memory on the stack in order to push parameters to printf()!

i.e
Code: [Select]
push ...
push ...
.. etc for all other parameters ..
call _printf

And of course, the call will modify the stack itself (storing the return address).

Not to mention there will be plenty of register modifications (if you wanted to be picky a register is a form of 'memory' albeit inside the CPU). But yes, I assume you meant real RAM; pushing stuff to the stack would write the memory :P

Not to sound harsh but to sound picky for fun's sake :P
Title: Coding Challenges
Post by: Hooman on October 21, 2006, 06:05:25 PM
Lol!

Alright, fine be picky. Run it on a SPARC. :P

(It usues the register wheel to pass parameters).



Besides, the fact that the compiler might insert instructions to puss stuff on the stack, is pretty much equivalent to printf modifying memory deep within. You have no control over it, and can't make it do anything special. Such as a cheap hack way to increment a variable. Besides, if I said function like printf are allowed to modify memory, I guess I'm sorta implying that you can actually call printf. Besides, that memory is only being modified by the call to printf. :P If you dno't call printf, that wouldn't happen.



 
Title: Coding Challenges
Post by: Hooman on October 22, 2006, 01:35:14 AM
Well why not add a few new challenges.  B)


#12.
Suppose b is a fixed odd integer, and suppose c is some multiple of b.
(Treat b as a constant, and c as a variable).
We want to compute c/b.
That is, we want to divide by an odd constant b, where we know beforehand the remainder after division will be 0.
Do this (for arbitrary odd b ) in a way that is likely faster than using a division instruction.


Bonus: Do this so it is correct whether we are using signed or unsigned integers.

Super Bonus: Extend this method to the case where b is even.
(This may require additional instructions to compute. But it should still be done with fast instructions so that it is still likely a gain on many computers.)



Ex:
A simple case: Let b = -1. Then c/b = c*b.
On most computers multiplication will be faster than division. Hence this would be an acceptable solution.
Title: Coding Challenges
Post by: Hooman on October 22, 2006, 02:29:19 AM
#13 Base Converter
Write a converter to express numbers in a different base b.
For each base b, make the following output:
A. A table listing the representations in base b of the numbers from 0 to 15.
B. A table listing the usual representation of numbers for the first 16 encodings of base b numbers. (Sorta like the reverse of table A).

Challenge: Do this for the following bases:
I. b = -2
II. b = -1 + i (where i = sqrt(-1))


Note: Both systems are expressible in binary. (i.e., the with the digits 0 and 1).


Have fun.  :lol:
Title: Coding Challenges
Post by: jwhiteheadcc on October 22, 2006, 04:25:18 AM
Quote
#13 Base Converter
Write a converter to express numbers in a different base b.
For each base b, make the following output:
A. A table listing the representations in base b of the numbers from 0 to 15.
B. A table listing the usual representation of numbers for the first 16 encodings of base b numbers. (Sorta like the reverse of table A).

Challenge: Do this for the following bases:
I. b = -2
II. b = -1 + i (where i = sqrt(-1))


Note: Both systems are expressible in binary. (i.e., the with the digits 0 and 1).


Have fun.  :lol:
#13 isn't so bad...  Hint:  Instead of say 1,10,100,1000... you'de have -1,10,-100,1000... and thus some negative decimal numbers become positive in base -10 and also the other way around.
I won't show the even simpler way for binary numbers to/from -2 base because I want others to find it!  :)


For the complex number, you just do the same but use a coordinate pair such as (3,4) in (x,y) coordinate form.
The radius of the base is 5 and the angle is the arctangent of 4/3rds.
Create an inverse function to go both ways.  :)

Now to win his challenge, you'll have to create an actual program that achieves that for both (-2,0) and (-1,1).  <--- note that the negative can be considered 2 at an angle of 180 degrees.


GL Ya'll!
 (thumbsup)
Title: Coding Challenges
Post by: Hooman on October 22, 2006, 02:26:22 PM
I would just like to repeat myself on something.

Quote
Note: Both systems are expressible in binary. (i.e., the with the digits 0 and 1).

That is, in each system, you can express all the numbers as a bit vector, where each bit corresponds to powers of the base. The base -2 case is easy to see, but not so much for the other one.

Ex: (base -1 + i)
b_(n-1) (-1 + i)^(n-1) + ... + b_1 (-1 + i)^1 + b_0 (-1 + i)^0
where each of the b_i's is either 0 or 1.

So the bit strings would be as follows:
00: 0 * (-1 + i) + 0 * (1) = 0
01: 0 * (-1 + i) + 1 * (1) = 1
10: 1 * (-1 + i) + 0 * (1) = -1 + i
11: 1 * (-1 + i) + 1 * (1) = i
...


Really though, some justification should be provided as to why every number can be expressed where each b_i is either 0 or 1.


Btw, if you know how to solve it, just solve it. Don't wait for other people. :P
 
Title: Coding Challenges
Post by: jwhiteheadcc on October 24, 2006, 05:06:52 AM
<in reply to PM> I didn't see that you meant for 1/b to be a constant.  This should have been obvious even as tired as I was.

If 1/b=p/q then we can set q to a power of a common base.
For example, if 1/b=0.00123456 then p=1.23456 and q=10^-3, if using scientific notation.
Multiplying an integer using q=power of 2 is also possible.
Notice that I said nothing about rather b is odd since being even is just going to shift q to the left.  Normally you don't store q, but instead it's power.  This is called floating point.   :heh:

c/b is then c*p/q=c*p shifted to the left (or right) by the log_2 of q.
If q=2^n, then c/b=(c*p) << n.

Since i and -1 are both integers, then it's realitively easy to just say that if I have a coordinate system (x,y), that I can map it to (x',y').

For example, to display -5 in base i-1:
1)  We first convert -5 to binary and get -101.
2)  We find the mapping for (x,y) <-> (x',y').
(x',y') is i-1 for one's place in -101.
(x,y) is 1 for one's place in -101.
It seems that mapping is (x',y')=x(i-1)=xi-x and (x,y)=(x',y')/(i-1).
3)  -101 becomes (100+00-1,-100+00-1)=(11,-101)=11 - 101i.

Solving this for a base of -2 seems a lot easier.  :)
Title: Coding Challenges
Post by: Hooman on October 26, 2006, 12:05:54 AM
I'm not sure I fully understand what you're trying to say there, so bear with me if I get this wrong.

My first concern with your answer is precision. If you're using floating point, then there is a certain precision required to ensure the resulting answer is correct. It's certainly possible that the floating point registers on the CPU you're using provide enough precision that any integer will end up being correct (after rounding).

Quote
1/b=0.00123456 then p=1.23456 and q=10^-3
Note that this is subject to rounding errors. The value of 1/b might not be expressible in such an easy format. Think about repeating decimals, or ones that are long enough to overflow the precision of the registers. Try expression 1/3 in binary for example.
1/3 = 1/4 + 1/16 + 1/64 + 1/256 + 1/1024 + 1/4096 + ...

It takes quite a few bits actually.

I guess I'd need a bit of insurance that the result will be correct with rounded values. You could show this by giving maximum error bounds for your method and show what precision is needed to ensure correct results. (Can be specific for a certain architecture, such as 32 bit ints need so many bits of floating point precision).

But then I have to question if a floating point multiply is faster than an integer divide. Generally multiply is faster than divide, but integers also tend to be faster than floating point. Of course a lot of modern machines can probably do them in about just as many clock cycles now.


At any rate it's not quite the answer I was thinking of, but the idea is fairly close. The way I know is never slower on any machine I've ever seen (although possibly not faster on some). As a bit of a hint, you don't actually need floating point to do this.  :ph34r:

Psst, computer don't really have a true "integer" data type (integers are infinite, but computer memory is finite). Rather they have equivalence classes of integers modulo some value. That is, mod 2^W, where W is the word size, such as 32 bits. Think wrap around.



I have no idea what you're doing for this second challenge, but it looks like complete nonsense to me at the moment. It sounds like you're saying -5 is 11 - 101i in base -1 + i. But isn't that kinda like saying -5 is not an integer because it has an imaginary part to it?

As for the ordered pairs, I don't think you're using them in a format that is correct for the challenge. If a number x is expressed in base b with n digits, then it can be written as a string (excuse the delimiters for clarity) <x_n-1, ..., x_2, x_1, x_0>. The value of this number is then x = x_n-1 * b^(n-1) + ... + x_2 * b^2 + x_1 * b + x_0.

Remember last post I gave the bit representations of 0, 1, -1 + i, and i, in the base -1 + i format. They are the first four 2 bit strings.

Quote
So the bit strings would be as follows:
00: 0 * (-1 + i) + 0 * (1) = 0
01: 0 * (-1 + i) + 1 * (1) = 1
10: 1 * (-1 + i) + 0 * (1) = -1 + i
11: 1 * (-1 + i) + 1 * (1) = i

Completing this table is fairly easy using the formula above. The inverse table is a little harder however.
 
Title: Coding Challenges
Post by: Hooman on November 17, 2006, 04:57:02 PM
I'm gonna prod this thread a little since it seems to have been forgotten about.

I'm gonna answer the last bonus part to question #12. I figure if the bonus hasn't been answered by now then it's no longer up for grabs.  :o



The main part of question #12 is to find a way to divide by some odd constant b, when we know the number we are dividing is a multiple of b. The bonus is to extend this method to handle the case when b is even.

Suppose b is even. Then b = (2^k) * (b'), where b' is odd, and k >= 1. That is, we take out all the factors of 2 from b, until there are none left.

Now, a/b = a / ((2^k)*(b')).

Note: a/b = (a / (2^k)) / (b') = (a / (b')) / (2^k).
What that last line says, is that we may either divide by 2^k first, and then by b', or we can divide by b' first, and then by 2^k. They are equivalent.

Also note that since b' is odd, and if b divides a where b = (2^k) * (b'), then both b' divides a and 2^k divides a. Thus in either case, the remainder after division is 0. What this is saying, is that the answer to question #12 can be applied to do the division by b'. It's also easy to see that we can divide by 2^k without worrying about inexact division.

I'm sure you've all figured by now, shift 'a' right k places to perform the division by 2^k. Use the method that's the answer to question #12 to perform the division by b'. We can do these in any order by the above. We now have a/b.

The total cost of performing this, is the cost of performing the division by b', which according to the challenge rules is no slower than a divide instruction, plus the cost of performing the shift instruction. It might end up being more than the cost of a divide instruction, but not by much. On older architectures where the divide instruction is much slower than most other instructions, this should be faster.

I should mention that to perform the shift to divide by 2^k, we need to know if we are working with signed or unsigned values (for the value 'a' anyways). The first bonus is showing the other part, division by an odd b can be done correctly without knowing if signed or unsigned values are being used. Note that the bonus was for the original question, and not for the second bonus asked later, where b can be even.



I will provide hints to the answer upon request (but in this thread only, not by PM).



And Whitehead, sure you don't want to answer at least part of question #13? I'm pretty sure you've got the answer to most of it.