[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference rusure::math

Title:	Mathematics at DEC

Moderator:	RUSURE::EDP

Created:	Mon Feb 03 1986
Last Modified:	Fri Jun 06 1997
Last Successful Update:	Fri Jun 06 1997
Number of topics:	2083
Total number of notes:	14613

2003.0. "simple combination question" by SNOFS1::JONESCHRIS (Chris Jones) Mon Oct 16 1995 22:18

Hello out there,
	I have a simple combinational problem, that must have a simple and 
elegant solution, but I can't work it out nicely, hence I am asking the question 
here.  If this is not the place to ask, please advise me accordingly.

	The problem is I am trying to work outthe algorithm for a simple voting 
function.  ie givn a population of 'X', how many combinations are available if 
'N' or grrater voted for or against this.
	ie for a popuation of 5 and a vote count of 1, there are 5 combinations. 
If the vote count was raised to 2, there are now 10 combinations possible, etc.

	What is the algorithm to work out this function, or an algorithm to give 
all of the combinations (probably more useful to me)!

	Many thanks

T.R	Title	User	Personal Name	Date	Lines
2003.1		AUSSIE::GARSON	achtentachtig kacheltjes	`Tue Oct 17 1995 07:05`	24
	re .0 C(n,r) represents the number of ways of selecting a set of r objects from a pool of n. Note that I use the word set here to emphasise that the order of the objects is unimportant, as is applicable to your problem. Hence if there are X voters and N of them voted for you then there are C(X,N) ways of selecting the set of voters that voted for you. n! C(n,r) = -------- r!(n-r)! where k! = 1.2.3...k. If avoiding overflow is important then this is not the best way actually to calculate C(n,r). If n is really large so that an approximation is acceptable then there are some simple approximation formulae. I am not familiar with a good algorithm to generate all combinations. What sort of size for X and N are we talking?
2003.2	width of problem	SNOFS1::JONESCHRIS	Chris Jones	`Tue Oct 17 1995 20:47`	4
	The number of voters in this problem are typicaly less than 50, and the vote acceptance variable may be upto 48! In this domain, there may be a computational overflow. IS there any other way of simply solving this?
2003.3		AUSSIE::GARSON	achtentachtig kacheltjes	`Wed Oct 18 1995 06:51`	9
	re .2 If you don't require an exact integer answer and for number of voters less than 50 then you could use G-float double precision to get a reasonable value for the number of combinations without having to be careful about overflow. Actually enumerating the combinations would be infeasible e.g. for 50 voters and 25 supporters.
2003.4		RTL::HANEK		`Wed Oct 18 1995 11:03`	26
	I assume that when your talking about computational overflow, you mean intermediate overflow. That is, in the computation of n! C(n,k) = -------- k!(n-k)! At least one of n!, k! or (n-k)! overflows, but the quotient doesn't. If that is the case, a reasonable accurate result can be obtained using the lgamma(x) funtion. The gamma function is (for discussion purposes) a generalization of factorials to non-integer values. For integer values gamma(n+1) = n! The lgamma function is the natural log of the gamma function. lgamma(x) is frequently used because gamma(x) is such a rapidly increasing function it overflows most finite representations and because usually the real quantity of interest is a ratio of gamma functions, like C(n,k). So one might compute C(n,k) by C(n,k) = exp(log(n!/(k!(n-k)!))) = exp(log(n!) - log(k!) - log((n-k)!)) = exp(lgamma(n+1) - lgamma(k+1) - lgamma(n-k)) There are very good implementations of the lgamma function on all of the DEC Alpha platforms.
2003.5	lisp or extended precision package	RANGER::BRADLEY	Chuck Bradley	`Wed Oct 18 1995 16:37`	8
	re .2 i have not done things like this for a while, so this is not promised to work, but ask lisp to evaluate it. or use one of the extended precision packages mentioned in this conference.
2003.6	some simple integer arithmetic might be enough	HANNAH::OSMAN	see HANNAH::IGLOO$:[OSMAN]ERIC.VT240	`Wed Oct 18 1995 20:15`	51
	Well, consider X=n!/(r!(n-r)!). If n=50, the largest value for x is probably near r=25. For that case we have X=504948...252423...2 / (252423...2 252423...2) The 25! cancels on top and bottom, giving X=504948...26 / 252423...2 The 50 cancels with the 252: X=4948...26 / 2423...3 27 cancels with 3 and 9. 28 cancels with 4 and 8. 30 cancels with 6 and 5. X=494847464544434241403938373635343332312926 / 2423222120191817161514131211107 Useful reductions now are 48/24, 46/23, 44/22, 42/21, 40/20, 38/19, 36/18, 34/17, 32/16 : X=492472452432412392372352332312926 / 1514131211107 Now we apply 39/13, 35/7, 33/11: X=492472452432412323725232312926 / 15141210 Now 35/15, 322/12: X=4924724524324122372312926 / 1410 Take one of the 7's from the 49 combined with a 2, and cancel with the 14, then take a 5 from the 45 with another 2, to cancel with the 10: X=747924324122372312926 Combining the 2's: X=7479434137323129*26 Not quite small enough for a vax (or even my alpha/vms dcl [why not???] ) /Eric
2003.7		AUSSIE::GARSON	achtentachtig kacheltjes	`Wed Oct 18 1995 21:44`	14
	re .6 >Well, consider X=n!/(r!(n-r)!). If n=50, the largest value for x is >probably near r=25. For that case we have For a given even n C(n,r) has a maximum at r=n/2. (Think Pascal's triangle). My calculator gives C(50,25) as around 1.3E14. >Not quite small enough for a vax (or even my alpha/vms dcl [why not???] ) Hence as you observe too large for 32-bit integer arithmetic. Alpha VMS is initially a straight port of VAX VMS hence DCL can't do 64-bit arithmetic even on Alpha. Maybe later... Followups in VMSNOTES. (-:
2003.8	Example using 64-bit integers	WIBBIN::NOYCE	EV5 issues 4 instructions per meter	`Thu Oct 19 1995 13:33`	32
	For small N, it's not too hard to take advantage of the cancellation idea in a program: $ type comb.for integer8 n,k,i,m accept ,n,k if (k.gt.n-k) k=n-k m = 1 do i = 1,k m = m(n-i+1)/i enddo print ,m end $ fort/check=overflow comb $ link comb $ run comb 50,25 126410606437752 $ run comb 60,30 118264581564861424 $ run comb 70,21 385439532530137800 $ run comb 70,22 %SYSTEM-F-HPARITH, high performance arithmetic trap, Imask=00010000, Fmask=00000000, summary=40, PC=8048C798, PS=0000001B -SYSTEM-F-INTOVF, arithmetic trap, integer overflow at PC=8048C798, PS=0000001B %TRACE-F-NOMSG, Message number 0009804C Image Name Module Name Routine Name Line Number rel PC abs PC 0 8048C798 8048C798 0 82FBDA50 82FBDA50
2003.9		HANNAH::OSMAN	see HANNAH::IGLOO$:[OSMAN]ERIC.VT240	`Thu Oct 19 1995 13:55`	5
	That's quite a compression of all the cancelation stuff I discussed ! I'm still trying to figure out how your program works. Thanks. /Eric
2003.10		HANNAH::OSMAN	see HANNAH::IGLOO$:[OSMAN]ERIC.VT240	`Thu Oct 19 1995 14:18`	47
	The heart of the program (there's always one line that's the heart of any computer program right?) is: integer8 n,k,i,m accept ,n,k if (k.gt.n-k) k=n-k m = 1 do i = 1,k ==> m = m(n-i+1)/i enddo print ,m end Let's suppose we're doing (50,25). The generated value is: 150/149/248/347/4...*26/25 This value is computed in order from left to right, so we need to convince ourselves (well, I need to convince myself!) that none of those divisions will ever be non-integer. Certainly the /2 is o.k. since either n or n-1 is even. If I were being sloppy, I could convince myself that /3 is o.k. since n or n-1 or n-2 is divisible by 3. But I'm not quite that sloppy, so I need to worry that perhaps whichever of those 3 was divisible by 3 no longer is, once we did the /2. But on further thought, I realize that the /2 can't "damage" the divisible-by-3-ness of a number. But what about the /4 ? Certainly either n or n-1 or n-2 or n-3 is divisible by 4, but suppose it was the n or n-1 that was divisible by 4. One of those was already divided by 2, so can we be certain that what's left is divisible by 4 ? Then of course there's the whole question of why the entire computation just happens to be equivalent to (n,k), even if all the divisibility concerns are satisfied. I guess this slow mathematician here will need to think about all of this a bit more while the whole thing must be obvious to the rest of you... Thanks. /Eric
2003.11		HANNAH::OSMAN	see HANNAH::IGLOO$:[OSMAN]ERIC.VT240	`Thu Oct 19 1995 14:24`	29
	o.k. I'm not yet convinced that all those immediate operations will be exactly integers, but I think I've figured out why if they are integers that the expression is equivalent to (n,k). We have 504948...1 -------------------------------- 252423...1 * 252423...1 The first set cancels, leaving us with 504948...26 -------------- 252423...1 which we can reverse the bottom of: 504948...26 -------------- 123...25 which can of course be written as 150/149/248/3...*26/25 but this doesn't answer the first question yet about guaranteeing that each intermediate result is an exact integer... /Eric
2003.12	You can't be certain of anything until you prove it, but:	EVMS::HALLYB	Fish have no concept of fire	`Thu Oct 19 1995 18:12`	16
	> But what about the /4 ? Certainly either n or n-1 or n-2 or n-3 is divisible > by 4, but suppose it was the n or n-1 that was divisible by 4. One of those > was already divided by 2, so can we be certain that what's left is divisible > by 4 ? Because no matter which one was divisible by 4 you still have another one divisible by 2. E.g., if n-1 is divisible by 4 then n-3 is divisible by 2 (but not 4). Similarly when trying to divide by 6: n, n-1, n-2, n-3, n-4, n-5; Whichever one is divisible by six (say it's n-4) leaves you another that's divisible by 3 (in this case n-1), one that's divisible by 4 (either n-2 or n) and one that's divisible by 2 (the "4" candidate that wasn't divisible by 4). And so on. John
2003.13		RUSURE::EDP	Always mount a scratch monkey.	`Thu Oct 19 1995 18:14`	12
	Re .9, .10, .11: At step i, the value of m is (n,i). The number of combinations is known to be an integer (particularly because it can be defined inductively by adding previous values, which are also integers). -- edp Public key fingerprint: 8e ad 63 61 ba 0c 26 86 32 0a 7d 28 db e7 6f 75 To find PGP, read note 2688.4 in Humane::IBMPC_Shareware.
2003.14	non-inductive proof	JOBURG::BUCHANAN	dodging lions and wasting time	`Fri Oct 27 1995 13:56`	19
2003.15	nits	AUSSIE::GARSON	achtentachtig kacheltjes	`Sat Oct 28 1995 05:20`	12
	re .14 > depending on whether rem(a,c) + rem(b,c) < or > 1. depending on whether rem(a,c) + rem(b,c) < or >= 1. This is also a somewhat non-obvious use of the name "rem". Given the names floor() and ceil() perhaps the right function name is height(). (-: Equivalently expressed... depending on whether a mod c + b mod c < or >= c.