get all combinations

When implementing my code as native C without using any AEDesc and AEDescList types (and their associate functions) but simple array of int values, it’s done in 15 ms on an old MacBook Pro from 2011. So the delay in my OSAX is really handling and copying AppleEvent descriptors. But I was cheating, I only swapped and printed them (and ignoring the results). In my second attempt I have created an int ** to lookup performance differences.

The results were that it took 30ms to actually find every permutation for a list of 9 items and copy them to an listed list of arrays (or array of pointer pointers). in that case someone could really do something with the results for later processing. I also tried to find the permutations of a list of 11 integers. It comes back with a list containing almost 40 million permutations (39,916,800 to be excactly) just under 4 seconds.

Anyway, if anyone is interested in the code:

Nigel,

Am I right in assuming you’re using a script object because that’s what you need to do with long AS lists? If so, there’s no need in the ASObjC version. I rewrote it without it, and I think it’s a little bit faster, but I’d be interested in your time:

use AppleScript version "2.3.1"
use scripting additions
use framework "Foundation"

on prmt(l, workArray, permutations)
	-- l is the zero-based index of the leftmost item affected by this iteration
	set r to (workArray's |count|()) - 1
	set m to r - 1
	set n to r - l + 1 -- n is the number of list items affected by this iteration (l thru r)
	
	if (n is 3) then
		-- These six permutations are hard-coded to reduce low-level recursion
		permutations's addObject:(workArray's |copy|())
		
		workArray's exchangeObjectAtIndex:r withObjectAtIndex:m
		permutations's addObject:(workArray's |copy|())
		
		workArray's exchangeObjectAtIndex:r withObjectAtIndex:l
		permutations's addObject:(workArray's |copy|())
		
		workArray's exchangeObjectAtIndex:r withObjectAtIndex:m
		permutations's addObject:(workArray's |copy|())
		
		workArray's exchangeObjectAtIndex:r withObjectAtIndex:l
		permutations's addObject:(workArray's |copy|())
		
		workArray's exchangeObjectAtIndex:r withObjectAtIndex:m
		permutations's addObject:(workArray's |copy|())
	else
		-- Precalculate some values for the repeat
		set lPlus1 to l + 1 -- parameter for next-level recursions
		set nIsEven to (n mod 2 = 0) -- true if n is even
		set x to r -- the default index with which to swap if n is odd
		
		-- Get all permutations of items (l +1) thru r with the current item l
		prmt(lPlus1, workArray, permutations)
		-- Repeat with successive values of item l
		repeat with i from r to lPlus1 by -1
			-- If n is even, swap items l and i, otherwise default to swapping items l and r
			if (nIsEven) then set x to i
			(workArray's exchangeObjectAtIndex:x withObjectAtIndex:l)
			prmt(lPlus1, workArray, permutations)
		end repeat
	end if
end prmt

on allPermutations(theList)
	set permutations to current application's NSMutableArray's array() -- the mutable array we will add to
	set workArray to current application's NSMutableArray's arrayWithArray:theList -- the starting array
	set r to (workArray's |count|()) - 1
	if (r < 2) then
		-- Special-case lists of less than three items
		workArray's addObject:theList
		if (r is 1) then permutations's addObject:(workArray's reverseObjectEnumerator()'s allObjects())
	else
		-- Otherwise use the recursive handler
		prmt(0, workArray, permutations)
	end if
	return permutations as list
end allPermutations

That makes sense. If you modify Nigel’s script to remove the coercion of the final array to an AS list, the script runs in less than half the time. Put another way, the single act of coercing the array of arrays to an AS list accounts for more than 50% of the running time. And assuming that you can’t do that coercion to a descriptor at your end any quicker, that means the best overall result you can achieve is roughly half the time of the ASObjC version, even if you build the array in a nanosecond.

The other question is what someone is going to do with such a list. I mean, rather than say repeating through a long list in AS, it may be quicker to leave the main list as an array, and just coerce each item as you use it. That’s just moving the job elsewhere in code, but it may save time by skipping a large AS repeat loop.

Hi Shane.

Your assumption’s party right. As you probably noticed, my first ASObjC script (post #13) is just a revamp of my right-to-left vanilla script from post #10, with ObjC arrays and methods used instead of vanilla where there are more than two items.

In the vanilla scripts, the script-object-within-a-handler idea offers three advantages (to my way of thinking):

  1. The script object’s properties can be “referenced” to allow faster access to the vanilla list items.
  2. The recursive part of the only-partially recursive process can be contained within the main handler.
  3. The lists and the precalculated r and m values can be held in local properties instead of having to be passed or recalculated with each recursion or held in globals or global properties outside the main handler.

Point 1, as you say, doesn’t apply with the ObjC arrays. Point 2 is a largely a matter of my own personal preference. Point 3 is connected with point 2, but is also relevant with regard to the amount of work the script has to do. Your rewrite passes two extra parameters AND recalculates r and m on every call to the recursive handler. I’d therefore expect it to be a little slower than my script. And it is, on my machine. With nine items, it’s taking about six seconds longer than mine this morning. But mine’s taking a fair bit longer than it did yesterday, so the difference between the scripts’ times may actually be less under optimal conditions.

Now that’s interesting! But the difference is far less dramatic for me. 117 seconds as opposed to 134 (this morning). Still, that’s a lot more than I’d have expected until you mentioned it.

Yes, I found it a bit slower here too, although I got a bit excited when I accidentally left a number off the initial list :slight_smile:

That coercion time is the killer. I found when I wrote ASObjC Runner that with even modest lists of lists, virtually all the time was taken converting AS stuff to Cocoa and vice-versa; the work performed was almost irrelevant most of the time. One of the advantages of having ASObjC everywhere is that it makes it easier to do the conversions only when you need to.

Exactly! Nigel’s code took 45 seconds on my MBP when an AppleScript list is returned but when I return a pointer to the NSMutableArray it took only 40 seconds. That means that AppleScriptObjC bridge will take up to 5 seconds to coerce to a NSMutableArray containing 362,880 items into an AppleScript list. That fits right with my OSAX “problem” because when thinking about it, the AEPutParamDesc does an extra copy of the object meaning it will take two times 5 seconds (and probably some more copying when the event is handled by the event manager).

It’s getting nicer and nicer. :smiley:

But . if you make a list of 362,880, I have to assume that you are going to use that list for something other than just generating it, and then, those 10 seconds, won’t take up that much time proportionally. You can for all I know, execute an Osascript in the background, from a do shell script for all I know.

I have another approach, I generate permutations, or combinations on the fly, paying a penalty with each invocation, I live well with that, at least until I have timed it, because I have a practical need for testing if vectors in matrix are orthogonal to each other, by testing them accordingly to generated combinations. If a test fails, then I must start over from scratch. So I pay a penalty in total time, but saves some time per invocation, and some memory too! I have no idea how I would create a list with 362880 items in Applescript however. Have the boundaries changed?

There are however a lot of things lists with permutations and combinations that are generated up front can be used for. Today I have understood how mikerickson’s algorithm for generating permutations with repeats works, it is pretty clever I think.

This is really a great thread. :slight_smile:

AppleScript allows lists up to 2^31 items, well at least in theory. Because the maximum allowed index is the maximum value of a signed 32-bit integer and negative indexes are not allowed. To create a list with 362880 items, there is no problem you just have to wait 20+ seconds.

Hello DJ.

Thanks for enlightening me, I was dead sure anything above 2^14 was futile. :slight_smile:

20 seconds isn’t a long time, except for when in front of a computer! :smiley:

Hello.

For those interested, here is a generator of r-combinations. There is nothing wrong with Daehls, I just wanted something simpler to understand as the problem intrigued me, so this is probably slower. I found the pseudo code in a buggy! :mad: pdf document somwhere. (Daehls delist handler is something that seems to be a great way to translate items into indicies IMO.)

set comboList to {}
combination(comboList, 3, 6)
listprint(comboList)
(* 
"{1, 2, 3}
{1, 2, 4}
{1, 2, 5}
{1, 2, 6}
{1, 3, 4}
{1, 3, 5}
{1, 3, 6}
{1, 4, 5}
{1, 4, 6}
{1, 5, 6}
{2, 3, 4}
{2, 3, 5}
{2, 3, 6}
{2, 4, 5}
{2, 4, 6}
{2, 5, 6}
{3, 4, 5}
{3, 4, 6}
{3, 5, 6}
{4, 5, 6}"
*)
on combination(combinations, r, n)
	# print the first r combination
	set S to {}
	repeat with i from 1 to r
		set end of S to i
	end repeat
	copy S to end of combinations
	
	repeat with i from 2 to C(n, r)
		set m to r
		set max_val to n
		
		repeat while (((item m) of S) = max_val)
			set m to m - 1
			set max_val to max_val - 1
		end repeat
		# increment the above rightmost element
		set item m of S to (item m of S) + 1
		# all others are the successors: 
		
		repeat with j from (m + 1) to r
			set item j of S to (item (j - 1) of S) + 1
		end repeat
		copy S to end of combinations
	end repeat
end combination


on listprint(theL)
	try
		text 0 of theL
	on error e
		set ofs to offset of "{" in e
		set tmp to text (ofs + 1) thru -3 of e
		tell (a reference to text item delimiters)
			set astid to contents of it
			set contents of it to "}, "
			set newl to text items of tmp
			set contents of it to "}" & return
			set newt to newl as text
			set contents of it to astid
		end tell
		return newt
	end try
end listprint


on C(n, r)
	# Counts the number of r-combinations in a set of n elements.
	# it is also called the binomial coeffecient.
	if n = 0 or n < r or r < 0 then error "C: argument error"
	return (factorial(n) div (factorial(r) * (factorial(n - r))))
end C


on factorial(n)
	if n > 170 then error "Factorial: n too large."
	# Result greater than 1.797693E+308!
	if n < 0 then error "Factorial: n not positive."
	set a to 1
	repeat with i from 1 to n
		set a to a * i
	end repeat
	return a
end factorial

Edit
Made a “fancier” listprint handler. :slight_smile:

Hello.

I have made a handler for generating combinations with repetitions, for completion, since we then have all kinds of handlers here. You’ll have to allow for every element being chosen as many times as there are rooms in your subset.

set comboList to {}

set donuts to {"iced", "jam", "plain", "something completely different"}
set chosen to {0, 0, 0, 0}
# the number of items in the list chosen, must coincide with the n_chosen parameter.

choose(comboList, chosen, 1, 4, 1, 3)
--> 15 (combinations

listprint(comboList)
(*
"{\"iced\", \"iced\", \"iced\", \"iced\"}
{\"iced\", \"iced\", \"iced\", \"jam\"}
{\"iced\", \"iced\", \"iced\", \"plain\"}
{\"iced\", \"iced\", \"jam\", \"jam\"}
{\"iced\", \"iced\", \"jam\", \"plain\"}
{\"iced\", \"iced\", \"plain\", \"plain\"}
{\"iced\", \"jam\", \"jam\", \"jam\"}
{\"iced\", \"jam\", \"jam\", \"plain\"}
{\"iced\", \"jam\", \"plain\", \"plain\"}
{\"iced\", \"plain\", \"plain\", \"plain\"}
{\"jam\", \"jam\", \"jam\", \"jam\"}
{\"jam\", \"jam\", \"jam\", \"plain\"}
{\"jam\", \"jam\", \"plain\", \"plain\"}
{\"jam\", \"plain\", \"plain\", \"plain\"}
{\"plain\", \"plain\", \"plain\", \"plain\"}"

*)

(*

The calculation for the number of combinations with repetitions are:

C( maxtypes+n_chosen-1,n_chosen) or C(n+r-1,r) with other variable names.

*)

log C(3 + 4 - 1, 4)

--> 15



on choose(combinations, got, n_chosen, len, atw, maxtypes)
	global donuts
	
	set tcount to 0
	
	if n_chosen = (len + 1) then
		# log "n_chosen = len "
		if got = 0 then return 1
		set lineout to {}
		repeat with i from 1 to len
			set end of lineout to item (item i of got) of donuts
		end repeat
		copy lineout to end of combinations
		return 1
	end if
	
	repeat with i from atw to maxtypes
		if (got ≠ 0) then
			set item n_chosen of got to i
			set tcount to tcount + choose(combinations, got, (n_chosen + 1), len, i, maxtypes)
		end if
	end repeat
	return tcount
	
end choose

on listprint(theL)
	try
		text 0 of theL
	on error e
		set ofs to offset of "{" in e
		set tmp to text (ofs + 1) thru -3 of e
		tell (a reference to text item delimiters)
			set astid to contents of it
			set contents of it to "}, "
			set newl to text items of tmp
			set contents of it to "}" & return
			set newt to newl as text
			set contents of it to astid
		end tell
		return newt
	end try
end listprint

on C(n, r)
	# Counts the number of r-combinations in a set of n elements.
	# it is also called the binomial coeffecient.
	if n = 0 or n < r or r < 0 then error "C: argument error"
	return (factorial(n) div (factorial(r) * (factorial(n - r))))
end C


on factorial(n)
	if n > 170 then error "Factorial: n too large."
	# Result greater than 1.797693E+308!
	if n < 0 then error "Factorial: n not positive."
	set a to 1
	repeat with i from 1 to n
		set a to a * i
	end repeat
	return a
end factorial

Here’s an interesting one.

By means of concatenation, this vanilla version of my script prebuilds a list of the required length to hold the permutations. When used as a Library in exactly the same way as the ASObjC version in post #13, it executes in a mere 5 seconds or so as opposed to the ASObjC version’s 134 seconds! However, with both versions, it then takes nearly two and a half minutes to do anything with the result, such as setting a variable to it or getting its class!

on allPermutations(theList)
	
	script o
		property workList : missing value
		property permutations : {}
		property r : count theList -- index of the rightmost item of workList.
		property m : r - 1 -- index of the middle item of the last three of workList.
		property p : 1 -- index into the permutations list.
		
		on prmt(l)
			-- l is the index of the leftmost item affected by this iteration
			set n to r - l + 1 -- n is the number of list items affected by this iteration (l thru r)
			
			if (n is 3) then
				-- These six permutations are hard-coded to reduce low-level recursion
				copy my workList to item p of my permutations
				
				set {v1, v2, v3} to items l thru r of my workList
				
				set item m of my workList to v3
				set item r of my workList to v2
				copy my workList to item (p + 1) of my permutations
				
				set item l of my workList to v2
				set item r of my workList to v1
				copy my workList to item (p + 2) of my permutations
				
				set item m of my workList to v1
				set item r of my workList to v3
				copy my workList to item (p + 3) of my permutations
				
				set item l of my workList to v3
				set item r of my workList to v2
				copy my workList to item (p + 4) of my permutations
				
				set item m of my workList to v2
				set item r of my workList to v1
				copy my workList to item (p + 5) of my permutations
				
				set my p to p + 6
			else
				-- Precalculate some values for the repeat
				set lPlus1 to l + 1 -- parameter for next-level recursions
				set nIsEven to (n mod 2 = 0) -- true if n is even
				set x to r -- the default index with which to swap if n is odd
				
				-- Get all permutations of items (l +1) thru r with the current item l
				prmt(lPlus1)
				-- Repeat with successive values of item l
				repeat with i from r to lPlus1 by -1
					-- If n is even, swap items l and i, otherwise default to swapping items l and r
					if (nIsEven) then set x to i
					tell item x of my workList
						set item x of my workList to item l of my workList
						set item l of my workList to it
					end tell
					prmt(lPlus1)
				end repeat
			end if
		end prmt
		
	end script
	
	if (o's r < 3) then
		-- Special-case lists of less than three items
		copy theList to the beginning of o's permutations
		if (o's r is 2) then set the end of o's permutations to the reverse of theList
	else
		-- Otherwise use the recursive handler
		copy theList to o's workList
		-- Prebuild a list of the required length (factorial of theList's length) to hold the permutations.
		set mv to missing value
		set mvList to {mv, mv, mv, mv, mv, mv} -- Minimum length is 6 with a 3-item input list.
		set mvList2 to mvList
		repeat with i from 3 to (o's r) - 1
			repeat i times
				set mvList2 to mvList2 & mvList
			end repeat
			set mvList to mvList2
		end repeat
		set o's permutations to mvList
		o's prmt(1)
	end if
	
	return o's permutations
	
end allPermutations

Hello.

Your handlers are as eminent as well always. I guess you just stay with AsobjC, when time is precarious enough.

I figure that the data, the permuations are used for, are maybe less demanding with regards to copying them in, a little at a time from vanilla Applescript.

It is very comforting to know that such handlers exists, DJ’s handler from that other thread is also a nice one, for practical purposes. This is such a great and fun thread! :slight_smile:

I’m not sure what you’re timing, but yes, most stuff would be much quicker if we didn’t have to await results :wink:

It’s the time taken to execute the permutations code with a nine-item input list versus the time taken to execute the code and set a variable to the result. The difference is about two to two-and-a-half minutes! The same’s true for getting information from the result, such as its class or the value of one of its items.

Here’s the timing script I’ve been using in AppleScript Editor. Only the timing result appears in the Result pane, so the time it would take to decompile all the permutations for display isn’t a factor:

use permuter : script "Permuter 4" -- The vanilla script in post #49.
use scripting additions

----------
-- "Timer.scpt" is a script on my own computer. The current script won't work without it (or a similar one).
tell me to set timer to (load script file ((path to scripts folder from user domain as text) & "Libraries:Timer.scpt"))
tell timer to StartTimer()
----------
permuter's allPermutations({1, 2, 3, 4, 5, 6, 7, 8, 9}) -- Timed code.
----------
tell timer to set t to readTimer()
----------
(*
Sample results this morning vary between 4.964 and 6.342 seconds.
But changing the timed code to:
set l to permuter's allPermutations({1, 2, 3, 4, 5, 6, 7, 8, 9})
--> 120.436 to 132.929 seconds.
*)

The situation’s similar when timing the ASObjC version, but with longer times:

use permuter : script "Permuter 3" -- The ASObjC script in post #13.
use scripting additions

----------
-- "Timer.scpt" is a script on my own computer. The current script won't work without it (or a similar one).
tell me to set timer to (load script file ((path to scripts folder from user domain as text) & "Libraries:Timer.scpt"))
tell timer to StartTimer()
----------
permuter's allPermutations({1, 2, 3, 4, 5, 6, 7, 8, 9}) -- Timed code.
----------
tell timer to set t to readTimer()
----------
(*
Sample results this morning vary between 131.031 and 137.428 seconds.
But changing the timed code to:
set l to permuter's allPermutations({1, 2, 3, 4, 5, 6, 7, 8, 9})
--> 257.602 to 260.286 seconds.
*)

Interesting.

Let me add a few comments about ASObjC and this thread. One of the reasons I was initially reluctant to bother with an ASObjC version is that I didn’t think it would offer any advantage. And ultimately it didn’t, except by bending the test in its favor with very large lists – so large, they are probably practically unusable in AS.

Where ASObjC is likely to offer real speed gains, I think, is in cases where it can replace algorithms. Things like using a sort method, or using classes like sets to make lists unique. Although this might look a bit like cheating, and at times be downright dull, I think it’s very much in the spirit of AppleScript and things like the whose clause: real speed gains are generally made by pushing the work out to be done somewhere else. And while we’re all happy to see clever algorithms implemented in Nigel’s hands, less code means less chance for bugs.

Hello.

I felt that an r-permutation with repeats was missing here, that is permutations with r elements, from a set of n elements, with repeating elements.

There isn’t much to say about the implementation, other than that it works like an odometer basically. If we make permutations with a size of r elements, over a set of n elements, with repeats, then we must generate n^r combinations.

I have also left out zero as a valid element, and I intend to use this with daehls delist handler, for translating back and forth.

set thePerms to {}

r_permute_reps(3, 5, thePerms)
listprint(thePerms)
(*
"{1, 1}
{1, 2}
{1, 3}
{1, 4}
{1, 5}
{2, 1}
{2, 2}
{2, 3}
{2, 4}
{2, 5}
{3, 1}
{3, 2}
{3, 3}
{3, 4}
{3, 5}
{4, 1}
{4, 2}
{4, 3}
{4, 4}
{4, 5}
{5, 1}
{5, 2}
{5, 3}
{5, 4}
{5, 5}"
*)
on r_permute_reps(n, r, perm_list)
	
	set oldperm to {}
	repeat with i from 1 to r
		-- initialize oldperm
		set end of oldperm to 1
	end repeat
	copy oldperm to end of perm_list
	repeat with i from 1 to (n ^ r - 1)
		if ((item r) of oldperm < n) then
			set (item r) of oldperm to ((item r) of oldperm) + 1
		else
			set (item r) of oldperm to 1
			repeat with j from (r - 1) to 1 by -1
				if item j of oldperm < n then
					set item j of oldperm to (item j of oldperm) + 1
					exit repeat
				else
					set item j of oldperm to 1
				end if
			end repeat
		end if
		copy oldperm to end of perm_list
		
	end repeat
end r_permute_reps

on listprint(theL)
	try
		text 0 of theL
	on error e
		set ofs to offset of "{" in e
		set tmp to text (ofs + 1) thru -3 of e
		tell (a reference to text item delimiters)
			set astid to contents of it
			set contents of it to "}, "
			set newl to text items of tmp
			set contents of it to "}" & return
			set newt to newl as text
			set contents of it to astid
		end tell
		return newt
	end try
end listprint

Edit

Added some code that was forgotten so it works for r’s >2. :confused:

Hello.

Here is yet another handler, this time one that returns the powerset of a set. A power set, is the combination of all sets that can be made from the original set.

I have used handlers that I have posted earlier in this thread, and included them here, so it works as it is is posted.

 
on factorial(n)
	if n > 170 then error "Factorial: n too large."
	# Result greater than 1.797693E+308!
	if n < 0 then error "Factorial: n not positive."
	set a to 1
	repeat with i from 1 to n
		set a to a * i
	end repeat
	return a
end factorial

on c(n, r)
	# Counts the number of r-combinations in a set of n elements.
	# it is also called the binomial coeffecient.
	if n = 0 or n < r or r < 0 then error "C: argument error"
	return (factorial(n) div (factorial(r) * (factorial(n - r))))
end c

on combination(combinations, r, n)
	# print the first r combination
	set s to {}
	repeat with i from 1 to r
		set end of s to i
	end repeat
	copy s to end of combinations
	
	repeat with i from 2 to my c(n, r)
		set m to r
		set max_val to n
		
		repeat while (((item m) of s) = max_val)
			set m to m - 1
			set max_val to max_val - 1
		end repeat
		# increment the above rightmost element
		set item m of s to (item m of s) + 1
		# all others are the successors: 
		
		repeat with j from (m + 1) to r
			set item j of s to (item (j - 1) of s) + 1
		end repeat
		copy s to end of combinations
	end repeat
end combination

on createPowerSet from origSet
	-- assumes the set is made up of consecutive integers, starting at 1.
	-- i.e. {1,2,3} is a valid set, {2,5,6} is not. You may make your own mapping of sorts, of course.
	-- Number of sets in the powerset is 2 ^(length of origset)
	set powSet to {origSet}
	set origLen to (length of origSet)
	repeat with i from (origLen - 1) to 1 by -1
		combination(powSet, i, origLen)
	end repeat
	set end of powSet to {}
	return powSet
end createPowerSet


createPowerSet from {1, 2, 3}
-- {{1, 2, 3}, {1, 2}, {1, 3}, {2, 3}, {1}, {2}, {3}, {}}

could you decode the above characters?

≥
≤

thanks

Sorry. They’re due to a change in MacScripter’s BBS software a year or two ago.

What gets the same results as daehl is to change ≥ to (greater than or equal to) and ≤ to (less than or equal to).

1 Like