Add document on Solo, Couple, and Merge

author: Marshall Lochbaum <mwlochbaum@gmail.com> 2020-08-16 22:14:53 -0400
committer: Marshall Lochbaum <mwlochbaum@gmail.com> 2020-08-16 22:14:53 -0400
commit: bda30287098ab9002bc4d3e8290ed06c21abec28 (patch)
tree: 17e2c7599c0f2af08accab4363728a4ae2dbbd21 /doc
parent: 144854e542f6f853d10d9efed95de2d1a5025758 (diff)
4 files changed, 58 insertions, 3 deletions
diff --git a/doc/README.md b/doc/README.md
index bc4d2b34..d9b480e1 100644
--- a/doc/README.md
+++ b/doc/README.md
@@ -16,6 +16,7 @@ Primitives:
 - [Join](join.md) (`∾`)
 - [Logical functions](logic.md) (`∧∨¬`)
 - [Prefixes and Suffixes](prefixes.md) (`↑↓`)
+- [Solo, Couple, and Merge](couple.md) (`≍>`)
 - [Transpose](transpose.md) (`⍉`)
 - [Windows](windows.md) (`↕`)
 
diff --git a/doc/couple.md b/doc/couple.md
new file mode 100644
index 00000000..3b11b2b3
--- /dev/null
+++ b/doc/couple.md
@@ -0,0 +1,54 @@
+*View this file with results and syntax highlighting [here](https://mlochbaum.github.io/BQN/doc/couple.html).*
+
+# Couple and Merge
+
+Solo/Couple (`≍`) and Merge (`>`) are functions that create a higher-rank array from lower-rank components. Each takes some number of inner arrays organized in an outer structure, and creates a single array combining all elements of those inner arrays. For example, let's couple two arrays of shape `2‿3`:
+
+        ⊢ p ← 3‿5×⌜↕3
+        ⊢ q ← 2‿3⥊"abcdef"
+        p ≍ q   # p coupled to q
+        ≢ p ≍ q
+
+The result has two inner axes that are shared by `p` and `q`, preceded by an outer axis: length 2 because there are two arguments. Calling `≍` with no left argument does something simpler: because there is one argument, it just adds a length-1 axis to the front. The argument goes solo, becoming the only major cell of the result.
+
+        ≍ q
+        ≢ ≍ q
+
+Merge (`>`) also takes one argument, but a nested one. Its argument is an array of arrays, each with the same shape. The shape of the result is then the outer shape followed by this shared inner shape.
+
+        ⊢ a ← "AB"‿"CD" ∾⌜ "rst"‿"uvw"‿"xyz"
+        > a
+        ≢ > a
+
+Merge is effectively a generalization of Solo and Couple, since Solo is `{>⟨𝕩⟩}` and Couple is `{>⟨𝕨,𝕩⟩}`. Since `≍` works on the "list" of arguments, it can only add one dimension, but `>` can take any number of dimensions as its input.
+
+## Merge and array theory
+
+In all cases what these functions do is more like reinterpreting existing data than creating new information. In fact, if we ignore the shape and look at the ravels of the arrays involved in a call to Merge, we find that it just [joins](join.md) them together. Essentially, Merge is a request to ensure that the inner arrays (which, being independent elements, could be any sort of "ragged" array) can fit together in an array, and then to consider them to be such an array. For this reason, Merge (or a virtual analogue) is used to combine the result cells when calling a function with Rank into a single array.
+
+        ⥊ > a
+        ⥊ ⥊¨ a
+        ∾ ⥊ ⥊¨ a
+
+The way this happens, and the constraint that all inner arrays have the same shape, is closely connected to the concept of an array, and like Table `⌜`, Merge might be considered a fundamental way to build up multidimensional arrays from lists. In both cases scalars are somewhat special. They are the identity element of a function with Table, and can be produced by Merge inverse, `>⁼` **on a list**, which forces either the outer or inner shape to be empty (BQN chooses `>⁼` to be `<`, but only on an array, as `>` cannot produce non-arrays). Merge has another catch as well: it cannot produce arrays with a `0` in the shape, except at the end, without some sort of prototype system.
+
+        ⊢ e ← ⟨⟩¨ ↕3
+        ≢ > e
+        ≢ > > e
+
+Above we start with a list of three empty arrays. After merging once we get a shape `3‿0` array, sure, but what happens next? The shape added by another merge is the shared shape of that array's elements—and there aren't any! If the nested list kept some type information around then we might know, but extra type information is essentially how lists pretend to be arrays. True dynamic lists simply can't represent multidimensional arrays with a `0` in the middle of the shape. In this sense, arrays are a richer model than nested lists.
+
+## Coupling scalars
+
+A note on the topic of Solo and Couple applied to scalars. As always, one axis will be added, so that the result is a list (strangely, J's [laminate](https://code.jsoftware.com/wiki/Vocabulary/commaco#dyadic) differs from Couple in this one case, as it will add an axis to get a shape `2‿1` result). For Solo, this is interchangeable with Deshape (`⥊`), and either primitive might be chosen for stylistic reasons. For Couple, it is equivalent to Join-to (`∾`), but this is an irregular form of Join-to because it is the only case where Join-to adds an axis to both arguments instead of just one. Couple should be preferred in this case.
+
+The pair function, which creates a list from its arguments, can be written `Pair ← ≍○<`, while `≍` in either valence is `>∘Pair`. As an interesting consequence, `≍ ←→ >∘≍○<`, and the same relationship holds for `Pair`.
+
+        ⟨2,3⟩ ≍○< "abc"  # Pair two values
+        ≍○< "abc"        # Pair one(?) value
+
+## Definitions
+
+As discussed above, `≍` is equivalent to `>{⟨𝕩⟩;⟨𝕨,𝕩⟩}`. To complete the picture we should describe Merge fully. Merge is defined on an array argument `𝕩` such that there's some shape `s` satisfying `∧´⥊(s≡≢)¨𝕩`. If `𝕩` is empty then any shape satisfies this expression; `s` should be chosen based on known type information for `𝕩` or otherwise assumed to be `⟨⟩`. If `s` is empty then `𝕩` is allowed to contain non-arrays as well as array scalars, and these will be implicitly promoted to arrays by the `⊑` indexing used later. We construct the result by combining the outer and inner axes of the argument with Table; since the outer axes come first they must correspond to the left argument and the inner axes must correspond to the right argument. `𝕩` is a natural choice of left argument, and because no concrete array can be used, the right argument will be `↕s`, the array of indices into any element of `𝕩`. To get the appropriate element corresponding to a particular choice of index and element of `𝕩` we should select using that index. The result of Merge is `𝕩⊑˜⌜↕s`.
+
+Given this definition we can also describe Rank (`⎉`) in terms of Each (`¨`) and the simpler monadic function Enclose-Rank `<⎉k`. We assume effective ranks `j` for `𝕨` (if present) and `k` for `𝕩` have been computed. Then the correspondence is `𝕨F⎉k𝕩 ←→ >(<⎉j𝕨)F¨(<⎉k𝕩)`.
diff --git a/doc/join.md b/doc/join.md
index 5a58aaac..05171e24 100644
--- a/doc/join.md
+++ b/doc/join.md
@@ -2,7 +2,7 @@
 
 # Join
 
-Join (`∾`) is an extension of the monadic function [Raze](https://aplwiki.com/wiki/Raze) from A+ and J to arbitrary argument ranks. It has the same relationship to Join to, the dyadic function sharing the same glyph, as Merge (`>`) does to Couple (`≍`): `a≍b` is `>a‿b` and `a∾b` is `∾a‿b`. While Merge and Couple combine arrays (the elements of Merge's argument, or the arguments themselves for Couple) along a new leading axis, Join and Join to combine them along the existing leading axis. Both Merge and Join can also be called on a higher-rank array, causing Merge to add multiple leading axes while Join combines elements along multiple existing axes.
+Join (`∾`) is an extension of the monadic function [Raze](https://aplwiki.com/wiki/Raze) from A+ and J to arbitrary argument ranks. It has the same relationship to Join to, the dyadic function sharing the same glyph, as [Merge](couple.md) (`>`) does to Couple (`≍`): `a≍b` is `>a‿b` and `a∾b` is `∾a‿b`. While Merge and Couple combine arrays (the elements of Merge's argument, or the arguments themselves for Couple) along a new leading axis, Join and Join to combine them along the existing leading axis. Both Merge and Join can also be called on a higher-rank array, causing Merge to add multiple leading axes while Join combines elements along multiple existing axes.
 
 Join can be used to combine several strings into a single string, like `array.join()` in Javascript (but it doesn't force the result to be a string).
 
diff --git a/doc/leading.md b/doc/leading.md
index 95fde0fa..35d72295 100644
--- a/doc/leading.md
+++ b/doc/leading.md
@@ -28,7 +28,7 @@ In these three cases above, the results are the same as you would get from trans
         ∾˝ a                  # Join the cells
         ∾˝˘ a                 # Join-insert is a no-op on lists
 
-Solo (`≍`), something of a maverick, manages to act on *zero* leading axes of its argument by creating the first axis of the *result* instead. Because it doesn't need any axis to work, it can go in front of either axis but also past the last one by working with rank 0, a case where most array functions would give an error.
+[Solo](couple.md) (`≍`), something of a maverick, manages to act on *zero* leading axes of its argument by creating the first axis of the *result* instead. Because it doesn't need any axis to work, it can go in front of either axis but also past the last one by working with rank 0, a case where most array functions would give an error.
 
         ≢ ≍ a                 # Solo adds a length-1 axis
         a ≡ ⊏ ≍ a             # First Cell undoes this
@@ -59,7 +59,7 @@ The Each (`¨`) and Table (`⌜`) modifiers return functions which are the same
 
 ## Dyadic functions
 
-For dyadic functions the pattern of working on only one argument axis is not so common. Only two functions can be said to follow it roughly: Join to (`∾`) combines two arrays along one axis, using the first axis of both arguments if they have the same rank and of the higher-rank argument if they differ by one. Couple (`≍`), like Solo, does not manipulate the argument axes but adds a result axis. There are also some functions that can't be limited to leading axes: Reshape (`⥊`) treats the argument as one long list, and Pick (`⊑`) requires each index to be as long as the right argument's rank, because it selects elements and not cells from the right argument.
+For dyadic functions the pattern of working on only one argument axis is not so common. Only two functions can be said to follow it roughly: Join to (`∾`) combines two arrays along one axis, using the first axis of both arguments if they have the same rank and of the higher-rank argument if they differ by one. [Couple](couple.md) (`≍`), like Solo, does not manipulate the argument axes but adds a result axis. There are also some functions that can't be limited to leading axes: Reshape (`⥊`) treats the argument as one long list, and Pick (`⊑`) requires each index to be as long as the right argument's rank, because it selects elements and not cells from the right argument.
 
 ### Multiple axes
author	Marshall Lochbaum <mwlochbaum@gmail.com>	2020-08-16 22:14:53 -0400
committer	Marshall Lochbaum <mwlochbaum@gmail.com>	2020-08-16 22:14:53 -0400
commit	bda30287098ab9002bc4d3e8290ed06c21abec28 (patch)
tree	17e2c7599c0f2af08accab4363728a4ae2dbbd21 /doc
parent	144854e542f6f853d10d9efed95de2d1a5025758 (diff)