From 383298b70274c5ac22eb2100aad7f6cd8eeca02d Mon Sep 17 00:00:00 2001 From: Marshall Lochbaum Date: Sat, 11 Jun 2022 16:55:33 -0400 Subject: Add discussions of the important tacit modifiers to tacit.md --- docs/doc/tacit.html | 30 ++++++++++++++++++++++++------ 1 file changed, 24 insertions(+), 6 deletions(-) (limited to 'docs/doc/tacit.html') diff --git a/docs/doc/tacit.html b/docs/doc/tacit.html index 51d0d82d..52a9eed3 100644 --- a/docs/doc/tacit.html +++ b/docs/doc/tacit.html @@ -1,11 +1,11 @@ - BQN: Tacit (point-free) programming + BQN: Tacit programming -

Tacit (point-free) programming

-

Tacit programming (APL Wiki) is a general term used to refer to ways to define functions that don't refer to arguments directly (say, with identifiers). Instead, tacit programs are built up by combining smaller functions together; we'll discuss the ways BQN offers to combine functions on this page. Since primitive functions like those returning the left () and right () arguments, and selection functions (⊏⊑), are available as building blocks, tacit programming doesn't keep the programmer from pinpointing a specific part of the input, as the description might lead you to believe. Nonetheless, it has its limitations. In larger tacit programs, moving values to the right place is tedious and error-prone because of the lack of a convenient labelling mechanism, and important context tends to disappear in a sea of symbols.

+

Tacit programming

+

Tacit programming (or "point-free" in some other languages) is a term for defining functions without referring to arguments directly, which in BQN means programming without blocks. Instead, tacit programs are built up by combining smaller functions together; we'll discuss the ways BQN offers to combine functions on this page. Since primitive functions like those returning the left () and right () arguments, and selection functions (⊏⊑), are available as building blocks, tacit programming doesn't keep the programmer from pinpointing a specific part of the input, as the description might lead you to believe. Nonetheless, it has its limitations. In larger tacit programs, moving values to the right place is tedious and error-prone because of the lack of a convenient labelling mechanism, and important context tends to disappear in a sea of symbols.

In smaller amounts—portions of a line—tacit programming can be the clearest way to express an idea, particularly when just one or two variables are used a few times. Consider the following three expressions to filter only the positive values from a list:

↗️
    l  05¯21¯3¯4
 
@@ -16,11 +16,11 @@
     0</ l
 ⟨ 5 1 ⟩
 
-

The first of these expressions is the most direct, but with the variable name buried inside, it can't be used on an intermediate value and its input will have to be named. The other two forms stand alone as functions, so they can easily be placed anywhere in a program, even as an operand. But with even the small amount of structure added by a BQN anonymous function, the second method has more organization than action! The third, tacit, version strips away most of the organizing syntax to leave us with the essential pieces 0, <, and / joined by combinators. The explicit function uses 𝕩 as a sort of pronoun ("I want the elements of it where it's greater than zero"), while the tacit one elides it ("give me the elements greater than zero").

-

The ability to easily combine tacit and "explicit" programming such as statements or anonymous functions, far from being only a way to mitigate the disadvantages of these two methods, brings new advantages that no single paradigm could accomplish. Purely tacit programming requires programs to use no local variable names, but partly tacit programming allows them to use fewer names. That means names can be used only for the parts of a program that represent clean, human-understandable concepts. Another possible strategic choice is to use the fact that variables in a tacit expression are expanded as it's formed but those inside a block aren't. So Fa+ can be chosen to "freeze" the value of a in F without having to use an extra variable, while F{a+𝕩} uses the current value of a each time F is called.

+

The first of these expressions is the most direct, but with the variable name buried inside, it can't be used on an intermediate value—its input will have to be named. The other two forms stand alone as functions, so they can easily be placed anywhere in a program, even as an operand. But with even the small amount of structure added by a BQN anonymous function, the second method has more organization than action! The third, tacit, version strips away most of the organizing syntax to leave us with the essential pieces 0, <, and / joined by combinators. The explicit function uses 𝕩 as a sort of pronoun ("I want the elements of it where it's greater than zero"), while the tacit one elides it ("give me the elements greater than zero").

+

The ability to easily combine tacit and "explicit" programming in blocks isn't only a way to mitigate the disadvantages of these two methods, but brings new advantages that no single paradigm could accomplish. Purely tacit programming requires programs to use no named local variables, but partly tacit programming allows them to use fewer names. That means names can be used only for the parts of a program that represent clean, human-understandable concepts. Another possible strategic choice is to use the fact that variables in a tacit expression are expanded as it's formed but those inside a block aren't. So Fa+ can be chosen to "freeze" the value of a in F without having to use an extra variable, while F{a+𝕩} uses the current value of a each time F is called.

The rest of this page describes BQN's tacit programming facilities. Deciding when to use them is a matter of taste, and experience.

Trains

-

In modern APL and its relatives, the backbone of tacit infrastructure is the function train. Trains can take some practice to understand and use well, so they're described in more depth on a dedicated page. The idea of trains is that you can "apply" a function to other functions, forming a composed function where it will actually apply to their results. So a typical use is to pair two functions as shown below: the pair »« is never formed, but the result of applying T is a pair.

+

In modern APL and its relatives, the backbone of tacit infrastructure is the function train. Trains can take some practice to understand and use well, so they're described in more depth on a dedicated page. The idea of trains is that you can "apply" a function to other functions, forming a composed function where it will actually apply to their results. So a typical use is to pair two functions as shown below: the pair »« is never formed, but the result of applying T is a pair.

↗️
    T  »  «    # Pair both shift functions
     T            # Nothing happens yet...
 »⋈«
@@ -41,6 +41,7 @@
 

The functions ⊣⊢ are as simple as they come, but are discussed quite a bit on their own page. A definition is that is {𝕩} and is {𝕩;𝕨}, so that returns its right argument, and returns its left argument but will settle for the right one if there's just one.

Combinators

+

Here's a table of all the typical tacit combinators for reference. The yellow lines show which values are passed to functions in order to evaluate each combinator: we start with values 𝕩 and maybe 𝕨 at the bottom, and the result is the single value that comes out at the top, after applying the functions.

@@ -211,3 +212,20 @@ +

Each of these combinators has its role to play in tacit code. To start with, Constant (˙) is the subtlest one, because it's used everywhere, in a sense, but rarely needs to be made explicit. This is because data values already behave like constant functions. For example, the train 4 might more precisely be written 4˙, but trains conveniently handle this part already. However, it is often used at the end of a train, to force the last part into a function role. ×-1 isn't a train—it just evaluates to ¯1. But ×-1˙ is the train that multiplies its arguments and subtracts 1.

+

Using Self and Swap is pretty straightforward. Swap in particular is often useful if your arguments happen to be the wrong way around. For example, ÷˜⋈| gives you the integer part and remainder when dividing 𝕨 by 𝕩. For compatibility with math, ÷ is "backwards" from a BQN perspective, but ˜ will fix that right up. And while we're at it, let's note that Atop is just another way to write a 2-train, so that ÷˜ is (⌊÷)˜ but avoids the parentheses.

+↗️
    3 (÷˜⋈|) 13  # 13 = 1+3×4
+⟨ 4 1 ⟩
+
+

Before and After are tacit powerhouses. Yes, FG is just F⊣G⊢ and GF is ⊣G F. But the symmetric symbols make these cases a lot easier to read and manipulate as a programmer (just remember, pointy-side function is applied first!). It's common to write big tacit functions with the pattern ()Fn 𝕩, so a lot of processing is applied to 𝕩 and then the result of this is passed along with 𝕩 to Fn. And of course binds like ÷2, dividing by 2, or ¯1», shifting in a ¯1, are very often helpful.

+

Over is a bit weirder, and you'll have to learn when to recognize this pattern (of course, a repeated function is a strong hint). Perhaps you'd compare the first element of two lists with , for example.

+

Conditionals

+

There are two main ways to perform conditional logic in tacit code. Although, a note first: for arithmetic, array operations that use booleans as numbers can often fill in for conditionals, and they're a lot faster. So, to replace every space character with a hyphen, try this function that multiplies the difference between those characters by a mask that's 1 where 𝕩 is a space, then subtracts that back from 𝕩.

+↗️
    " -" (⊢--´⊣×⊑=) "ab cde  f "
+"ab-cde--f-"
+
+

The Repeat () modifier makes a nice "if" conditional. FG, assuming the result of G is a boolean, is equivalent to {𝕨G𝕩?𝕨F𝕩;𝕩}. Note how 𝕨 gets passed in to both functions. Often you'll want 𝔽 to apply to 𝕩 only, and in this case, you need to make this explicit with or similar.

+↗️
    3 (2÷˜)< 7  # halve 𝕩 if greater than 𝕨
+3.5
+
+

For more complicated "if-else" or "select" type conditionals, use Choose (). Watch for ordering here: FG0,G1 puts the two parts in the opposite order to Repeat, and list element 1 comes after element 0 even though it might seem more intuitive for the "true" value to come first.

-- cgit v1.2.3